Skip to content

kb22/Web-Scraping-using-Python

Repository files navigation

Web-Scraping-using-Python

A Jupyter notebook to scrape Wikipedia webpages using Python to create a dataset.

The complete project is detailed as a two part series:

  1. Part 1: Describes how web scraping can be used to fetch data from a website.
  2. Part 2: Describes how collected data can be cleaned before actual use.

NOTE: This project is for understanding how web scraping works on actual websites. If however, web scraping is needed on a website, proper permissions must be taken and terms and conditions must be followed.

About

This project scrapes Wikipedia for its articles using BeautifulSoup to create a dataset and then draws analysis on the collected data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy