BeCode project on web scraping
- Author: Vitaly Shalem
- Repository: WikipediaScraper
- Type of challenge: Individual learning
- Duration: 3 days
This project focuses on webscraping techniques in Python.
Apply recently acquired knowledge on various webscraping techniques.
- Day 1:
- Creating clean environment
- Implementting various parts and converting them into functions
- Day 2:
- Putting together pieces of code
- Creating MVP
- Day 3:
- Test & Fix bugs, if any
- Add improvemnts
- Write documentation
- This is a Python based project
- Please install Python v3.6 or higher
- PLease istall the following Python libraries:
- Requests:
pip install requests
- Beautiful Soup:
pip install beautifulsoup4
- Requests:
- Clone WikipediaScraper repository to your machine:
git clone https://github.com/vitaly-shalem/WikipediaScraper.git <clone path>
- In Command Line / Git Bash navigate to the project folder
<clone path>
- Run
python main.py
- Check project folder for
leaders.json
file
- Developemnt and MVP implementation: 2 days
- Wrap-up (testing, fixing, code cleaning) and documentation: 1 day
- This project was done as part of the AI Boocamp at BeCode.org
- LinkedIn: Vitaly Shalem