A web scraper built with Python and Beautiful Soup for collecting data of the Top 75 Masters in Engineering in the US, according to the USNews rankings.
The features collected are as follows:
- Rank
- School Name
- Location
- Application deadline for international applicants
- Application fee for international applicants
- Size of full-time faculty
- Size of full-time student body
- Full-time tuition
- Link for each school's engineering department
Feel free to take a look at the code and tweak it to fit your own needs. Beautiful Soup provides a very neat and intuitive way of parsing HTML files.
The scraper was built using Python 3, so make sure you have that installed on your machine before running the script.
- Press the green Clone or download button in this repository;
- Decide if you want to Clone the repo or download it as .zip (I'd suggest downloading as .zip if you're not familiar with Github);
- Open the project folder and install project requirements by running
pip install -r 'requirements.txt'
in your terminal; - Run
python script.py
in your terminal; - Voilà, the file
database.csv
should be up-to-date. You can open it with your favourite spreadsheet software as well.