This repository contains scripts scraping data from USOS for PromoCHATor project in Solvro ML section. These scripts allow to collect data about university teachers, their scientific achievements, abstracts of students' scientific papers. The project will be developed as necessary.
This repository contains scripts scraping data from USOS. These scripts allow to collect data about university teachers, their scientific achievements, abstracts of students' scientific papers.
Project uses following languages and technologies
- Python 3.9.13
-
Clone the repository:
git clone https://github.com/Solvro/script-promochator-usos-scraper.git
-
Change directory:
cd script-promochator-usos-scraper
-
Create new virtual environment:
python -m venv <your_env_name>
-
Activate environment:
./venv/Scripts/activate
-
Install the required modules:
python -m pip install -r requirements.txt
-
Sign up for an API key:
https://apps.usos.pwr.edu.pl/developers/
-
Change directory:
cd usos-teachers-scraper
-
Create a
config.json
file. Then come up with a secret key. Finally, paste your Consumer Key and Consumer Secret:{ "secret_key": "<your-secret-key>", "consumer_key": "<generated-consumer-key>", "consumer_secret": "<generated-consumer-secret>" }
-
Run the script:
python ./usos-teachers-scraper.py
-
Visit the USOS authorization page:
http://127.0.0.1:5000/start_oauth
-
Fetch teachers data by visiting page:
http://127.0.0.1:5000/fetch_staff
-
Change directory:
cd usos-abstracts-scraper
-
Run the script:
python ./usos-abstracts-scraper.py
-
Input initial thesis id and final thesis id.
When you had assigned yourself to new task, you should stick to these steps
git checkout main
Check out main branchgit pull origin main
Pull current changes from main branchgit fetch
Be up to date with remote branchesgit checkout -b type/task
Create new task branchgit add .
Add all changes we have madegit commit -m "My changes description"
Commit changes with proper descriptiongit push origin type/task
Pushing our changes to remote branch- On Github we are going to make Pull Request (PR) from our remote branch
Warning
Do not push changes directly to main branch
For further information read Solvro handbook
Github Solvro Handbook 🔥 - https://docs.google.com/document/d/1Sb5lYqYLnYuecS1Essn3YwietsbuLPCTsTuW0EMpG5o/edit?usp
This is our current team
- @LukiLenkiewicz - Tech Lead
- @Micz26 - ML Engineer
- @farqlia - ML Engineer
- @AgataGro - ML Engineer
- @dekompot - ML Engineer
- @b4rt4s - ML Engineer
- @Woleek - ML Engineer
- @WiktoriaFrost - ML Engineer
- @Barionetta - Project Manager