LPSE Parser - as reflected by its name - is a data parser for LPSE websites
Developed for KPP Pratama Kotabumi.
- Python3.5.3+
- urllib
- requests
- ssl
- BeautifulSoup4
In vars.py
, several parameters need to be configured
govName
is the name of the website (without the trailing go.id or any subdomain)staticCode
is the last 3 numbers inKode
column of/eproc4/lelang
lowNum
andhighNum
is the lower and upper bounds for the iteration.
Use python main.py
to scrape several information from pengumumanlelang
page.
The result will be recorded in results
folder in csv format.
I am not responsible to any damage to the server due to the excessive requests sent from the script.
The project is under development where other pages will be scraped in addition to the existing ones. A database might also be needed to manage complex data structure, rather than the current mechanism of storing the data to csv format.
- Create other pages' scrapers
- Build a database system
- Build a reporting system (POD might be interesting)
- Auto update for new data