- Clone the repository
- Cd into the project directory
- Run
pip install -r .\requirements.txt
- Run
main.py
- Should take around an hour for 500k addresses. Go grab a coffee! ☕ Or some boba tea ♨️, that's pretty good too.
- There is a dataclass class in
api_wrapper.py
namedEntityParsedResults
. This class represents the result of a single API call to the address analysis endpoint. You can use this object to easily access the results of the API call without having to go through the documentation! - in
main.py
, each batch of API calls generates a list ofEntityParsedResults
. An empty functionhandle_results(results: list[EntityParsedResults])
is provided so that you can handle actions like saving the results to a dataframe, JSON, or SQL database. The implementation of this function is on you! - The
test_addresses.csv
is used for our benchmarking. Simply provide a CSV with a single row calledaddresses
as input and the script will handle the rest. Thetest_addresses.csv
is provided as an example. (ignore the ID row) - The main function automatically prints progress for you, with a basic ETA. No need to panic!