You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Dec 18, 2023. It is now read-only.
We need to filter the iDigBio information to only include angiosperm data (among other things). Two web pages have a fairly exhaustive list of this information along with some alternate names. Alternate names will be very helpful for older data because the names will change over time.
We need to write a script that will download the pages and then extract the data from them and write them to a text file that can be read programmatically. I'm OK with either CSV or JSON-lines output. Links to the pages:
We need to be able to run this script (or jupyter notebook) repeatedly... Or more accurately all of us need to be able to run this script to gather the information ourselves.
Note that I have had success using the BeautifulSoup4 python library. I just used it to parse a checklist for lice hosts.
The text was updated successfully, but these errors were encountered:
We need to filter the iDigBio information to only include angiosperm data (among other things). Two web pages have a fairly exhaustive list of this information along with some alternate names. Alternate names will be very helpful for older data because the names will change over time.
We need to write a script that will download the pages and then extract the data from them and write them to a text file that can be read programmatically. I'm OK with either CSV or JSON-lines output. Links to the pages:
We need to be able to run this script (or jupyter notebook) repeatedly... Or more accurately all of us need to be able to run this script to gather the information ourselves.
Note that I have had success using the BeautifulSoup4 python library. I just used it to parse a checklist for lice hosts.
The text was updated successfully, but these errors were encountered: