Script for scraping LinkedIn company profiles and posts. The project was created to extract data on IT consultation companies in the Greater Saint Louis area which could then be compared and analyzed to garner insights. The files in this repository use Daugherty Business Solutions, Slalom, 1904labs, and World Wide Technology as examples.
This project is deprecated and will not be getting ongoing support. Considering my experiences with scraping LinkedIn, the scripts will be outdated sooner than later.
# Followers
# Employees on LinkedIn
Content of each post
# Likes for each post
# Comments for each post
# Date post was published in LinkedIn format
Example output can be found in the "data" folder (Note: Certain scraping attempts featured limited post activity which might be reflected in the datasets).
- Python
- RegEx
- Jupyter Notebook
- Pandas
- Splinter
- chromedriver
- pg8000
- PostgreSQL