Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags
- Make sure you have Chrome browser installed.
- Download chromedriver and put it into driver folder:
./driver/chromedriver
- Install requirements
pip install -r requirements.txt
Results: under the ./data folder
Download the first 10 photos and information from username "instagram"
$ python3 crawl.py -q 'instagram' -n 10
Download the first 7 photos and information(all comments) from hashtags #hello, #hi
$ python3 crawl.py -q '#hello, #hi' --a -n 7
likes:
5,326
comments:
923
caption:
Art of @kendricklamar by @illestration Bold, bright and colorful.
commentMessages:
tttt: Amazing 😉
this_is_t.rs: my name says it all
dateTime:
2018-05-30T19:42:03.000Z
Usage:
crawl.py [-q QUERY] [-n NUMBER] [--a] [-h HELP]
Options:
-q QUERY username, add '#' to search for hashtags, e.g. 'username' or '#hashtag'
For multiple query seperate with comma, e.g. 'username1, username2, #hashtag'
-n NUM number of returned posts [default: 10000]
--a collect all comments
-h HELP show this help message and exit
This is an open source project so feel free to contribute by