Skip to content

Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags

License

Notifications You must be signed in to change notification settings

hehpollon/Instagram-Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Instagram-Crawler

Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags

Installation

  1. Make sure you have Chrome browser installed.
  2. Download chromedriver and put it into driver folder: ./driver/chromedriver
  3. Install requirements pip install -r requirements.txt

Examples:

Results: under the ./data folder

Download the first 10 photos and information from username "instagram"

$ python3 crawl.py -q 'instagram' -n 10

Download the first 7 photos and information(all comments) from hashtags #hello, #hi

$ python3 crawl.py -q '#hello, #hi' --a -n 7
you can enter multiple username or hashtags by separating them with commas

Example of a files data

likes: 
5,326

comments: 
923

caption: 
Art of @kendricklamar by @illestration
Bold, bright and colorful. 

commentMessages: 
tttt: Amazing 😉
this_is_t.rs: my name says it all

dateTime: 
2018-05-30T19:42:03.000Z
photo of post will download in ./data folder

Usage

Usage:
    crawl.py [-q QUERY] [-n NUMBER] [--a] [-h HELP]
    
Options:
    -q QUERY  username, add '#' to search for hashtags, e.g. 'username' or '#hashtag'
                  For multiple query seperate with comma, e.g. 'username1, username2, #hashtag'

    -n NUM    number of returned posts [default: 10000]

    --a       collect all comments
    
    -h HELP   show this help message and exit

Contributing

This is an open source project so feel free to contribute by

About

Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages