Deleted tweets monitoring

This is a collection of tools to monitor deleted tweets, automate screenshoting, and archiving.

streaming.py and save_to_db.py work together to grab a real-time streamed timeline from Twitter and save all the results in a database.
All the tweets in the database are then screenshot by screenshot.py
Finally, the monitoring.py worker crawls through the database and checks if the tweets have been deleted.
I included get_user_ids.py, as the Twitter API often requires the ID, and not the screen name (eg not "@basilesimon").

Dependencies and install

git clone this repo
wget https://raw.githubusercontent.com/pypa/pip/master/contrib/get-pip.py then sudo python get-pip.py
pip install tweepy
pip install MySQL-python (but you might need to apt-get install build-essential python-dev libmysqlclient-dev. I read it's easy to install on Max OS, with Homebrew)
pip install needle
apt-get install mysql-server nodejs-legacy nodejs npm
sudo apt-get install build-essential chrpath git-core libssl-dev libfontconfig1-dev libxft-dev
sudo npm -g install phantomjs
You will need a comma-separated list of user IDs, or a list of keywords you want to track. See all the other options in the Docs.
Obviously, you will also need your developer access keys and things. Pop them in the placeholders accordingly in each file.

Comma-separated list of user IDs

I use the wonderful t from sferik, a command line tool for twitter shenanigans. Usually, I have an account following all the people I want to track - but it also works with lists.

$ t followings [account] > list.csv
python get_user_ids.py > ids.csv

Run the program

Then to run it:

Run streaming.py. Constantly. If it doesn't run, you're not saving the tweets.
Run nosetests screenshot.py --with-save-baseline --nocapture periodically to grab the screenshots.
Run monitoring.py periodically to check for deleted tweets.

You might want to consider running all these with cron on a server. Just saying.

Integration with ElasticSearch

wget https://download.elasticsearch.org/elasticsearch/elasticsearch/elasticsearch-1.3.4.tar.gz
tar -xvf elasticsearch-1.3.4.tar.gz
sudo apt-get install default-jre default-jdk to install Java
Start ES instance with bin/elasticsearch in the directory where you extracted ElasticSearch

Then uncomment line 2 and 34-40 in save_to_db.py

License

PDD/ Unlicense

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deleted tweets monitoring

Dependencies and install

Comma-separated list of user IDs

Run the program

Integration with ElasticSearch

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE.txt		LICENSE.txt
README.md		README.md
get_user_ids.py		get_user_ids.py
monitoring.py		monitoring.py
save_to_db.py		save_to_db.py
screenshot.py		screenshot.py
streaming.py		streaming.py

License

basilesimon/twitter-tools

Folders and files

Latest commit

History

Repository files navigation

Deleted tweets monitoring

Dependencies and install

Comma-separated list of user IDs

Run the program

Integration with ElasticSearch

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages