This release gives two additions which were requested by researchers.
-
You can now collect images in URLs which are displayed when a url is posted on social media platforms like Twitter and Facebook using the
html_utils.get_webpage_display_image
and alsohtml_utils.get_page_meta
. -
Stripping URLs from Tweets (via
tweet_utils.get_link
) only worked for original tweets, but now it also works for retweets and quoted tweets. A new key was added to the returned dictionary calledtweet_type
to denote this distinction.