Text Mining

This repository has the analysis that was did for 3 politicians from Rio de Janeiro using their tweets.

The paper is published and can be downloaded at this url: https://periodicos.uff.br/anaisdoser/article/view/29333.

That was the following steps for the text analysis:

Twitter

Using the social network API (package twitteR), get tweets from users.

Manipulate data

To lower
Tokenization
Remove punctuation
Remove stopwords
Stem (to join with the sentimental lexicon)

Sentimental Analysis

Using sentiLex_lem_PT02 dictionary

TF-IDF

Identify importants terms to each poltician. This technique consider the term frequency and the inverse document frequency.

Topic Modelling

We used the LDA (Latent Dirichlet Allocation) algorithm to build a model to predict each tweet and classify them into a group.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Dataset		Dataset
Presentation		Presentation
Scripts		Scripts
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Mining

Twitter

Manipulate data

Sentimental Analysis

TF-IDF

Topic Modelling

About

Releases

Packages

Languages

alexvlima/TextMining-SeR2019

Folders and files

Latest commit

History

Repository files navigation

Text Mining

Twitter

Manipulate data

Sentimental Analysis

TF-IDF

Topic Modelling

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages