Unlabeled directed graph mining project from Co-occurrence graph of Document using gSpan algorithm based on Apache Spark
-
Updated
Jan 10, 2018 - Scala
Unlabeled directed graph mining project from Co-occurrence graph of Document using gSpan algorithm based on Apache Spark
😅 A topic model of reddit.com/r/EmojiPasta trained with Spark and an LDA model (NSFW) - Trigger Warning: The r/emojipasta subreddit posts controversial content and anything I have crawled is to provide visibility of a topic modeling some of this controversial content. Unfortunately there is also discriminatory speech which must be called out!
Topic modeling of twitter data and web page collections using Latent Dirichlet Allocation [LDA] in Scala
Measuring career preparedness with topic modeling.
A lightweight Akka stream PubSub engine for distributing data to multiple consumers.
Topic Modeler based on Latent Semantic Analysis. Used to identify commonalities and differences within a corpus of documents. Identifies the top 5% of keywords for each topic that appears in the set of documents and identifies the documents with the strongest association to each topic. Our corpus consists of sell-side reports on many companies p…
Apache Spark application, which analyses Twitter data using LDA Topic Modelling algorithm
Add a description, image, and links to the topic-modeling topic page so that developers can more easily learn about it.
To associate your repository with the topic-modeling topic, visit your repo's landing page and select "manage topics."