Skip to content

sorrychoe/topic-modeling-theory

Repository files navigation

Topic-Modeling-Theories

This repository provides an in-depth exploration of various topic modeling techniques, implemented using Python and Jupyter Notebooks(or Jupyter lab). The following models are covered:

  • Latent Dirichlet Allocation (LDA)
  • Dynamic Topic Model (DTM)
  • Topics Over Time (ToT)
  • Correlated Topic Model (CTM)
  • Structural Topic Model (STM)
  • Biterm Topic Model (BTM)
  • BERTopic

Getting Started

Prerequisites

Before setting up the environment, ensure that Make is installed on your system. For Windows users, please refer to this guide to install Make.

Installation

  1. Clone this repository:

    git clone https://github.com/sorrychoe/topic-modeling-theory.git
    cd topic-modeling-theory
  2. Initialize the setup:

    make init
  3. Install dependencies:

    make install

Now you're ready to start exploring the topic modeling notebooks.

Available Notebooks

Each model is implemented in a separate Jupyter Notebook, providing explanations and code walkthroughs. You can start by running any of the notebooks to see how the models work and modify the code to fit your specific use case.

  1. notebook/Latent_Dirichlet_Allocation.ipynb:
    • Theory & Python Sample Code of Latent Dirichlet Allocation.
  2. notebook/Dynamic_Topic_Model.ipynb:
    • Theory & Python Sample Code of Dynamic Topic Model.
  3. notebook/Topic_over_Time.ipynb:
    • Theory about Topics Over Time.
  4. notebook/Correlated_Topic_Model.ipynb:
    • Theory & Python Sample Code of Correlated Topic Model.
  5. notebook/Structural_Topic_Model.ipynb:
    • Theory & R Sample Code of Structural Topic Model.
  6. notebook/Biterm_Topic_Model.ipynb:
    • Theory & Python Sample Code of Biterm Topic Model.
  7. notebook/BERTopic.ipynb:
    • Theory & Python Sample Code of BERTopic.

Authors

you have some issue?

If there is a problem while using it, please register the issue section in Github.