DAPPER Topic Model

Dynamic Author-Persona Performed Exceedingly Rapidly

The DAPPER topic model is designed for multi-author corpora in which authors write over time. Unlike other temporal topic models, DAPPER doesn't model the change in language associated with a topic (e.g. Dynamic Topic Models) but instead models the trajectory of topics that an author discusses over time.

Introduction

See /docs/dap_2018_arxiv.pdf for technical information on the dynamic author-persona topic model (DAP).

Getting Started

Clone the repo:

cd ~
git clone https://github.com/robert-giaquinto/dapper.git

Virtual environments.

It may be easiest to install dependencies into a virtual environment. For python 3+ run:
```
cd dapper
python -m venv ./venv
```
To activate the virtualenv run:
```
source ~/dapper/venv/bin/activate
```
Installing the necessary python packages.

A requirements.txt file, listing all packages used for this project is included in the repository. To install them first make sure your virtual environment is activated, then run the following line of code:
```
pip install --upgrade pip
```
```
pip install -r ~/dapper/requirements.txt
```
Install dapper package.

This is done to allow for absolute imports, which make it easy to load python files can be spread out in different folders. To do this navigate to the ~/dapper directory and run:
```
python setup.py develop
```
Preparing data for the model

See Signal Media 1M to download the Signal Media dataset.

See /src/preprocessing/preprocess_signalmedia.py for tools to prepare the Signal Media data. Or use the already preprocessed data included in this repository.
Running the model

See /scripts/ for examples of running the model and setting various model parameters.

Project Structure

docs/ - Documentation on the model, including derivation and papers related to the model.
log/ - Log files from running the programs.
scripts/ - Bash scripts for running programs.
src/ - Directory containing various sub-packages of the project and any files shared across sub-packages.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
data		data
docs		docs
results		results
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE.md		LICENSE.md
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DAPPER Topic Model

Dynamic Author-Persona Performed Exceedingly Rapidly

Introduction

Getting Started

Project Structure

About

Releases

Packages

Languages

License

robert-giaquinto/dapper

Folders and files

Latest commit

History

Repository files navigation

DAPPER Topic Model

Dynamic Author-Persona Performed Exceedingly Rapidly

Introduction

Getting Started

Project Structure

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages