-
Notifications
You must be signed in to change notification settings - Fork 1
/
README
30 lines (21 loc) · 967 Bytes
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
A simple implementation of Gibbs-Lda algo written in CoffeeScript running on node.js.
== Routine ==
Please compile all the .coffee files for the js modules, also including
the files under langcode/
$ coffee -c *.coffee
Orgnize all the data files under 'data/corpus', then run
$ node token_main.js
which will tokenize all the documents under data/corpus for each file as
a single document, and create the docuemnts array and vocabulary file.
then,
$ node lda_main.js
to run lda process, you can tune the parameters in lda_main.coffee
the performance depends on #document * #kTopic, #voc * #kTopic
once it's done, the model's phi and theta matrix will be under data/ dir
then,
$ node render_topic.js
to create the topics/words/docs json file for data-server
$ node data_server.js
data server is simple browsing server for the data, but is very coupled to
my own business, so make your own, :)
for any information, you can contact me at winters.mi(at)gmail.com