You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This file provides information about running the Dynamic Topic Model
or the Document Influence Model. It gives two command-line examples
for running the software and several example commands in R for reading
output files.
Dynamic topic models and the influence model have been implemented
here in c / c++. This implementation takes two input files:
(a) foo-mult.dat, which is one-doc-per-line, each line of the form
unique_word_count index1:count1 index2:count2 ... indexn:counnt
where each index is an integer corresponding to a unique word.
(b) foo-seq.dat, which is of the form
Number_Timestamps
number_docs_time_1
...
number_docs_time_i
...
number_docs_time_NumberTimestamps
- The docs in foo-mult.dat should be ordered by date, with the first
docs from time1, the next from time2, ..., and the last docs from
timen.
I don't understand how the time correspondence is defined between test-mult.dat (which has 1 document per line) and test-seq.dat which has the number of docs per time-period (in this case 10 time periods). Can someone clarify for me how the input data should be formatted? Are we assuming the first 10 documents in test-mult.dat correspond to time period 1, the next 25 documents correspond to time period 2, etc?
The text was updated successfully, but these errors were encountered:
example.sh
describes the input format as:test-mult.dat
looks like this (1000 lines):test-seq.dat
looks like this (10 lines):I don't understand how the time correspondence is defined between
test-mult.dat
(which has 1 document per line) andtest-seq.dat
which has the number of docs per time-period (in this case 10 time periods). Can someone clarify for me how the input data should be formatted? Are we assuming the first 10 documents intest-mult.dat
correspond to time period 1, the next 25 documents correspond to time period 2, etc?The text was updated successfully, but these errors were encountered: