just compile all the java files.
>>javac *.java
a. Run the main entry:
>> java main [dictionary] [corpus] [output]
e.g.
>> java main words.txt enron-skilling.txt output.txt
when you enter the main system, there are four choices.
- output wrong words.
- output the answers of 2-gram.
- output the answers of editex.
- output the answers of edit-distance.
(The answer means that if a word is a misspelling word, it will output the alternatives, otherwise output the new word information.)
b. Run each algorithm to make a query:
>> java [algorithm] [dictionary]
e.g.
>> java N_Gram words.txt
or >> java Editex words.txt
or >> java Edit_distance words.txt
and then, you can input words to get the answers.
precision, recall, average precision.
e.g. the "query_test.txt" contains 50 misspelling words, every line one word. And then you can use the pooling to compute precision, recall, average precision for three techniques.
>> java Pooling [dictionary] [query]
e.g.
>> java Pooling words.txt query_test.txt