Derivational_Task_MLP

We learn a multilayer perceptron model that generates vectors for the derived words, when given the vector for source word and the target affix. We also report the accuracy of the model after performing 5-fold cross validation. The glove vectors used in the task have been downloaded from http://nlp.stanford.edu/projects/glove/ (we have used the file with 6 billion tokens).

The files output by the program are :

AnsFastText.txt - fastText vectors of derived words in wordList.csv
AnsLzaridou.txt - Lazaridou vectors of the derived words in wordList.csv
AnsModel.txt - Vectors for derived words as provided by the model

The function 'derivedWordTask' returns 2 values : averaged cosine similarity between the corresponding words from output files 1 and 3, as well as 2 and 3.

The files used for the task are :

Vector_lazaridou.txt : Word vectors for source and derived words as per the distributional space described in “Compositional-ly Derived Representations of Morphologically Complex Words in Distributional Semantics”
fastText_vectors.txt : Word vectors for source and derived words a per the fastText model
wordList.csv : CSV file containing the triplets (Source word, derived word and the affix)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Derivational_Task.py		Derivational_Task.py
README.md		README.md
fastText_vectors.txt		fastText_vectors.txt
vector_lazaridou.txt		vector_lazaridou.txt
wordList.csv		wordList.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Derivational_Task_MLP

About

Releases

Packages

Languages

chinmayapancholi13/Derivational_Task_MLP

Folders and files

Latest commit

History

Repository files navigation

Derivational_Task_MLP

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages