Corpus and Vocabulary Preprocessing Utilities for Natural Language Pipelines
-
Updated
Jun 6, 2021 - C++
Corpus and Vocabulary Preprocessing Utilities for Natural Language Pipelines
Skipgram with Hierarchical Softmax
fastText v0.9.3 (C++ port)
Fast word-like N-gram embeddings
bilingual word embeddings mapping using fastText
Lossless Compression Techniques for Embedding Tables in Substantial Deep Learning-Based Recommendation System
Golang "native" implementation of word2vec algorithm (word2vec++ port)
Distributed Representations of Words using word2vec
Distributed Representations of Sentences and Documents
R package to Embed All the Things! using StarSpace
R wrapper for fastText
Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures
Epsilla is a high performance Vector Database Management System
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Add a description, image, and links to the embeddings topic page so that developers can more easily learn about it.
To associate your repository with the embeddings topic, visit your repo's landing page and select "manage topics."