Popular repositories Loading
-
tfidf
tfidf PublicTF-IDF is a method, which allows get matrix tokens for your list of text. It creates n * m matrix, where n - quantity of texts and m - quantity of unique words - tokens in texts vocabulary. IMPORTA…
Python
-
word2vec
word2vec PublicWord2Vec is an algorythm of word representation in embeddings. This repo contains a code about word2vec only.
Python
-
bpe-dropout
bpe-dropout PublicClass allows you to create BPE tokens with dropout or not. Implements Sentencepeace lib with easy fit predict way.
Python
-
-
-
stratify
stratify PublicCode allows you to strtify dataframe in different convenient ways. ReadMe shows how.
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.