Skip to content

Fanchouille/ovnlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ovnlp

A toolkit to download, train, use fastText word vectors on text data. Also lets you deduplicate data based on TF IDF representation (see txtMatcher) Developed under MIT license by Openvalue : http://openvalue.co

Fasttext

Installation

OVNLP runs on Python 3.6 ONLY.

Just run

> pip install ovnlp

Usage

See demo_notebook.ipynb for usage examples

FT Weights source

Pretrained weights from FB :

Feel free to change weightsource.json to add data sources if needed.