-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Find/Create function for text tokenzation #5
Comments
Exaple of this function: (on their basis you can write test fo this function)
|
I take it |
Also sugest add lemmatization option to lemmatize tokens, as independant function or as part of tokenize function |
Looks like there is no ready to use library for lemmatization written in ruby, so we will focus on tokenzation in this issue and extract lemmatization as a separate issue. |
Find/Create function for sepatating text string to tokens(words).
Function must get text string and return list of string tokens. Function also should not return tokens
containing digits and punctuation (you can do this with regular expressions).
Function must have optional parameters:
The text was updated successfully, but these errors were encountered: