This repository contains a list of stopwords for the Tatar language.
The list was constructed manually based on word distributions obtained from news texts. There are mostly functional words (conjunctions, postpositions, interjections), as well as pronouns and numerals, some high frequency verbs like "диде" ("said"), and a few parentheses.
Current count: 1006 wordforms (~300 unique lemmata).
Some rare functional words were included from Apertium. Additional surface wordforms were generated automatically also using Apertium.