v0.2.2: Add sentence rewritting and polish documentation.
This release adds:
- Sentence rewriting extractor and packer to generate mixcase datasets. Contrary to gap and masking, a set of sentences of the documents are selected and the LLM has to rewrite them in its own words.
- Argument validation in extractors.
- Remove private methods from the documentation.