A collection of scripts and sources for the generation and gathering of a comprehensive text corpus.
TODO: Running tests
TODO: usecase of copying, or using as submodule
The CMake build process will either find external dependencies on the system if they have been properly installed, or automatically download and build them from their official repositories in case they cannot be found. In that regard, a proper installation of the dependencies is not required.
Said external dependencies are the following:
- Google Logging (glog) (0.34 or later).
- Google Test (1.7.0 or later) [Just for running the unit tests].
The code in this repository is published under the Apache License 2.0