Skip to content

Commit

Permalink
tell the reader that corenlp can be auto downloaded
Browse files Browse the repository at this point in the history
  • Loading branch information
daniel committed Apr 29, 2016
1 parent 9ffd4ae commit 64c9e30
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -413,7 +413,7 @@ Again, common sense dictates what is possible. When searching trees, only trees,
<a name="building-corpora"></a>
## Building corpora

*corpkit*'s `Corpus()` class contains `parse()` and `tokenise()`, modest methods for created parsed and/or tokenised corpora. The main thing you need is **a folder, containing either text files, or subfolders that contain text files**. If you want to parse the corpus, you'll also need to have downloaded and unzipped [Stanford CoreNLP](http://nlp.stanford.edu/software/corenlp.shtml). If you're tokenising, you'll need to make sure you have NLTK's tokeniser data. You can then run:
*corpkit*'s `Corpus()` class contains `parse()` and `tokenise()`, methods for created parsed and/or tokenised corpora. The main thing you need is **a folder, containing either text files, or subfolders that contain text files**. [Stanford CoreNLP](http://nlp.stanford.edu/software/corenlp.shtml) is required to parse corpora. If you don't have it, *corpkit* can download and install it for you. If you're tokenising, you'll need to make sure you have NLTK's tokeniser data. You can then run:

```python
>>> unparsed = Corpus('path/to/unparsed/files')
Expand Down
2 changes: 1 addition & 1 deletion index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -95,7 +95,7 @@ via Git:
cd corpkit
python setup.py install
Parsing and interrogation of parse trees will also require *Stanford CoreNLP*.
Parsing and interrogation of parse trees will also require *Stanford CoreNLP*. *corpkit* can download and install it for you automatically.

.. rubric:: Graphical interface

Expand Down

0 comments on commit 64c9e30

Please sign in to comment.