Skip to content

Latest commit

 

History

History
73 lines (46 loc) · 2.27 KB

README.md

File metadata and controls

73 lines (46 loc) · 2.27 KB

WordChecker

WordChecker is simple python word checker made as an exercise in style:

the goal is explained here:

Write a program that reads a large list of English words (e.g. from
/usr/share/dict/words on a unix system) into memory, and then reads
words from stdin, and prints either the best spelling suggestion, or
"NO SUGGESTION" if no suggestion can be found. The program should print ">" as a prompt before reading each word, and should loop until killed.

Your solution should be faster than O(n) per word checked, where n is
the length of the dictionary. That is to say, you can't scan the
dictionary every time you want to spellcheck a word.

For example:

> sheeeeep
sheep
> peepple
people
> sheeple
NO SUGGESTION

The class of spelling mistakes to be corrected is as follows:

Case (upper/lower) errors: "inSIDE" => "inside"
Repeated letters: "jjoobbb" => "job"
Incorrect vowels: "weke" => "wake"
Any combination of the above types of error in a single word should be corrected (e.g. "CUNsperrICY" => "conspiracy").

If there are many possible corrections of an input word, your program
can choose one in any way you like. It just has to be an English word
that is a spelling correction of the input by the above rules.

Final step: Write a second program that *generates* words with spelling mistakes of the above form, starting with correctly spelled English words. Pipe its output into the first program and verify that there are no occurrences of "NO SUGGESTION" in the output.

Install

Clone the repository and just launch the word checker program:

python word_checker

It will ask you a word and check the existence of this word in your "/usr/share/dict/words" by default you can run the program in virtualenv with:

source bootstrap
python word_checker

you can pipe the mistake_generator with:

python mistakes_generator | python word_checker	

or simple run the mistake generator:

python mistakes_generator

both script accepte a -f word/dictionary/path, default is "/usr/share/dict/words":

python word_checker -f test_words.text
python mistakes_generator -f test_words.text

Tests

You can run the unit tests with:

python word_checker_tests