Skip to content

Commit

Permalink
Cleaned up improvements list and added some useful info to NOTES
Browse files Browse the repository at this point in the history
  • Loading branch information
tkedwards committed Dec 30, 2016
1 parent 0ea9f31 commit 1f29b4a
Show file tree
Hide file tree
Showing 2 changed files with 10 additions and 18 deletions.
7 changes: 7 additions & 0 deletions NOTES
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,13 @@ https://help.github.com/articles/configuring-a-publishing-source-for-github-page

Note that the old 'Automatic Page Generator' has been deprecated: Github web UI->(repository)Settings->Automatic Page Generator (see https://help.github.com/articles/what-is-github-pages/)

**Dictionaries and Formats**
To see how many 'headwords' are in a dictionary count the number of lines in its .index file, e.g:
wc -l wikt-deu-eng.index
62824 wikt-deu-eng.index

Dictionaries are stored under /usr/share/dictd

**Packaging**
To create Debian packages for a newly added language, e.g. French (ISO code fra):
- Copy one of the existing stanzas in debian/control for both wikt-eng-fra and wikt-fra-eng
Expand Down
21 changes: 3 additions & 18 deletions TODO.txt
Original file line number Diff line number Diff line change
@@ -1,21 +1,7 @@
**Non-Program Improvements**
- Create an Ubuntu PPA for the .deb files

- upload packages to Debian project and to website. Sign packages with [email protected] gpg key (gpg currently defaults to old tkedwards key)

**Bug Fix improvements**
- Catch exceptions and print the data that was being processed at the time, e.g this happened while processing for Norwegian:
File "/home/tim/devel/wiktionarytodict/wiktionarytodict.py", line 46, in process
TextElement
self.parseTranslations(content)
File "/home/tim/devel/wiktionarytodict/wiktionarytodict.py", line 67, in parseTr
anslations
self.parseTranslationline(aline, currentMeaning)
File "/home/tim/devel/wiktionarytodict/wiktionarytodict.py", line 75, in parseTr
anslationline
rawtranslation = aline.split(':')[2].lstrip()
IndexError: list index out of range
- upload packages to Debian project and to website. Sign packages with [email protected] gpg key

**Improvements**
- better formatting:
* when going from a language that has noun genders such as Spanish back to English it should show the noun gender on its own line. Look at the dictionary format from trans-de-en e.g:
Haus
Expand All @@ -29,8 +15,6 @@ IndexError: list index out of range
(a place for recreation)
parque (m)

**New Features requiring Significant Change or Work**

- deal with {{qualifier|someword}} properly

- instead of just using wiktionary data bring in data from other dictionaries, e.g Freedict, or even other language Wiktionaries (e.g. fuse data from Spanish and English wiktionaries to make Eng<->Spa dictionary). See Quickdic Android app for an example
Expand All @@ -44,5 +28,6 @@ Examples of sub-languages are Nynorsk and Bokmål as the two alternate writing s

- have wiktionarytodict create StartDict format files http://www.stardict.org/HowToCreateDictionary

- implement a progress bar/display during creation, see python3-progressbar and https://stackoverflow.com/questions/1001871/showing-progress-of-pythons-xml-parser-when-loading-a-huge-file


0 comments on commit 1f29b4a

Please sign in to comment.