Releases: OCR-D/ocrd_all
Releases · OCR-D/ocrd_all
v2021-07-21
cor-asv-ann e919020..1b8c120
Release: v0.1.12
- 📦 0.1.12
- wrapper.evaluate: allow aggregating histograms, add parameter
- scripts.compare: allow aggregating histograms
- lib.seq2seq / scripts.eval: allow aggregating histograms
- lib.alignment: allow aggregating histograms
- wrapper.evaluate: rename reported metrics and measure WER, too; expose gt_level parameter
- scripts.compare: rename reported metrics and measure WER, too
- lib.alignment: apply equivalences when aggregating confusion, too
- lib.alignment: more level 1/2 equivalences/normalizations
- lib.alignment: remove commented experimental code for other aligners
- 📦 0.1.11
- update README
- lib.alignment: no gaps for confusion counts
core d2c6216f..271dcdf6
Release: v2.25.1
- Merge pull request #701 from OCR-D/apidocs-standalone
- 📦 v2.25.1
- Merge remote-tracking branch 'origin/gds-id'
- 📦 v2.25.0
- 📝 changelog
- Merge remote-tracking branch 'bertsky/ocrd_page_etree'
- 📝 changelog
- Merge remote-tracking branch 'bertsky/fix-recrop' ...
v2021-05-21
Hotfix (v2021-05-20 was an incomplete merge)
v2021-05-20
ocrd_anybaseocr 6fe756a..e930c93
Release: v1.6.0
- 📦 v1.6.0
- ragequit debugging block-segmentation, disable it
- replace keras.saving with the up-to-date location
- logging doesn't like standalone % signs
- keras.engine.Layer is now tensorflow.keras.layers.Layer
- fix keras imports
- 📦 v1.5.0
- Merge branch 'fix-tiseg'
- 📦 v1.4.1
- hotfix: re-introduce ocrd-fork-ocropy requirement
- 📦 v1.4.0
- cropping: edge detection: make position priority peak at 30% of margin and decrease slower inwards
- cropping: allow steeper angles for badly skewed/keystoned images
- cropping: apply detected ruler to edge detector result, too
- cropping: fix for grayscale input
- cropping: improve bombast/outdated comment header
- cropping: reduce default left/right margin zones
- cropping: use more adequate LSD parameters to make line reconstruction via line segment clustering easier
- cropping: crop in last derived image (including binarized) and use transparency and bg-filling
- cropping: add docstring
- cropping: allow plotting debug images to files
- cropping: fix 0e00b3e7 (absolute vs relative position)
- cropping: edge detection: include wider range of line segments
- cropping: edge detection: make position priority peak at 10% of margin
- cropping: fix 370005b3 (polygon ordering)
- cropping: make everything DPI-relative (assuming 300 DPI as natural scale)
- cropping: use intersections from edge detection as polygon instead of bbox, and pad it inwardly
- cropping: ignore margin-only columns before merging
- cropping: edge detection: make consistency criterion more robust
- cropping: edge detection: increase length priority over position
- cropping: edge detection: fix gap criterion
- cropping: prefer line-based detector if successful on all 4 sides
- cropping: more logging
- cropping: lower fg density text boxes, merge nearby columns before removing small ones
- cropping: ruler detection does not binarization (but mask instead of clip it)
- 📦 1.4.0
- cropping: ignore margin-only text areas
- cropping: rewrite edge detection based algorithm…
- cropping: rename and describe parameters in a sane way
- cropping: use existing parameters for same things everywhere
- cropping: for detection based on textbox estimation, use all contours recursively (not just outer) and use exact contour area (without drawing to intermediate image)
- cropping: paste improved pil2array, remove legacy ocrolib dependency
- cropping: PIL/ocrolib uses RGB (not BGR) channels
- cropping: algorithm needs raw (not binarized) images
dinglehopper bd32433..0b9af3a
- Merge pull request #58 from kba/unorderedgroupindexed
- 🚧 dinglehopper: Read PAGE UnorderedGroup in XML order
eynollah 26283c6..80795c9
Release: v0.0.5
- 📦 v0.0.5
- Merge pull request #42 from qurator-spk/ocrd-noae
- 📦 v0.0.4
- fixed #40 and separators are also written in xml
- 📦 v0.0.3
- Merge branch 'main' of https://github.com/qurator-spk/eynollah into main
- fixed #38
cor-asv-ann 6c831fa..e919020
Release: v0.1.10
- 📦 0.1.10
- forgot compare script
sbb_binarization 4a5544d..8dd0506
Release: v0.0.8
- 📦 v0.0.8
- resolving error for inputs whcih have smaller scale than model patch
- Merge pull request #28 from qurator-spk/fix-unbound-var
v2021-05-04
eynollah 12f5b29..26283c6
Release: v0.0.2
- 📦 v0.0.2
- resolving issue qurator-spk/eynollah#38
- adding the binarization model and option to binarize input document for the cases like dark, stronly bright and other ones
- strong erosion, more modification
- Merge pull request #36 from mikegerber/fix/ocrd-core-req
- Update eynollah.py
- Merge branch 'main' of https://github.com/qurator-spk/eynollah into main
- fixing again the error raised because of erosion
sbb_textline_detector 95d8f70..f1d2777
- Merge pull request #57 from mikegerber/fix/setup-namespace
v2021-04-25
v2021-03-15
core 1df3f45..1aabf82
Release: v2.23.2
- 📦 v2.23.2
- add updated sbb_binarization models, ht @vahidrezanezhad
tesseract fe614c60..c676d5bc
Release: 5.0.0-alpha-20201231
- STRING is no longer required for Visual Studio
- Remove last usage of STRING
- Remove unused parts of class STRING
- Replace remaining STRING by std::string in unittest
- Replace remaining STRING by std::string in src/training
- Replace all but one remaining STRING by std::string in src/ccstruct
- Replace remaining STRING by std::string in src/dict
- Replace remaining STRING by std::string in src/viewer and src/wordrec
- Replace remaining STRING by std::string in src/classify
- Replace remaining STRING by std::string in src/ccmain
- Remove unused file.
- Replace typedef structs with structs.
- snprintf will add '\0' at the end itself.
- [training] Modernize.
- [unittest] Use more smart ptrs, more std::make_unique instead of .reset(new T()).
- Merge pull request #3327 from Shreeshrii/patch-1
- [capi] Restore some of wrongly removed apis.
- Misc.
- Merge pull request #3325 from stweil/string
- Merge pull request #3324 from stweil/nt_symclip
- Remove unused include statements for strngs.h
- [readme] Require C++17 for building.
- Add missing export header.
- [clang-format] Format generated protobuf source.
- [clang-format] Format unit tests.
- Include missing config_auto.h
- Add missing forward decl.
- [clang-format] Execute clang format on include and src dirs.
- [clang-format] Update config.
- Revert "Format public API files with 'clang-format-11 -i include/tesseract/*.h'"
- Format public API files with 'clang-format-11 -i include/tesseract/*.h'
- Fix tatweel_test for C++-20
- Replace GenericVector by std::vector in class ImageData
- Merge pull request #3323 from Shreeshrii/ci
- autobuild: Require C++17
- Fix build (#3322)
- Fix linux build.
- Pass empty strings into Tesseract::init_tesseract().
- Pass init strings as std::string instead of const char * internally. This does not affect public APIs.
- Remove STRING::size() method.
- Remove unused STRING::strdup().
- Remove unused STRING::unsigned_size().
- Replace STRING::truncate_at() with resize().
- More std::string replacements.
- [cmake] Require C++17.
- Clean implementation for (de-)serialization of a vector
- Update .travis.yml
- Fix serialization for vector of RecodedCharID
- Travis: build linux matrix (#3320)
- Merge pull request #3318 from Shreeshrii/travis
- Add heuristic to fix swap (partially fixes issue #2586)
- Replace GenericVector by std::vector
- Fix two compiler warnings for serialis.h
v2020-03-08
core 0ca5aab..a32857d
Release: v2.23.1
- 📦 v2.23.1
- Merge pull request #679 from OCR-D/logging-test-fix
- Merge pull request #680 from OCR-D/resmgr-tess-config
- Merge pull request #677 from OCR-D/python-versions
ocrd_tesserocr fd17386..b755b26
Release: v0.12.0
ocrd_pc_segmentation 7298475..413f5d3
Release: v0.1.3
- pip install -U pip-tools for compat with pip 20.3+, fix #18
ocrd_wrap 9dfbdb3..5a501df
Release: v0.1.7
- 📦 0.1.7
- skimage-denoise: change option
protect
from bool to pt- 📦 0.1.6
- skimage_denoise: make maxsize in pt instead of pt²
- skimage_denoise: add option
protect
- 📦 0.1.5
- skimage.remove_small_objects expects boolean array
- update link labels
tesseract 51a214a..fe614c6
Release: 5.0.0-alpha-20201231
- Enable less FP exceptions for clang compiler when running tesseract
- GenericVector -> std::vector.
- Allow to serialize std vectors with classes from TFile. Implementation from GenericVector.
- Fix memory leak in DocumentCache
- Avoid fp division by 0 (fix issue #3314)
- Merge pull request #3313 from stweil/learning_rate
v2020-02-27
core fe119e4..0ca5aab
Release: v2.23.0
- 📦 v2.23.0
- 📝 changelog
- Merge branch 'calamari-models'
- 📝 changelog
- Merge branch 'merge-mets'
- generate_page: raise ValueError if no numeric part in start/end
- document .. range operator
- fix ocrdmets test
- generate_range: include end
- implement value range with .. operator for pageId
- 📦 v2.22.4
- 📝 changelog
- Merge pull request #671 from OCR-D/resmgr-cwd-again
- 📝 changelog
- Merge pull request #665 from OCR-D/gds-suppress-stdout
- Merge pull request #664 from OCR-D/fix-docstrings
- Merge pull request #666 from OCR-D/validator-log-debug
- Merge pull request #669 from OCR-D/fix-indentation
dinglehopper 8cd8314..bd32433
- 🚧 dinglehopper: Try out Drone CI
- 🚧 Replace Travis with CircleCI
- Merge pull request #50 from b2m/fix-table-extraction
ocrd_calamari f00df75..da98713
Release: v1.0.2
- 📦 v1.0.2
- 🐛 Fix word and glyph coordinates
- ✔️ Explicitly check for None
- Merge pull request #10 from bertsky/image-features
ocrd_cis 1abf38b..426a33d
Release: v0.1.5
- segment: avoid adding existing regions to RO group if they are not immediate children
ocrd_fileformat df95fd7..86763d1
Release: v0.2.1
- 📦 v0.2.1
- Merge pull request #28 from OCR-D/fix-page-schema
ocrd_pagetopdf 6047bd4..7eb2e4e
Release: v1.0.0
- Install default-jre-headless instead of openjdk-8-jre-headless
ocrd_segment 744261f..8214d35
Release: v0.1.10
- 📦 0.1.10
- extract_page: draw reading-order arrows
- extract-pages: add 'feature_filter' param
- extract-regions: apply 'feature_filter'
- 📦 0.1.9 (extract*: expose param
feature_filter
)- raise pylint karma
- 📦 0.1.8
- replace-page: fix getLogger context
- extract-pages: multi-level mask output via
plot_segmasks
, optional alpha compositing viaplot_overlay
- from-masks: pick default colordict from parameter, omit initial # sign from colorspecs
- extract-pages: expose colordict as parameter (with PageViewer scheme as default)
- extract_words, extract_glyphs: new
opencv-python bbaa777..fd4e604
Release: 48
- disable Qt on macOS for now due to multiple issues
- OpenCV 4.5.1
- fix paths
- Merge branch 'master' of https://github.com/skvark/opencv-python
- merge 3.4 changes
tesseract 5ca4e006..1ab6b0fb
Release: 5.0.0-alpha-20201231
- Merge pull request #3311 from stweil/master
- Don't compile OpenCL code when OpenCL is disabled
- Format code
- Remove blanks at line endings
- Replace GenericVector by std::vector in class WERD_CHOICE
- Update configure.ac
- Merge pull request #3309 from stweil/std
- Merge pull request #3162 from xloem/android
- Merge pull request #3305 from Shreeshrii/vcpkg-cmake-win
- Merge pull request #3302 from Shreeshrii/PR1
- Merge pull request #3300 from Shreeshrii/autocmake
- Handle file list without terminating LF (fix issue #3298)
- Update test submodule
- Update include statements for external header file allheaders.h
- Fix default value for --enable-doc
- Merge pull request #3297 from Shreeshrii/autotools
- Fix default value for --with-archive and --with-curl
- Merge pull request #3292 from stweil/master
- Remove Python training scripts which were moved to tesstrain
- Change LATIN_FONTS to use replacement fonts from TeX Gyre collection
- Merge pull request #3276 from Shreeshrii/travis
- Merge pull request #3279 from Shreeshrii/master
- Merge pull request #3275 from Shreeshrii/ci
- cmake: Use pkg_config to find required libraries
- cmake: Don't try to set compile flags for NEON support for ARM64
- cmake: Activate NEON support for ARM64
- Merge pull request #3270 from Shatur95/fix-set-and-check
- Merge pull request #3268 from Shreeshrii/ci
- Fix path for test-suite.log with flat unittest build in GitHub actions
- Fix build for flat unittest (CPPFLAGS for abseil and gtest)
- Merge pull request #3265 from Shreeshrii/master
- Merge pull request #3264 from stweil/unittest
- Replace GenericVector by std::vector for class SimpleStats
- Fix Memory leak in ligature_table_test
- Merge pull request #3259 from Shreeshrii/ci
- Merge pull request #3257 from tesseract-ocr/ci
- Replace STRING by std::string for function read_unlv_file
- Remove unneeded include statement for genericvector.h
- Remove unneeded include statement for strngs.h
- Wordrec: Modernize code
- Merge pull request #3256 from stweil/mastertrainer_test
- lstmtrainer: Modernize code
- lstmtrainer: Remove unused local variable
- Replace GenericVector by std::vector in LoadFileLinesToStrings
- Reduce number of INFO messages from lstm_test (#3250)
- Merge pull request #3252 from Shreeshrii/legacy
- Remove unused conditional OSX
- Merge pull request #3251 from stweil/master
- Merge pull request #3249 from Shreeshrii/ciupdate
- Merge pull request #3247 from Shreeshrii/ci
- Merge pull request #3244 from Shreeshrii/ci
- Update dailycmake.yml
- Merge pull request #3243 from Shreeshrii/ci
- Merge pull request #3242 from Shreeshrii/ci
- Merge pull request #3241 from Shreeshrii/ci
- Merge pull request #3239 from Shreeshrii/legacy
- Add more information shown by combine_tessdata -l
- Add new option -l for combine_tessdata to list the network string
- Replace STRING by std::string for LSTMRecognizer::network_str_
- Replace STRING by char* in LSTMRecognizer
- Merge pull request #3236 from stweil/max_iterations
- Merge pull request #3235 from Shreeshrii/ci
- Add TESS_API to instantiation of template functions
- Use explicit int32_t for serialized data type
- Replace GenericVector by std::vector
- Remove unnecessary subtest with missing input image
- Remove ANDROID_BUILD macro
- Fix memory allocation in TFile::DeSerialize(std::vector& data)
- Merge pull request #3231 from Shreeshrii/ci
- automake: Fix flat build for doc
- Don't use local CP_RESULT_STRUCT variable to initialize elements of std::vector
- Fix serialization of std::vector (fix issue #3220)
- Replace resize(0) by clear() for std::vector
- Remove --psm 6 for lstm.train in tesstrain.py
- No --psm 6 for lstm.train
- automake: Flat build for doc
- Fix NEON also tesseractmain.cpp
- Fix use of NEON on ARMv8
- Revert "Try to fix aarch64 build on macos."
- Try to fix cmake build.
- Try to fix aarch64 build on macos.
- Fix stringrenderer_test
- Fix ligature_table_test
- Fix pango_font_info_test
- Add assertion for IntCastRounded
- Update sw.yml
- Delete TESSDATA_BEST_DIR macro
- Replace snprintf by strncpy (fix compiler warning)
- Fix some format strings
- Add include path for leptonica for fuzzer build
- Merge pull request #3217 from 1480c1/unicode
- More std::vector.
- Misc.
- Move unicodes into files where they are used.
- Misc.
- Use std::bitset<16> instead of custom BITS16.
- More std::vector.
- Update sw.yml
- Update sw.yml
- Update sw.yml
- Fix brew check for GitHub actions
- Update sw.yml
- Update sw.yml
- Update sw.yml
- Remove remaining parts of emalloc (fix autotools build)
- Fix include statement for allheaders.h
- Sort entries in Makefile.am
- Remove remaining parts of tessopt (fix autotools build)
- [test] Init FC early.
- Remove emalloc. Replace it with malloc. To be replaced with new later.
- Update sw.yml
- Update sw.yml
- ...
v2021-02-03
ocrd_fileformat ae52a2e..df95fd7
Release: v0.2.0
- changelog
- 📦 v0.2.0
- Merge branch 'handle-false-positive-page-alto'
- update ocr-fileformat
sbb_binarization ea69494..4a5544d
Release: v0.0.7
sbb_textline_detector 4c498fc..95d8f70
v2021-01-30
make check
ensures executables are runnable and produce--help
output- Use
ocrd resmgr
instead of wget formake install-models*
core 135acb67..fe119e4d
Release: v2.22.3
- 📦 v2.22.3
- 🐛 resmgr: equivalent cwd bug for list_resource_candidated :/
- 🐛 resmgr: --location cwd should still include "ocrd-resources"
- Processor.init: remember old cwd before chdir
- Proceessor.resolve_resource: log/exit 1 instead of raise
- Proceessor.resolve_resource: No more downloading on-demand
- update assets
- 📦 v2.22.2
- resource list: disable < 1.0 calamari models, rename resource
- 📦 v2.22.1
- 🐛 mkdir -p $HOME/.config/ocrd
- 📦 v2.22.0
- update specs
- Merge pull request #559 from OCR-D/resolve-files
- update assets and spec to 3.11.0
- 📝 changelog
- Merge branch 'warn-unless-pageid'
- Merge pull request #660 from OCR-D/add-guess-mimetype
- Merge pull request #655 from OCR-D/rename-group
Submodule ocrd_anybaseocr contains untracked content
Submodule ocrd_anybaseocr contains modified content
ocrd_anybaseocr de27915..e63f555
Release: v1.3.0
- 📦 v1.3.0
- Merge pull request #78 from OCR-D/processor-resources
- 📦 v1.2.0
ocrd_calamari 1c7fcda..f00df75
Release: v1.0.1
- Merge pull request #55 from OCR-D/ocrd-tool-version
- 📦 v1.0.1
- Merge pull request #53 from OCR-D/resolve-resources
- 📝 README: Use new-style OCR-D parameter syntax and new-style mixed output file groups
- Merge pull request #49 from OCR-D/fix-48
- Merge pull request #52 from OCR-D/checkpoint_dir
- Merge pull request #50 from OCR-D/add-calamari-version
ocrd_cis 925d744..1abf38b
Release: v0.1.5
- re/segment: before spreading lines, assign diacritics to seeds below
- segment: ensure run-time hierarchy refs when modifying reading order elements
- segment: fix typo
- ocrd-cis-align: fix invalid TextEquiv/dataType use
- ocrd-cis-align: fix log level override
- ocrd-cis-align: fix getLogger context
ocrd_im6convert 11bfea3..105697f
Release: v0.0.5
- 📦 v0.0.5
- Merge pull request #16 from OCR-D/sharedir
ocrd_tesserocr 89b73ff..fd17386
Release: v0.11.0
- 📦 v0.11.0
- Merge pull request #166 from OCR-D/processor-resource
ocrd_typegroups_classifier 54c8a78..1ed0cb1
Release: v0.3.0
- 📦 v0.3.0
- Merge pull request #8 from OCR-D/resolve-resources