Explore projects
-
empty deriver image - @see https://git.archive.org/ia/petabox/-/blob/master/chart/build.yml
Updated -
Updated
-
combine audio and unaligned captions into aligned captions
Updated -
www / iaux-sr-only
GNU Affero General Public License v3.0a web component that visually hides something but keeps it available for screen readers. made with lit element.
Updated -
See https://git.archive.org/www/tesseract/ instead
Updated -
Updated
-
archivecd / tesseract
Apache License 2.0Updated -
Updated
-
Merlijn Wajer / Python Derivermodule
GNU Affero General Public License v3.0Shared code for Python deriver modules
Updated -
Merlijn Wajer / archive-hocr-tools
GNU Affero General Public License v3.0Updated -
Merlijn Wajer / hocr-tools
Apache License 2.0Updated -
Merlijn Wajer / archive-pdf-tools
GNU Affero General Public License v3.0Updated -
www / Tesseract
GNU Affero General Public License v3.0Tesseract deriver module used to OCR items with tesseract. Outputs hOCR and various metadata keys.
Updated -
Christian Clauss / morituri
GNU General Public License v3.0 onlyUpdated -
-
Merlijn Wajer / dhSegment
GNU General Public License v3.0 onlyUpdated -
Tesserotate (aka tesseract-baselines-rotate)
Calculates image rotation to rotate them to their proper orientation with tesseract, using the baselines as found by the OSD
Updated -
-
Machine Learning applied to deskew and autocrop books and microfilm.
Contains tools to manage datasets, also contains actual ML scripts - using tensorflow and dhSegment
Updated