Explore projects
-
Updated
-
archived 0Updated
-
BibTeX-to-web for Web Archiving and Internet Archive stuff (based on anonbib): https://bnewbold.the-nsa.org/pub/archivebib/
Updated -
Updated
-
archived 0Updated
-
Updated
-
This repository contains scripts for retrieving metadata from external sources for the acdc collection.
Updated -
-
-
Arthur Milliken / morituri
GNU General Public License v3.0 onlyUpdated -
Automatic extraction data (e.g. content, title and etc) from archived news pages
Updated -
archivecd / pagenet
BSD 3-Clause "New" or "Revised" LicenseUpdated -
Create 'perfect' CDs from CD items or CDs from LP items, or segment LP side recordings into separate tracks
Updated -
Christian Clauss / morituri
GNU General Public License v3.0 onlyUpdated -
Merlijn Wajer / hocr-tools
Apache License 2.0Updated -
Updated
-
Updated
-
See https://git.archive.org/www/tesseract/ instead
Updated -
Merlijn Wajer / ocr-dataset
GNU Affero General Public License v3.0Updated -
Temporary repository for implementations of https://arxiv.org/pdf/1905.13038.pdf
Will contain at least a Cython implementation and a C implementation (using leptonica to read images)
Updated