Explore projects
-
-
Updated
-
-
This project is also mirror in github. https://github.com/internetarchive/dwebsummit-site
Updated -
-
Dat/CDL/IA dweb preservation network pilot: scripts, notes, etc (circa 2018)
archived 0Updated -
Updated
-
Automatic extraction data (e.g. content, title and etc) from archived news pages
Updated -
Updated
-
Updated
-
Updated
-
produces ffmpeg binary into a docker run-able nvidia runtime container that can run on GPU-enabled VMs/baremetals.
Updated -
Machine Learning applied to deskew and autocrop books and microfilm.
Contains tools to manage datasets, also contains actual ML scripts - using tensorflow and dhSegment
Updated -
combine audio and unaligned captions into aligned captions
Updated -
-