Explore projects
-
www / www
GNU Affero General Public License v3.0lightweight JS-only slimmed down archive.org website prototype
Updated -
Merlijn Wajer / archive-pdf-tools
GNU Affero General Public License v3.0Updated -
-
ia / dwebsummit-site
GNU Affero General Public License v3.0This project is also mirror in github. https://github.com/internetarchive/dwebsummit-site
Updated -
This repository contains utilities for working with the 78rpm collection.
Updated -
ansible-roles-contrib / ansible-role-php
MIT LicenseUpdated -
ansible-roles-contrib / ansible-wordpress
MIT LicenseUpdated -
Dat/CDL/IA dweb preservation network pilot: scripts, notes, etc (circa 2018)
archived 0Updated -
ansible-roles-contrib / statsd
MIT LicenseUpdated -
-
Updated
-
archivecd / cdparanoia
GNU Lesser General Public License v2.1 onlyUpdated -
Updated
-
Extract structured metadata and content from article PDFs; use this to match against databases of known identifiers.
Updated -
archived 0Updated
-
Updated
-
Automatic extraction data (e.g. content, title and etc) from archived news pages
Updated -
combine audio and unaligned captions into aligned captions
Updated -
Merlijn Wajer / hocr-tools
Apache License 2.0Updated