Skip to content
@internetarchive

Internet Archive

The Internet Archive is "the library of the Internet", and a big supporter of Free Software.

Pinned Loading

  1. openlibrary openlibrary Public

    One webpage for every book ever published!

    Python 5.5k 1.5k

  2. bookreader bookreader Public

    The Internet Archive BookReader

    JavaScript 1k 431

  3. heritrix3 heritrix3 Public

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    Java 2.9k 763

  4. cicd cicd Public

    build & test using github registry; deploy to nomad clusters

    15

Repositories

Showing 10 of 255 repositories
  • brozzler Public

    brozzler - distributed browser-based web crawler

    internetarchive/brozzler’s past year of commit activity
    Python 687 Apache-2.0 100 33 17 Updated Mar 5, 2025
  • iaux-reviews Public

    Web component for displaying and editing Internet Archive reviews

    internetarchive/iaux-reviews’s past year of commit activity
    JavaScript 0 AGPL-3.0 0 1 2 Updated Mar 5, 2025
  • openlibrary Public

    One webpage for every book ever published!

    internetarchive/openlibrary’s past year of commit activity
    Python 5,496 AGPL-3.0 1,481 790 (30 issues need help) 142 Updated Mar 5, 2025
  • bookreader Public

    The Internet Archive BookReader

    internetarchive/bookreader’s past year of commit activity
    JavaScript 1,029 AGPL-3.0 431 136 (3 issues need help) 91 Updated Mar 5, 2025
  • trendmachine Public

    A mathematical model to calculate a normalized score to quantify the temporal resilience of a web page as a time-series data based on the historical observations of the page in web archives.

    internetarchive/trendmachine’s past year of commit activity
    Python 7 AGPL-3.0 1 0 0 Updated Mar 5, 2025
  • iaux-typescript-wc-template Public template

    IAUX Typescript WebComponent Template

    internetarchive/iaux-typescript-wc-template’s past year of commit activity
    TypeScript 7 AGPL-3.0 3 3 13 Updated Mar 5, 2025
  • wayback-discover-diff Public Forked from ftsalamp/wayback-discover-diff

    A Python 3.6+ application that calculates and returns simhash values for Internet Archive's snapshots

    internetarchive/wayback-discover-diff’s past year of commit activity
    Python 9 8 1 0 Updated Mar 5, 2025
  • iaux Public

    Monorepo for Archive.org UX development and prototyping.

    internetarchive/iaux’s past year of commit activity
    JavaScript 70 AGPL-3.0 87 89 (5 issues need help) 147 Updated Mar 5, 2025
  • iaux-notification-toast Public

    displays notifications and automatically clears them

    internetarchive/iaux-notification-toast’s past year of commit activity
    TypeScript 0 AGPL-3.0 0 1 12 Updated Mar 4, 2025
  • internetarchive/iaux-item-metadata’s past year of commit activity
    TypeScript 0 AGPL-3.0 0 1 0 Updated Mar 4, 2025