Skip to content
@ArchiveBox

ArchiveBox

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

Pinned Loading

  1. ArchiveBox ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    Python 22.2k 1.2k

  2. abx-spec-behaviors abx-spec-behaviors Public

    Proposal for a shared user script specification between scraping, crawling, archiving, and AI tools. Allows user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reus…

    JavaScript 4

  3. abx-dl abx-dl Public

    ⬇️ A CLI tool to download all discovered content from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screensh…

    Python 17 2

  4. archivebox-browser-extension archivebox-browser-extension Public

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    TypeScript 244 21

  5. abx-pkg abx-pkg Public

    📦 Modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    Python 14

  6. good-karma-kit good-karma-kit Public

    😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

    318 8

Repositories

Showing 10 of 18 repositories
  • abx-spec-behaviors Public

    Proposal for a shared user script specification between scraping, crawling, archiving, and AI tools. Allows user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, and many other contexts with minimal adjustments.

    ArchiveBox/abx-spec-behaviors’s past year of commit activity
    JavaScript 4 MIT 0 0 0 Updated Nov 9, 2024
  • abx-pkg Public

    📦 Modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    ArchiveBox/abx-pkg’s past year of commit activity
    Python 14 MIT 0 0 0 Updated Nov 7, 2024
  • ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    ArchiveBox/ArchiveBox’s past year of commit activity
  • abx-dl Public

    ⬇️ A CLI tool to download all discovered content from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git srcs, and more...

    ArchiveBox/abx-dl’s past year of commit activity
    Python 17 MIT 2 0 0 Updated Oct 21, 2024
  • docker-archivebox Public

    Home of the official docker image for ArchiveBox

    ArchiveBox/docker-archivebox’s past year of commit activity
    47 GPL-3.0 12 1 1 Updated Oct 16, 2024
  • pip-archivebox Public archive

    Official Python package for ArchiveBox, the self-hosted internet archiving solution.

    ArchiveBox/pip-archivebox’s past year of commit activity
    13 GPL-3.0 2 0 7 Updated Oct 5, 2024
  • homebrew-archivebox Public archive

    Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

    ArchiveBox/homebrew-archivebox’s past year of commit activity
    Ruby 26 GPL-3.0 3 0 0 Updated Oct 5, 2024
  • debian-archivebox Public archive

    Home of the official apt/deb package for Ubuntu/Debian-based systems.

    ArchiveBox/debian-archivebox’s past year of commit activity
    Python 17 GPL-3.0 5 0 1 Updated Oct 5, 2024
  • docs Public

    Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

    ArchiveBox/docs’s past year of commit activity
    CSS 14 4 0 1 Updated Oct 5, 2024
  • readability-extractor Public

    Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

    ArchiveBox/readability-extractor’s past year of commit activity
    JavaScript 37 13 0 2 Updated Sep 16, 2024