Browsertrix is the high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all! Archive whole websites with automated crawling, analyze them with our assistive QA tools, and combine them with previously captured uploaded content to share with others.
ArchiveWeb.page is a Chrome extension and standalone desktop app that allows you to archive websites interactively as you browse. The extension works in any Chromium based browser (Chrome, Brave, Edge) and the desktop app provides the same interactive high-fidelity archiving functionality as a standalone application.
ReplayWeb.page provides an embeddable web archive viewer for WARC and WACZ web archives hosted on the web, your local computer, or Google Drive. ReplayWeb.page is also available as a PWA or desktop application for offline use.
pywb toolkit is a full-featured, advanced web archiving capture and replay framework for Python. It provides command-line tools and an extensible framework for high-fidelity web archive access and capture, including localization and access control. A subset of features provides the basic functionality also known as a 'wayback machine', but pywb includes additional features to create new web archives and to manage existing collections.
In addition to the above key tools, we maintain a numerous other smaller tools as part of the web archiving ecosystem. Select one of the categories to further filter this list. Take a look at these tools if you are interested in deploying web archiving tools on your, or integrating into other projects.
All currently maintained Webrecorder tools are listed below. Select one of the
categories to further filter this list.
A set of automated behaviors for automating interactions with the browser, including generic (playing video, scrolling) and site-specific behaviors, such as for social media
A command-line tool to convert on-disk directories of web documents (commonly HTML, web assets and any other data files) into an ISO standard web archive (WARC) files.