Archive.org – How to Archive a Whole Website

archive.org

To save a single page via Wayback Machine I can go to:

How can I archive the website recursively by Wayback Machine?


There is some project such as wayback-machine-downloader, but I'm looking for a feature which allows me the upload the website recursively.

Best Answer

Since Wayback Machine doesn't provide such feature, I've found some workaround.

  1. First, mirror the website using wget, e.g.

    wget -m https://example.com/
    
  2. Then use curl to archive all pages one by one that you've downloaded.

    find . -name "*.html" -exec curl -v "https://web.archive.org/save/https://{}" ';'
    

    Note: You can change .html to .php, or include certain type of files.