Archive.org – How to Archive a Whole Website

archive.org

To save a single page via Wayback Machine I can go to:

http://web.archive.org/save/https://somewebsite.example.com/

How can I archive the website recursively by Wayback Machine?

There is some project such as wayback-machine-downloader, but I'm looking for a feature which allows me the upload the website recursively.

Best Answer

Since Wayback Machine doesn't provide such feature, I've found some workaround.

First, mirror the website using wget, e.g.
```
wget -m https://example.com/
```
Then use curl to archive all pages one by one that you've downloaded.
```
find . -name "*.html" -exec curl -v "https://web.archive.org/save/https://{}" ';'
```
^{Note: You can change .html to .php, or include certain type of files.}

Related Topic