I am an owner of a now-defunct Yahoo Group. Apparently, it's been announced that
Yahoo Groups is to remove all content December 14 2019
and I would like to save the archive of messages to that group. For posterity, or maybe for vanity, never mind.
Now, on the group page, I do have access to the message archive, by month, and then by message title. But what I want is to get all messages, at once. I'm not very picky about the exact format (e.g. separate files, one file per month, one single file), as long as there's no junk in it (ads, loads and loads of Yahoo boilerplate HTML).
Is there a way – other than crawling all the message pages myself – to download all those messages?
Best Answer
There's an option in Yahoo Groups to download Groups Data. I submitted a request but I haven't heard back yet so I can't verify if it's the solution to our problem: https://groups.yahoo.com/neo/getmydata
In the meantime, I like this script: https://github.com/IgnoredAmbience/yahoo-group-archiver (Thanks @tripleee in the comments).
This script downloads all files, photos, and more.
You'll need two Cookie values. I describe how to find them in Chrome below.
To use this new script I had to:
cd
into the repopip install -r requirements.txt
(best practice is to use avirtualenv
)./yahoo.py -ct "<T_cookie>" -cy "<Y_cookie>" "<groupid>"
.The
<groupid>
is found in the URL:https://groups.yahoo.com/neo/groups/GROUPID
.My final input looked like this:
Finding the cookie values wasn't apparent at first. Using Chrome, this is how I got the values:
Expand the "yahoo.com" option:
Go into T and Y one at a time and copy their "Content" values to use in the CLI input above.
In case you're interested, one of the Yahoo Groups that I follow is considering paying Groups.io to transfer their Yahoo Group into their site. They were quoted at $220.
https://groups.io/static/transfer