This collection is old school internet and I believe it should stay that way. What I won’t be doing is adding cookies, tracking, or any of that kind of stuff. I’m not sure what arrangements the original site had for contributors, but I’ll probably start looking at the most recent content and reaching out to the authors where contact information is listed to see how we can move forward. By that, I mean updating the links on the homepage to show any content that wasn’t previously linked, and fixing/removing stuff that doesnt work on the mirror, such as search. If the original is gone for good, I’ll start looking at maintaining some of the content in a couple of months. If there’s content you enjoy that isn’t currently available on the xyz site, please let me know and I’ll scrape it.įor now, nothing. Where I’ve been made aware of these, I’ve manually scraped that collection specifically. Because my scrape only followed the links from the homepage, it didn’t find these and so they’re not there. There are a number of collections that aren’t reachable simply by following links from the homepage – for example, Kristen’s Collection isn’t linked anywhere. **Why isn’t my favourite collection there?** If the original comes back up, I’ll retire my copy and possibly set up a Cron job to keep an up to date copy available in case we lose it again. **What if this is just a temporary outage?** That should be everything under /files (the FTP section) and a lot (but not all – see below) of the Web section. I set up a hosts file entry so I can access the original content, and scraped everything that’s reachable from the main page. I wanted to post here to explain what I’ve done and answer any questions. In case it does go down for good, I’ve taken a mirror of the current content, and set up to host it.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |