
Datahoarders became essential. They are the 'Book People' of our Dark Age.
Datahoarders became essential. They are the 'Book People' of our Dark Age.
Shiny new torrents are up!!
Current data preservation torrents being seeded: #datahoarder #datapreservation #preservation #uspol #usgovernment #data #DataRescue #torrent
Just discovered ArchiveBox — FOSS, self-hosted internet archiving.
The way the web is going, with the US government redacting and outright erasing historic content, publishers segmenting content by region (and also sometimes redacting/censoring it), and CloudFlare shitting all over everything, I think it's time for me to start my #archiving and #DataHoarding journey.
Amazon will remove the ability to download the ebooks for Kindle at the end of the month. So if you ever close your amazon account, you'll no longer be able to access the books you had bought.
Let's fix that
1. Bulk Exporter: https://github.com/treetrum/amazon-kindle-bulk-downloader
2. Calibre to manage books https://calibre-ebook.com/download
3. Calibre plugin to remove DRM: https://github.com/noDRM/DeDRM_tools/releases
Source: https://bsky.app/profile/remysharp.com/post/3lihtiq2rqc22
so #Youtube should be completely destroyed, right now if you are a #DataHoarder download all the you-tube videos that you consier culturally significant, turn them into torrents, mirror them on
If you're a #contentCreator, youtube is not really a viable medium for earning revenue, most you-tubers i watch are mainly financed by patreon, porn sites pay significantly more and will have better reach, even for educational content, i wouldn't ask anyone to delete their youtube channel, but please consider dual alternative platforms, like #PeerTube
@EposVox is right; if you have the means, it's time to start backing up the web. I'm going to see about hosting my own mirror of the latest .zim copy of Wikipedia. The current administration is going after wrong-think in all its forms, and they have the means to do a hell of a lot of damage if the community doesn't come together to protect our valuable resources.
Title: It's Time to Start Backing Up the Web.
Just wanted to post to encourage people to continue to grab whatever you fancy from government websites. Aside from anything else, if you're a US taxpayer you paid for all these reports, podcasts, blogs, educational material, &c.
The more hands this material gets into, the better.
Join up with an organized effort like #SafeguardingResearch here on the Fedi, or even just save PDFs (as it is my understanding that these can sometimes get missed).
EDIT: ARCHIVE.ORG sets have been fixed. Let the downloading commence!
My US government data hoarding page is up and ready with links and torrents. The torrents are all being seeded by my junkbox torrent server. I will continue to add torrents as I download things.
I usually leave a new laptop sticker-free for a few months. I've had this MacBook Air for over a year and have finally broken it in with a #sticker from @molly0xfff, which arrived last week and is even more timely now than when I ordered it. They are available from https://store.mollywhite.net/collections/stickers
#Archives #DigitalPreservation #DataHoarder
#TIL about the Internet History Initiative (@IHI). It's a website that focuses on historical relevant public data sets. As a #datanerd and #datahoarder of #internet data, I appreciate that something like this spun up.
However, I am shocked, I haven't heard from it so far. Although, it's online since January 2024 already! Will definitely start to keep an eye on it.
Edit: Forgot to link the website: internethistoryinitiative.org
How do #windows users handle incremental backups?
A friend is asking for help. He doesn't want to use a cloud, just an external HDD. What he did till now is just copy paste everything once every other month and hope that nothing is missing.
Any non-proprietary windows software that I can recommend to him that can handle incremental backups? Encryption would also be nice imo
What's the path of least resistance to archive reddit.com threads that are somehow still online?
I've seen that web.archive.org and archive.is can get blocked by reddit's bot detection, and even if not, they don't archive all the permalinks to nested comments etc.
#datahoarder #dataarchival #webarchiving
The hoard's storage. The MyBooks are primary storage and 1st mirror (104TB). The tall black unit is an air-gapped mirror (32TB). The small black unit is the new air-gapped offsite mirror (36TB). #data #datahoarder #datapreservation #backups #storage #harddrives
Another 36TB of Enterprise HDDs ordered for my data hoard. That brings me to ~182TB of raw storage. I have one live pool, 1 live mirror, 1 offline/air-gapped mirror, and 1 off-site offline mirror. No RAID, that shit takes too long on drives this big. It's an expensive hobby but data preservation is becoming more and more important as things disappear from the Internet. #datahoarder #archive #backup #preservation
couple new stickers in the store
https://store.mollywhite.net/products/im-not-hoarding-im-archiving-sticker
If you were to buy:
1. A Linux-compatible home server
2. A NAS that would be compatible with a Linux-based home setup
right now, what would they be and why? #Linux #NAS #HomeLab #DataHoarder #FOSS
Another one for the #archive #datahoarder #storage #nas bubble:
What is the ~best DVD-ROM drive to read old and likely degraded media?
The Internet at large seems devoid of any information, it's all AI and linkfarming slop.
Are #MDisc worth it? If yes, what drive, disk manufacturer etc do I want? Are 100GB discs are reliable as 50 and 25 ones?
#HDD have shorter life? #LTO #tape seems less portable?
CC @internetarchive @textfiles @brewsterkahle
Retoots and replies *very* much appreciated.
[Long Post] Guide to Cleanly Archiving Aethy Profiles (including Text!)
https://aethy.com/@YOURNAME/hide_boosts
4a. (Optional) Turn on The Hide Images extension on this page to make the future loading faster (The images will still be downloaded)Voila, you have just archived all your text posts.
Fandom history is so important. With the state of crushing culture pushing us to further and darker corners away from each other, lost media is a heart breaking side effect of fractured communities. If you love someones posts, save them! You never know if one day it'll simply be gone. If you like looking back at your old time capsules, backup your activities! By keeping the memories of our resistance alive, we can keep going on.