Data being seeded by my torrent server
Help preserve data being deleted by fascists: https://lydie.cc/data.html
Data being seeded by my torrent server
Help preserve data being deleted by fascists: https://lydie.cc/data.html
In the process of downloading the National Archives' youtube channel - 2914 videos. You know the fascists will start deleting these. I will have a shiny new torrent! #datahoarder #datapreservation #preservation #uspol #usgovernment #data #DataRescue #torrent #antifa #resistance #resist
A user on the DataHoarders subreddit was recently inquiring about historical AQI data from US embassies around the world.
The historical data was hosted at dosairnowdata.org which is dead.
Webarchive only has 12 CSV files archived.
Does anyone here have any direct knowledge of an archive of this website or data?
#DataHoarders #DataHoarder #WhiteHouseWash #USGovPurge
https://apnews.com/article/us-air-quality-monitors-8270927bbd0f166238243ac9d14bce03
Just discovered ArchiveBox — FOSS, self-hosted internet archiving.
The way the web is going, with the US government redacting and outright erasing historic content, publishers segmenting content by region (and also sometimes redacting/censoring it), and CloudFlare shitting all over everything, I think it's time for me to start my #archiving and #DataHoarding journey.
Amazon will remove the ability to download the ebooks for Kindle at the end of the month. So if you ever close your amazon account, you'll no longer be able to access the books you had bought.
Let's fix that
1. Bulk Exporter: https://github.com/treetrum/amazon-kindle-bulk-downloader
2. Calibre to manage books https://calibre-ebook.com/download
3. Calibre plugin to remove DRM: https://github.com/noDRM/DeDRM_tools/releases
Source: https://bsky.app/profile/remysharp.com/post/3lihtiq2rqc22
so #Youtube should be completely destroyed, right now if you are a #DataHoarder download all the you-tube videos that you consier culturally significant, turn them into torrents, mirror them on
If you're a #contentCreator, youtube is not really a viable medium for earning revenue, most you-tubers i watch are mainly financed by patreon, porn sites pay significantly more and will have better reach, even for educational content, i wouldn't ask anyone to delete their youtube channel, but please consider dual alternative platforms, like #PeerTube
I now have my own fully search-able mirror of #Kiwix hosted on my home server, including #Wikipedia. You can check it out at:
Content may change as time goes on since I literally "just" got it set up and working, and you should definitely prioritize the original resources and donate to the folks hosting it. But IF some of these resources become unavailable from their original source, feel free to use my mirror as long as it's up.
@EposVox is right; if you have the means, it's time to start backing up the web. I'm going to see about hosting my own mirror of the latest .zim copy of Wikipedia. The current administration is going after wrong-think in all its forms, and they have the means to do a hell of a lot of damage if the community doesn't come together to protect our valuable resources.
Title: It's Time to Start Backing Up the Web.
Department of Labor resources about Long COVID as a disability have been removed from the AskJAN website ("JAN" stands for Job Accomodation Network).
Relevant archive links in article.
Just wanted to post to encourage people to continue to grab whatever you fancy from government websites. Aside from anything else, if you're a US taxpayer you paid for all these reports, podcasts, blogs, educational material, &c.
The more hands this material gets into, the better.
Join up with an organized effort like #SafeguardingResearch here on the Fedi, or even just save PDFs (as it is my understanding that these can sometimes get missed).
Let the downloading commence!
My US government data hoarding page is up and ready with links and torrents. The torrents are all being seeded by my junkbox torrent server. I will continue to add torrents as I download things.
Due to the current data preservation emergency, I'm pretty sure I'm going to find at least 20 terabytes to stuff into an old 8 drive tower that I'm building out of junk box parts and then I will host government data mirrors that the fascists have wiped out. #datahoarder #datapreservation #fascists
I usually leave a new laptop sticker-free for a few months. I've had this MacBook Air for over a year and have finally broken it in with a #sticker from @molly0xfff, which arrived last week and is even more timely now than when I ordered it. They are available from https://store.mollywhite.net/collections/stickers
#Archives #DigitalPreservation #DataHoarder
Time for a #datahoarder somewhere to spring into action. https://mstdn.science/@firefoxx66/113928652500028599
Are you a fellow data hoarder? Have some spare terabytes? Start here:
https://commoncrawl.org/blog/january-2025-crawl-archive-now-available
https://meta.wikimedia.org/wiki/Data_dump_torrents#English_Wikipedia
https://github.com/end-of-term/eot2024
https://github.com/internetarchive/dweb-mirror
https://archive.org/details/20250128-cdc-datasets
https://wiki.archiveteam.org/index.php/Main_Page
https://github.com/lisawilliams/NIH_Data
https://archive.org/details/academictorrents_c5bf370a90cae548d5a306c1be7d79186b9f60b9
#TIL about the Internet History Initiative (@IHI). It's a website that focuses on historical relevant public data sets. As a #datanerd and #datahoarder of #internet data, I appreciate that something like this spun up.
However, I am shocked, I haven't heard from it so far. Although, it's online since January 2024 already! Will definitely start to keep an eye on it.
Edit: Forgot to link the website: internethistoryinitiative.org
Hey did anyone back this database up?
*Edit*: apparently at least one org/person might have a pretty good backup. Maybe others have, too. :)
https://lets-address-this-with-qasim-rashid.ghost.io/i-filed-a-foia-challenging-the-trump-regime/
https://www.cnn.com/2025/01/25/politics/january-6-justice-department-database/index.html
I just became aware that the NWS has been asked to stop all public outreach, school visits, SKYWARN trainings, etc.
If you're someone that's saving data or other materials (educational, podcasts, &c.) from the NWS or NOAA, I would get on it quickly if there's stuff you still want to grab.