clifff @clifff

1 post1 participant0 posts today

**Preston Maness ☭** @aspensmonster@tenforward.social · 4d

Preston Maness ☭ @aspensmonster@tenforward.social

I've mirrored a relatively simple website (redsails.org; it's mostly text, some images) for posterity via #wget. However, I also wanted to grab snapshots of any outlinks (of which there are many, as citations/references). By default, I couldn't figure out a configuration where wget would do that out of the box, without endlessly, recursively spidering the whole internet. I ended up making a kind-of poor man's #ArchiveBox instead:

for i in $(cat others.txt) ; do dirname=$(echo "$i" | sha256sum | cut -d' ' -f 1) ; mkdir -p $dirname ; wget --span-hosts --page-requisites --convert-links --backup-converted --adjust-extension --tries=5 --warc-file="$dirname/$dirname" --execute robots=off --wait 1 --waitretry 5 --timeout 60 -o "$dirname/wget-$dirname.log" --directory-prefix="$dirname/" $i ; done

Basically, there's a list of bookmarks^W URLs in others.txt that I grabbed from the initial mirror of the website with some #grep foo. I want to do as good of a mirror/snapshot of each specific URL as I can, without spidering/mirroring endlessly all over. So, I hash the URL, and kick off a specific wget job for it that will span hosts, but only for the purposes of making the specific URL as usable locally/offline as possible. I know from experience that this isn't perfect. But... it'll be good enough for my purposes. I'm also stashing a WARC file. Probably a bit overkill, but I figure it might be nice to have.

#RedSails #archive #archival

Replied in thread

**Michael T Babcock** @mikebabcock@floss.social · Mar 6

Mar 6

Michael T Babcock @mikebabcock@floss.social

@gutenberg_org @internetarchive is anyone doing anything about the idea of including the codec with the scan for future future generations? I love digital archiving but at the same time there are already videos in my own archive that I can no longer easily decode.
#software #codec #transcoding #archiving #archival #digitalArchaeology

**Paul Houle** @UP8@mastodon.social · Feb 24

Feb 24

Paul Houle @UP8@mastodon.social

How we’re recovering priceless audio and lost languages from old decaying tapes

https://theconversation.com/how-were-recovering-priceless-audio-and-lost-languages-from-old-decaying-tapes-248116

The ConversationHow we’re recovering priceless audio and lost languages from old decaying tapesA sophisticated machine called the LM-3032 Tape Restorator is helping give new life to decaying, decades-old cassettes.

#australia #audio #cassette

Replied in thread

**Kevin Karhan** @kkarhan@infosec.space · Feb 24 *

Feb 24 *

Kevin Karhan @kkarhan@infosec.space

@FerdiMagellan @jwildeboer @soatok I use #XMPP+#OMEMO, more specifically, @monocles / #monclesChat & @gajim / #gajim!

https://docs.monocles.eu/apps/chat.app/

Another option that may be interesting if you are a business/org & need archival support (and have existing #eMail infrastructure) is @delta / #deltaChat, which is easy to bring into #compliance with #GoBD / #HGD and similar #Archival mandates for professional/business comms.

docs.monocles.eumonocles chat - monocles Documentation

#deltachat #compliance #gobd

**Marcus Adams** @gerowen@mastodon.social · Feb 15 *

Feb 15 *

Marcus Adams @gerowen@mastodon.social

I now have my own fully search-able mirror of #Kiwix hosted on my home server, including #Wikipedia. You can check it out at:

https://kiwix.marcusadams.me

Content may change as time goes on since I literally "just" got it set up and working, and you should definitely prioritize the original resources and donate to the folks hosting it. But IF some of these resources become unavailable from their original source, feel free to use my mirror as long as it's up.

kiwix.marcusadams.meWelcome to Kiwix Server

#Archival #DataHoarder #Censorship

**hairylarry** @hairylarry@gamerplus.org · Feb 15

Feb 15

hairylarry @hairylarry@gamerplus.org

The Indie Archive is taken. There is a domain name

theindiearchive.com

for a different paid service.

My project is open source and free to anyone to use.

I'm not even in beta so I'm going to rename.

So far we came up with

Indie Archival Storage

which is an accurate descriptor but not nearly as cool as The Indie Archive.

So ...

Any ideas?

If you're interested here's the link to my podcast, What Is The Indie Archive.

https://hackerpublicradio.org/eps/hpr4312/index.html

Thanks

hackerpublicradio.orgHacker Public Radio ~ The Technology Community PodcastHacker Public Radio is a podcast that releases shows every weekday Monday through Friday. Our shows are produced by the community (you) and can be on any topic that is of interest to hackers and hobbyists.

#branding #archive #archival

**Hacker Public Radio** @hpr@infosec.exchange · Feb 11

Feb 11

Hacker Public Radio @hpr@infosec.exchange

New Episode: hpr4312 :: What Is The Indie Archive?

The Indie Archive is a archival solution for indie producers.

Hosted by hairylarry on Tuesday, 2025-02-11 is flagged as Clean and is released under a CC-BY-SA license.

Tags: #PlainText, #archival, #programming

Today on the #HackerPublicRadio #Community #Podcast

#HPR #CreativeCommons

https://hackerpublicradio.org/eps/hpr4312/index.html

**Molly White** @molly0xfff@hachyderm.io · Feb 9

Feb 9

Molly White @molly0xfff@hachyderm.io

“We just launched a 16TB archive of every dataset that has been available on data.gov since November. This will be updated day by day as new datasets appear. It can be freely copied, and we're sharing the code behind it to help others make their own archives of data they depend on.” Harvard Library Innovation Lab (via BlueSky)

https://lil.law.harvard.edu/blog/2025/02/06/announcing-data-gov-archive/

https://bsky.app/profile/harvardlil.bsky.social/post/3lhjzh7f54226

lil.law.harvard.eduAnnouncing the Data.gov Archive | Library Innovation Lab

#archival

Replied in thread

**Kevin Karhan** @kkarhan@infosec.space · Feb 6

Feb 6

Kevin Karhan @kkarhan@infosec.space

@delta also the whole "BuT #mEtAdAtA?" Discussion is completely blown out of proportions by #Signal fanboys.

Whereas #deltaChat and #XMPP+#OMEMO are truly #decentralized, #MultiVendor & #MultiProvider #OpenStandards that allow for full #SelfCustody off all the keys and comms.

In fact, I'm convinced someone already made a #delta #chat #server as an #OnionService over @torproject / #Tor just for the lulz.

The biggest Advantage for Delta Chat is that it doesn't require yet another server but instead just uses #IMAP + #SMTP and can even be integrated in #corporate communications that require #archival and #indexing by merely feeding the private keys to said #eMail archival software [i.e. #benno #MailArchiv], which makes it possible to comply with regulations like #GoBD & #HGB where applicable.

Not that this is something the average user encounters, but it is a big bonus for larger organizations!

⁂ @eatallyourdarlings@mastodon.gamedev.place · Feb 4

Feb 4

⁂ @eatallyourdarlings@mastodon.gamedev.place

by pure chance I’ve stumbled upon a beautiful Indonesian web comic The Great Insula, 2024, which seems to be lost media :[

host was hardly crawled by Wayback, I can’t find a mention of its author. tried searching a few social media platforms. one chapter was shared before it disappeared.

DuckDuckGo still has some of the beautiful pages cached. I’ve saved a few of my favourites. just wanted to post here in the hopes it’s not lost forever. lossy archival </3

https://duckduckgo.com/?q=The+Great+Insula+site:nonmemoir.xyz&ia=web

a machine leaving a a trail of oil in a desert, in a frame with blue feathers blowing in the wind below

a giant stony bird in the clouds, closeup frame on its eye with a geometric iris, its hooked beak.

pale figures amongst a temperate forest wearing furs and skulls, a giant wolf emerging from the greenery

#archival

**Yesterday's Rose** @umbraroze@tech.lgbt · Feb 2

Feb 2

Yesterday's Rose @umbraroze@tech.lgbt

I was squinting really hard at the files extracted from #Nintendo #Wii #VirtualConsole version of California Games.

I was like "Wait, why is one of the files called 'califo.games_rem'. That file name doesn't really make sense. ...Wait. is that 'rem' a group tag or something? Could this be the main program??"

Drop the file to C64 emulator.
BOOM. Enjoy the dulcet SIDchip tones of Cracktro.

Nintendo didn't have the cojones to make this the first thing that will be loaded by the Virtual Console emulator, obviously. Cowards.

Commodore 64 game crack intro:
"REMEMBER proudly presents:
California Games (PAL/NTSC) 100%
Copyright 1987 by Epyx, Inc."

#piracy #archival

Replied in thread

**Kevin Karhan** @kkarhan@infosec.space · Feb 1

Feb 1

Kevin Karhan @kkarhan@infosec.space

@luckytran do you or anyone else have the raw data, notes, etc from #CDC?

Cuz at this point a #archival project #overseas is necessary!

#archival #overseas

**Library Juice Press** @LibraryJuicePress@glammr.us · Jan 31

Jan 31

Library Juice Press @LibraryJuicePress@glammr.us

We've got a great lineup of #book releases over the next few months.

First up is a discussion of the new book Preserving #Disability: Disability and the #Archival Profession

Feb. 10, 11am Eastern
Register: https://libraryjuice.zoom.us/webinar/register/WN_13B8ZXtpRMW6qepKN63CFA#/registration

(Note that this is event 1 of 2 for this book!)

#libraries #archives @bookstodon

Preserving Disability: Disability and the Archival Profession book launch event. Feb. 10, 11am Eastern

**Bloop Museum** @bloopmuseum@oldbytes.space · Jan 30

Jan 30

Bloop Museum @bloopmuseum@oldbytes.space

The Video Game History Foundation (VGHF) has opened up their online #digital #library. They've got an incredible staff and collection.

Quit reading this toot and hit this link!

https://gamehistory.org/vghf-library-launch/

Video Game History Foundation · Jan 30The VGHF Library opens in early access | Video Game History FoundationFor free. For everyone. Wherever you are.

#museum #museums #archival

**Sophie** @Sophie@glammr.us · Jan 27

Jan 27

Sophie @Sophie@glammr.us

Hey #library, #archives, #oralHistory & #memoryWorker friends!

I'm teaching a course with @LibraryJuiceAcademy

Solidarity Memory Work
February 3 - March 2

This 4-week class begins with an examination of emerging #solidarity practices in allied fields, such as #journalism, academic #history, and #law. We then explore the practice of #documenting #movements and #organizations, as well as activating #archival records for a more just society.

https://libraryjuiceacademy.com/shop/course/334-solidarity-memory-work/

Library Juice AcademySolidarity Memory Work - Library Juice AcademyMemory work is a broad category of practices, including archival science, librarianship, oral history, journalism, community history, and others, that seek to document a place, community, or event. Solidarity memory work is the application of these practices in explicit solidarity with movements and causes. As our society confronts contemporary civil rights struggles, solidarity memory work is quickly solidifying as a central site of documenting, sharing and preserving the stories of those on the front line. In this class, participants will explore the unique position of memory workers within the broader landscape of political organizing work. This 4-week class begins with an examination of emerging solidarity practices in allied fields, such as journalism, academic history, and law. We then explore the practice of documenting movements and organizations, as well as activating archival records for a more just society.

**58692 lunya :3** @sleepybisexual@fearness.org · Jan 18

Jan 18

58692 lunya :3 @sleepybisexual@fearness.org

calling all archivists, file hoarders and anyone else who is willing to help on fedi. i need a favour

i need books, like lots, i want to arhive as many queer books as i can get my hands on. fiction non fiction, scientific papers. i will take anything you have. just i need fiels that are reaady to use. iu dont have the skills to automate scraping or anything

#queer #lgbt #books #archival

**FluConf** @fluconf@social.cryptography.dog · Jan 8

Jan 8

FluConf @fluconf@social.cryptography.dog

#fluConf2025 will include a track on independent publishing and archival. We want to hear stories of what's being done to create non-corporate spaces on the web and preserve the media big companies so often erase.

Tell us about your motivations and experience moving from big platforms like Substack, Twitter, Instagram, and Wix to self-hosted or communally-operated alternatives.

Share your insights into the world of for-profit journals in academia, and efforts to establish better options not based on extraction.

How do you adapt to challenges like the falling adoption of established syndication protocols like RSS, the costs of AI scraping, and ever-changing search engine algorithms? How do you keep up with legal requirements for content moderation and age verification?

With so many corporate platforms shutting down, changing policies on media retention, or moving to monetize content for AI training: how have you gone about archiving your media? What tools and techniques have you used to ensure it isn't lost? How do we resist corporate capture of independent media and foster conditions for more long-lived infrastructure?

Apply up until midnight of January 19th, 2025 (anywhere on Earth)

https://fluconf.online/apply/

fluconf.onlineSubmit a proposalSubmit your proposal for FluConf 2025 until the end of January 19th, 2025

#FOSS #selfhosting #archival

**Kevin Karhan** @kkarhan@infosec.space · Dec 23, 2024 *

Dec 23, 2024 *

Kevin Karhan @kkarhan@infosec.space

Update: thanks to @bwana for helping:
https://discordian.social/@bwana/113702443188005437

#FollowerPower: Does anyone have the dimensions for the #packaging of #Windows7 & #WindowsVista?

I'm talking about those rounded boxes for the retail version (not SystemBuilder/OEM/...)

#PleaseBoost :boosted: :boost:

I'll also accept anonymous submissions... Contact details pinned in profile / my website...

If anyone has a calipher and said box, it would be cool if they could get the measurements.

I could propably "guesstimate" them based off the DVD-ROM inside but I'd ratzer have the correct dimensions just to be shure...

Windows Vista Boxart, which resembles a book but it's sode is curved quarter-circuar to the front and the opposing top vorner is also rounded off

Windows 7 Box, which retains the curve at ths top edge but not the back to front.

#FediHelp #Archival #Windows

**hairylarry** @hairylarry@gamerplus.org · Dec 20, 2024

Dec 20, 2024

hairylarry @hairylarry@gamerplus.org

@coreysnipes

It's designed for Indie Producers, like myself, to store projects they really don't ever want to lose, like albums, videos, last year's blog posts, a novel or other writings, photos, etc.

Backups first, backup everything at least daily, automatically.

Archival storage, next, For more redundancy, 7 drives on 4 systems in 3 locations with file integrity checks for completed projects or milestones of works in progress. Files added to archival storage manually.

#archives #archival

**Kevin Karhan** @kkarhan@infosec.space · Dec 11, 2024

Dec 11, 2024

Kevin Karhan @kkarhan@infosec.space

Yeah, I don't do #pirate but there's a need for #archival of #media and a right to #DRMfree content, because #DRM is #malware!

https://www.youtube.com/watch?v=V1aqZT7jbo8

YouTubeThe Secret Online Piracy Club You've Never Heard OfBy Eric Murphy

Recent searches

Search options

Administered by:

Server stats:

#archival