#Repositories out of the Blind Spot of Initiatives to Correct the Scholarly Record
by Frédérique Bordignon
Learned Publishing, 2025, 38(2)
#Repositories out of the Blind Spot of Initiatives to Correct the Scholarly Record
by Frédérique Bordignon
Learned Publishing, 2025, 38(2)
Update. "‘Omg, did #PubMed go dark?’ Blackout stokes fears about database’s future"
https://www.nature.com/articles/d41586-025-00674-3
"The NIH told Nature in an e-mail that, on 1 March 'some of NIH public-web services experienced service disruptions' but all services were restored on 2 March. 'NIH is committed to resilient and open access to PubMed and other NIH web services,' the team wrote. But [Krutika Kuppalli, an infectious-diseases physician] says any disruption — even if temporary — raises concerns about how researchers access essential scientific information. 'It also serves as a reminder of the need for contingency plans, such as alternative #repositories or offline access to critical research, to ensure that health-care providers and researchers are not left without crucial information when they need it most,' she says."
Super interesting 4-year tender...
https://bidstats.uk/tenders/2025/W07/840565060
Manchester Open Research Environment, advancing the University's green open access (OA) strategy - Request for Information
@neuralreckoning @internetarchive
Sorry if you already know this. The #NelsonMemo described #GreenOA policies. It required deposit in OA #repositories, not submission to OA #journals. Some publishers told authors that they'd have to pay #APCs to comply with the policy. But that was deception and spin. Compliance with the policy was always free of charge. When journals charge APCs to publish fed-funded research, it was to publish in those journals, not to comply with federal policy.
I started referring to the AS/AP or #ActivityStreams / #ActivityPub based #Fediverse as the "as soon as possible" #fedi.
It's the federated universe that was the lowest hanging fruit in the #FOSS community, where individual projects are great at producing compiled #code and #OpenSource #repositories to inspire others to produce more compiled code.
The fedi we can't get is the one where we all #collaborate and #cocreate at the level of the #OpenStandards #ecosystem, where the true potential is.
I’ve been thinking lately (always a mistake) about all the cultural works to which we don't have access. Everything removed from streaming; everything locked behind DRM so that most libraries and archives won't have copies which can redundantly survive disruption. Sometimes I get real sad about the future readers and historians and others who just won't be able to find copies of the incredible things made during the current digital dark age.
As ever, I try to let this radicalize me rather than lead me into despair. I know that there are lots of horrors worth raging against, but this is one I feel well-positioned to work against. It's low-stakes enough that I won't feel self-loathing if I burn out or need to take a break. It's no secret that I like to read and organize books so this is a topic close to my heart and one which can bring me joy and allow me to share it with those around me too. There is a fair bit of tech nerd stuff to it, enough that I have an opportunity to learn & practice new things, but not so much that I’m totally out of my depth. And there are plenty of communities out there to help and share strategies.
But the big thing I see missing from my understanding and many of the conversations about shadow libraries and unauthorized archivism is the social and professional practice of librarianship rather than mechanical practice of data storage. I don't have space to go to library school, but I could definitely stand to read (and archive) introductory books on the topic, or take an online class. Friends who know: what are some of the better places to get started with an introduction to library & information science and archive science? #libraries #librarian #archivist #archives #archivism #archivist #libraryScience #informationScience #archiveScience #culture #repositories
Just for those who don't know what Infoseepage is talkn bout there is:
#Kiwix is a project to make large #information #repositories available in compact #offline forms - https://kiwix.org/en/ in various languages.
Several of their downloadable #libraries are subsets and snapshots of #Wikipedia. https://library.kiwix.org/#lang=eng&q=Wikipedia
The "flagship" all-six-million-en_US article module of Wikipedia is ~110GB and I seed this torrent.
Good news from the US Repository Network (#USRN):
https://sparcopen.org/wp-content/uploads/2023/12/Outcomes-of-the-USRN-Discovery-Pilot-Project.pdf
"At the beginning of the project, about half of the #repositories did not have their #OAIPMH interface properly configured and, therefore, could not be indexed by external discovery systems. After just over a year of the pilot, all but one repositories are now OAI-PMH compliant. This has resulted in a 50% increase in indexed content, with 728,770 new records now publicly accessible."
A recent survey of faculty at an R1 #university (with most respondents from the social sciences) revealed strong objections to #APCs at #OA #journals and deep ignorance of OA #repositories.
https://www.iastatedigitalpress.com/jlsc/article/17647/galley/16356/view/
The survey did not ask whether respondents were familiar with no-APC (#DiamondOA) OA journals.
Under the 2022 #OSTP #NelsonMemo, US federal funding agencies must make agency-funded research #OpenAccess in "agency-designated" #repositories. The same was true under the 2013 #HoldrenMemo. This is the first time that any federal agency has designated a repository neither built nor hosted by government agencies — an important milestone for #GreenOA. Kudos to #KnowledgeCommons (@hello).
New study: "Articles authored or co-authored by employees of the [US] National Cancer Institute [are in the #PublicDomain but frequently] omit any assertion of public domain status, and…many of them remain inaccessible to the general public behind publisher firewalls. Medical institutional #repositories and libraries can play an important role in making this literature (both current and historical) more widely available."
https://scholarlycommons.henryford.com/mirl/2024/program/3/
#Elsevier acquired #bepress in 2017.
https://www.prnewswire.com/news-releases/elsevier-acquires-bepress-a-leading-service-provider-used-by-academic-institutions-to-showcase-their-research-638101803.html
That triggered a mini-movement — #beprexit — of universities trying to migrate from bepress to other #repositories.
https://www.google.com/search?q=beprexit
A new study shows that "the total number of R1 and R2 bepress institutions grew from 76 in 2017 to 90 in 2019, followed by declines to 86 in 2021 and 82 in 2023. From 2017 to 2023, 15 R1 or R2 institutions were observed leaving bepress, and 12 joined."
https://www.iastatedigitalpress.com/jlsc/article/id/17786/
This is an important step in adding new layers of utility to #OpenAccess #repositories.
https://coar-repositories.org/news-updates/coar-notify-functionality-now-available-to-dspace-users/
New study: "We found that nine Latin American countries carry out #OpenScience actions; eight develop research #data #repositories; eight implement technological infrastructure to develop repositories; seven collaborate with #libraries in open science actions; and seven have their repositories recorded in the Registry of Research Data Repositories [#re3data] and Dataverse, resulting in 62 research #DataRepositories in #LatinAmerica."
https://unisapressjournals.co.za/index.php/LIS/article/view/15112
> the #TeaProtocol perversely incentivizes #software #developers to exaggerate their contribution to #openSource #development. Using a modified PageRank called #teaRank, #softwareDevelopers are rewarded based on their “Proof of Contribution”... and a few software developers have spammed #foss #repositories with absurd amounts of worthless #packages.
Anyone can Access Deleted and Private Repository Data on GitHub
You can access data from deleted forks, deleted repositories and even private repositories on GitHub. And it is available forever. This is known by GitHub, and intentionally designed that way.
https://trufflesecurity.com/blog/anyone-can-access-deleted-and-private-repo-data-github
Kudos to #Japan for adopting a national #OpenAccess policy, for requiring OA through #repositories rather than #journals (#GreenOA rather than #GoldOA), and for funding to upgrade and standardize the country's institutional repositories (#IRs).
https://www.nature.com/articles/d41586-024-01493-8
Update. This new study follows up on three recommendations from #COAR on #multilingualism in #repositories — "[1] declaring the language(s) of the resource and…its #metadata, [2] writing personal name/s using the writing system used in the deposited document while providing a persistent identifier to disambiguate author/s…and…[3] enabling UTF-8 support so as to promote use of the original alphabet / the writing system whenever possible."
https://zenodo.org/records/11060284