In the Dark<p><strong>Meta Theft</strong></p><blockquote><p>Beware, all thieves and imitators of other people’s labour and talents, of laying your audacious hands upon our work. </p><p>Albrecht Dürer, 1511</p></blockquote><p>I’ve remembered that quotation since it was uttered by Inspector Morse in the episode <em>Who Killed Harry Field?</em> Albrecht Dürer wasn’t referring to Artificial Intelligence when he said it, but it does seem pertitent to what’s going on today.</p><p>There’s an <a href="https://www.theatlantic.com/technology/archive/2025/03/libgen-meta-openai/682093/" rel="nofollow noopener noreferrer" target="_blank">article in <em>The Atlantic</em> </a>about a huge database of pirated work called LibGen that has been used by Mark Zuckerberg’s corporation Meta to train its artificial intelligence system. Instead of acquiring such materials from publishers – or, Heaven forbid, authors! – they decided simply to steal it. That’s theft on a grand scale: 7.5 million books and 81 million research papers.</p><p>The piece provides <a href="https://www.theatlantic.com/technology/archive/2025/03/search-libgen-data-set/682094/" rel="nofollow noopener noreferrer" target="_blank">a link to LibGen</a> so you can search for your own work there. I searched it yesterday and found 137 works by “Peter Coles”. Not all of them are by me, as there are other authors with the same name, but all <a href="https://telescoper.blog/my-books/" rel="nofollow noopener noreferrer" target="_blank">my books</a> are there, as well as numerous research articles, reviews and other pieces:</p> <a href="https://telescoper.blog/wp-content/uploads/2025/03/libgen.jpg" rel="nofollow noopener noreferrer" target="_blank"></a> <p>I suppose many think I should be flattered that my works are deemed to be of sufficiently high quality to be used to train a large language model, but I’m afraid I don’t see it that way at all. I think, at least for the books, this is simply theft. I understand that there may be a class action in the USA against Meta for this larceny, which I hope succeeds.</p><p>I think I should make a few points about copyright and authorship. I am a firm advocate of open access to the scientific literature, so I don’t think research articles should be under copyright. Meta can access them along with everyone else on the planet. It’s not really piracy if it’s free anyways. Although it would be courteous of Meta to acknowledge its sources, lack of courtesy is not the worst of Meta’s areas of misconduct. </p><p>In a similar vein, when I started writing this blog back in 2008 I did wonder about copyright. Over the years, quite a lot of my ramblings here have been lifted by journalists, etc. Again a bit of courtesy would have been nice. I did make the decision, however, not to bother about this as (a) it would be too much hassle to chase down every plagiarist and (b) I don’t make money from this site anyway. As far as I’m concerned as soon as I put anything on here it is in the public domain. I haven’t changed that opinion with the advent of ChatGPT etc. Indeed, I am pretty sure that all 7000+ articles from this blog were <a href="https://telescoper.blog/2024/11/19/weird-statistics/" rel="nofollow noopener noreferrer" target="_blank">systematically scraped last year</a>.</p><p>Books are, however, in a different category. I have never made a living from writing books, but it is dangerous to the livelihood of those that do to have their work systematically stolen in this way. I understand that there may be a class action in the USA against Meta for this blatant larceny, which I hope succeeds.</p><p></p><p><a rel="nofollow noopener noreferrer" class="hashtag u-tag u-category" href="https://telescoper.blog/tag/artificial-intelligence/" target="_blank">#ArtificialIntelligence</a> <a rel="nofollow noopener noreferrer" class="hashtag u-tag u-category" href="https://telescoper.blog/tag/copyright/" target="_blank">#copyright</a> <a rel="nofollow noopener noreferrer" class="hashtag u-tag u-category" href="https://telescoper.blog/tag/large-language-models/" target="_blank">#LargeLanguageModels</a> <a rel="nofollow noopener noreferrer" class="hashtag u-tag u-category" href="https://telescoper.blog/tag/libgen/" target="_blank">#LibGen</a> <a rel="nofollow noopener noreferrer" class="hashtag u-tag u-category" href="https://telescoper.blog/tag/meta/" target="_blank">#meta</a> <a rel="nofollow noopener noreferrer" class="hashtag u-tag u-category" href="https://telescoper.blog/tag/the-atlantic/" target="_blank">#TheAtlantic</a> <a rel="nofollow noopener noreferrer" class="hashtag u-tag u-category" href="https://telescoper.blog/tag/theft/" target="_blank">#Theft</a></p>