clifff @clifff

**Hacker News** @h4ckernews@mastodon.social · 3d

**Debby** @debby@hear-me.social · 4d

AI’s ‘reasoning’ is more mirror than mind—but that’s okay!
This ASU study reveals that LLMs’ "chain-of-thought" abilities are pattern-based illusions, not true logic.
The authors warn of a "false aura of reliability" in AI outputs, which could mislead in fields like healthcare or finance.
While that might sound disappointing, it’s actually useful insight! Understanding these limits can help us:
Build better guardrails for AI in critical applications.
Develop tests to expose AI’s blind spots.
Shift focus from "human-like thinking" to reliable, transparent outputs.

Big thanks to @agnieszkaserafinowicz for sharing!

Read the full study:
Is Chain-of-Thought Reasoning of LLMs a Mirage?
https://arxiv.org/pdf/2508.01191

@ai@a.gup.pe @ai@misskey.io @openscience @artificial_intel @ai@newsmast.community @alphasignal.ai #AI #research #disinformation #LLM #languagemodel #thinking #Science #news #artificialIntelligence #technology #AIRisk #LLM #Science #TechDebate #falseReliability #reliability

**Europe Says** @europesays@pubeurope.com · 4d

**craque is icumen in** @dtauvdiodr@c.im · 6d

craque is icumen in @dtauvdiodr@c.im

If they're gonna use "post-mortem" then I say use "postmortem". Make it a different word, refuse to align it with death.

Post-Incident Review is always better.

Learning Review isn't always the same thing for a lot of companies, sometimes you need both.

But whatever you call it, just do it.

#SRE #IncidentResponse #Reliability

**h o ʍ l e t t** @homlett@mamot.fr · Aug 6

**craque is icumen in** @dtauvdiodr@c.im · Aug 6

Aug 6

craque is icumen in @dtauvdiodr@c.im

Staff SRE available for work!!!

I am a hard working systems thinker who has a unique balance of seasoned TechOps skills, good DevEx chops, experience designing and running SRE programs like Observability, Incidents, and CI/CD.

I was put out of work in June and I need a new gig in short order. Boosts and cross-platform posts appreciated!

#FediHire #LookingForWork #SRE

**craque is icumen in** @dtauvdiodr@c.im · Aug 2

**Europe Says** @europesays@pubeurope.com · Jul 28

**Europe Says** @europesays@pubeurope.com · Jul 17

Replied in thread

**Chris Geoghooligan** @VTDARKSIM@toot.community · Jul 15

Jul 15

Chris Geoghooligan @VTDARKSIM@toot.community

I also have a #Masters in #MarineEngineering & ~7y associated experience, with some overlap producing #FMEA’s and a 400+ page analysis report for an unmanned vessel, marrying my #marine engineering and #reliability engineering expertise. The #MarE was a little while ago but I could do it if it plays nicely with the data path.

**Hacker News** @h4ckernews@mastodon.social · Jul 15

**Jan R. Boehnke** @jrboehnke@mastodon.social · Jul 14

**Europe Says** @europesays@pubeurope.com · Jun 24

**Catherine Schmidt** @lillyfinch@mstdn.social · Jun 5

**craque is icumen in** @dtauvdiodr@c.im · May 28

**Kevin Karhan** @kkarhan@infosec.space · Apr 5 *

Apr 5 *

Kevin Karhan @kkarhan@infosec.space

#AITA or does the #Fairphone3plus's #battery tend to #bulge pretty quickly?

Cuz I shouldn't be on the 3rd battery when I bought it mere days before it got discontinued...

#Fairphone #NoticesYourBulge #BulgeGate

**craque is icumen in** @dtauvdiodr@c.im · Mar 25 *

Mar 25 *

craque is icumen in @dtauvdiodr@c.im

I really liked this informal community poll and thematic analysis on SLO usage. It does a better job at highlighting the hurdles to adopting them at a Company Who Is Not Google than a lot of "Here's how to do SLOs" pieces that just don't cover it.

If there is ever a "Seeking SLOs" book, this should be the first chapter.

https://ericmustin.substack.com/p/notes-on-service-level-objectives

A Small, Good Thing · Mar 24Notes on Service Level ObjectivesBy Eric Mustin

#SRE #SLO #Reliability

**Europe Says** @europesays@pubeurope.com · Feb 28

Continued thread

**petersuber** @petersuber@fediscience.org · Feb 14

Feb 14

petersuber @petersuber@fediscience.org

Update. From @hildabast: "What if We Can’t Rely on PubMed?"
https://absolutelymaybe.plos.org/2025/02/14/what-if-we-cant-rely-on-pubmed/

"#PubMed is incredibly reliable…That said, between the risks of an exodus of key personnel, understaffing, or goodness-knows-what vandalism when a goon squad arrives at NIH, it’s not paranoid any more to think ahead to the once-unthinkable. What would PubMed enshittification look like? Could PubMed go down more often, and for longer? Might services no longer be free? How else could the #quality and #reliability of its services be degraded?"

Absolutely Maybe · Feb 14What if We Can't Rely on PubMed? - Absolutely MaybePubMed is incredibly reliable. And a lot depends on it. It’s an ecosystem built around MEDLINE, the steady feed of new publications…

#Censorship #DefendResearch #Medicine

Recent searches

Search options

Administered by:

Server stats:

#reliability