shakedown.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A community for live music fans with roots in the jam scene. Shakedown Social is run by a team of volunteers (led by @clifff and @sethadam1) and funded by donations.

Administered by:

Server stats:

243
active users

#datascience

11 posts8 participants0 posts today

The phenomenon of data scientists using models (LLM or otherwise) on data that they don't understand -- and don't care to learn about -- is not just limited to tech bros: my own former colleagues were like that. On top of that, our technical manager, a PhD in Computer Science and expert in machine learning, perpetuated the idea that understanding the domain-specific data was NOT the job of his data scientists #datascience
propublica.org/article/inside-

ProPublicaInside the AI Prompts DOGE Used to “Munch” Contracts Related to Veterans’ Health
More from ProPublica

Dr. Ellie Murray ScD @epiellie has been breaking down the MAHA report, piece by piece.

If you enjoy looking at bad data, or are interested in learning what is in the report and would rather read a review, take a look. I have been thoroughly enjoying it, as a data nerd.

Parts I & II are currently available:
epiellie.substack.com/p/the-ma

E is for Epi · The MAHA Report: I Read it so You Don't Have to.By Ellie Murray, ScD

I asked a Harvard postdoc which skills are essential to thrive as a researcher in AI & Biomedicine:

His answer:
:blobcoffee: Cultivate abstract thinking.
:blobcoffee: Build solid foundations instead of chasing hypes.
:blobcoffee: Think independently, embrace a do-it mindset, stay curious and persistent.

Knowledge is accessible. Thinking is up to us.

What skills are you trying to develop?

Replied in thread

💬 Want to use GPT-4, Claude, Gemini, Ollama & more directly from R?
Meet {ellmer}: a powerful wrapper to access a wide range of LLM providers via a unified interface.
Includes function/tool calling, structured output, image input & streaming!

📦 install.packages("ellmer")
📘 Docs: ellmer.tidyverse.org/
#rstats #LLM #AI #OpenSource #DataScience #RPackage #NLP

ellmer.tidyverse.orgChat with Large Language ModelsChat with large language models from a range of providers including Claude <https://claude.ai>, OpenAI <https://chatgpt.com>, and more. Supports streaming, asynchronous calls, tool calling, and structured data extraction.

I definitely got out of bed the wrong way this morning, but what the heck is going on with #datascience techbros talking enthusiastically about "vibe coding" whilst laughing at those who know how to do it properly.

Did they learn anything at university? It's the kind of bullying I last saw from teenagers in school who made fun of other students who "worked too hard"

Garbage in, garbage out – even Agentic AI can’t save you from yourself.

Artificial intelligence is only as brilliant as the data it’s spoon-fed – and spoiler alert: your data is often trash.
Whether it’s traditional machine learning, generative models, or your shiny new agentic systems, the pattern remains insultingly consistent:
• Bad data? Expect bad decisions.
• Incomplete data? Enjoy half-baked ideas.
• Outdated data? Say hello to irrelevant nonsense.

I often talk about what AI can or tragically still can’t do.
But here’s the real twist: the problem isn’t the system. It’s you. Or more specifically, the glorious mess you call your “data foundation.”

You don’t have a lack of innovation.
You have a lack of clean data structures, maintained knowledge bases, and basic contextual awareness.
And then you expect the AI to magically fill gaps that should never have existed in the first place.

🛠️ Cascadia R Workshops – June 20 in Portland!

Join us the day before the conference for a full day of hands-on workshops taught by experienced R developers and educators.

🎯 Topics include:
• GitHub for R Users
• Rust for R Developers
• Intermediate Shiny
• Intro to Positron
• Intro to GIS & Mapping in R

📍 Location: Portland State University
🎟️ Workshops tickets going fast!— space is limited!

👉 Full details + tickets: cascadiarconf.com
#rstats #Portland #DataScience #OpenSource

CascadiaRConfCascadiaRConfCascadia R Conference is an R conference serving the Pacific Northwest region (Alaska/British Columbia/Washington/Oregon/California).

📝 New Bioconductor Blog Post!

We’ve just launched the first in a new series by Vince Carey, founding contributor to Bioconductor and core developer, exploring the evolving complexity and value of the Bioconductor ecosystem.

Check it out 👉 blog.bioconductor.org/posts/20

Bioconductor community blogAsk and you shall receive – Bioconductor community blogDiscussion of evolving complexity and value of the Bioconductor ecosystem.

💠 R FOR DATA SCIENCE 💠

Looking to take your #Rskills to the next level as a #UniversityofCopenhagen #researcher, employee, or #PhD student? We've got the #course for you!

💡 R for Data Science
🗓 16-18 June 2025
📍 Panum, University of Copenhagen

✏ Register here as an employee: lnkd.in/dyQx6K63
🖋️ Register here for ECTS credits as a PhD student: lnkd.in/da47HJ3f

#excel #rstudio #r #datascience #data #phd #research #datascience #UCPH