shakedown.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A community for live music fans with roots in the jam scene. Shakedown Social is run by a team of volunteers (led by @clifff and @sethadam1) and funded by donations.

Administered by:

Server stats:

266
active users

#ReinforcementLearning

0 posts0 participants0 posts today
Hacker News<p>Shoggoth Mini – A soft tentacle robot powered by GPT-4o and RL</p><p><a href="https://www.matthieulc.com/posts/shoggoth-mini" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">matthieulc.com/posts/shoggoth-</span><span class="invisible">mini</span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/ShoggothMini" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ShoggothMini</span></a> <a href="https://mastodon.social/tags/SoftRobot" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftRobot</span></a> <a href="https://mastodon.social/tags/GPT4o" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GPT4o</span></a> <a href="https://mastodon.social/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://mastodon.social/tags/TechInnovation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TechInnovation</span></a></p>
Hacker News<p>How to scale RL to 10^26 FLOPs</p><p><a href="https://blog.jxmo.io/p/how-to-scale-rl-to-1026-flops" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.jxmo.io/p/how-to-scale-rl</span><span class="invisible">-to-1026-flops</span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/scaleRL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scaleRL</span></a> <a href="https://mastodon.social/tags/FLOPs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOPs</span></a> <a href="https://mastodon.social/tags/reinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>reinforcementLearning</span></a> <a href="https://mastodon.social/tags/AIresearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AIresearch</span></a> <a href="https://mastodon.social/tags/optimization" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>optimization</span></a></p>
Hacker News<p>The upcoming GPT-3 moment for RL</p><p><a href="https://www.mechanize.work/blog/the-upcoming-gpt-3-moment-for-rl/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">mechanize.work/blog/the-upcomi</span><span class="invisible">ng-gpt-3-moment-for-rl/</span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/GPT3" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GPT3</span></a> <a href="https://mastodon.social/tags/RL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RL</span></a> <a href="https://mastodon.social/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/Innovation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Innovation</span></a></p>
Hacker News<p>RULER – Easily apply RL to any agent</p><p><a href="https://openpipe.ai/blog/ruler" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">openpipe.ai/blog/ruler</span><span class="invisible"></span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/RULER" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RULER</span></a> <a href="https://mastodon.social/tags/RL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RL</span></a> <a href="https://mastodon.social/tags/agents" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>agents</span></a> <a href="https://mastodon.social/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://mastodon.social/tags/OpenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenAI</span></a> <a href="https://mastodon.social/tags/AItools" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AItools</span></a></p>
Erik Jonker<p>Good article how reinforcement learning improved current AI models. Also illustrates that LLMs today are not just imitating.<br><a href="https://arstechnica.com/ai/2025/07/how-a-big-shift-in-training-llms-led-to-a-capability-explosion/?utm_brand=arstechnica&amp;utm_social-type=owned&amp;utm_source=mastodon&amp;utm_medium=social" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">arstechnica.com/ai/2025/07/how</span><span class="invisible">-a-big-shift-in-training-llms-led-to-a-capability-explosion/?utm_brand=arstechnica&amp;utm_social-type=owned&amp;utm_source=mastodon&amp;utm_medium=social</span></a><br><a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/reinforcementlearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>reinforcementlearning</span></a></p>
Assn for Computing Machinery<p>"Intelligence is figuring out how the world works rather than waiting for someone to tell you how the world works."</p><p>Join us as we hear from Andrew Barto and Richard Sutton, the 2024 <a href="https://mastodon.acm.org/tags/ACMTuringAward" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ACMTuringAward</span></a> recipients as they discuss their work on <a href="https://mastodon.acm.org/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a>.</p><p><a href="https://vimeo.com/1085726612" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">vimeo.com/1085726612</span><span class="invisible"></span></a></p>
Dr. Carlotta A. Berry, PhD<p><a href="https://blacktwitter.io/tags/BlackInRobotics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BlackInRobotics</span></a> workshop series <a href="https://blacktwitter.io/tags/ROS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ROS</span></a> <a href="https://blacktwitter.io/tags/ROS2" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ROS2</span></a> <a href="https://blacktwitter.io/tags/Robot" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Robot</span></a> <a href="https://blacktwitter.io/tags/Robotics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Robotics</span></a> <a href="https://blacktwitter.io/tags/STEM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STEM</span></a> <a href="https://blacktwitter.io/tags/STEAM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STEAM</span></a> <a href="https://blacktwitter.io/tags/BlackSTEM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BlackSTEM</span></a> <a href="https://blacktwitter.io/tags/BlackSTEAM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BlackSTEAM</span></a> <a href="https://blacktwitter.io/tags/Drone" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Drone</span></a> <a href="https://blacktwitter.io/tags/ComputerVision" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ComputerVision</span></a> <a href="https://blacktwitter.io/tags/Drones" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Drones</span></a> <a href="https://blacktwitter.io/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://blacktwitter.io/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://blacktwitter.io/tags/Neuralnetworks" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Neuralnetworks</span></a> <a href="https://blacktwitter.io/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://blacktwitter.io/tags/Learning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Learning</span></a></p>
Dr. Carlotta A. Berry, PhD<p><a href="https://blacktwitter.io/tags/BlackInRobotics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BlackInRobotics</span></a> workshop series <a href="https://blacktwitter.io/tags/ROS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ROS</span></a> <a href="https://blacktwitter.io/tags/ROS2" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ROS2</span></a> <a href="https://blacktwitter.io/tags/Robot" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Robot</span></a> <a href="https://blacktwitter.io/tags/Robotics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Robotics</span></a> <a href="https://blacktwitter.io/tags/STEM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STEM</span></a> <a href="https://blacktwitter.io/tags/STEAM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STEAM</span></a> <a href="https://blacktwitter.io/tags/BlackSTEM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BlackSTEM</span></a> <a href="https://blacktwitter.io/tags/BlackSTEAM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BlackSTEAM</span></a> <a href="https://blacktwitter.io/tags/Drone" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Drone</span></a> <a href="https://blacktwitter.io/tags/ComputerVision" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ComputerVision</span></a> <a href="https://blacktwitter.io/tags/Drones" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Drones</span></a> <a href="https://blacktwitter.io/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://blacktwitter.io/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://blacktwitter.io/tags/Neuralnetworks" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Neuralnetworks</span></a> <a href="https://blacktwitter.io/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://blacktwitter.io/tags/Learning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Learning</span></a></p>
LavX News<p>AI Progress: Understanding the Myths Behind the Intelligence Explosion</p><p>As the AI landscape evolves, the concept of an 'intelligence explosion' raises questions about the future of AI development. This article delves into the current state of AI, addressing misconceptions...</p><p><a href="https://news.lavx.hu/article/ai-progress-understanding-the-myths-behind-the-intelligence-explosion" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">news.lavx.hu/article/ai-progre</span><span class="invisible">ss-understanding-the-myths-behind-the-intelligence-explosion</span></a></p><p><a href="https://ioc.exchange/tags/news" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>news</span></a> <a href="https://ioc.exchange/tags/tech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>tech</span></a> <a href="https://ioc.exchange/tags/DataScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataScience</span></a> <a href="https://ioc.exchange/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://ioc.exchange/tags/AI2027" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI2027</span></a></p>
Dr. Anna Latour<p>My colleagues at TU Delft are seeking to hire a postdoc to work on Applied Planning and Scheduling under Uncertainty, with applications in modelling supply chain scenarios for offshore wind farm installation: <a href="https://careers.tudelft.nl/job/Delft-Postdoc-in-Applied-Planning-and-Scheduling-under-Uncertainty-2628-CD/814890902/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">careers.tudelft.nl/job/Delft-P</span><span class="invisible">ostdoc-in-Applied-Planning-and-Scheduling-under-Uncertainty-2628-CD/814890902/</span></a></p><p><a href="https://mathstodon.xyz/tags/AcademicMastodon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AcademicMastodon</span></a> <a href="https://mathstodon.xyz/tags/PostdocLife" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PostdocLife</span></a> <a href="https://mathstodon.xyz/tags/Hiring" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Hiring</span></a> <a href="https://mathstodon.xyz/tags/Research" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Research</span></a> <a href="https://mathstodon.xyz/tags/Planning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Planning</span></a> <a href="https://mathstodon.xyz/tags/Scheduling" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scheduling</span></a> <a href="https://mathstodon.xyz/tags/ReasoningUnderUncertainty" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReasoningUnderUncertainty</span></a> <a href="https://mathstodon.xyz/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mathstodon.xyz/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialIntelligence</span></a> <a href="https://mathstodon.xyz/tags/JobSearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>JobSearch</span></a> <a href="https://mathstodon.xyz/tags/Vacancy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Vacancy</span></a> <a href="https://mathstodon.xyz/tags/AcademicChatter" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AcademicChatter</span></a> <a href="https://mathstodon.xyz/tags/Career" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Career</span></a> <a href="https://mathstodon.xyz/tags/CombinatorialOptimisation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CombinatorialOptimisation</span></a> <a href="https://mathstodon.xyz/tags/Sustainability" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sustainability</span></a> <a href="https://mathstodon.xyz/tags/EnergyTransition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EnergyTransition</span></a> <a href="https://mathstodon.xyz/tags/Wind" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Wind</span></a> <a href="https://mathstodon.xyz/tags/WindEnergy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WindEnergy</span></a> <a href="https://mathstodon.xyz/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://mathstodon.xyz/tags/ML" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ML</span></a> <a href="https://mathstodon.xyz/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://mathstodon.xyz/tags/WindTurbines" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WindTurbines</span></a> <a href="https://mathstodon.xyz/tags/ComputerScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ComputerScience</span></a> <a href="https://mathstodon.xyz/tags/Optimisation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Optimisation</span></a> <a href="https://mathstodon.xyz/tags/Optimization" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Optimization</span></a> <a href="https://mathstodon.xyz/tags/CombinatorialOptimization" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CombinatorialOptimization</span></a> <a href="https://mathstodon.xyz/tags/PostDoc" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PostDoc</span></a> <a href="https://mathstodon.xyz/tags/AcademicCareer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AcademicCareer</span></a> <a href="https://mathstodon.xyz/tags/Academia" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Academia</span></a> <a href="https://mathstodon.xyz/tags/AcademicJob" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AcademicJob</span></a> <a href="https://mathstodon.xyz/tags/AcademicJobs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AcademicJobs</span></a> <a href="https://mathstodon.xyz/tags/TUDelft" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TUDelft</span></a></p>
Sean Patrick<p>New instance, new <a href="https://wandering.shop/tags/introduction" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>introduction</span></a>! </p><p>I'm a <a href="https://wandering.shop/tags/DataScientist" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataScientist</span></a> with a background in <a href="https://wandering.shop/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> and <a href="https://wandering.shop/tags/ElectricalEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ElectricalEngineering</span></a>. Well, that's what my resume says, but really I'm a <a href="https://wandering.shop/tags/poet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>poet</span></a> and a SF/F <a href="https://wandering.shop/tags/writer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>writer</span></a>. I love to play <a href="https://wandering.shop/tags/DnD" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DnD</span></a> and other <a href="https://wandering.shop/tags/TTRPGs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTRPGs</span></a>.</p><p>I use they/them pronouns, and "Dr." not "Mr.", please and thank you.</p><p>I maintain a blog at www.seanpatrick.phd which includes a current list of publications, including my debut sonnet collection, "Love, Death, and Other Surprises."</p>
Dr. Carlotta A. Berry, PhD<p><a href="https://blacktwitter.io/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://blacktwitter.io/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://blacktwitter.io/tags/BiasInAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BiasInAI</span></a> <a href="https://blacktwitter.io/tags/STEMSaturday" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STEMSaturday</span></a> <a href="https://blacktwitter.io/tags/DeepLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeepLearning</span></a> <a href="https://blacktwitter.io/tags/ComputerVision" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ComputerVision</span></a> <a href="https://blacktwitter.io/tags/Robotics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Robotics</span></a> <a href="https://blacktwitter.io/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> </p><p>Meet the editors of "Mitigating Bias in Machine Learning" Dr. Carlotta Berry and Dr. Brandeis Hill Marshall (Brandeis Marshall, PhD) <br>This practical guide shows, step by step, how to use machine learning to carry out actionable decisions that do not discriminate based on numerous human factors, including ethnicity and gender.<br>On Sale On Amazon <a href="https://a.co/d/dtMizVH" rel="nofollow noopener" target="_blank"><span class="invisible">https://</span><span class="">a.co/d/dtMizVH</span><span class="invisible"></span></a></p>
Brandon Rohrer<p>Adding my love letter to</p><p>arxiv.org/pdf/2304.01315</p><p>Empirical Design in Reinforcement Learning<br>by<br>Andrew Patterson, Samuel Neumann, Martha White, Adam White</p><p>JMLR 25 (2024) 1-63</p><p><a href="https://recsys.social/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a></p><p>These aren’t the heroes we deserve, but they are the heroes we need.</p>
Brandon Rohrer<p>If you've ever worked with a physical robot and <a href="https://recsys.social/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> you've had to deal with delays. Thinking takes time, even at computer speeds, and the world doesn't stop.</p><p>One way to minimize the delays is for the to world to act on new commands mid-cycle, rather than wait for its next turn.</p><p><a href="https://www.brandonrohrer.com/rl_noninteger_delay" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">brandonrohrer.com/rl_nonintege</span><span class="invisible">r_delay</span></a></p>
IT News<p>Supercon 2023: Teaching Robots How to Learn - Once upon a time, machine learning was an arcane field, the preserve of a precious... - <a href="https://hackaday.com/2024/09/03/supercon-2023-teaching-robots-how-to-learn/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackaday.com/2024/09/03/superc</span><span class="invisible">on-2023-teaching-robots-how-to-learn/</span></a> <a href="https://schleuss.online/tags/reinforcementlearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>reinforcementlearning</span></a> <a href="https://schleuss.online/tags/2023hackadaysupercon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>2023hackadaysupercon</span></a> <a href="https://schleuss.online/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinelearning</span></a> <a href="https://schleuss.online/tags/algorithm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>algorithm</span></a> <a href="https://schleuss.online/tags/arduino" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>arduino</span></a> <a href="https://schleuss.online/tags/esp32s3" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>esp32s3</span></a> <a href="https://schleuss.online/tags/cons" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cons</span></a></p>
Carl Gold, PhD<p>Someone just shared this awesome comic with me. Does anyone know the original source? (I can't read the small signature.) 3 Complaining <a href="https://sigmoid.social/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinelearning</span></a> robots 🤖 : <a href="https://sigmoid.social/tags/SupervisedLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SupervisedLearning</span></a> - they gave me so much to read, and test! <a href="https://sigmoid.social/tags/unsupervised" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>unsupervised</span></a> - Me too. But at least they told you the answers. <a href="https://sigmoid.social/tags/reinforcementlearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>reinforcementlearning</span></a> - At least you don't get punished for every wrong action. 😆</p>
Brandon Rohrer<p>In architecture diagrams, they are often drawn as separate boxes, but as I go to implement a handful of use cases, I’m having trouble making that abstraction.</p><p>Speculative musings welcome</p><p> <a href="https://recsys.social/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a></p>
Brandon Rohrer<p>A fun part of working on a <a href="https://recsys.social/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> workbench is that I get to think about how to connect different kinds of agents to different kinds of worlds – representation, interfaces, abstraction.</p><p>Something I’m stumbling on is representing models and planners.<br>Is there such a thing as a planner distinct from a model? Or is planning just something a model does?<br>In object-oriented programming terms, would a planner be a separate class from a model? Or would it be a method in a model class?</p>
IT News<p>What kind of bug would make machine learning suddenly 40% worse at NetHack? - Enlarge (credit: Aurich Lawson) </p><p>Members of the Legendary Compu... - <a href="https://arstechnica.com/?p=2028789" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">arstechnica.com/?p=2028789</span><span class="invisible"></span></a> <a href="https://schleuss.online/tags/reinforcementlearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>reinforcementlearning</span></a> <a href="https://schleuss.online/tags/imitationlearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>imitationlearning</span></a> <a href="https://schleuss.online/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinelearning</span></a> <a href="https://schleuss.online/tags/softwarebugs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>softwarebugs</span></a> <a href="https://schleuss.online/tags/roguelikes" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>roguelikes</span></a> <a href="https://schleuss.online/tags/moonphase" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>moonphase</span></a> <a href="https://schleuss.online/tags/roguelike" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>roguelike</span></a> <a href="https://schleuss.online/tags/nethack" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nethack</span></a> <a href="https://schleuss.online/tags/gaming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gaming</span></a> <a href="https://schleuss.online/tags/bugs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bugs</span></a> <a href="https://schleuss.online/tags/cuda" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cuda</span></a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a></p>
Scripter :verified_flashing:<p>Mini-Quadkopter lernt Fliegen in Sekunden | heise online<br><a href="https://heise.de/-9623443" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">heise.de/-9623443</span><span class="invisible"></span></a> <a href="https://social.tchncs.de/tags/DeepReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeepReinforcementLearning</span></a> <a href="https://social.tchncs.de/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://social.tchncs.de/tags/RL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RL</span></a></p>