Shakedown Social

Erik JonkerTrain your own R1 reasoning model with Unsloth. "We've enhanced the entire GRPO process, making it use 80% less VRAM than Hugging Face + FA2. This allows you to reproduce R1-Zero's "aha moment" on just 7GB of VRAM using Qwen2.5 (1.5B)" <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#ai</a> <a href="https://mastodon.social/tags/reasoning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#reasoning</a> <a href="https://mastodon.social/tags/unsloth" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#unsloth</a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#opensource</a> <a href="https://mastodon.social/tags/locally" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#locally</a> <a href="https://mastodon.social/tags/grpo" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#grpo</a> <a href="https://unsloth.ai/blog/r1-reasoning" rel="nofollow noopener noreferrer" translate="no" target="_blank">https://unsloth.ai/blog/r1-reasoning</a>

Recent searches

Search options

Administered by:

Server stats:

#unsloth