unsloth: 5X faster 50% less memory LLM finetuning

Unsloth, a new technology, offers 80% faster and 50% less memory-consuming local QLoRA finetuning. It uses OpenAI’s Triton language and supports NVIDIA GPUs since 2018+. The technology maintains accuracy without requiring hardware changes and supports 4bit and 16bit LoRA finetuning. It also allows training Slim Orca fully locally in 260 hours, a significant reduction from 1301 hours. The open-source version offers 5x faster training, while Unsloth Pro and Max provide 30x faster training.
Read more at GitHub…

unsloth: 5X faster 50% less memory LLM finetuning

Related

When the Vending Machine Went Sentient

Constant-Time Breakthrough Raises the Hash-Table Speed Limit

Star Wars Reimagined: China’s Laser Satellite Outpaces Starlink

Court Rules AI’s Use of Books as Fair Use but Slams Pirated Collection Storage

Introducing the OWASP AI Testing Guide: A New Standard for AI Security Testing

The Low-Background Steel Problem of AI

Chinese AI Firms Dodge US Chip Bans with Cross-Border Data Smuggling to Malaysia

OpenAI open-sources a demo of a UI testing agent

Financial Dynamics in Agentic AI: Cursor’s Rise Versus GitHub Copilot