Friday, April 18, 2025

Supervised Fine-Tuning and Direct Preference Optimization on Intel Gaudi2

2023-11-18

Intel has developed a top-ranked 7B chat model using Intel Gaudi2 accelerators, designed to speed up large language model (LLM) training and inference. The process involves supervised fine-tuning and direct preference optimization (DPO), achieving comparable or better results than other open-source LLMs. The Intel Gaudi2 AI accelerator, developed by Habana Labs, is designed for state-of-the-art deep learning training and inference.
Read more at Medium…

Supervised Fine-Tuning and Direct Preference Optimization on Intel Gaudi2

Related

OpenAI Codex CLI: Executable AI Reasoning Hits Your Terminal

GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano

DolphinGemma: Unveiling the Language of the Seas with AI

Grok 3 API Debuts with Scalable Models for Code, Data, and Enterprise Tasks

Smarter GitHub Automation with the MCP Server

China Unveils GPMI: A Single-Cable Standard for 8K Video and High Power

When Weather Apps Steal Your SSH Keys

Llama 4

Tame Your Terminal: Managing AI Coding Agents with Claude Squad