Large language models can do jaw-dropping things. But nobody knows exactly why.

Harvard computer scientist Boaz Barak, currently with OpenAI, likens the current state of machine learning to…

German man got 217 COVID shots over 29 months—here’s how it went

A 62-year-old German man received 217 COVID-19 vaccinations over 29 months without suffering ill effects, challenging…

Unsloth Fixing Gemma bugs

Unsloth developers Daniel and Michael Han have dedicated the past week to addressing a series of…

Startling Exception Discovered to 200-Year-Old Law of Physics

Scientists have challenged a foundational concept in thermodynamics by uncovering an exception to Fourier’s law, which…

Ema, a ‘Universal AI employee,’ emerges from stealth with $25M

Millions of research papers at risk of disappearing from the Internet

A recent study reveals a concerning gap in the preservation of scholarly articles, with over a…

Introducing the next generation of Claude

Introducing the Claude 3 model family, a new benchmark in AI intelligence, with three models tailored…

We finally know why humans don’t have tails

Scientists have pinpointed a genetic mutation responsible for the loss of tails in the ancestors of…

GitHub’s Copilot Enterprise is now generally available at $39 a month

Mathematicians Have Discovered the Secret Geometry of Life

Mathematicians from Budapest University of Technology have discovered a new class of natural shapes known as…

The Dawn of 1-Bit Large Language Models

A new paper from Microsoft Research titled “The Era of 1-bit LLMs: All Large Language Models…

Klarna AI assistant handles two-thirds of customer service chats in its first month

Klarna has unveiled its AI assistant developed in collaboration with OpenAI, showcasing impressive results after just…

Mistral AI releases new model to rival GPT-4 and its own chat assistant

echo-embeddings

Echo embeddings offer a novel solution to enhance autoregressive language models by incorporating information from later…

Phind-70B closes the code quality gap with GPT-4 while running 4x faster

Phind, the startup behind the AI assistant of the same name, has released their largest language…

FireFunction V1 – Fireworks’ GPT-4-level function calling model – 4x faster than GPT-4 and open weights

Fireworks has unveiled FireFunction-v1, an enhanced function calling model that integrates external knowledge into large language…

GPT-4 developer tool can hack websites without human help

OpenAI’s GPT-4 has demonstrated the alarming ability to hack websites and extract information from databases autonomously,…

Prompt engineering is a task best left to AI models

Large language models (LLMs) have sparked a new focus on prompt engineering, a technique to craft…

Stable Diffusion 3.0 debuts new diffusion transformation architecture to reinvent text-to-image gen AI

Stability AI has unveiled an early preview of Stable Diffusion 3.0, a cutting-edge text-to-image generative AI…

Google pauses AI-generated images of people after ethnicity criticism

Google has halted the generation of images depicting people by its AI model, Gemini, after it…