GitHub – openai/transformer-debugger

Answer.AI – Enabling 70B Finetuning on Consumer GPUs

Answer.AI has unveiled FSDP+QLoRA, a groundbreaking open-source project that allows for the fine-tuning of massive models…

Microsoft’s AI Copilot for Security launches next month with pay-as-you-go pricing

Devin, the first AI software engineer

Meet Devin, the pioneering AI software engineer developed by Cognition, an applied AI lab focused on…

GitHub – punica-ai/punica: Serving multiple LoRA finetuned LLM as one

Elon Musk to open-source AI chatbot Grok this week

Radical New Discovery Could Double The Speed of Existing Computers

Researchers at the University of California, Riverside have developed an innovative computing process called simultaneous and…

Russian Fiber Optic Drone Can Beat Any Jammer

Kalashnikov subsidiary ZALA’s new quadcopter, ‘Product 55’, utilizes a surprising method to avoid radio jamming: a…

Large language models can do jaw-dropping things. But nobody knows exactly why.

Harvard computer scientist Boaz Barak, currently with OpenAI, likens the current state of machine learning to…

Unsloth Fixing Gemma bugs

Unsloth developers Daniel and Michael Han have dedicated the past week to addressing a series of…

Ema, a ‘Universal AI employee,’ emerges from stealth with $25M

Introducing the next generation of Claude

Introducing the Claude 3 model family, a new benchmark in AI intelligence, with three models tailored…

GitHub’s Copilot Enterprise is now generally available at $39 a month

The Dawn of 1-Bit Large Language Models

A new paper from Microsoft Research titled “The Era of 1-bit LLMs: All Large Language Models…

Klarna AI assistant handles two-thirds of customer service chats in its first month

Klarna has unveiled its AI assistant developed in collaboration with OpenAI, showcasing impressive results after just…

Mistral AI releases new model to rival GPT-4 and its own chat assistant

echo-embeddings

Echo embeddings offer a novel solution to enhance autoregressive language models by incorporating information from later…

Phind-70B closes the code quality gap with GPT-4 while running 4x faster

Phind, the startup behind the AI assistant of the same name, has released their largest language…

FireFunction V1 – Fireworks’ GPT-4-level function calling model – 4x faster than GPT-4 and open weights

Fireworks has unveiled FireFunction-v1, an enhanced function calling model that integrates external knowledge into large language…

GPT-4 developer tool can hack websites without human help

OpenAI’s GPT-4 has demonstrated the alarming ability to hack websites and extract information from databases autonomously,…