GPT-4 Architecture, Infrastructure, Training Dataset, Costs, Vision, MoE

AI summary: OpenAI’s GPT-4 model architecture is not a secret, but a replicable solution with complex…

Meet LongLLaMA: A Large Language Model Capable of Handling Long Contexts of 256k Tokens

AI summary: Researchers have developed the Focused Transformer (FOT), a technique that addresses the challenge of…

The EU’s Product Liability Directive could kill open source

AI summary: The rise in software supply chain attacks has outpaced policy development, leading to a…

New AI tool can help treat brain tumors more quickly and accurately, study finds

AI summary: Harvard Medical School researchers have developed an artificial intelligence (AI) tool that could improve…

Machine learning enables accurate electronic structure calculations at large scales for material modeling

AI summary: Researchers from CASUS at HZDR, Germany, and Sandia National Laboratories, U.S., have developed a…

torchscale: Transformers at any scale

GitHub – mshumer/gpt-prompt-engineer

AI summary: The `gpt-prompt-engineer` is a revolutionary tool that generates, tests, and ranks AI prompts for…

InterCode – interactive coding with execution feedback

AI summary: InterCode introduces a new standard for interactive coding with execution feedback, aiming to enhance…

Team develops a faster, cheaper way to train large language models

AI summary: A team from Stanford University has developed Sophia, a method to optimize the pretraining…

InstructBLIP

AI summary: The InstructBLIP model, based on the pre-trained BLIP-2 models, is a general-purpose vision-language model…

Stay on topic with Classifier-Free Guidance

In Stable Diffusion, CFG (Classifier-Free Guidance) is used to guide a model to follow a given…

What’s it like to code with GPT-4 and aider?

AI summary: Explore the capabilities of GPT-4 in coding tasks through the aider command-line chat tool.…

GitHub – imoneoi/openchat: OpenChat: Less is More for Open-source Models

AI summary: First model to beat ChatGPT. OpenChat, a series of open-source language models, has been…

Can LLMs Generate Mathematical Proofs that can be Rigorously Checked? Meet LeanDojo: An Open-Source AI Playground With Toolkits, Benchmarks, and Models for Large Language Models to Prove Formal Theorems in the Lean Proof Assistant

AI summary: Researchers from Caltech, NVIDIA, MIT, UC Santa Barbara, and UT Austin have developed LeanDojo,…

Chinese Researchers Used AI to Design RISC-V CPU in Under 5 Hours

AI summary: Chinese scientists have designed an industrial-scale RISC-V CPU using AI in under five hours,…

New AI-based theory explains your weird dreams

AI summary: Erik Hoel’s “overfitted brain hypothesis” suggests that dreaming is the brain’s way of generalizing…

Man designs ChatGPT bot subscription service to annoy and waste telemarketers’ time

AI summary: Roger Anderson, owner of Jolly Roger Telephone Company, uses AI-powered bots to combat robocallers.…

Announcing Windows 11 Insider Preview Build 23493

AI summary: Microsoft has released Windows 11 Insider Preview Build 23493 to the Dev Channel, introducing…

Training LLMs with AMD MI250 GPUs and MosaicML

AI summary: MosaicML has successfully tested AMD’s MI250 GPU for machine learning (ML) training, finding it…

The Huge Power and Potential Danger of AI-Generated Code

AI summary: GitHub’s AI coding tool, Copilot, is transforming the coding landscape, with a report showing…