Large language models like GPT-3 and PaLM have demonstrated impressive performance on many natural language tasks.…
Category: Emsi
In this category I publish my original content that is blog posts, analysis, articles, random thoughts, experiments, etc. All that is not purely a feed of stuff that I’ve found on the internet.
Large Language Models Show Promise as General-Purpose Optimizers
A new paper from researchers at Google DeepMind demonstrates the potential for large language models (LLMs)…
Large Language Models Still Struggle with Reliable Code Generation
A new study from researchers at UC San Diego raises concerns about the reliability and robustness…
Can AI Find and Fix Software Vulnerabilities?
A new study evaluates the ability of large language models (LLMs) like ChatGPT to detect and…
A New System to Turn Natural Language Prompts into Deployable AI Models
A team of researchers from Carnegie Mellon University and Tsinghua University have introduced a new system…
Automated Unit Testing Reaches New Heights with ChatGPT-Based Tool ChatUniTest
Unit testing is a crucial yet often tedious task in software development. To make this process…
Knowledge Graph Prompting Enhances Multi-Document Question Answering with Large Language Models
Recent advances in large language models (LLMs) like ChatGPT have shown promising results on open-domain question…
Consciousness in Artificial Intelligence: New Insights from Neuroscience
A new interdisciplinary report, which counts as one of its co-authors, Yoshua Bengio, the Turing Award…
Reinforcement Learning from Unit Test Feedback Achieves State-of-the-Art in Program Synthesis
A new paper titled “RLTF: Reinforcement Learning from Unit Test Feedback” proposes a novel method for…
Meta AI Releases Code Llama Models for Advanced Code Generation
Meta AI has released Code Llama, a family of large language models for code that establishes…
Bringing Children’s Drawings to Life with AI
A team of researchers from Meta AI, Tencent, MIT, and Carnegie Mellon University have developed a…
Training Code Models to Follow Instructions with OctoPack
A new paper titled “OctoPack: Instruction Tuning Code Large Language Models” proposes a novel method for…
New AI System Accelerates Large Language Model Serving
A new artificial intelligence system called SpecInfer can significantly accelerate the speed of large language model…
GPT-4 is Easily Tricked with Encrypted Messages
A new study reveals a surprising vulnerability in large language models like GPT-4 – they can…
WizardMath: Empowering Large Language Models for Mathematical Reasoning
A team of researchers from Microsoft and the Shenzhen Institute of Advanced Technology have developed a…
Bayesian Flow Networks: A New Deep Generative Modeling Approach
A new deep generative modeling technique called Bayesian Flow Networks (BFNs) was recently introduced in a…
GPT-4 Code Interpreter Shows Impressive Math Reasoning Abilities With Self-Verification
A new study reveals that OpenAI’s latest version of GPT-4, known as the GPT-4 Code Interpreter,…
GPT-4 Outperforms RL Algorithms in Crafter by Reading Paper and Reasoning
A new approach called SPRING allows large language models (LLMs) like GPT-4 to achieve strong performance…
Humpack: Self-Alignment Allows Language Models to Improve Without Human Supervision
A new technique called “instruction backtranslation” allows large language models (LLMs) to improve their ability to…
New Benchmark Tests AI Agents in Real-World Challenges
Researchers from Tsinghua University, The Ohio State University, and UC Berkeley have introduced AgentBench, a new…