Emsi - Page 4 of 7 - Emsi's feed

Large language models like GPT-3 and PaLM have demonstrated impressive performance on many natural language tasks.…

AI / ML ARTICLE

Large Language Models Show Promise as General-Purpose Optimizers

2023-09-10

Emsi

A new paper from researchers at Google DeepMind demonstrates the potential for large language models (LLMs)…

AI / ML ARTICLE

Large Language Models Still Struggle with Reliable Code Generation

2023-09-08

Emsi

A new study from researchers at UC San Diego raises concerns about the reliability and robustness…

AI / ML ARTICLE

Can AI Find and Fix Software Vulnerabilities?

2023-09-07

Emsi

A new study evaluates the ability of large language models (LLMs) like ChatGPT to detect and…

AI / ML ARTICLE

A New System to Turn Natural Language Prompts into Deployable AI Models

2023-09-04

Emsi

A team of researchers from Carnegie Mellon University and Tsinghua University have introduced a new system…

AI / ML ARTICLE

Automated Unit Testing Reaches New Heights with ChatGPT-Based Tool ChatUniTest

2023-09-01

Emsi

1 Comment

Unit testing is a crucial yet often tedious task in software development. To make this process…

AI / ML ARTICLE

Knowledge Graph Prompting Enhances Multi-Document Question Answering with Large Language Models

2023-08-29

Emsi

Recent advances in large language models (LLMs) like ChatGPT have shown promising results on open-domain question…

AI / ML ARTICLE Culture Other

Consciousness in Artificial Intelligence: New Insights from Neuroscience

2023-08-28

Emsi

A new interdisciplinary report, which counts as one of its co-authors, Yoshua Bengio, the Turing Award…

AI / ML ARTICLE

Reinforcement Learning from Unit Test Feedback Achieves State-of-the-Art in Program Synthesis

2023-08-25

Emsi

A new paper titled “RLTF: Reinforcement Learning from Unit Test Feedback” proposes a novel method for…

AI / ML ARTICLE Tools

Meta AI Releases Code Llama Models for Advanced Code Generation

2023-08-24

Emsi

Meta AI has released Code Llama, a family of large language models for code that establishes…

AI / ML ARTICLE

Bringing Children’s Drawings to Life with AI

2023-08-24

Emsi

A team of researchers from Meta AI, Tencent, MIT, and Carnegie Mellon University have developed a…

AI / ML ARTICLE

Training Code Models to Follow Instructions with OctoPack

2023-08-23

Emsi

A new paper titled “OctoPack: Instruction Tuning Code Large Language Models” proposes a novel method for…

AI / ML ARTICLE

New AI System Accelerates Large Language Model Serving

2023-08-21

Emsi

A new artificial intelligence system called SpecInfer can significantly accelerate the speed of large language model…

AI / ML ARTICLE

GPT-4 is Easily Tricked with Encrypted Messages

2023-08-18

Emsi

A new study reveals a surprising vulnerability in large language models like GPT-4 – they can…

AI / ML ARTICLE Tools

WizardMath: Empowering Large Language Models for Mathematical Reasoning

2023-08-17

Emsi

A team of researchers from Microsoft and the Shenzhen Institute of Advanced Technology have developed a…

AI / ML ARTICLE

Bayesian Flow Networks: A New Deep Generative Modeling Approach

2023-08-17

Emsi

A new deep generative modeling technique called Bayesian Flow Networks (BFNs) was recently introduced in a…

AI / ML ARTICLE

GPT-4 Code Interpreter Shows Impressive Math Reasoning Abilities With Self-Verification

2023-08-16

Emsi

A new study reveals that OpenAI’s latest version of GPT-4, known as the GPT-4 Code Interpreter,…

AI / ML ARTICLE

GPT-4 Outperforms RL Algorithms in Crafter by Reading Paper and Reasoning

2023-08-14

Emsi

A new approach called SPRING allows large language models (LLMs) like GPT-4 to achieve strong performance…

AI / ML ARTICLE

Humpack: Self-Alignment Allows Language Models to Improve Without Human Supervision

2023-08-14

Emsi

A new technique called “instruction backtranslation” allows large language models (LLMs) to improve their ability to…

AI / ML ARTICLE

New Benchmark Tests AI Agents in Real-World Challenges

2023-08-14

Emsi

Researchers from Tsinghua University, The Ohio State University, and UC Berkeley have introduced AgentBench, a new…

Category: Emsi

When Stars Align with AI: Training LLM for Astronomy Texts

Large Language Models Show Promise as General-Purpose Optimizers

Large Language Models Still Struggle with Reliable Code Generation

Can AI Find and Fix Software Vulnerabilities?

A New System to Turn Natural Language Prompts into Deployable AI Models

Automated Unit Testing Reaches New Heights with ChatGPT-Based Tool ChatUniTest

Knowledge Graph Prompting Enhances Multi-Document Question Answering with Large Language Models

Consciousness in Artificial Intelligence: New Insights from Neuroscience

Reinforcement Learning from Unit Test Feedback Achieves State-of-the-Art in Program Synthesis

Meta AI Releases Code Llama Models for Advanced Code Generation

Bringing Children’s Drawings to Life with AI

Training Code Models to Follow Instructions with OctoPack

New AI System Accelerates Large Language Model Serving

GPT-4 is Easily Tricked with Encrypted Messages

WizardMath: Empowering Large Language Models for Mathematical Reasoning

Bayesian Flow Networks: A New Deep Generative Modeling Approach

GPT-4 Code Interpreter Shows Impressive Math Reasoning Abilities With Self-Verification

GPT-4 Outperforms RL Algorithms in Crafter by Reading Paper and Reasoning

Humpack: Self-Alignment Allows Language Models to Improve Without Human Supervision

New Benchmark Tests AI Agents in Real-World Challenges

Why Passwords Aren’t the Problem—But How We Use Them Is

Claude 3.7 Sonnet Set to Expand Context Window to 500K Tokens

IngressNightmare: Critical Flaws in NGINX Controller Expose Kubernetes Clusters to RCE

Google’s Gemini 2.5 Pro Thinks Slower to Answer Smarter

In Pursuit of Efficiency: Rethinking AI with DeepSeek-V3-0324

AI-Generated Research: Charting New Territory in Peer-Reviewed Science

Awesome MCP Clients, A New Way To Interact With LLMs

Are We Living Inside a Spinning Black Hole?

The New OpenAI Responses API: A Technical Deep Dive