Multimodal Web Navigation with Instruction-Finetuned Foundation Models

GPT-4: WebGUM, a multimodal agent, leverages vision-language foundation models to improve autonomous web navigation. By jointly…

Guillotine Regularization: Why removing layers is needed to improve…

GPT-4: Guillotine Regularization (GR) is a critical technique in Self-Supervised Learning (SSL) that significantly improves generalization…

Japan Goes All In: Copyright Doesn’t Apply To AI Training

GPT-4: Japan’s government has decided not to enforce copyrights on data used in AI training, aiming…

We know That LLMs Can Use Tools, But Did You Know They Can Also Make New Tools? Meet LLMs As Tool Makers (LATM): A Closed-Loop System Allowing LLMs To Make Their Own Reusable Tools

GPT-4: Researchers from Google Deepmind, Princeton University, and Stanford University have developed a system called LLMs…

Meet LLMScore: A New LLM-based Instruction-Following Matching Pipeline to Evaluate the Alignment Between Text Prompts and Synthesized Images in Text-to-Image Synthesis

GPT-4: Researchers have introduced LLMScore, a framework that leverages large language models (LLMs) to evaluate text-image…

Raspberry Pi Camera Takes Photos Using AI Instead of Lens

GPT-4: Bjørn Karmann has created a unique Raspberry Pi camera project called Paragraphica, which uses AI…

ChatGPT and large language models in gastroenterology – Nature Reviews Gastroenterology & Hepatology

GPT-4: Explore the potential of artificial intelligence (AI) in revolutionizing the field of endoscopy, as it…

Researchers from UC Berkeley Introduce Gorilla: A Finetuned LLaMA-based Model that Surpasses GPT-4 on Writing API Calls

GPT-4: Researchers from Berkeley and Microsoft have developed Gorilla, a finetuned LLaMA-based model that outperforms GPT-4…

Can Language Models Generate New Scientific Ideas? Meet Contextualized Literature-Based Discovery (C-LBD)

GPT-4: Researchers from the University of Illinois, Hebrew University of Jerusalem, and the Allen Institute for…

Ortus – your YouTube AI buddy

GPT-4: Ortus is an AI-powered extension that enhances your YouTube learning experience by providing real-time answers…

GitHub – theonlyfoxy/CommanderGPT: Voice Assisted Desktop Automation for Simple to Complex Tasks using ChatGPT

GPT-4: Introducing CommanderGPT, a powerful desktop automation tool that leverages OpenAI’s GPT-3.5 language model for seamless…

LLMs Outperform Reinforcement Learning- Meet SPRING: An Innovative Prompting Framework for LLMs Designed to Enable in-Context Chain-of-Thought Planning and Reasoning

GPT-4: Researchers from Carnegie Mellon University, NVIDIA, Ariel University, and Microsoft have developed SPRING, a Large…

UAE’s Technology Innovation Institute Launches Open-Source “Falcon 40B” Large Language Model for Research & Commercial Utilization

GPT-4: The Technology Innovation Institute (TII) has announced that the UAE’s first large-scale AI model, Falcon…

Meet PromptingWhisper: Using Prompt Engineering to Adapt the Whisper Model to Unseen Tasks, the Proposed Prompts Enhances Performance by 10% to 45% on Three Zero-Shot Tasks

GPT-4: Researchers have adapted OpenAI’s Whisper model, an automatic speech recognition system, to perform unseen tasks…

tiiuae/falcon-40b · Hugging Face

GPT-4: Falcon-40B is a cutting-edge causal decoder-only model developed by TII, trained on 1,000B tokens of…

Voyager: An Open-Ended Embodied Agent with Large Language Models

Gpt-4: Introducing Voyager, the first GPT-4 powered agent capable of lifelong learning in Minecraft. It continually…

Stanford Researchers Introduce Sophia: A Scalable Second-Order Optimizer For Language Model Pre-Training

GPT-4: Researchers have developed a novel optimizer called Sophia, which can train large language models (LLMs)…

Massively Multilingual Speech (MMS) project

Massively Multilingual Speech (MMS) project by Meta combines wav2vec 2.0 and a new dataset to provide…

Built-in ChatGPT-driven Copilot will transform Windows 11 starting in June

GPT-4: Microsoft is set to introduce a ChatGPT-driven Copilot feature to Windows 11, offering AI assistance…

GitHub – artidoro/qlora: QLoRA: Efficient Finetuning of Quantized LLMs

GPT-4: QLoRA is an efficient finetuning approach that enables training a 65B parameter model on a…