#llm
63 articles
Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts
This paper was accepted at the Workshop on Navigating and Addressing Data Problems for Foundation Models at ICLR 2026. Large language models (LLMs) can struggle to memorize factual knowledge in…
Beyond Vector Search: Building a Deterministic 3-Tiered Graph-RAG System
LWiAI Podcast #238 - GPT 5.4 mini, OpenAI Pivot, Mamba 3, Attention Residuals
OpenAI ships GPT-5.4 mini and nano, faster and more capable but up to 4x pricier, DLSS 5 looks like a real-time generative AI filter for video games | The Verge,…
ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv and feedback from readers. If you’d like to support this, please subscribe. Subscribe now Can LLMs…
LWiAI Podcast #237 - Nemotron 3 Super, xAI reborn, Anthropic Lawsuit, Research!
Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning, Another XAI Cofounder Has Left, Anthropic Sues Department of Defense
LWiAI Podcast #234 - Opus 4.6, GPT-5.3-Codex, Seedance 2.0, GLM-5
An action-packed episode!
LWiAI Podcast #229 - Gemini 3 Flash, ChatGPT Apps, Nemotron 3
Google launches Gemini 3 Flash, ChatGPT launches an app store, Introducing GPT-5.2-Codex
IBM's Granite 4.0 is now on Replicate
Run OpenAI’s latest models on Replicate
OpenAI's latest models are now available on Replicate, including GPT-4.1, GPT-4o, and the o-series.
What's Missing From LLM Chatbots: A Sense of Purpose
LLM-based chatbots’ capabilities have been advancing every month. These improvements are mostly measured by benchmarks like MMLU, HumanEval, and MATH (e.g. sonnet 3.5, gpt-4o). However, as these measures get more…
Financial Market Applications of LLMs
The AI revolution drove frenzied investment in both private and public companies and captured the public’s imagination in 2023. Transformational consumer products like ChatGPT are powered by Large Language Models…
Car-GPT: Could LLMs finally make self-driving cars happen?
Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?
Do text embeddings perfectly encode text?
'Vec2text' can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data.