#llm

🎯 Newsletters Machine Learning Mastery 4 min read

5 Techniques for Efficient Long-Context RAG

#rag #long-context #llm

🕐 21 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts

This paper was accepted at the Workshop on Navigating and Addressing Data Problems for Foundation Models at ICLR 2026. Large language models (LLMs) can struggle to memorize factual knowledge in…

#llm #memorization #data pruning

🕐 24 days ago

Read →

🎯 Newsletters Machine Learning Mastery 13 min read

Beyond Vector Search: Building a Deterministic 3-Tiered Graph-RAG System

#rag #knowledge-graphs #vector-databases

🕐 26 days ago

Read →

📅 Newsletters Last Week in AI 3 min read

LWiAI Podcast #238 - GPT 5.4 mini, OpenAI Pivot, Mamba 3, Attention Residuals

OpenAI ships GPT-5.4 mini and nano, faster and more capable but up to 4x pricier, DLSS 5 looks like a real-time generative AI filter for video games | The Verge,…

#llm #openai #agents

🕐 a month ago

Read →

🍔 Newsletters Ben's Bites 5 min read

A peek inside CLI tools

No more funny videos at OpenAI

#agents #cli-tools #llm

🕐 a month ago

Read →

📥 Newsletters Import AI 16 min read

ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text

Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv and feedback from readers. If you’d like to support this, please subscribe. Subscribe now Can LLMs…

#llm #post-training #ai-autonomy

🕐 a month ago

Read →

📅 Newsletters Last Week in AI 3 min read

LWiAI Podcast #237 - Nemotron 3 Super, xAI reborn, Anthropic Lawsuit, Research!

Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning, Another XAI Cofounder Has Left, Anthropic Sues Department of Defense

#ai-agents #llm #nvidia

🕐 a month ago

Read →

📅 Newsletters Last Week in AI 3 min read

LWiAI Podcast #234 - Opus 4.6, GPT-5.3-Codex, Seedance 2.0, GLM-5

An action-packed episode!

#llm #generative-media #model-releases

🕐 2 months ago

Read →

📅 Newsletters Last Week in AI 2 min read

LWiAI Podcast #229 - Gemini 3 Flash, ChatGPT Apps, Nemotron 3

Google launches Gemini 3 Flash, ChatGPT launches an app store, Introducing GPT-5.2-Codex

#gemini #chatgpt #llm

🕐 4 months ago

Read →

♻️ Tools Replicate Blog 3 min read

IBM's Granite 4.0 is now on Replicate

#llm #open-source #ibm

🕐 7 months ago

Read →

♻️ Tools Replicate Blog 1 min read

Run OpenAI’s latest models on Replicate

OpenAI's latest models are now available on Replicate, including GPT-4.1, GPT-4o, and the o-series.

#openai #llm #api

🕐 11 months ago

Read →

📐 Research The Gradient 14 min read

What's Missing From LLM Chatbots: A Sense of Purpose

LLM-based chatbots’ capabilities have been advancing every month. These improvements are mostly measured by benchmarks like MMLU, HumanEval, and MATH (e.g. sonnet 3.5, gpt-4o). However, as these measures get more…

#llm #chatbots #dialogue systems

🕐 1 year, 7 months ago

Read →

📐 Research The Gradient 9 min read

Financial Market Applications of LLMs

The AI revolution drove frenzied investment in both private and public companies and captured the public’s imagination in 2023. Transformational consumer products like ChatGPT are powered by Large Language Models…

#llm #overviews

🕐 2 years ago

Read →

📐 Research The Gradient 14 min read

Car-GPT: Could LLMs finally make self-driving cars happen?

Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?

#llm #perspectives

🕐 2 years ago

Read →

📐 Research The Gradient 11 min read

Do text embeddings perfectly encode text?

'Vec2text' can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data.

#interpretability #llm #nlp

🕐 2 years ago

Read →

#llm — AI News & Research · DeepTrendLab