Large Language Models AI News & Research

🤗 AI Labs Hugging Face Blog 4 min read

Holotron-12B - High Throughput Computer Use Agent

#AI #Computer Vision #Large Language Models

🕐 a month ago

Read →

📅 Newsletters Last Week in AI 2 min read

LWiAI Podcast #236 - GPT 5.4, Gemini 3.1 Flash Lite, Supply Chain Risk

OpenAI launches GPT-5.4 with Pro and Thinking versions, Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro, Where things stand with the Department of War Anthropic

#Large Language Models #AI Safety #Defense and Policy

🕐 2 months ago

Read →

📅 Newsletters Last Week in AI 3 min read

LWiAI Podcast #235 - Sonnet 4.6, Deep-thinking tokens, Anthropic vs Pentagon

Anthropic releases Sonnet 4.6, Google Rolls Out Gemini 3.1 Pro, Anthropic CEO Amodei says Pentagon’s threats ‘do not change our position’ on AI

#Anthropic #Large Language Models #AI Infrastructure

🕐 2 months ago

Read →

📅 Newsletters Last Week in AI 1 min read

LWiAI Podcast #234 - Opus 4.6, GPT-5.3-Codex, Seedance 2.0, GLM-5

An action-packed episode!

#Large Language Models #Generative Media #Industry Funding

🕐 2 months ago

Read →

📅 Newsletters Last Week in AI 1 min read

Last Week in AI #335 - Opus 4.6, Codex 5.3, Gemini 3 Deep Think, GLM 5, Seedance 2.0

A crazy packed edition of Last Week in AI! Plus some small updates.

#AI Models #Large Language Models #Gemini

🕐 2 months ago

Read →

📅 Newsletters Last Week in AI 1 min read

Last Week in AI #334 - Kimi K2.5 & Code, Genie 3, OpenClaw & Moltbook

China’s Moonshot releases a new open source model Kimi K2.5 and a coding agent, Google Brings Genie 3’s Interactive World-Building Prototype to AI Ultra Subscribers, and more!

#AI Models #Multimodal #Open Source

🕐 3 months ago

Read →

📅 Newsletters Last Week in AI 1 min read

Last Week in AI #332 - Apple + Gemini, OpenAI + Cerebras, Claude Cowork

Google’s Gemini to power Apple’s AI features like Siri, OpenAI signs deal worth $10B for compute from Cerebras, and more!

#Apple #Google Gemini #AI Partnership

🕐 3 months ago

Read →

🔄 News Synced Review 5 min read

MIT Researchers Unveil “SEAL”: A New Step Towards Self-Improving AI

MIT introduces SEAL, a framework enabling large language models to self-edit and update their weights via reinforcement learning. The post MIT Researchers Unveil “SEAL”: A New Step Towards Self-Improving AI…

#AI #Self-Improvement #Large Language Models

🕐 10 months ago

Read →

🔄 News Synced Review 12 min read

DeepSeek-V3 New Paper is coming! Unveiling the Secrets of Low-Cost Large Model Training through Hardware-Aware Co-design

A newly released 14-page technical paper from the team behind DeepSeek-V3, with DeepSeek CEO Wenfeng Liang as a co-author, sheds light on the “Scaling Challenges and Reflections on Hardware for…

#DeepSeek #Large Language Models #Hardware Design

🕐 11 months ago

Read →

🔄 News Synced Review 3 min read

DeepSeek Unveils DeepSeek-Prover-V2: Advancing Neural Theorem Proving with Recursive Proof Search and a New Benchmark

DeepSeek AI releases DeepSeek-Prover-V2, an open-source LLM for Lean 4 theorem proving. It uses recursive proof search with DeepSeek-V3 for training data and reinforcement learning, achieving top results on MiniF2F.…

#AI #theorem proving #formal methods

🕐 1 year, 13 days ago

Read →

🔄 News Synced Review 8 min read

Can GRPO be 10x Efficient? Kwai AI’s SRPO Suggests Yes with SRPO

Kwai AI's SRPO framework slashes LLM RL post-training steps by 90% while matching DeepSeek-R1 performance in math and code. This two-stage RL approach with history resampling overcomes GRPO limitations. The…

#reinforcement learning #large language models #reasoning models

🕐 1 year, 19 days ago

Read →

🔄 News Synced Review 5 min read

DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT

DeepSeek AI, a prominent player in the large language model arena, has recently published a research paper detailing a new technique aimed at enhancing the scalability of general reward models…

#DeepSeek #Large Language Models #Reinforcement Learning

🕐 1 year, 1 month ago

Read →

📊 Business Benedict Evans 6 min read

Are better models better?

Every week there’s a better AI model that gives better answers. But a lot of questions don’t have better answers, only ‘right’ answers, and these models can’t do that. So…

#artificial intelligence #large language models #AI limitations

🕐 1 year, 3 months ago

Read →

🌍 Safety Future of Life Institute 14 min read

Why You Should Care About AI Agents

Powerful AI agents are about to hit the market. Here we explore the implications.

#AI agents #artificial intelligence #large language models

🕐 1 year, 5 months ago

Read →

📐 Research The Gradient 14 min read

Car-GPT: Could LLMs finally make self-driving cars happen?

Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?

#autonomous vehicles #large language models #machine learning

🕐 2 years ago

Read →

Large Language Models AI News & Research · DeepTrendLab

Large Language Models