Apple ML Research

🍎 AI Labs Apple ML Research 2 min read

SpecMD: A Comprehensive Study on Speculative Expert Prefetching

Mixture-of-Experts (MoE) models enable sparse expert activation, meaning that only a subset of the model’s parameters is used during each inference. However, to translate this sparsity into practical performance, an expert caching mechanism is required. Previous works have proposed hardware-centric caching policies, but how these various caching policies interact with each other and different hardware specification remains poorly understood. To…

#mixture-of-experts #caching-policy #inference-optimization

🕐 a day ago

Read →

🍎 AI Labs Apple ML Research 1 min read

Normalizing Flows with Iterative Denoising

Normalizing Flows (NFs) are a classical family of likelihood-based methods that have received revived attention. Recent efforts such as TARFlow have shown that NFs are capable of achieving promising performance…

#normalizing flows #generative models #image synthesis

🕐 a day ago

Read →

🍎 AI Labs Apple ML Research 2 min read

From Where Things Are to What They’re For: Benchmarking Spatial–Functional Intelligence for Multimodal LLMs

True spatial intelligence for multimodal agents transcends low-level geometric perception, evolving from knowing where things are to understanding what they are for. While existing benchmarks, such as VSI-Bench, effectively evaluate…

#multimodal llms #spatial reasoning #functional understanding

🕐 a day ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing

Serving transformer language models with high throughput requires caching Key-Values (KVs) to avoid redundant computation during autoregressive generation. The memory footprint of KV caching is significant and heavily impacts serving…

#kv-cache #transformers #model-optimization

🕐 2 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

PORTool: Importance-Aware Policy Optimization with Rewarded Tree for Multi-Tool-Integrated Reasoning

Multi-tool-integrated reasoning enables LLM-empowered tool-use agents to solve complex tasks by interleaving natural-language reasoning with calls to external tools. However, training such agents using outcome-only rewards suffers from credit-assignment ambiguity,…

#multi-agent reasoning #tool-use #reinforcement learning

🕐 3 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Reinforced Agent: Inference-Time Feedback for Tool-Calling Agents

This paper was accepted at the Fifth Workshop on Natural Language Generation, Evaluation, and Metrics at ACL 2026. Tool-calling agents are evaluated on tool selection, parameter accuracy, and scope recognition,…

#tool-calling agents #inference-time evaluation #multi-agent systems

🕐 6 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows

Normalizing flows (NFs) are end-to-end likelihood-based generative models for continuous data, and have recently regained attention with encouraging progress on image generation. Yet in the video generation domain, where spatiotemporal…

#video generation #normalizing flows #generative models

🕐 7 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Bootstrapping Sign Language Annotations with Sign Language Models

AI-driven sign language interpretation is limited by a lack of high-quality annotated data. New datasets including ASL STEM Wiki and FLEURS-ASL contain professional interpreters and 100s of hours of data…

#sign language recognition #machine learning #dataset annotation

🕐 7 days ago

Read →

🍎 AI Labs Apple ML Research 1 min read

International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2026

Apple is presenting new research at the annual International Conference on Acoustics, Speech and Signal Processing (ICASSP) , which takes place in person in Barcelona, Spain, from May 4 to…

#signal processing #acoustic research #speech processing

🕐 7 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Adaptive Thinking: Large Language Models Know When to Think in Latent Space

Recent advances in large language models (LLMs) test-time computing have introduced the capability to perform intermediate chain-of-thought (CoT) reasoning (thinking) before generating answers. While increasing the thinking budget yields smooth…

#large language models #chain-of-thought reasoning #inference optimization

🕐 8 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

DSO: Direct Steering Optimization for Bias Mitigation

Generative models are often deployed to make decisions on behalf of users, such as vision-language models (VLMs) identifying which person in a room is a doctor to help visually impaired…

#bias-mitigation #vision-language-models #activation-steering

🕐 8 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Local Mechanisms of Compositional Generalization in Conditional Diffusion

Conditional diffusion models appear capable of compositional generalization, i.e., generating convincing samples for out-of-distribution combinations of conditioners, but the mechanisms underlying this ability remain unclear. To make this concrete, we…

#diffusion models #compositional generalization #conditional generation

🕐 9 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

StereoFoley: Object-Aware Stereo Audio Generation from Video

We present StereoFoley, a video-to-audio generation framework that produces semantically aligned, temporally synchronized, and spatially accurate stereo sound at 48 kHz. While recent generative video-to-audio models achieve strong semantic and…

#audio generation #video-to-audio #stereo sound

🕐 9 days ago

Read →

🍎 AI Labs Apple ML Research 1 min read

LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

Large Language Models (LLMs) demonstrate their reasoning ability through chain-of-thought (CoT) generation. However, LLM’s autoregressive decoding may limit the ability to revisit and refine earlier tokens in a holistic manner,…

#llm #reasoning #diffusion models

🕐 9 days ago

Read →

🍎 AI Labs Apple ML Research 1 min read

Learning Long-Term Motion Embeddings for Efficient Kinematics Generation

Understanding and predicting motion is a fundamental component of visual intelligence. Although modern video models exhibit strong comprehension of scene dynamics, exploring multiple possible futures through full video synthesis remains…

#motion generation #video synthesis #flow matching

🕐 13 days ago

Read →

🍎 AI Labs Apple ML Research 10 min read

ParaRNN: Large-Scale Nonlinear RNNs, Trainable in Parallel

Recurrent Neural Networks (RNNs) are naturally suited to efficient inference, requiring far less memory and compute than attention-based architectures, but the sequential nature of their computation has historically made it…

#rnn #parallel-training #llm

🕐 14 days ago

Read →

🍎 AI Labs Apple ML Research 7 min read

Apple Machine Learning Research at ICLR 2026

Apple is advancing AI and ML with fundamental research, much of which is shared through publications and engagement at conferences in order to accelerate progress in this important field and…

#recurrent neural networks #state space models #3d scene generation

🕐 15 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Can Large Language Models Understand Context?

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of…

#large language models #context understanding #benchmark

🕐 16 days ago

Read →

🍎 AI Labs Apple ML Research 1 min read

What Do Your Logits Know? (The Answer May Surprise You!)

Recent work has shown that probing model internals can reveal a wealth of information not apparent from the model generations. This poses the risk of unintentional or malicious information leakage,…

#vision-language models #model interpretability #information leakage

🕐 17 days ago

Read →

🍎 AI Labs Apple ML Research 12 min read

International Conference on Learning Representations (ICLR) 2026

Apple is presenting new research at the annual International Conference on Learning Representations (ICLR) , which takes place in person in Rio de Janeiro, Brazil, from April 23 to 27.…

#deep learning #machine learning #conference

🕐 20 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

MixAtlas: Uncertainty-aware Data Mixture Optimization for Multimodal LLM Midtraining

This paper was accepted at the Workshop on Navigating and Addressing Data Problems for Foundation Models (NADPFM) at ICLR 2026. Principled domain reweighting can substantially improve sample efficiency and downstream…

#multimodal-learning #data-mixture-optimization #proxy-models

🕐 21 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts

This paper was accepted at the Workshop on Navigating and Addressing Data Problems for Foundation Models at ICLR 2026. Large language models (LLMs) can struggle to memorize factual knowledge in…

#llm #memorization #data pruning

🕐 24 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

Efficient Privacy Loss Accounting for Subsampling and Random Allocation

We consider the privacy amplification properties of a sampling scheme in which a user’s data is used in k steps chosen randomly and uniformly from a sequence (or set) of…

#differential-privacy #privacy-amplification #sampling-schemes

🕐 24 days ago

Read →

🍎 AI Labs Apple ML Research 2 min read

ACM Human-Computer Interaction Conference (CHI) 2026

Apple is presenting new research at the annual ACM (Association of Computing Machinery) CHI Conference on Human Factors in Computing Systems , which takes place in person in Barcelona, Spain,…

#human-computer interaction #wearable design #airpods

🕐 27 days ago

Read →

Apple ML Research Articles · DeepTrendLab