Explore the latest AI news and research tagged #inference optimization — curated from top sources including OpenAI, Anthropic, Google DeepMind, and more.
#inference optimization
3 articles
☁️ AI Labs
AWS Machine Learning Blog
10 min read
Tomofun, the Taiwan-headquartered pet-tech startup behind the Furbo Pet Camera, is redefining how pet owners interact with their pets remotely. To reduce costs and maintain accuracy, Tomofun turned to EC2 Inf2 instances powered by AWS Inferentia2, the Amazon purpose-built AI chips. In this post, we walk through the following sections in detail.
ℹ️ News
InfoQ AI
3 min read
The schedule for QCon AI Boston 2026 (June 1-2) is now live. The two-day program groups sessions around context engineering, inference economics, agent reliability, and how AI is changing the…
🍎 AI Labs
Apple ML Research
2 min read
Recent advances in large language models (LLMs) test-time computing have introduced the capability to perform intermediate chain-of-thought (CoT) reasoning (thinking) before generating answers. While increasing the thinking budget yields smooth…