Berkeley AI Research

🐻 Research Berkeley AI Research 14 min read

Gradient-based Planning for World Models at Longer Horizons

GRASP is a new gradient-based planner for learned dynamics (a “world model”) that makes long-horizon planning practical by (1) lifting the trajectory into virtual states so optimization is parallel across time, (2) adding stochasticity directly to the state iterates for exploration, and (3) reshaping gradients so actions get clean signals while we avoid brittle “state-input” gradients through high-dimensional vision models.…

#world models #gradient-based planning #reinforcement learning

🕐 16 days ago

Read →

🐻 Research Berkeley AI Research 7 min read

Identifying Interactions at Scale for LLMs

Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to make the decision-making process more…

#llm interpretability #feature interactions #mechanistic interpretability

🕐 a month ago

Read →

🐻 Research Berkeley AI Research 6 min read

Information-Driven Design of Imaging Systems

An encoder (optical system) maps objects to noiseless images, which noise corrupts into measurements. Our information estimator uses only these noisy measurements and a noise model to quantify how well…

#imaging systems #information theory #neural networks

🕐 3 months ago

Read →

🐻 Research Berkeley AI Research 9 min read

RL without TD learning

In this post, I’ll introduce a reinforcement learning (RL) algorithm based on an “alternative” paradigm: divide and conquer . Unlike traditional methods, this algorithm is not based on temporal difference…

#reinforcement learning #off-policy learning #temporal difference learning

🕐 6 months ago

Read →

🐻 Research Berkeley AI Research 6 min read

What exactly does word2vec learn?

What exactly does word2vec learn, and how? Answering this question amounts to understanding representation learning in a minimal yet interesting language modeling task. Despite the fact that word2vec is a…

#word2vec #representation learning #embeddings

🕐 8 months ago

Read →

🐻 Research Berkeley AI Research 7 min read

Whole-Body Conditioned Egocentric Video Prediction

× Predicting Ego-centric Video from human Actions (PEVA) . Given past video frames and an action specifying a desired change in 3D pose, PEVA predicts the next video frame. Our…

#video prediction #egocentric vision #embodied ai

🕐 10 months ago

Read →

🐻 Research Berkeley AI Research 4 min read

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1…

#prompt injection #llm security #fine-tuning

🕐 1 year, 25 days ago

Read →

🐻 Research Berkeley AI Research 6 min read

Repurposing Protein Folding Models for Generation with Latent Diffusion

PLAID is a multimodal generative model that simultaneously generates protein 1D sequence and 3D structure, by learning the latent space of protein folding models. The awarding of the 2024 Nobel…

#protein-generation #diffusion-models #protein-folding

🕐 1 year, 28 days ago

Read →

🐻 Research Berkeley AI Research 9 min read

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled cars into rush-hour highway traffic to smooth congestion and reduce fuel consumption for everyone. Our goal is to…

#reinforcement learning #autonomous vehicles #traffic optimization

🕐 1 year, 1 month ago

Read →

🐻 Research Berkeley AI Research 5 min read

Virtual Personas for Language Models via an Anthology of Backstories

We introduce Anthology , a method for conditioning LLMs to representative, consistent, and diverse virtual personas by generating and utilizing naturalistic backstories with rich details of individual values and experience.…

#language models #virtual personas #prompt conditioning

🕐 1 year, 5 months ago

Read →

Berkeley AI Research Articles · DeepTrendLab