Best AI papers explained

A podcast by Enoch H. Kang

544 Episodes

The Era of Real-World Human Interaction: RL from User Conversations
Published: 10/24/2025
Agent Learning via Early Experience
Published: 10/24/2025
Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL
Published: 10/22/2025
Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior
Published: 10/22/2025
A Definition of AGI
Published: 10/22/2025
Provably Learning from Language Feedback
Published: 10/21/2025
In-Context Learning for Pure Exploration
Published: 10/21/2025
On the Role of Preference Variance in Preference Optimization
Published: 10/20/2025
Training LLM Agents to Empower Humans
Published: 10/20/2025
Richard Sutton Declares LLMs a Dead End
Published: 10/20/2025
Demystifying Reinforcement Learning in Agentic Reasoning
Published: 10/19/2025
Emergent coordination in multi-agent language models
Published: 10/19/2025
Learning-to-measure: in-context active feature acquisition
Published: 10/19/2025
Andrej Karpathy's insights: AGI, Intelligence, and Evolution
Published: 10/19/2025
Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data
Published: 10/18/2025
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Published: 10/18/2025
The attacker moves second: stronger adaptive attacks bypass defenses against LLM jail- Breaks and prompt injections
Published: 10/18/2025
When can in-context learning generalize out of task distribution?
Published: 10/16/2025
The Art of Scaling Reinforcement Learning Compute for LLMs
Published: 10/16/2025
A small number of samples can poison LLMs of any size
Published: 10/16/2025

3 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

544 Episodes

The Era of Real-World Human Interaction: RL from User Conversations

Agent Learning via Early Experience

Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL

Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior

A Definition of AGI

Provably Learning from Language Feedback

In-Context Learning for Pure Exploration

On the Role of Preference Variance in Preference Optimization

Training LLM Agents to Empower Humans

Richard Sutton Declares LLMs a Dead End

Demystifying Reinforcement Learning in Agentic Reasoning

Emergent coordination in multi-agent language models

Learning-to-measure: in-context active feature acquisition

Andrej Karpathy's insights: AGI, Intelligence, and Evolution

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

Representation-Based Exploration for Language Models: From Test-Time to Post-Training

The attacker moves second: stronger adaptive attacks bypass defenses against LLM jail- Breaks and prompt injections

When can in-context learning generalize out of task distribution?

The Art of Scaling Reinforcement Learning Compute for LLMs

A small number of samples can poison LLMs of any size