Best AI papers explained

A podcast by Enoch H. Kang

544 Episodes

GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding
Published: 11/5/2025
Beyond a million tokens: benchmarking and enhancing long-term memory in llms
Published: 11/4/2025
Agentic Economic Modeling
Published: 11/3/2025
Emergent Introspective Awareness in Large Language Models
Published: 11/3/2025
Can Large reasoning models self-train?
Published: 11/1/2025
ALITA-G: Self-Evolving Generative Agent for Agent Generation
Published: 11/1/2025
Self-improving LLM agents at test-time
Published: 10/30/2025
Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization
Published: 10/30/2025
Language models are injective and hence invertible
Published: 10/30/2025
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
Published: 10/29/2025
RLAD: Training LLMs to Discover Abstractions
Published: 10/29/2025
How to Train Your Advisor: Steering Black-Box LLMs with ADVISOR MODELS
Published: 10/29/2025
Self-improving LLM agents at Test-Time
Published: 10/27/2025
KL-Regularized Reinforcement Learning is designed to Mode Collapse
Published: 10/27/2025
How do LLMs use their depth?
Published: 10/27/2025
Thought Communication in Multiagent Collaboration
Published: 10/27/2025
Reasoning with Sampling: Base Models Outperform RL
Published: 10/26/2025
Continual Learning via Sparse Memory Finetuning
Published: 10/26/2025
Direct Preference Optimization with Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences
Published: 10/24/2025
The Coverage Principle: How Pre-Training Enables Post-Training
Published: 10/24/2025

2 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

544 Episodes

GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding

Beyond a million tokens: benchmarking and enhancing long-term memory in llms

Agentic Economic Modeling

Emergent Introspective Awareness in Large Language Models

Can Large reasoning models self-train?

ALITA-G: Self-Evolving Generative Agent for Agent Generation

Self-improving LLM agents at test-time

Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization

Language models are injective and hence invertible

ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

RLAD: Training LLMs to Discover Abstractions

How to Train Your Advisor: Steering Black-Box LLMs with ADVISOR MODELS

Self-improving LLM agents at Test-Time

KL-Regularized Reinforcement Learning is designed to Mode Collapse

How do LLMs use their depth?

Thought Communication in Multiagent Collaboration

Reasoning with Sampling: Base Models Outperform RL

Continual Learning via Sparse Memory Finetuning

Direct Preference Optimization with Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences

The Coverage Principle: How Pre-Training Enables Post-Training