Best AI papers explained
A podcast by Enoch H. Kang
430 Episodes
-
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
Published: 4/24/2025 -
Iterative Nash Policy Optimization for Language Model Alignment
Published: 4/24/2025 -
SycEval: Benchmarking LLM Sycophancy in Mathematics and Medicine
Published: 4/23/2025 -
Stack AI: Democratizing Enterprise AI Development
Published: 4/22/2025 -
Evaluating Modern Recommender Systems: Challenges and Future Directions
Published: 4/22/2025 -
AI in the Enterprise: Seven Lessons from Frontier Companies by OpenAI
Published: 4/22/2025 -
Discussion: Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Published: 4/21/2025 -
AI Agent Protocols and Human Preference
Published: 4/21/2025 -
Cross-Environment Cooperation for Zero-Shot Multi-Agent Coordination
Published: 4/20/2025 -
Sutton and Silver: The Era of Experience: Learning Beyond Human Data
Published: 4/19/2025 -
Sample, Don't Search: Rethinking Test-Time Alignment for Language Models
Published: 4/19/2025 -
AI Agents: Echoes of Past Technology Pivots?
Published: 4/19/2025 -
Minimalist LLM Reasoning: Rejection Sampling to Reinforcement
Published: 4/19/2025 -
Securing the Model Context Protocol in Enterprise Environments
Published: 4/19/2025 -
Improving Multi-Turn Tool Use with Reinforcement Learning
Published: 4/19/2025 -
Cultural Knowledge Conservation and Control in Large Language Models
Published: 4/19/2025 -
Data Quality, Repetition, and Scaling of Language Models
Published: 4/18/2025 -
Compute-Optimal Scaling Laws for Language Models Revisited
Published: 4/18/2025 -
Concise Reasoning via Reinforcement Learning
Published: 4/18/2025 -
Throughput Limits for LLM Inference and AI Agent Scheduling
Published: 4/14/2025
Cut through the noise. We curate and break down the most important AI papers so you don’t have to.