Best AI papers explained

A podcast by Enoch H. Kang

Categories:

153 Episodes

  1. Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

    Published: 4/3/2025
  2. On the Biology of a Large Language Model

    Published: 4/1/2025
  3. Async-TB: Asynchronous Trajectory Balance for Scalable LLM RL

    Published: 4/1/2025
  4. Instacart's Economics Team: A Hybrid Role in Tech

    Published: 3/31/2025
  5. Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework

    Published: 3/31/2025
  6. Why MCP won

    Published: 3/31/2025
  7. SWEET-RL: Training LLM Agents for Collaborative Reasoning

    Published: 3/31/2025
  8. TheoryCoder: Bilevel Planning with Synthesized World Models

    Published: 3/30/2025
  9. Driving Forces in AI: Scaling to 2025 and Beyond (Jason Wei, OpenAI)

    Published: 3/29/2025
  10. Expert Demonstrations for Sequential Decision Making under Heterogeneity

    Published: 3/28/2025
  11. TextGrad: Backpropagating Language Model Feedback for Generative AI Optimization

    Published: 3/27/2025
  12. MemReasoner: Generalizing Language Models on Reasoning-in-a-Haystack Tasks

    Published: 3/27/2025
  13. RAFT: In-Domain Retrieval-Augmented Fine-Tuning for Language Models

    Published: 3/27/2025
  14. Inductive Biases for Exchangeable Sequence Modeling

    Published: 3/26/2025
  15. InverseRLignment: LLM Alignment via Inverse Reinforcement Learning

    Published: 3/26/2025
  16. Prompt-OIRL: Offline Inverse RL for Query-Dependent Prompting

    Published: 3/26/2025
  17. Alignment from Demonstrations for Large Language Models

    Published: 3/25/2025
  18. Q♯: Distributional RL for Optimal LLM Post-Training

    Published: 3/18/2025
  19. Scaling Test-Time Compute Without Verification or RL is Suboptimal

    Published: 3/14/2025
  20. Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

    Published: 3/14/2025

7 / 8

Men know other men best. Women know other women best. And yes, perhaps AIs know other AIs best. AI explains what you should know about this week's AI research progress.