Best AI papers explained

A podcast by Enoch H. Kang

550 Episodes

Uncovering Causal Hierarchies in Language Model Capabilities
Published: 6/17/2025
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
Published: 6/17/2025
Improving Treatment Effect Estimation with LLM-Based Data Augmentation
Published: 6/17/2025
LLM Numerical Prediction Without Auto-Regression
Published: 6/17/2025
Self-Adapting Language Models
Published: 6/17/2025
Why in-context learning models are good few-shot learners?
Published: 6/17/2025
Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina∗
Published: 6/14/2025
The Logic of Machines: The AI Reasoning Debate
Published: 6/12/2025
Layer by Layer: Uncovering Hidden Representations in Language Models
Published: 6/12/2025
Causal Attribution Analysis for Continuous Outcomes
Published: 6/12/2025
Training a Generally Curious Agent
Published: 6/12/2025
Estimation of Treatment Effects Under Nonstationarity via Truncated Difference-in-Q’s
Published: 6/12/2025
Strategy Coopetition Explains the Emergence and Transience of In-Context Learning
Published: 6/12/2025
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
Published: 6/11/2025
Agentic Supernet for Multi-agent Architecture Search
Published: 6/11/2025
Sample Complexity and Representation Ability of Test-time Scaling Paradigms
Published: 6/11/2025
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators
Published: 6/10/2025
LLMs Get Lost In Multi-Turn Conversation
Published: 6/9/2025
PromptPex: Automatic Test Generation for Prompts
Published: 6/8/2025
General Agents Need World Models
Published: 6/8/2025

11 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

550 Episodes

Uncovering Causal Hierarchies in Language Model Capabilities

Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

Improving Treatment Effect Estimation with LLM-Based Data Augmentation

LLM Numerical Prediction Without Auto-Regression

Self-Adapting Language Models

Why in-context learning models are good few-shot learners?

Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina∗

The Logic of Machines: The AI Reasoning Debate

Layer by Layer: Uncovering Hidden Representations in Language Models

Causal Attribution Analysis for Continuous Outcomes

Training a Generally Curious Agent

Estimation of Treatment Effects Under Nonstationarity via Truncated Difference-in-Q’s

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Agentic Supernet for Multi-agent Architecture Search

Sample Complexity and Representation Ability of Test-time Scaling Paradigms

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

LLMs Get Lost In Multi-Turn Conversation

PromptPex: Automatic Test Generation for Prompts

General Agents Need World Models