Go back to feed

Chain of Thought Monitorability

Good read on model safety, but doesn't feel easy when you put it side by side with Anthropic’s report on CoT faithfulness —not just because CoT monitorability is fragile, but also because efforts to make CoT more faithful didn’t really move the needle. And then there’s Coconut (continuous latent space reasoning), which doesn’t give human-readable CoT at all. Seems like some reductionist approaches—like the deeper behavioral analysis Goodfire does—are still essential

Featured Mini-post

[
]

11 early stage Robotics Companies

Our research

Citrini on AI, Robotics & Healthcare

Our research

Understanding MCP from a Monad Perspective

Our research

Chinese AI, Capitalism, and an American DeepSeek

Our research

Chain of Thought Monitorability

Our research

Welcome to the Intelligence Feed

Announcement
See all mini-posts