The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data

Brian Christian The Alignment Problem

Brian Christian The Alignment Problem

Prometheus Using Consensus Mechanisms as an approach to Alignment — LessWrong

Brian Christian The Alignment Problem