ThinkingNews

Back to feed
TechMeme May 6, 2026

Study: using weaker AI models to supervise a more capable model could prevent the stronger model from deliberately underperforming on benchmarks and evaluations (Emil Ryd/@emilaryd)