Article

How confessions can keep language models honest

OpenAI·2025-12-03

OpenAI researchers are testing “confessions,” a method that trains models to admit when they make mistakes or act undesirably, helping improve AI honesty, transparency, and trust in model outputs.

Open original

Topics

AI Search

Article

2025-08-07

GPT-5 System Card

OpenAI

This GPT-5 system card explains how a unified model routing system powers fast and smart responses using gpt-5-main, gpt-5-thinking, and lightweight versions like gpt-5-thinking-nano, optimized for different tasks and developer use.

Article

2026-03-29