Article

Faulty reward functions in the wild

OpenAI·2016-12-21

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

Open original

Topics

AI Search

Article

2025-08-07

GPT-5 System Card

OpenAI

This GPT-5 system card explains how a unified model routing system powers fast and smart responses using gpt-5-main, gpt-5-thinking, and lightweight versions like gpt-5-thinking-nano, optimized for different tasks and developer use.

Article

2026-03-29