AI models know when they’re being tested – and change their behavior, research shows

OpenAI and Apollo Research tried to stop models from lying – and discovered something else altogether.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top