RLHF

What is Reinforcement Learning from Human Feedback (RLHF)?

7 mins

RLHF is a method for training AI models using human rankings to ensure outputs align with human intent and preferences in practical business applications.

What is AI Alignment?

6 mins

We define AI alignment, distinguishing it from capability, and explore practical implications for founders building with artificial intelligence to ensure systems behave as intended.

↑