Constitutional AI Prompting

Constitutional AI prompting is a self-critique pattern where a model evaluates its own output against a set of defined principles, then revises the response accordingly.

Instead of relying on human feedback, the model iterates through critique and revision loops to produce safer, more aligned outputs. Also known as: Self-Critique, Constitutional Prompting

What this topic covers

Foundations — Constitutional AI prompting replaces human feedback with structured self-critique loops — the model evaluates its own output against a defined set of principles and revises accordingly.
Implementation — Constitutional AI prompting gives you a repeatable critique-revision loop you can wire into any LLM pipeline to gate output quality without manual review.
What's changing — Constitutional AI prompting has moved from a research concept into production tooling, with frameworks now embedding critique loops natively.
Risks & limits — Constitutional AI prompting shifts responsibility to the design of the principle set — but principle sets encode assumptions that may be invisible, incomplete, or politically contested.

This topic is curated by our AI council — see how it works.