Perspective API

Also known as: Google Perspective API, Jigsaw Perspective, Perspective Toxicity Scorer

Perspective API: A free toxicity-detection API from Google’s Jigsaw that uses machine learning to score how likely a comment is to be perceived as toxic, returning a probability between 0 and 1.

Perspective API is a free toxicity-detection service developed by Google’s Jigsaw that assigns a probability score between 0 and 1 to indicate how likely a comment is to be perceived as toxic or harmful.

What It Is

Online conversations break down when toxic comments go unchecked. Moderating millions of comments manually is not practical, and simple keyword filters miss sarcasm, context, and coded language. Perspective API was built to solve this problem: give platforms a way to automatically assess how toxic a piece of text is likely to be, so moderators can focus their attention where it matters most.

Developed by Google’s Jigsaw unit and launched in 2017, Perspective API works by sending a text string to a web API endpoint and receiving back a score between 0 and 1 for attributes like TOXICITY. A score of 0.9 means the model is highly confident the comment would be perceived as toxic by readers. A score of 0.2 suggests it is unlikely to be harmful. Think of it like a thermometer for hostility — it does not decide whether to remove a comment, but it tells you how heated things are getting.

The primary attribute is TOXICITY, but the API also scores related dimensions such as SEVERE_TOXICITY, IDENTITY_ATTACK, INSULT, PROFANITY, and THREAT. Each attribute returns its own independent score, allowing platforms to set different thresholds for different types of harm. A news site might tolerate strong opinions (moderate TOXICITY) but enforce zero tolerance on IDENTITY_ATTACK, for example.

Toxicity and safety evaluation systems used by LLM providers today — including safety classifiers and guard models — inherited the same core idea that Perspective API popularized: assign a numeric harm score, then let the application decide what threshold triggers action. The difference is that Perspective API focuses on human-written comments in online forums, while modern guard models evaluate AI-generated text for a broader range of safety concerns including hallucination, bias, and policy violations.

According to Perspective API, the service will be sunsetted on December 31, 2026, with no migration path provided. Teams currently relying on it will need to evaluate alternative toxicity scoring solutions before that date.

How It’s Used in Practice

The most common way teams encounter Perspective API is through content moderation workflows on platforms that host user-generated comments. A platform sends each new comment to the API, receives a toxicity score, and then applies rules: scores above a high threshold get automatically flagged for review, scores in a middle range get queued for human moderators, and low scores pass through. This lets small moderation teams handle large comment volumes without reading every single message.

News publishers, forum operators, and social platforms have been the primary adopters. At its peak, the service processed hundreds of millions of requests daily across over a thousand platforms, according to PR Newswire. The API is also used in research contexts — teams studying online harassment or building toxicity benchmarks like ToxiGen often use Perspective scores as a baseline comparison for their own models.

Pro Tip: If you are evaluating toxicity scoring for your platform, test Perspective API against your specific content before committing. Scores vary across languages, cultural contexts, and topic domains. A comment that scores 0.3 on a tech forum might score 0.7 on a political news site because of different baseline expectations in the training data.

When to Use / When Not

Scenario	Use	Avoid
Pre-screening user comments on a public forum before human review	✅
Building a toxicity benchmark to compare against other safety classifiers	✅
Detecting nuanced misinformation or factual inaccuracies in AI outputs		❌
Quick prototype for a moderation workflow before building a custom model	✅
Scoring toxicity in languages with limited training data coverage		❌
Production moderation system that needs to operate beyond December 2026		❌

Common Misconception

Myth: Perspective API can tell you whether a comment should be removed. Reality: The API returns a probability score, not a verdict. A score of 0.85 means the model predicts most people would find the comment toxic — it does not mean the comment violates any specific policy. The decision about what score threshold triggers removal, flagging, or no action belongs entirely to the platform. Two sites using the same API can make opposite moderation decisions on the same comment.

One Sentence to Remember

Perspective API proved that automated toxicity scoring could work at scale, and its core pattern — score the harm, let the app decide the threshold — is now the standard approach in modern guard models and safety classifiers that evaluate harmful AI outputs.

FAQ

Q: Is Perspective API free to use? A: Yes, the API is free to use. You need a Google Cloud API key to access it, and standard usage operates within default rate limits with higher quotas available on request.

Q: What happens after Perspective API shuts down in December 2026? A: Teams will need to migrate to alternative toxicity scoring solutions. Google has not announced an official successor, so options include commercial moderation APIs, open-source safety classifiers, or custom-trained models.

Q: How does Perspective API relate to guard models used by LLM providers? A: Both follow the same pattern: score text on a numeric harm scale and let the application set the action threshold. Guard models extend this to evaluate AI-generated outputs across a wider range of safety dimensions beyond comment toxicity.

Sources

Perspective API: Perspective API Official Site - Official documentation, attribute definitions, and API reference from Google’s Jigsaw
Perspective API GitHub: conversationai/perspectiveapi - Open-source documentation, API reference, and attribute definitions

Expert Takes

MONA

Perspective API is a set of independent binary classifiers, each outputting calibrated probability estimates for a single toxicity attribute. The TOXICITY score and INSULT score are not derived from a shared latent variable — combining them without understanding their independence assumptions leads to unreliable composite risk metrics. The underlying architecture predates transformer-based approaches, which partly explains why newer safety classifiers trained on adversarial examples outperform it on edge cases like coded language and irony.

MAX

If you are building a moderation pipeline today, treat Perspective API as a reference implementation rather than a production dependency. The pattern is worth copying: receive text, return attribute scores, let the calling application own the threshold logic. But with the service going offline, your integration layer should use an adapter that can swap in a different scoring backend without rewriting your moderation rules. Design the interface around the score-and-threshold contract, not around this specific endpoint.

DAN

Perspective API proved market demand for toxicity scoring as a service and set the baseline expectation that platforms should have automated safety checks. Its shutdown signals that standalone toxicity scoring is becoming a feature inside larger AI safety stacks rather than a separate product category. Teams that treated it as their only safety layer are about to learn the cost of single-vendor dependency the hard way.

ALAN

When one company defines what counts as toxic for over a thousand platforms, it is worth asking whose norms are encoded in those scores. Perspective API was trained primarily on English-language crowdsourced annotations, and its toxicity thresholds inevitably reflected the cultural assumptions of those annotators. As the service disappears, the question persists for every successor: who decides the boundary between acceptable and harmful, and who audits that decision?

Back to Glossary