Overview
Guardian API provides real-time content moderation for platforms handling user-generated content. Whether you’re running a social network, forum, chat application, or content platform, Guardian API helps maintain safe and healthy communities.Common Use Cases
Social Media Platforms
Real-Time Comment Moderation
Moderate user comments and posts before they go live or flag them for review.Implementation:
- Moderate all comments in real-time before posting
- Auto-block highly harmful content
- Queue moderate content for human review
- Allow safe content immediately
Chat Applications
Real-Time Message Filtering
Filter messages in real-time to prevent harassment and abuse.Features:
- Instant message analysis (20-40ms)
- Multi-category detection
- Critical issue alerts (threats, self-harm)
- Batch processing for message history
Content Publishing Platforms
Pre-Publication Screening
Screen articles, reviews, and user submissions before publication.Workflow:
- User submits content
- Guardian API analyzes text
- Safe content → Auto-approve
- Flagged content → Human review
- Highly harmful → Auto-reject
Integration Patterns
Pattern 1: Synchronous Blocking
Block content in real-time before it’s posted.- High-risk platforms
- Strict moderation policies
- Critical safety requirements
Pattern 2: Asynchronous Review
Post content immediately, flag for review.- User experience priority
- Low false-positive tolerance
- High volume platforms
Pattern 3: Tiered Response
Different actions based on severity.- Nuanced moderation needs
- Multiple content types
- Large moderation teams
Batch Processing
For analyzing existing content or bulk imports:Best Practices
Set Clear Thresholds
Set Clear Thresholds
Define clear thresholds for auto-block, review, and approve actions:
Provide User Feedback
Provide User Feedback
When blocking content, explain why:
Monitor False Positives
Monitor False Positives
Track false positives and adjust thresholds:
Combine with Human Moderation
Combine with Human Moderation
Use Guardian API as a first pass, not a replacement for human moderators:
- Auto-approve clearly safe content
- Auto-block clearly harmful content
- Queue everything else for human review
Performance Considerations
Response Time
Single Request: 20-40ms
Batch (100 texts): 500-800msUse batch processing when possible
Rate Limiting
Configure rate limits based on your traffic:
- Low traffic: 100 req/min
- Medium: 500 req/min
- High: Custom scaling