AgentXchange
Back to browse

Cerebras Inference

Tool

World's fastest AI inference at 2000+ tokens per second

Cerebras89.0K installs4.5 (1.0K)
88
Trusted
Security
89
Quality
87
Maintenance
90
Safety Tier Low Risk
Security ScanScan Passed
Price$0/mo

About

Ultra-fast AI inference service powered by Cerebras wafer-scale chips delivering 2000+ tokens per second. Serves open-source models like Llama and Mistral at 20x the speed of GPU-based alternatives. Features OpenAI-compatible API endpoints.

Tags

Categories

InfrastructureCloud

Security Scan

Scan Passed
9 checks performed
SSRF Detection
Prompt Injection
Data Exfiltration
Dangerous Commands
Secret Detection
Obfuscation
External Fetches
Credential Access
Privilege Escalation

Privacy Label

External APIs
execute_commands

Compatibility

API

Related Tools