Back to browse949687
Text Generation Inference
FrameworkHugging Face's production LLM inference server
Hugging Face156.0K installs4.4 (828)
88
TrustedSecurity
86
Quality
91
Maintenance
90
Safety Tier Low Risk
Security ScanScan Passed
PriceFree
About
Production-ready inference server for large language models by Hugging Face. Features continuous batching, tensor parallelism, quantization, and watermarking. Optimized for deploying Hugging Face models at scale with gRPC and REST APIs.
Tags
Categories
InfrastructureCloud
Security Scan
86/100
9 checks ยท 9 passed ยท 0 findings5/13/2026
0 files scanned from repository
Related Frameworks
CrewAI
Multi-agent orchestration framework
FrameworkFreeScanned
234.5K4.4(1.6K)CrewAI
PythonAny LLM
LangGraph
Stateful multi-actor agent framework
FrameworkFreeScanned
198.7K4.5(1.2K)LangChain
PythonJavaScriptAny LLM
OpenAI Swarm
Lightweight multi-agent orchestration
FrameworkFreeScanned
98.4K4.2(567)OpenAI
PythonOpenAI