Orion-NLP-v3
Deployment dpl-g7h8i9
Orion-NLP-v3
Scalinghttps://nexus.trinidy.cloud/v1/orion-nlp-v3-staging/inferenceRequests 24h
12.4K
Avg Latency
55ms
Uptime
99.8%
Region
us-west-2
Instance
nexus.gpu.a10g
Cost/hr
$2.10
Total Requests
203.1K
Total Tokens
56.9M
Total Errors
94
0.046% error rate
Cost (24h)
$50.40
Requests per Hour
Average Latency (ms)
Errors per Hour
Tokens Generated