Orion-NLP-v3
Deployment dpl-g7h8i9
Orion-NLP-v3
Scalinghttps://nexus.trinidy.cloud/v1/orion-nlp-v3-staging/inferenceRequests 24h
12.4K
Avg Latency
55ms
Uptime
99.8%
Region
us-west-2
Instance
nexus.gpu.a10g
Cost/hr
$2.10
Total Requests
209.8K
Total Tokens
58.7M
Total Errors
102
0.049% error rate
Cost (24h)
$50.40
Requests per Hour
Average Latency (ms)
Errors per Hour
Tokens Generated