Orion-NLP-v3
Deployment dpl-g7h8i9
Orion-NLP-v3
Scalinghttps://nexus.trinidy.cloud/v1/orion-nlp-v3-staging/inferenceRequests 24h
12.4K
Avg Latency
55ms
Uptime
99.8%
Region
us-west-2
Instance
nexus.gpu.a10g
Cost/hr
$2.10
Total Requests
202.9K
Total Tokens
56.8M
Total Errors
102
0.050% error rate
Cost (24h)
$50.40
Requests per Hour
Average Latency (ms)
Errors per Hour
Tokens Generated