Orion-NLP-v3

Deployment dpl-g7h8i9

Orion-NLP-v3

Scaling
https://nexus.trinidy.cloud/v1/orion-nlp-v3-staging/inference
Requests 24h

12.4K

Avg Latency

55ms

Uptime

99.8%

Region

us-west-2

Instance

nexus.gpu.a10g

Cost/hr

$2.10

Total Requests

202.9K

Total Tokens

56.8M

Total Errors

102

0.050% error rate

Cost (24h)

$50.40

Requests per Hour

Average Latency (ms)

Errors per Hour

Tokens Generated