/submit/batch | POST | Submit a batch inference job. |
/test/placement | POST | Run solver only without launching. |
/jobs | GET | List all jobs. |
/job/{id} | GET | Job status and progress. |
/job/{id}/phase | POST | Update job lifecycle phase. |
/job/{id}/metrics | GET | Latest aggregated metrics snapshot. |
/job/{id}/metrics/stream | GET | SSE metrics stream. |
/job/{id}/metrics/ingest | POST | Sidecar metrics ingest from replicas. |
/job/{id}/metrics/summary | POST | Per-replica build metrics summary. |
/job/{id}/throughput | GET | Sustained throughput over the rolling window. |
/job/{id}/replicas | GET | Per-replica state, phase, region, and metrics availability. |
/job/{id}/replicas/{rid}/metrics | GET | Metrics for a specific replica. |
/job/{id}/replicas/summaries | GET | Per-replica completion summaries. |
/job/{id}/scale | POST | Add replicas to a running job. |
/job/{id}/kill | POST | Kill specific replicas. |
/job/{id}/swap | POST | Hot-swap replicas to a new GPU configuration. |
/job/{id}/chunks/progress | GET | Chunk-level progress. |
/job/{id}/chunks/pull | POST | Pull the next chunk. Replica-facing. |
/job/{id}/chunks/complete | POST | Mark a chunk complete. |
/job/{id}/chunks/renew | POST | Renew a chunk lease. |
/dashboard | GET | Web dashboard HTML. |
/dashboard/poll | GET | Dashboard JSON payload for polling fallback. |
/dashboard/stream | GET | Real-time dashboard SSE stream. |
/analytics/runs | GET | List completed runs. |
/analytics/runs/{id} | GET | Full completed run report. |
/analytics/runs/{id}/timeseries | GET | Scheduler timeseries for a run. |
/quota/status | GET | Quota usage across AWS regions. |
/resources | GET | Instance catalog and quota pools. |