The Tandemn System control plane exposes a REST API atDocumentation Index
Fetch the complete documentation index at: https://docs.tandemn.com/llms.txt
Use this file to discover all available pages before exploring further.
http://localhost:26336 by default. Most users should use the CLI; use the REST API for integrations, dashboards, and operational tooling.
Endpoint reference
| Endpoint | Method | Description |
|---|---|---|
/submit/batch | POST | Submit a batch inference job. |
/test/placement | POST | Run solver only without launching. |
/jobs | GET | List all jobs. |
/job/{id} | GET | Job status and progress. |
/job/{id}/phase | POST | Update job lifecycle phase. |
/job/{id}/metrics | GET | Latest aggregated metrics snapshot. |
/job/{id}/metrics/stream | GET | SSE metrics stream. |
/job/{id}/metrics/ingest | POST | Sidecar metrics ingest from replicas. |
/job/{id}/metrics/summary | POST | Per-replica build metrics summary. |
/job/{id}/throughput | GET | Sustained throughput over the rolling window. |
/job/{id}/replicas | GET | Per-replica state, phase, region, and metrics availability. |
/job/{id}/replicas/{rid}/metrics | GET | Metrics for a specific replica. |
/job/{id}/replicas/summaries | GET | Per-replica completion summaries. |
/job/{id}/scale | POST | Add replicas to a running job. |
/job/{id}/kill | POST | Kill specific replicas. |
/job/{id}/swap | POST | Hot-swap replicas to a new GPU configuration. |
/job/{id}/chunks/progress | GET | Chunk-level progress. |
/job/{id}/chunks/pull | POST | Pull the next chunk. Replica-facing. |
/job/{id}/chunks/complete | POST | Mark a chunk complete. |
/job/{id}/chunks/renew | POST | Renew a chunk lease. |
/dashboard | GET | Web dashboard HTML. |
/dashboard/poll | GET | Dashboard JSON payload for polling fallback. |
/dashboard/stream | GET | Real-time dashboard SSE stream. |
/analytics/runs | GET | List completed runs. |
/analytics/runs/{id} | GET | Full completed run report. |
/analytics/runs/{id}/timeseries | GET | Scheduler timeseries for a run. |
/quota/status | GET | Quota usage across AWS regions. |
/resources | GET | Instance catalog and quota pools. |
Example responses
GET /job/{id}/metrics
GET /job/{id}/chunks/progress
POST /job/{id}/scale
gpu_type, tp_size, and pp_size are optional. If omitted, Tandemn inherits them from the existing job.
POST /job/{id}/swap
ready_threshold new replicas begin processing.
