Skip to main content
The Tandemn CLI is the main interface for Tandemn System. Use it to check server connectivity, run placement planning, submit batch inference jobs, scale replicas, monitor progress, and inspect completed runs. Tandemn System is self-hosted. You run the control plane in your own AWS account, point the CLI at that server, and submit workloads from your environment. Your data stays inside your infrastructure.
Naming note: the product is Tandemn System. Parts of the repository, CLI output, and environment variables may still reference the legacy codename orca.

Install

pip install tandemn

Configure

export TD_SERVER_URL=<your-server-url>
If the server requires API key authentication, also set:
export TD_API_KEY=<your-api-key>

Core commands

check

Verify that the CLI can reach the Tandemn server.

plan and deploy

Preview or launch a job with a model, input file, and deadline.

replicas

Add, kill, or hot-swap replica clusters while work is queued.

monitoring

Watch progress, metrics, logs, clusters, and the dashboard.

analytics

Review completed runs and scheduler timeseries.

input format

Build OpenAI-compatible JSONL batch files.

Typical workflow

pip install tandemn
export TD_SERVER_URL=<your-server-url>
tandemn check
tandemn plan Qwen/Qwen2.5-7B-Instruct prompts.jsonl --slo 4
tandemn deploy Qwen/Qwen2.5-7B-Instruct prompts.jsonl --slo 4
tandemn progress

Command groups

CommandPurpose
tandemn checkVerify CLI connectivity to the control plane.
tandemn plan <model> <input>Show a placement plan without launching.
tandemn deploy <model> <input>Run a batch job. The solver picks GPU and parallelism unless you override them.
tandemn add <job_id> NAdd replicas to a running job.
tandemn kill <job_id>Kill one or more replicas.
tandemn swap <job_id>Hot-swap all replicas to a new GPU configuration.
tandemn progress [job_id]Show a live progress bar.
tandemn webOpen the real-time dashboard.
tandemn statusList jobs.
tandemn metrics <job_id>Show vLLM metrics.
tandemn stream <job_id>Stream live metrics events.
tandemn logs [cluster]Stream SkyPilot cluster logs.
tandemn clustersShow active clusters.
tandemn destroy <job_id>Tear down job clusters and Redis state.
tandemn historyBrowse completed runs.
tandemn inspect <run_id>Show a full run report.
tandemn timeseries <run_id>Show scheduler timeseries for a completed run.

Responsibilities

The CLI user is responsible for:
  • Installing the CLI
  • Pointing it at the correct Tandemn server
  • Preparing valid input files
  • Choosing a supported model
  • Sharing useful error details with the administrator when something fails