Skip to main content
A Tandemn job moves from user submission to server-side scheduling and execution.
1

Prepare input

The user prepares a JSONL prompt file and chooses a model that the deployment can run.
2

Submit

The user runs tandemn deploy from a machine that can reach the Tandemn server.
3

Validate

The server checks the request and determines whether the workload can be accepted.
4

Plan

Tandemn evaluates the available GPU pool and chooses where to place the workload.
5

Execute

The workload runs on the selected resources.
6

Review

The user checks the CLI output and any job artifacts exposed by the deployment.

Practical advice

  • Start with a small prompt file before submitting a large workload.
  • Confirm the model is available in your deployment.
  • Keep TD_SERVER_URL visible in your terminal when debugging.
  • Ask your administrator for the supported SLO conventions in your environment.