Add replicas
| Argument or flag | Description |
|---|---|
job_id | Running job to scale. |
N | Number of replicas to add. |
--gpu <type> | Optional GPU type override for the new replicas. |
--tp N | Optional tensor parallelism override. |
--pp N | Optional pipeline parallelism override. |
--on-demand | Launch new replicas on on-demand instances instead of spot. |
Kill replicas
tandemn kill to terminate specific replicas. Any chunk leased by a killed replica is reclaimed and returned to the queue.
| Flag | Description |
|---|---|
--replica <rid> | Replica ID to kill. Repeat the flag to kill multiple replicas. |
Hot-swap replicas
| Flag | Description |
|---|---|
--gpu <type> | GPU type for the replacement fleet. |
--tp N | Tensor parallelism for replacement replicas. |
--pp N | Pipeline parallelism for replacement replicas. |
--replicas N | Number of replacement replicas. |
--ready-threshold N | Number of new replicas that must be ready before old replicas are removed. |
--on-demand | Use on-demand instances for the replacement fleet. |
Operational notes
- Replica operations require Redis-backed chunk coordination.
- Killed replicas do not lose completed chunks.
- Hot-swap is designed to avoid dropped chunks by keeping the shared queue intact.

