Skip to main content
Use the monitoring commands after a job has been submitted with tandemn deploy.

Progress and dashboard

tandemn progress
tandemn progress <job_id>
tandemn web
CommandDescription
tandemn progressLive progress bar for the active or most recent job.
tandemn progress <job_id>Live progress bar for a specific job.
tandemn webOpen the real-time web dashboard in a browser.
The dashboard shows workload details, chunk progress, replica phase state, cost, ETA, throughput, quota usage, event logs, and metrics charts. It uses server-sent events with polling fallback.

Job and cluster state

tandemn status
tandemn clusters
tandemn logs [cluster]
CommandDescription
tandemn statusList jobs known to the control plane.
tandemn clustersShow active SkyPilot clusters.
tandemn logs [cluster]Stream logs from a SkyPilot cluster.

Metrics

tandemn metrics <job_id>
tandemn metrics <job_id> --watch
tandemn metrics <job_id> --replica <rid>
tandemn metrics <job_id> --compare
tandemn stream <job_id>
CommandDescription
tandemn metrics <job_id>Latest vLLM metrics snapshot.
tandemn metrics <job_id> --watchRefresh metrics every two seconds.
tandemn metrics <job_id> --replica <rid>Show metrics for a specific replica.
tandemn metrics <job_id> --compareShow aggregated and per-replica metrics side by side.
tandemn stream <job_id>Stream live metrics at roughly one event per second.
Metrics can include throughput, queue depth, KV cache utilization, scheduler state, GPU utilization, latency, and completion counters, depending on the replica state and server configuration.

Cleanup

tandemn destroy <job_id>
tandemn destroy --all
CommandDescription
tandemn destroy <job_id>Tear down clusters and Redis state for one job.
tandemn destroy --allTear down all tandemn clusters.
Clusters are destroyed by default after job completion. Use --persist with tandemn deploy when you want to keep clusters alive.