Architecture
See how the CLI, server, and GPU resources fit together.
Batch inference
Learn why Tandemn focuses on queued inference workloads.
Job lifecycle
Follow a job from input file to execution.
Models and routing
Understand how model choice and hardware placement relate.
CLI-first design
Tandemn is documented as a CLI-first product. Users submit work withtandemn deploy; administrators operate the server and cluster environment behind that interface.
