Prepare a prompt file
Use OpenAI-style batch JSONL where each line is one request payload. Keep the first test small so you can confirm the end-to-end flow quickly.prompts.jsonl
Preview the placement
Submit the job
What the arguments mean
Qwen/Qwen2.5-7B-Instructis the model identifier.prompts.jsonlis the batch input file.--slo 4is a four-hour deadline.
Monitor progress
Use a model that your Tandemn deployment is configured to run. If a model cannot be scheduled, ask your administrator which models are currently available.
If something fails
Start with the basics:- Run
tandemn check. - Confirm
TD_SERVER_URLpoints at the right server. - Confirm the JSONL file exists and is readable.
- Confirm the model is available in your environment.

