Deployment checklist
Choose a server host
Pick a machine in your AWS environment that CLI users and EC2 inference replicas can reach.
Prepare AWS access
Configure AWS credentials, IAM permissions, S3 bucket access, and quota visibility.
Commands
Operating model
The administrator owns the server deployment, AWS credentials, S3 bucket, Redis, environment configuration, network exposure, and cluster access. Users own their prompt files, model choices, SLOs, and CLI environment.Document the server URL, supported models, and expected SLO values for your users before sharing access.

