together-dedicated-containers

📁 zainhas/togetherai-skills 📅 Today

总安装量

周安装量

#67378

全站排名

安装命令

npx skills add https://github.com/zainhas/togetherai-skills --skill together-dedicated-containers

Agent 安装分布

amp 1

cline 1

opencode 1

cursor 1

kimi-cli 1

codex 1

Skill 文档

Together Dedicated Containers

Overview

Run custom Dockerized inference workloads on Together’s managed GPU infrastructure. You bring the container â Together handles compute, autoscaling, networking, and observability.

Components:

Jig CLI: Build, push, and deploy containers
Sprocket SDK: Python SDK for handling inference requests inside containers
Container Registry: registry.together.xyz for storing images
Queue API: Async job submission with priority and progress tracking

Workflow

Write inference code using Sprocket SDK (setup() + predict())
Build container with Jig CLI (jig build)
Push to registry (jig push)
Deploy (jig deploy)
Send requests to your deployment

Quick Start

1. Install Jig CLI

pip install jig-together
jig auth login  # Authenticate with Together AI

2. Create Inference Worker

# worker.py
from sprocket import SprocketWorker

class MyWorker(SprocketWorker):
    def setup(self):
        """Load model and resources (runs once at startup)."""
        import torch
        self.model = torch.load("model.pt")

    def predict(self, request):
        """Handle a single inference request."""
        input_data = request.json()
        result = self.model(input_data["prompt"])
        return {"output": result}

3. Configure Project

# pyproject.toml
[tool.jig]
name = "my-inference-service"
gpu = "A100"
gpu_count = 1
worker = "worker:MyWorker"

4. Build, Push, Deploy

jig build                    # Build Docker image
jig push                     # Push to registry.together.xyz
jig deploy                   # Deploy to Together infrastructure
jig status                   # Check deployment status
jig logs                     # View logs

5. Send Requests

Check the health endpoint:

curl https://api.together.ai/v1/deployments/my-inference-service/health \
  -H "Authorization: Bearer $TOGETHER_API_KEY"

Submit a job via the Queue API:

curl -X POST "https://api.together.ai/v1/queue/submit" \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "my-inference-service",
    "payload": {"prompt": "Hello world"},
    "priority": 1
  }'

Response:

{
  "request_id": "req_abc123",
  "status": "pending"
}

Poll for the result:

curl "https://api.together.ai/v1/queue/status?model=my-inference-service&request_id=req_abc123" \
  -H "Authorization: Bearer $TOGETHER_API_KEY"

Response (when complete):

{
  "request_id": "req_abc123",
  "model": "my-inference-service",
  "status": "done",
  "outputs": {"output": "..."}
}

Or use the Python requests library:

import os
import requests

response = requests.post(
    "https://api.together.ai/v1/queue/submit",
    headers={"Authorization": f"Bearer {os.environ['TOGETHER_API_KEY']}"},
    json={
        "model": "my-inference-service",
        "payload": {"prompt": "Hello world"},
        "priority": 1,
    },
)
print(response.json())

Or submit directly via the Jig CLI:

together beta jig submit --payload '{"prompt": "Hello world"}' --watch

Sprocket SDK

The SDK provides the SprocketWorker base class:

setup(): Called once at startup â load models, warm up caches
predict(request): Called per request â process input and return output
File handling: Upload/download files within predictions
GPU access: Full CUDA access inside the container

Queue API

For async workloads, use the Queue API for job submission with:

Priority-based fair queuing
Progress tracking
Job status polling

Key Jig CLI Commands

All commands are subcommands of together beta jig. Use --config <path> to specify a custom config file (default: pyproject.toml).

Build and Deploy

Command	Description
`jig init`	Create a starter `pyproject.toml` with defaults
`jig dockerfile`	Generate a Dockerfile from config (for debugging)
`jig build`	Build container image locally
`jig build --tag <tag>`	Build with a specific image tag
`jig build --warmup`	Build and pre-generate compile caches (requires GPU)
`jig push`	Push image to `registry.together.xyz`
`jig deploy`	Build, push, and create/update deployment
`jig deploy --build-only`	Build and push only, skip deployment creation
`jig deploy --image <ref>`	Deploy an existing image, skip build and push

Deployment Management

Command	Description
`jig status`	Show deployment status and configuration
`jig list`	List all deployments in your organization
`jig logs`	View deployment logs
`jig logs --follow`	Stream logs in real-time
`jig endpoint`	Print the deployment’s endpoint URL
`jig destroy`	Delete the deployment

Queue

Command	Description
`jig submit --payload '<json>'`	Submit a job to the queue
`jig submit --prompt '<text>'`	Submit with shorthand prompt payload
`jig submit --watch`	Submit and wait for the result
`jig job_status --request-id <id>`	Get the status of a submitted job
`jig queue_status`	Show queue backlog and worker status

Secrets

Command	Description
`jig secrets set --name <n> --value <v>`	Create or update a secret
`jig secrets list`	List all secrets for the deployment
`jig secrets unset <name>`	Remove a secret

Volumes

Command	Description
`jig volumes create --name <n> --source <path>`	Create a volume and upload files
`jig volumes update --name <n> --source <path>`	Update a volume with new files
`jig volumes describe --name <n>`	Show volume details and contents
`jig volumes list`	List all volumes
`jig volumes delete --name <n>`	Delete a volume

Resources

Full Jig CLI reference: See references/jig-cli.md
Sprocket SDK reference: See references/sprocket-sdk.md
App template: See scripts/sprocket_hello_world.py â minimal Sprocket worker with pyproject.toml example
Official docs: Dedicated Container Inference
Official docs: Containers Quickstart
API reference: Deployments API

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台