Add together-sandbox skill by necoline · Pull Request #16 · togethercomputer/skills

necoline · 2026-05-26T15:31:54Z

Summary

Adds new together-sandbox skill (singular) for the together-sandbox Python SDK, sandbox environments for RL training, SFT data generation, and coding agent rollouts
Distinct from the existing together-sandboxes (plural) skill, which covers the Code Interpreter API for managed Python notebook execution
Grounded in the GRPO RL workload profile: snapshot creation, batch sandbox fan-out, multi-turn exec, reward file collection, and lifecycle cleanup

Files added

File	Purpose
`skills/together-sandbox/SKILL.md`	Skill definition with routing, workflow, and high-signal rules
`skills/together-sandbox/agents/openai.yaml`	UI metadata for OpenAI/Codex surfaces
`skills/together-sandbox/references/api-reference.md`	Full SDK reference (snapshots, sandboxes, exec, files, lifecycle)
`skills/together-sandbox/references/rl-patterns.md`	GRPO training patterns (golden image, batch fan-out, rollouts, reward collection)
`skills/together-sandbox/scripts/sandbox_lifecycle.py`	End-to-end lifecycle demo (create, exec, files, shutdown)
`skills/together-sandbox/scripts/parallel_fanout.py`	Batch creation and parallel execution demo
`quality/trigger-evals/together-sandbox.json`	6 trigger eval cases (3 positive, 3 negative)

Test plan

python3 scripts/quick_validate.py skills/together-sandbox passes
python3 scripts/quality_check.py passes
Scripts run successfully against live Sandbox API (blocked on API key provisioning)
Run ./scripts/publish.sh to regenerate AGENTS.md and README.md (left for maintainer)

🤖 Generated with Claude Code

New skill covering the together-sandbox Python SDK for isolated container environments used in RL training, SFT data generation, and coding agent rollouts. Distinct from together-sandboxes (Code Interpreter API). Includes: - SKILL.md with routing, workflow, and high-signal rules - references/api-reference.md (full SDK surface) - references/rl-patterns.md (GRPO training patterns) - scripts/sandbox_lifecycle.py (create, exec, files, shutdown) - scripts/parallel_fanout.py (batch creation for RL) - agents/openai.yaml - trigger-evals (3 positive, 3 negative) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Verified all claims against togethercomputer/together-sandbox source code. Fixes: - execs.exec() does not exist; replaced with execs.create() + execs.get() polling pattern throughout all files - Added run_exec() helper to wrap the two-step create+poll flow - HttpError does not exist; replaced with RuntimeError - autostart parameter is actually autorun - user parameter is actually uid/gid (int, not str) - get_output() returns list[ExecStdout] with .output/.exit_code attributes, not a dict with string keys - Snapshot model has no alias field; documented this limitation - SSE stream uses camelCase (exitCode) not snake_case - Documented ExecItem and ExecStdout model fields Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Ran both scripts against live Sandbox API — both pass end-to-end. Corrections from live testing: - execs.exec() DOES exist (convenience method wrapping create+stream) Returns {"exit_code": int, "output": str} — reverted all files - autostart is correct (not autorun) - user is str ("1000:1000"), not uid/gid ints - sandbox.shutdown() is a classmethod — use sdk.sandboxes.shutdown(id) - sandbox.hibernate() is a classmethod — use sdk.sandboxes.hibernate(id) - Removed run_exec() helper (unnecessary since exec() exists) Verified against live API: - sandbox_lifecycle.py: snapshot create, sandbox start, DNS config, exec, file write/read, directory list, shutdown, snapshot delete - parallel_fanout.py: 4 concurrent sandboxes, parallel exec, cleanup Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…, fix install Address PR review: stop re-documenting the API surface in the skill and point to the SDK's own version-matched docs so the skill needs no maintenance as the SDK evolves. - Delete references/api-reference.md (307 lines) — it duplicated and had already drifted from the SDK (stale base_url; the now-deleted file predated current API) - SKILL.md: replace the API-reference section with an "SDK Reference" section pointing to docs/{python-sdk,typescript-sdk,cli}.md on GitHub - Install: together-sandbox is published on PyPI — use `pip install together-sandbox` (drop the git+https install and the "not yet on PyPI" note); note the TS package is not yet on npm - Fix the lifecycle rule: `sandbox.shutdown()` / `sandbox.hibernate()` instance methods do exist (alongside the `sdk.sandboxes.*` by-id forms) - Python-primary scope; reference TS/CLI by pointer only - Regenerate AGENTS.md / README.md skills table (was left for maintainer) Validated against installed SDK v1.12.0: every symbol used exists, scripts compile, quick_validate + quality_check + publish.sh --check all pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

necoline · 2026-06-01T16:19:29Z

Updated based on the review (pushed 4308449). Summary:

Stopped re-documenting the API surface. Deleted references/api-reference.md (307 lines) — it duplicated the SDK and had already drifted (stale base_url, etc.). SKILL.md now has an SDK Reference section pointing to the SDK's own version-matched docs so the skill needs no maintenance as the SDK changes:

docs/python-sdk.md (primary), docs/typescript-sdk.md, docs/cli.md

Install fixed. together-sandbox is live on PyPI (v1.12.0), so it now leads with pip install together-sandbox and drops the git+https / "not yet on PyPI" lines. Flagged that the TS package @together-sandbox/sdk isn't on npm yet (install from source until then).

Language scope trimmed to Python-primary; TS/CLI are referenced by pointer only, not enumerated.

Correctness fix. Removed the rule claiming sandbox.shutdown() doesn't exist — it does (instance method), alongside the sdk.sandboxes.shutdown(id) by-id form. Both are now documented as valid. The execs.exec(...) usage was verified correct against the installed v1.12.0.

Regenerated AGENTS.md / README.md skills table (was left for maintainer).

Validated against installed SDK v1.12.0: every symbol used exists, scripts compile, and quick_validate.py + quality_check.py + publish.sh --check all pass. The only remaining item is a live end-to-end run against a real sandbox (needs API key + sandbox access).

On bundling docs inside the package vs. linking GitHub: went with the canonical GitHub doc URLs for now since they work today with no publish dependency. Happy to switch the skill to point at the in-package docs/ path (e.g. node_modules/@together-sandbox/sdk/docs) once you publish the packages with the AI docs bundled — let me know which you prefer.

necoline and others added 2 commits May 26, 2026 17:30

necoline changed the title ~~Add together-sandbox skill for gVisor container execution~~ Add together-sandbox skill May 26, 2026

necoline and others added 2 commits May 26, 2026 18:57

necoline mentioned this pull request Jun 1, 2026

Add together-rl skill (GRPO with sandboxed code-execution rewards) #22

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add together-sandbox skill #16

Add together-sandbox skill #16
necoline wants to merge 4 commits into
mainfrom
add-together-sandbox-skill

necoline commented May 26, 2026 •

edited

Loading

Uh oh!

necoline commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

necoline commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Files added

Test plan

Uh oh!

necoline commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

necoline commented May 26, 2026 •

edited

Loading