DX-596-together-chat-completions: sync with mintlify-docs#942#26
Merged
zainhas merged 1 commit intoJun 4, 2026
Merged
Conversation
Sync with mintlify-docs#942 which adds the model to the serverless chat catalog at $0.60 input / $0.20 cached / $3.60 output per 1M tokens, NVFP4 quant, 512300 context length.
karina-together
approved these changes
Jun 4, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Syncs the
together-chat-completionsskill with mintlify-docs#942 (ENG-88891 docs: add NVIDIA Nemotron 3 Ultra 550B A55B to serverless models), merged at commit0191185.Triggering doc changes
docs/serverless/models.mdx— added a row to the chat models catalog for NVIDIA Nemotron 3 Ultra 550B A55B (nvidia/nemotron-3-ultra-550b-a55b), 512,300 context, NVFP4, $0.60 / $0.20 cached / $3.60 per 1M tokens, with function calling and structured outputs supported.Skill changes
references/models.md— appended a new row in the Full Chat Model Catalog table for NVIDIA Nemotron 3 Ultra 550B A55B (nvidia/nemotron-3-ultra-550b-a55b, context 512,300, quant NVFP4) so the skill's catalog matches the public serverless catalog.No SKILL.md, scripts, or other references were touched — this is a pure catalog row addition.
Generated by the Sync Skills Cursor Automation. Please review before merging.