Skip to content

DX-596-together-fine-tuning: sync with mintlify-docs#932#25

Merged
zainhas merged 1 commit into
mainfrom
docs-sync/together-fine-tuning/mintlify-docs-pr-932
Jun 4, 2026
Merged

DX-596-together-fine-tuning: sync with mintlify-docs#932#25
zainhas merged 1 commit into
mainfrom
docs-sync/together-fine-tuning/mintlify-docs-pr-932

Conversation

@zainhas

@zainhas zainhas commented Jun 3, 2026

Copy link
Copy Markdown
Collaborator

Syncs together-fine-tuning skill with togethercomputer/mintlify-docs#932 (merged).

Source PR

  • togethercomputer/mintlify-docs#932: MOSH-2800: add Max LoRA Rank column to LoRA fine-tuning table

Triggering doc files

  • docs/fine-tuning-models.mdx

Skill changes

  • references/supported-models.md: added a new Max LoRA rank caps table after the LoRA model tables, capturing the per-model lora_r ceilings introduced by the new Max LoRA Rank column. Models capped at 16: Moonshot Kimi K2 family, Z.ai GLM-5/5.1, DeepSeek R1/V3 (non-distill) families. All other LoRA-supported models keep the default cap of 64.
  • references/deployment.md: updated the lora_r row in the training-parameters table to note that the rank is capped per model and link to supported-models.md for the cap table.

No SKILL.md, scripts, agents, or other references touched.


Generated by the Sync Skills Cursor Automation. Please review before merging.

…cs#932

Adds Max LoRA rank caps documentation to align with the new Max LoRA Rank
column added in togethercomputer/mintlify-docs#932 (docs/fine-tuning-models.mdx).

- references/supported-models.md: new 'Max LoRA rank caps' table listing
  the model families capped at 16 (Moonshot Kimi K2, Z.ai GLM-5/5.1,
  DeepSeek R1/V3 non-distill families) vs the default cap of 64.
- references/deployment.md: lora_r row notes the per-model cap and links
  to supported-models.md.
| `warmup_ratio` | float | 0 | Warmup step ratio |
| `lora` | bool | true | Use LoRA method |
| `lora_r` | int | 64 | LoRA rank |
| `lora_r` | int | 64 | LoRA rank. Per-model max -- 64 for most models; 16 for Moonshot Kimi-K2 family, Z.ai GLM-5/5.1, and DeepSeek R1/V3 (non-distill) families. See [supported-models.md](supported-models.md). |

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Probably it's better to not specify any specific values and just redirect to supported-models.md for up-to-date info

@zainhas zainhas merged commit fb94cc1 into main Jun 4, 2026
2 checks passed
@zainhas zainhas deleted the docs-sync/together-fine-tuning/mintlify-docs-pr-932 branch June 4, 2026 18:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants