Skip to content

DX-596-together-fine-tuning: sync with mintlify-docs#892#31

Merged
zainhas merged 1 commit into
mainfrom
docs-sync/together-fine-tuning/mintlify-docs-pr-892
Jun 9, 2026
Merged

DX-596-together-fine-tuning: sync with mintlify-docs#892#31
zainhas merged 1 commit into
mainfrom
docs-sync/together-fine-tuning/mintlify-docs-pr-892

Conversation

@zainhas

@zainhas zainhas commented Jun 8, 2026

Copy link
Copy Markdown
Collaborator

Mirrors togethercomputer/mintlify-docs#892 ("MOSH-2768: update docs with examples of backend file validation").

Docs changes that triggered this PR

  • docs/fine-tuning-data-preparation.mdx
  • docs/fine-tuning-quickstart.mdx

Skill changes

  • references/data-formats.md: split validation into a client-side structural check (UTF-8 / one-JSON-per-line / min samples / max size) and a server-side schema validation stage, and document the new processing_status lifecycle (COMPLETED / INVALID_FORMAT / FAILED) along with sample validation_report payloads.
  • references/data-formats.md: refreshed the CLI section to show together files retrieve <FILE-ID> as the way to inspect server-side validation status.
  • scripts/finetune_workflow.py, scripts/dpo_workflow.py, scripts/function_calling_finetune.py, scripts/reasoning_finetune.py, scripts/vlm_finetune.py: added a small wait_for_file_ready(file_id) helper that polls client.files.retrieve(...).processing_status after upload and raises on INVALID_FORMAT / FAILED before calling client.fine_tuning.create(...), so bad datasets fail fast instead of consuming training time.

Generated by the Sync Skills Cursor Automation. Please review before merging.

…docs#892

Mirror the updated client/server-side file validation guidance from
togethercomputer/mintlify-docs#892:

- references/data-formats.md: split validation into client-side structural
  check and server-side schema validation, document processing_status
  states (COMPLETED / INVALID_FORMAT / FAILED) and validation_report
  payloads, refresh CLI examples.
- scripts/*_workflow.py: poll client.files.retrieve(...).processing_status
  after upload before starting fine-tuning so bad datasets fail fast
  instead of consuming training time.
@zainhas zainhas merged commit 882c3bb into main Jun 9, 2026
2 checks passed
@zainhas zainhas deleted the docs-sync/together-fine-tuning/mintlify-docs-pr-892 branch June 9, 2026 16:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants