Add directory for antalya docs, skills, and designs by hodgesrm · Pull Request #1673 · Altinity/ClickHouse

hodgesrm · 2026-04-22T03:26:02Z

Changelog category (leave one):

Documentation (changelog entry is not required)

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Added a skill for Antalya design creation and review. Also added a first cut design spec for ALTER TABLE EXPORT based on current documentation and tests.

Documentation entry for user-facing changes

...

CI/CD Options

Exclude tests:

Regression jobs to run:

Signed-off-by: Robert Hodges <rhodges@altinity.com>

arthurpassos

I see some mistakes done by Claude here in regards to plain object storage vs iceberg exports. I assume it is because it has consumed the antalya-26.1 branch docs that do not mention Iceberg at all. Those docs are in my 1618 PR.

Half way through, still more to come

arthurpassos · 2026-04-22T12:43:59Z

+be easy to monitor. This design covers two new ClickHouse commands to 
+export data.
+
+* `ALTER TABLE EXPORT PART` -- Exports a single part to Iceberg


Not only Iceberg, but plain object storage as well

arthurpassos · 2026-04-22T12:44:31Z

+
+* `ALTER TABLE EXPORT PART` -- Exports a single part to Iceberg
+
+* `ALTER TABLE EXPORT PARTITION` -- Exports one or more partitions to Iceberg


As of now, it exports one partition - not more than that. That partition can have multiple parts, yes.

Ack. However, I believe the design should properly cover the EXPORT PARTITION ALL case, even if it's not yet covered.

arthurpassos · 2026-04-22T13:19:50Z

+
+These commands replace `INSERT INTO ... SELECT FROM` pipelines that select
+rows and write them out to one or more Parquet files. This approach
+costs an extra decode/sort pass per export, does not coordinate across


an extra decode/sort

INSERT ... SELECT does penalize on sorting, but not sure about extra decode. INSERT ... SELECT also does extra work on partitioning

arthurpassos · 2026-04-22T13:23:49Z

+
+2. **Efficient, order-preserving writes.** Write a specified `MergeTree`
+   part (or every part of a specified partition) to an object-storage
+   destination in Parquet, preserving the source part's sort order,


in Parquet

In theory, we support other formats as well - tho I have never tested it. I would keep it as parquet

Noted. File formats other than Parquet are listed as non-requirements.

arthurpassos · 2026-04-22T13:25:21Z

+2. **Efficient, order-preserving writes.** Write a specified `MergeTree`
+   part (or every part of a specified partition) to an object-storage
+   destination in Parquet, preserving the source part's sort order,
+   without using a `SELECT` pass and also minimizing the RAM required


also minimizing the RAM required to hold data during transfer

I would remove this portion. We just rely on the ClickHouse internal pipeline, which is probably very similar to what INSERT ... SELECT uses.

My point was that we want to use the same or less memory than doing a SELECT. Design updated to reflect this.

arthurpassos · 2026-04-22T13:58:19Z

+-- Split large part across multiple Parquet files
+ALTER TABLE big EXPORT PART '2025_0_32_3' TO TABLE big_dest
+SETTINGS allow_experimental_export_merge_tree_part = 1,
+         export_merge_tree_part_max_bytes_per_file = 10000000,


export_merge_tree_part_max_bytes_per_file

We have a funny situation with this. When I added this setting, ClickHouse did not have the ability to split out parquet files. Plus, we did not support exporting to Iceberg.

The iceberg engine now has its own settings. Which one should we respect?

Good point. Seems like we would go with the Iceberg engine for consistency and to simplify EXPORT commands.

arthurpassos · 2026-04-22T14:01:27Z

+ALTER TABLE rmt_table EXPORT PARTITION ID '2020' TO TABLE s3_table;
+
+-- Cancel by filter
+KILL EXPORT PARTITION


Worth noting it is the same filter you would apply to read from system.replicated_partition_exports;

arthurpassos · 2026-04-22T14:07:20Z

+
+Create a `ReplicatedMergeTree` source, seed two partitions from
+`system.numbers`, create a hive-partitioned S3 destination (the on-disk
+shape an external Iceberg catalog such as Glue / REST / Nessie /


Well.. this is a plain object storage end to end case, not iceberg.

arthurpassos · 2026-04-22T14:08:05Z

+
+The following notes expand on expected behavior of commands. 
+
+1. `ALTER TABLE t EXPORT PART 'p' TO TABLE s3_t` writes


by default, yes

arthurpassos · 2026-04-22T14:08:51Z

+   `<dir>/commit_<part>_<checksum>`, readable end-to-end via
+   `SELECT * FROM s3(...)` in tests `03572_*` and `03608_*`.
+
+2. `ALTER TABLE rmt EXPORT PARTITION ID 'p' TO TABLE s3_t` exports


Based on the list of parts that the replica that received the command sees.

hodgesrm requested a review from arthurpassos April 22, 2026 03:26

Add directory for antalya docs, skills, and designs

227f7b1

Signed-off-by: Robert Hodges <rhodges@altinity.com>

hodgesrm force-pushed the 01-alter-table-export-design branch from 06d5b84 to 227f7b1 Compare April 22, 2026 03:42

arthurpassos reviewed Apr 22, 2026

View reviewed changes


		* `ALTER TABLE EXPORT PART` -- Exports a single part to Iceberg

		* `ALTER TABLE EXPORT PARTITION` -- Exports one or more partitions to Iceberg


		The following notes expand on expected behavior of commands.

		1. `ALTER TABLE t EXPORT PART 'p' TO TABLE s3_t` writes

Conversation

hodgesrm commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Documentation entry for user-facing changes

CI/CD Options

Exclude tests:

Regression jobs to run:

Uh oh!

arthurpassos left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hodgesrm Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hodgesrm commented Apr 22, 2026 •

edited

Loading

hodgesrm Apr 23, 2026 •

edited

Loading