[Flink] Fix PostponeFixedBucketChannelComputer routing all records to same channel by leaves12138 · Pull Request #7737 · apache/paimon

leaves12138 · 2026-04-29T11:52:33Z

Summary

Fix a bug in PostponeFixedBucketChannelComputer where all records in the same partition are routed to the same downstream channel in batch mode, causing only one subtask to actually process data.

Root Cause

In the channel() method, the variable bucket was set to the total number of buckets for the partition (from knownNumBuckets). Since this value is the same for all records in the same partition, ChannelComputer.select(partition, bucket, numChannels) always returns the same channel — all records go to one subtask.

Fix

Compute a per-record bucket by hashing the trimmedPrimaryKey and taking modulo numBuckets. This distributes records with different primary keys across different channels/subtasks, similar to how ChannelComputer.startChannel handles Integer.MIN_VALUE.

Testing

Verified the logic matches the pattern used in RowDataChannelComputer (fixed bucket tables) and PostponeBucketChannelComputer (streaming postpone tables).

… same channel Previously, the channel() method used the total number of buckets (knownNumBuckets) as the bucket parameter for ChannelComputer.select(). Since all records in the same partition share the same total bucket count, they were all routed to the same downstream channel, causing only one subtask to process data in batch mode. This fix computes a per-record bucket by hashing the trimmedPrimaryKey and taking modulo numBuckets, so records with different primary keys are distributed across different channels/subtasks.

JingsongLi · 2026-04-29T14:11:55Z

@leaves12138 Add the test.

JingsongLi · 2026-05-03T00:11:55Z

        BinaryRow partition = partitionKeyExtractor.partition(record);
-        int bucket = knownNumBuckets.computeIfAbsent(partition, p -> numChannels);
+        int numBuckets = knownNumBuckets.computeIfAbsent(partition, p -> numChannels);
+        int hash = partitionKeyExtractor.trimmedPrimaryKey(record).hashCode();


Just use FixedBucketRowKeyExtractor?

Fixed automatically by Codex. PostponeFixedBucketChannelComputer now reuses FixedBucketRowKeyExtractor, with FixedBucketRowKeyExtractor#bucket(int numBuckets) added so postpone fixed-bucket writes can still use the per-partition bucket count from knownNumBuckets.

[Flink] Add tests for PostponeFixedBucketChannelComputer fix

5698511

JingsongLi reviewed May 3, 2026

View reviewed changes

[Flink] Reuse fixed bucket extractor for postpone channel

aa071b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Flink] Fix PostponeFixedBucketChannelComputer routing all records to same channel#7737

[Flink] Fix PostponeFixedBucketChannelComputer routing all records to same channel#7737
leaves12138 wants to merge 3 commits intoapache:masterfrom
leaves12138:fix-postpone-fixed-bucket-channel-computer

leaves12138 commented Apr 29, 2026

Uh oh!

JingsongLi commented Apr 29, 2026

Uh oh!

JingsongLi May 3, 2026

Uh oh!

leaves12138 May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leaves12138 commented Apr 29, 2026

Summary

Root Cause

Fix

Testing

Uh oh!

JingsongLi commented Apr 29, 2026

Uh oh!

JingsongLi May 3, 2026

Choose a reason for hiding this comment

Uh oh!

leaves12138 May 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants