[GLUTEN-11012][CH] Support Thai/Khmer digit dates in CH#11476
Merged
baibaichen merged 6 commits intoapache:mainfrom Feb 2, 2026
Merged
[GLUTEN-11012][CH] Support Thai/Khmer digit dates in CH#11476baibaichen merged 6 commits intoapache:mainfrom
baibaichen merged 6 commits intoapache:mainfrom
Conversation
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
e145b6c to
d713a30
Compare
|
Run Gluten Clickhouse CI on x86 |
4 similar comments
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
Scan UTF-8 strings for local digits before conversion and add regression queries for Thai and Khmer numeral date parsing in the function suite.
Add comments describing the base64-encoded local digit date fixtures used in local digit date tests.
Use SIMD-based ASCII detection, fast-path common UTF-8 digit ranges, and avoid double scans when converting local digits.
Preserve original UTF-8 bytes when no local digit is detected in multi-byte sequences, and downgrade logging to debug.
Map UTF-8 byte ranges to correct digit values for Devanagari and Bengali local digits.
0601e08 to
fc9ec4c
Compare
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
baibaichen
approved these changes
Feb 2, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What changes are proposed in this pull request?
Scan UTF-8 strings for local digits before conversion and add regression queries for Thai and Khmer numeral date parsing in the function suite.
Fix #11012.
How was this patch tested?
UT