[GLUTEN-10727][CH]Daily Update Clickhouse Version (20250916)#11299
[GLUTEN-10727][CH]Daily Update Clickhouse Version (20250916)#11299zzcclp merged 17 commits intoapache:mainfrom
Conversation
|
Run Gluten Clickhouse CI on x86 |
2 similar comments
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
cde7f4b to
1037c29
Compare
|
Run Gluten Clickhouse CI on x86 |
2 similar comments
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
bc88f07 to
0880777
Compare
|
Run Gluten Clickhouse CI on x86 |
1 similar comment
|
Run Gluten Clickhouse CI on x86 |
0de074c to
d34775b
Compare
|
Run Gluten Clickhouse CI on x86 |
d34775b to
f87dfbe
Compare
|
Run Gluten Clickhouse CI on x86 |
2 similar comments
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
98e2e0d to
738600f
Compare
|
Run Gluten Clickhouse CI on x86 |
738600f to
6785b07
Compare
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
2 similar comments
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
| .includeCH("Spark vectorized reader - with partition data column - select nested field from a complex map key using map_keys") | ||
| .includeCH("Non-vectorized reader - without partition data column - select nested field from a complex map key using map_keys") | ||
| .includeCH("Non-vectorized reader - with partition data column - select nested field from a complex map key using map_keys") | ||
| .exclude("Spark vectorized reader - without partition data column - select nested field from a complex map key using map_keys") |
There was a problem hiding this comment.
it throw the error below:
CAST AS Map from array requires nested tuple of 2 elements. Left type: Array(Tuple(value Nullable(String))), right type: Map(Tuple(middle Nullable(String)), Nullable(String)): while converting column `relations` from type Nullable(Array(Tuple(value Nullable(String)))) to type Nullable(Map(Tuple(middle Nullable(String)), Nullable(String)))
ignore first and fix in another pr ?
There was a problem hiding this comment.
FYI
Map key field must be primitive in parquet reader
Caused by: org.apache.gluten.exception.GlutenException: Map key field must be primitive: (in file/uri /data3//docker/local_gluten/spark-3.3.2-bin-hadoop3/spark-warehouse/tm1/part-00000-fe984e75-ebaa-4b8e-a98a-1e82309c3199-c000.snappy.parquet): While executing SubstraitFileSource
0. ./contrib/llvm-project/libcxx/include/__exception/exception.h:113: Poco::Exception::Exception(String const&, int) @ 0x000000001d963432
1. ./build/./src/Common/Exception.cpp:128: DB::Exception::Exception(DB::Exception::MessageMasked&&, int, bool) @ 0x00000000100ab1de
2. ./src/Common/Exception.h:123: DB::Exception::Exception(String&&, int, String, bool) @ 0x00000000083fd40e
3. ./src/Common/Exception.h:58: DB::Exception::Exception(PreformattedMessage&&, int) @ 0x00000000083fd000
4. ./src/Common/Exception.h:141: DB::Exception::Exception<>(int, FormatStringHelperImpl<>) @ 0x00000000083dd86b
5. ./build/./src/Processors/Formats/Impl/Parquet/ParquetColumnReaderFactory.cpp:997: DB::ColumnReaderBuilder::buildReader(std::shared_ptr<parquet::schema::Node>, std::shared_ptr<DB::IDataType const> const&, int, int, bool) @ 0x0000000018a5eb28
6. ./build/./src/Processors/Formats/Impl/Parquet/RowGroupChunkReader.cpp:73: DB::RowGroupChunkReader::RowGroupChunkReader(DB::ParquetReader*, unsigned long, std::shared_ptr<DB::RowGroupPrefetch>, std::shared_ptr<DB::RowGroupPrefetch>, std::unordered_map<String, std::shared_ptr<DB::ColumnFilter>, std::hash<String>, std::equal_to<String>, std::allocator<std::pair<String const, std::shared_ptr<DB::ColumnFilter>>>>) @ 0x0000000018a4c0fd
7. ./contrib/llvm-project/libcxx/include/__memory/unique_ptr.h:634: std::__unique_if<DB::RowGroupChunkReader>::__unique_single std::make_unique[abi:ne190107]<DB::RowGroupChunkReader, DB::ParquetReader*, unsigned long&, std::shared_ptr<DB::RowGroupPrefetch>, std::shared_ptr<DB::RowGroupPrefetch>, std::unordered_map<String, std::shared_ptr<DB::ColumnFilter>, std::hash<String>, std::equal_to<String>, std::allocator<std::pair<String const, std::shared_ptr<DB::ColumnFilter>>>>&>(DB::ParquetReader*&&, unsigned long&, std::shared_ptr<DB::RowGroupPrefetch>&&, std::shared_ptr<DB::RowGroupPrefetch>&&, std::unordered_map<String, std::shared_ptr<DB::ColumnFilter>, std::hash<String>, std::equal_to<String>, std::allocator<std::pair<String const, std::shared_ptr<DB::ColumnFilter>>>>&) @ 0x00000000189b2d0e
8. ./build/./src/Processors/Formats/Impl/Parquet/ParquetReader.cpp:115: DB::ParquetReader::getRowGroupChunkReader(unsigned long, std::shared_ptr<DB::RowGroupPrefetch>, std::shared_ptr<DB::RowGroupPrefetch>) @ 0x00000000189b2205
9. ./build/./src/Processors/Formats/Impl/Parquet/ParquetReader.cpp:137: std::unique_ptr<DB::RowGroupChunkReader, std::default_delete<DB::RowGroupChunkReader>> std::__function::__policy_invoker<std::unique_ptr<DB::RowGroupChunkReader, std::default_delete<DB::RowGroupChunkReader>> (unsigned long, std::shared_ptr<DB::RowGroupPrefetch>, std::shared_ptr<DB::RowGroupPrefetch>)>::__call_impl[abi:ne190107]<std::__function::__default_alloc_func<DB::ParquetReader::getSubRowGroupRangeReader(std::vector<int, std::allocator<int>>)::$_0, std::unique_ptr<DB::RowGroupChunkReader, std::default_delete<DB::RowGroupChunkReader>> (unsigned long, std::shared_ptr<DB::RowGroupPrefetch>, std::shared_ptr<DB::RowGroupPrefetch>)>>(std::__function::__policy_storage const*, unsigned long, std::shared_ptr<DB::RowGroupPrefetch>&&, std::shared_ptr<DB::RowGroupPrefetch>&&) @ 0x00000000189b2b33
10. ./contrib/llvm-project/libcxx/include/__functional/function.h:716: ? @ 0x00000000189b3128
11. ./build/./src/Processors/Formats/Impl/Parquet/ParquetReader.cpp:199: DB::SubRowGroupRangeReader::loadRowGroupChunkReaderIfNeeded() @ 0x00000000189b2aa8
But ORC supports
0e9021d to
e35853d
Compare
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
Related issue: #10727
Upgrade CH version to v25.8.3.66-lts