[GLUTEN][FLINK] Nexmark q0 performance #11508
Replies: 2 comments 1 reply
-
|
I have reproduced this on my local environment, and it seems the pr: #11365 leads to the performance regression. @lgbo-ustc could you take a look at this? |
Beta Was this translation helpful? Give feedback.
-
|
@ParyshevSergey Can you provide more details about the test settings. It is expected that the new version of the code may experience a certain degree of performance regression, with a potential decrease of around 10%. Currently, our priority is the improvement of functionalities, and performance optimization in this aspect will be considered later. However, as you mentioned, we have not encountered such significant slowdowns before. We have re-run the stress tests, and the results are as follows (without datagen sourceglutenflinkkafka sourcegluten
flink
|
Beta Was this translation helpful? Give feedback.


Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello community, according flink docs I builded velox4j myself with gluten and run local cluster to test nexmark q0 query. With the latest branch of nexmark datagen not working for me (pure virtual call issue in logs) then I try to use kafka connector with local kafka. And in this case q0 pass after 49min - I noticed that conversion between RowData and RowVector is heavy operation so I removed code in CommonExecSink of StreamRecordTimestampInserter to avoid conversion. So after run the job was (NativeKafka -> gluten_calc -> Writer) as one chained operator - result is 8min. Without gluten same config the job pass after 4min. Why the performance is so terrible?
@shuai-xu @KevinyhZou What do you think?
flink config.yaml
Beta Was this translation helpful? Give feedback.
All reactions