This repository was archived by the owner on Feb 18, 2024. It is now read-only.
Faster take with null values (2-3x)#633
Merged
jorgecarleitao merged 3 commits intomainfrom Nov 25, 2021
Merged
Conversation
ae0f4c3 to
9d5a1c9
Compare
Collaborator
|
Intersting. I also had a similar result with splitting the bound check in two loops. Maybe that makes sense here as well. Als it is now checked in the validity and in the values. |
Codecov Report
@@ Coverage Diff @@
## main #633 +/- ##
==========================================
- Coverage 69.92% 69.91% -0.02%
==========================================
Files 300 300
Lines 16634 16627 -7
==========================================
- Hits 11632 11625 -7
Misses 5002 5002
Continue to review full report at Codecov.
|
9d5a1c9 to
865ad04
Compare
take with null values (2-3x)
Collaborator
|
Cool results 😎 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This was achieved by splitting the "take" of values and validity as two separate loops. It seems to vectorize better.
This was a side observation of DataEngineeringLabs/simd-benches#1 which showed that
is compiled to the same instructions as using portable_simd's
gather, suggesting that the compiler is quite smart in compilingnaive_takeand we can thus leverage that.