Skip to content

Conversation

@galsalomon66
Copy link

@galsalomon66 galsalomon66 commented Sep 2, 2021

2 main flows
(1) non-aggregation flow: split the input object into N ranges, each range gets its own context.
(2) aggregation flow:(for aggregation queries) requires 2 phases,
Upon the completion of the first phase (running the query on a specific range)
The AST will be traversed for aggregation nodes, the result of each node is pushed into a dedicated scratch area
the second phase will use the result of the first phase.
The AST for the second phase will be refactored.

…on, result returns as set of values. missing memory managment for variable and value

Signed-off-by: gal salomon <gal.salomon@gmail.com>
2 main flows
(1) non-aggregation flow: split the input object into N ranges, each range gets its own context.
(2) aggregarion flow:(for aggregatio queries)  requires 2 phases,
Upon the completion of the first phase (running the query on a specific range)
The AST will be traversed for aggregation nodes, the result of each node is pushed into a dedicated scratch area

current commit is handling the CSV use-case/non-aggregation flow.
@galsalomon66
Copy link
Author

#103 is handling the parallel execution. it takes a different approach(the caller splits the input or defines many input streams as a single one).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants