feat: Add logic to enable loading pepped events into Amplitude #52

pwnage101 · 2021-11-15T14:23:46Z

DENG-????

pwnage101 · 2021-11-15T15:51:42Z

edx_prefectutils/amplitude.py

+    # Iterate over processing segments in chronological order, handling each one
+    # one at a time, and completely freeing the memory of the previous one before
+    # fetching the next.
+    for processing_segment in processing_segments:


Each iteration of this loop could become a task, as long as the tasks can be made to run sequentially. This eliminates a big reason to use tasks which in my mind is automatic concurrency and dependency resolution. However, it may still be useful to make this into tasks for better visibility into the current progress, and ability to do things like mark a task run as "skipped" or "failed" if that's ever a possible scenario, or "retried" if we want to see retries manifest as task retries instead of backoff retries hidden in the logs.

But if we are going to go on a mark tasks as "skipped" or "failed", then we are also not running them sequentially.

brianhw

Looks good! Great progress! Just a few nits, really, nothing major.

edx_prefectutils/amplitude.py

brianhw · 2021-11-15T18:46:48Z

edx_prefectutils/amplitude.py

+    However, the worse-case-scenario is that somehow a large proportion of the events in this processing segment
+    corresponds to a single user, in which case the memory consumption would equal that of loading the entire processing
+    segment into memory (multiple gigabytes).  We could address memory consumption by adding even more complexity (and
+    possibly bugs) to the code, but this is an exceedingly rare scenario that isn't worth optimizing for.


It might be fun to calculate a statistic about what the maximum number of events per user actually is. But the case where we're most likely to hit this is if we were to do a backfill job. That first time, we'll presumably have all events for a user over the last two years. After the first load, we expect to run in smaller increments, and should be fine.

edx_prefectutils/amplitude.py

brianhw · 2021-11-15T19:37:26Z

edx_prefectutils/amplitude.py

+    # Iterate over processing segments in chronological order, handling each one
+    # one at a time, and completely freeing the memory of the previous one before
+    # fetching the next.
+    for processing_segment in processing_segments:


But if we are going to go on a mark tasks as "skipped" or "failed", then we are also not running them sequentially.

edx_prefectutils/amplitude.py

feat: Add logic to enable loading pepped events into Amplitude

89ed594

DENG-????

pwnage101 commented Nov 15, 2021

View reviewed changes

brianhw reviewed Nov 15, 2021

View reviewed changes

feat: FIXUP

2f57f68

pwnage101 force-pushed the pwnage101/amplitude_loader_logic branch from 0338b2b to 2f57f68 Compare November 18, 2021 19:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add logic to enable loading pepped events into Amplitude #52

feat: Add logic to enable loading pepped events into Amplitude #52

Uh oh!

pwnage101 commented Nov 15, 2021

Uh oh!

pwnage101 Nov 15, 2021

Uh oh!

brianhw Nov 15, 2021

Uh oh!

brianhw left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brianhw Nov 15, 2021

Uh oh!

Uh oh!

brianhw Nov 15, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Add logic to enable loading pepped events into Amplitude #52

Are you sure you want to change the base?

feat: Add logic to enable loading pepped events into Amplitude #52

Uh oh!

Conversation

pwnage101 commented Nov 15, 2021

Uh oh!

pwnage101 Nov 15, 2021

Choose a reason for hiding this comment

Uh oh!

brianhw Nov 15, 2021

Choose a reason for hiding this comment

Uh oh!

brianhw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brianhw Nov 15, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

brianhw Nov 15, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants