MetaWorkflow Handler and MetaWorkflow Run Handler: Pipeline Automation by vstevensf · Pull Request #11 · dbmi-bgm/magma

vstevensf · 2022-09-29T19:21:17Z

No description provided.

…culated attributes

Some are specific to the structure of CGAP portal schemas, but tried to generalize as much as possible. Also includes partial draft of dependency validation via topological sort.

… fxns

…ests for topological sort.

Resulted in edits within test files, including creation of a new test file for topological sort, and changes in imports for magma/metawfl_handler.py.

drio18

Dropping some comments for consideration. Happy to review again whenever you'd like, especially once there are tests available for the dependency sorting/checking. I think I understand the gist of it all and it sounds right, but I didn't go in detail; I'll do so once the tests are in.

magma/topological_sort.py

magma/metawfl_handler.py

magma/topological_sort.py

magma/utils.py

test/test_topological_sort.py

test/test_utils_magma.py

Changes included additions of different directed graph global vars for testing -- tests for the topological sort on their way. Also removed function for creating "forward dependencies" -- this version of topological sort works with "backward dependencies".

Addressed some prior comments. Future planned changes include creating a class for the topological sort function to decrease the number of command line arguments that are common among several functions.

Also addressed some prior comments on draft.

Changes included merging some former magma utils fxns into this class, and removing extraneous utils fxns. Also includes new test file for this class.

Also wrote a draft of pytests

includes addition of custom Exception classes

…onstants. Got rid of duplication_flag.

drio18

Leaving a batch of comments, some of which will require further discussion. It looks like there's still a good deal to complete, clean up, and test properly, so we should discuss a plan tomorrow for how best to utilize the remaining time.

drio18 · 2023-05-18T19:28:37Z

magma/magma_constants.py

+#!/usr/bin/env python3
+
+#################################################################
+#   Vars
+#################################################################


No need for the shebang or the headers here and elsewhere. I know Michele does this, but I find it unnecessary clutter.

drio18 · 2023-05-18T19:31:17Z

magma/magma_constants.py

+TITLE = "title"
+
+# MetaWorkflow Handler attributes
+PROJECT = "project"
+INSTITUTION = "institution"
+UUID = "uuid"
+META_WORKFLOWS = "meta_workflows"
+ORDERED_META_WORKFLOWS = "ordered_meta_workflows"
+META_WORKFLOW = "meta_workflow"
+NAME = "name"
+DEPENDENCIES = "dependencies"
+ITEMS_FOR_CREATION_PROP_TRACE = "items_for_creation_property_trace"
+ITEMS_FOR_CREATION_UUID = "items_for_creation_uuid"
+
+# MetaWorkflow Run Handler attributes
+STATUS = "status"
+FINAL_STATUS = "final_status"
+ASSOCIATED_META_WORKFLOW_HANDLER = "meta_workflow_handler"
+ASSOCIATED_ITEM = "associated_item"
+META_WORKFLOW_RUN = "meta_workflow_run"
+META_WORKFLOW_RUNS = "meta_workflow_runs"
+ITEMS_FOR_CREATION = "items_for_creation"
+ERROR = "error"
+# statuses
+PENDING = "pending"
+RUNNING = "running"
+COMPLETED = "completed"
+FAILED = "failed"
+STOPPED = "stopped"


Would be good to organize the constants into classes so they can be imported in groups. In general, importing * is not a good practice, and having the constants on classes makes it more clear where they are coming from.

Also, I don't believe all of these constants should be in the magma directory, as some are portal specific, and those should be differentiated. In general, I'd be fine with all this code living in magma_ff for now, but that's something to discuss more with Michele.

drio18 · 2023-05-18T19:32:48Z

magma/magma_constants.py

+#TODO: the following is here in case dup flag is added in the future
+# MWFR_TO_HANDLER_STEP_STATUS_DICT = {
+#     "pending": "pending",
+#     "running": "running",
+#     "completed": "completed",
+#     "failed": "failed",
+#     "inactive": "pending",
+#     "stopped": "stopped",
+#     "quality metric failed": "failed"
+# }


I think this is still needed regardless of the duplication flag stuff...

Regardless, it is portal-specific and thus shouldn't live in magma, and there's no need to leave the commented out code around if it's no longer needed.

drio18 · 2023-05-18T19:39:41Z

magma/validated_dictionary.py

+################################################
+#   ValidatedDictionary TODO: eventually make part of dcicutils?
+################################################
+class ValidatedDictionary(object):


Consider renaming this. It's not a dictionary, but rather a class, so ValidatedAttributes would probably be more apt.

In general, I would stay away from this style of code of arbitrarily placing all key, value pairs in a dictionary on the class itself. I know it's done elsewhere in this project, but it complicates tracing the origin of attributes to someone new to the code as well as placing an unknown number of attributes on the class that are then capable of being accidentally overwritten. As they say for python, "explicit is better than implicit." To my eyes, a superior alternative to this is to set an attribute on the class via __init__ such as self.properties, and then placing the input dictionary there and validating and setting other properties or attributes appropriately.

drio18 · 2023-05-18T19:41:01Z

magma/validated_dictionary.py

+                if retrieved_attr is None:
+                    raise AttributeError("attribute %s cannot have value 'None'." % attribute)


Why can't an attribute be None? Would a falsy but non-None value be acceptable?

drio18 · 2023-05-18T21:08:14Z

magma_ff/checkstatus.py

+        for running_mwfr_step_name in self.mwfr_handler_obj.running_steps():
+
+            # Get run uuid
+            run_uuid = self.mwfr_handler_obj.get_step_attr(running_mwfr_step_name, uuid)


I don't believe uuid is defined here.

drio18 · 2023-05-18T21:18:00Z

magma_ff/create_metawflrun_handler.py

+        # and convert property trace(s) to uuid(s)
+        else:
+            property_traces = getattr(meta_workflow_step, ITEMS_FOR_CREATION_PROP_TRACE, None)
+            if not isinstance(property_traces, list):


I believe this has to be a list, no?

drio18 · 2023-05-18T21:20:24Z

magma_ff/create_metawflrun_handler.py

+                item_uuid = make_embed_request(
+                    self.associated_item_identifier,
+                    item_prop_trace
+                    + ".uuid",  # TODO: are we assuming the user will include ".uuid" or @id as part of prop trace?
+                    self.auth_key,
+                    single_item=True,
+                )
+                if not item_uuid:
+                    raise MetaWorkflowRunHandlerCreationError(
+                        f"Invalid property trace '{item_prop_trace}' on item with the following ID: {self.associated_item_identifier}"
+                    )
+                items_for_creation_uuids.append(item_uuid)


I don't think this logic will work here, as you may receive multiple UUIDs, not just one, and you have to parse the result to obtain any UUIDs returned.

drio18 · 2023-05-18T21:23:46Z

magma_ff/create_metawflrun_handler.py

+    @property
+    def get_project(self):
+        """Retrieves project attribute from the associated item."""
+        return self.retrieved_associated_item.get(PROJECT)
+
+    @property
+    def get_institution(self):
+        """Retrieves institution attribute from the associated item."""
+        return self.retrieved_associated_item.get(INSTITUTION)


As a personal preference, consider using get in a method name for non-attribute/property methods, and using attribute-like names for properties (they live somewhere in-between, but I think of them more like attributes than methods).

magma_ff/run_metawflrun_handler.py

…ocstrings

vstevensf added 12 commits September 28, 2022 19:15

first draft of mwfh base magma code

15615cd

Further editing of baseline MWF handler, with added functions for cal…

165a612

…culated attributes

Merge branch 'master' into vs-mwfr-handler

ad1cbec

Baseline Magma FF MWF Handler -- will be modifying the use of copy

469804d

Creation of helper functions that may eventually be added to dcic utils.

6cccc8e

Some are specific to the structure of CGAP portal schemas, but tried to generalize as much as possible. Also includes partial draft of dependency validation via topological sort.

Drafts of pytests for baseline Magma MWF Handler and helper functions.

f62fabd

Remove extraneous files I use for local testing

4d50191

Added pytests for magma/utils.py

81c826d

Further edits to pytests of magma utils

32a576f

Finished topological sort, need to add docstrings and refactor helper…

fa02b5b

… fxns

Modified some tests, removed a few extraneous. Still need to finish t…

3e9348d

…ests for topological sort.

Refactored the utils functions for topological sort into its own file.

dcb737e

Resulted in edits within test files, including creation of a new test file for topological sort, and changes in imports for magma/metawfl_handler.py.

vstevensf requested a review from drio18 November 4, 2022 18:48

drio18 reviewed Nov 7, 2022

View reviewed changes

vstevensf and others added 16 commits November 9, 2022 18:56

Completed first draft of completed topological sort tests.

0ddf3d1

Addressed some prior comments. Future planned changes include creating a class for the topological sort function to decrease the number of command line arguments that are common among several functions.

Small changes to utils -- mainly variable naming.

c2231c2

Also addressed some prior comments on draft.

Merge branch 'master' into vs-mwfr-handler

5752c83

Small change to test file for topological sort

dae1229

Finished validation of MWF handler and its corresponding pytests.

8795286

Put the ValidatedDictionary class in its own file

ffefa6a

Finished ValidatedDictionary class

5fd8916

Changes included merging some former magma utils fxns into this class, and removing extraneous utils fxns. Also includes new test file for this class.

Refactored Topological Sort with TopologicalSorter from dcicutils.

2d16ab0

Also wrote a draft of pytests

Draft of MWF Handler, without creation of MWFR Handler

c6a0e7f

Further edits to basic handler classes

6cb41ac

includes addition of custom Exception classes

Merge branch 'master' into vs-mwfr-handler

993361d

Check in

6321fbf

Main changes to create mwfr handler function

f4a47ba

More updates the mwfr handler creation

ae249bb

Almost final draft of create MWFR handler functionality

58add81

vstevensf added 10 commits May 4, 2023 23:40

Got rid of duplication flag, for now

e411ceb

Merge branch 'master' into vs-mwfr-handler

dcbbec0

Basic running of mwfr handler.

2db5bff

Draft of status checking and updates of run handler

c1d2a0b

Added docstrings to toposort files

3c54a4c

Added docstrings to MWF handler files and tests, and added to magma c…

111a0c5

…onstants. Got rid of duplication_flag.

docstrings for mwfr handler class and tests

ffdabb6

quasi updated handler creation docstrings

d5deff5

some edits to the create run handler pytests -- need to refactor

4908f1b

Finalized rough draft of pytests for create mwfr handler functionality

c684121

vstevensf marked this pull request as ready for review May 16, 2023 06:06

Merge branch 'master' into vs-mwfr-handler

503cd00

drio18 reviewed May 18, 2023

View reviewed changes

vstevensf added 4 commits May 31, 2023 20:37

Edited execute handler function and created draft of pytests, plust d…

93c1715

…ocstrings

renamed test file

82c8d55

modified FFMetaWfrUtils class and pytests

1d5e77b

Draft of checkstatus tests

4f17490

		if retrieved_attr is None:
		raise AttributeError("attribute %s cannot have value 'None'." % attribute)

Conversation

vstevensf commented Sep 29, 2022

Uh oh!

drio18 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

drio18 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants