i156: Adding the target database connection string for the insert stage logging info #175

QSparks · 2023-07-31T15:43:19Z

Added a function that will take a session object and return a string of the database connection with the password masked.

Example:
Target database: "postgresql+psycopg2://scott:tiger@localhost/test"
Returned string: "postgresql+psycopg2://scott:***@localhost/test"

resolves #156

…output" This reverts commit e6960b1.

jameshiebert

Looks like you've covered most of the bases here, Quintin. Great work! I think that your approach should mostly work, modulo a few comments that I've put inline. I also think that there are a couple other modules that you may need to cover for database logging: align.py and infer.py. If you refer to slides 5-11 in this presentation, it explains the stages of crmprtd. align and insert both require database interaction (as does infer, but it was written after the presentation). Check those out and see if there are places which could use connection string information as extras.

Additionally, I'd like you to look at the test suite and see if there are ways that a few of the tests can check an ensure that the connection string is being logged. Try to use "Test Driven Development": i.e. write your test first (and presumably it will fail), and then write the code that makes the test pass.

Overall, good job!

crmprtd/insert.py

crmprtd/process.py

…e, spelling

jameshiebert

A few comments to consider, but I think that we're pretty close :)

jameshiebert · 2023-08-08T21:59:38Z

tests/conftest.py

    )
+
+
+def records_contain_db_connection(test_session, caplog):


I can't tell you exactly why, but importing from conftest is not something that is done. Fixtures are defined in conftest.py, but not utility functions.

Maybe you could create this as a utility fixture as explained in this SO answer? I agree w/ the first commenter that it "feels a bit hacky", but there don't appear to be any obviously better options.

@rod-glover have you run across a requirement like this (sharing a test helper function across test modules) in your pytest trials?

A possible explanation of why it hasn't come up to date (to my knowledge) may be that we are encouraged to keep our unit tests concise and simple. And if your test conditions are so complicated that they require more logic in an external function, then maybe they need to be simplified.

There's an argument to be made for either approach. I think I'll be happy however you choose to proceed.

I have, and I have used three methods: the hacky one, a slightly clunky one (A), and another maybe less clunky one (B).

Clunky A is to define a fixture that returns the helper function. Then use the fixture as normal. Best to scope such fixtures broadly, i.e., session scope.

Clunky B is to treat the test directory (or any subdirectory of it) as a package, with an __init__.py. Put helper functions there, and import them using relative imports. For example

from . import a_helper_function from .. import another_helper_function

Alternatively, create a module in such a package and import from it.

from .helpers import yet_another

Right now B is my preferred setup.

UPDATE: Should have read the SO answer first. It is a variation of clunky A, which reads in my code more like:

def helper_function: def f(): #... return f

I don't bother with the Helpers class; that seems ... unweildy and unnecessary unless you are importing a ton of helper functions ... in which case you have to ask why so many.

A possible explanation of why it hasn't come up to date (to my knowledge) may be that we are encouraged to keep our unit tests concise and simple. And if your test conditions are so complicated that they require more logic in an external function, then maybe they need to be simplified.

In general, I like this prinicple. If you write simple functions/methods, and compose them in straightforward ways, then they are easier to understand, test and maintain. That said, some functions need somewhat complicated tests, and/or the same helper function is needed in several different places. So I treat this principle with a certain pragmatism.

Plus I do use helper functions fairly often. YMMV.

Thanks, great input!

jameshiebert · 2023-08-08T22:06:42Z

tests/conftest.py

+
+def records_contain_db_connection(test_session, caplog):
+    for record in caplog.records:
+        if "database" in record.__dict__:


I could be wrong here, but I believe that you can just use the idiom if "database" in record and do not have to specifically access the __dict__ attribute. See: https://docs.python.org/3/reference/expressions.html#membership-test-operations

One thing that you don't check here, that maybe you should is whether the record is found in the correct level of logging. E.g. you check that a log entry is found, but it's possible that it's at a higher level of logging then specified.

jameshiebert · 2023-08-08T22:10:06Z

tests/conftest.py

+    for record in caplog.records:
+        if "database" in record.__dict__:
+            logged_db = getattr(record, "database", {})
+            if logged_db == test_session.bind.url.render_as_string(hide_password=True):


You might be able to simplify this to:

return record.database == test_session.bind.url.render_as_string(hide_password=True)

QSparks added 3 commits July 20, 2023 10:38

Add the connection string as a standard included for all log output

e6960b1

Revert "Add the connection string as a standard included for all log …

d79f0ca

…output" This reverts commit e6960b1.

Retrieve the connection from the session object.

c5c1513

QSparks assigned jameshiebert Jul 31, 2023

QSparks requested a review from jameshiebert July 31, 2023 15:43

jameshiebert requested changes Aug 1, 2023

View reviewed changes

crmprtd/insert.py Outdated Show resolved Hide resolved

crmprtd/insert.py Outdated Show resolved Hide resolved

crmprtd/insert.py Outdated Show resolved Hide resolved

crmprtd/process.py Outdated Show resolved Hide resolved

QSparks added 5 commits August 1, 2023 15:23

Remove caching and unused imports

3b8db88

Add db to align.py and infer.py logs, revoked stripping of driver nam…

7f59864

…e, spelling

Fix ValueError args in infer.py

6ba171b

Add tests for DB logging

4393fdf

Remove redundancies in assertions

f94704b

QSparks requested a review from jameshiebert August 8, 2023 19:15

jameshiebert requested changes Aug 8, 2023

View reviewed changes

QSparks added 2 commits August 9, 2023 13:34

Move helper functions, simplify record_contain_db_connection

e5f20ad

Move log_helpers from __init__.py to seperate module

101ac57

QSparks requested a review from jameshiebert August 9, 2023 21:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

i156: Adding the target database connection string for the insert stage logging info #175

i156: Adding the target database connection string for the insert stage logging info #175

Uh oh!

QSparks commented Jul 31, 2023 •

edited

Loading

Uh oh!

jameshiebert left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jameshiebert left a comment

Uh oh!

jameshiebert Aug 8, 2023

Uh oh!

rod-glover Aug 9, 2023 •

edited

Loading

Uh oh!

rod-glover Aug 9, 2023 •

edited

Loading

Uh oh!

jameshiebert Aug 9, 2023

Uh oh!

jameshiebert Aug 8, 2023 •

edited

Loading

Uh oh!

jameshiebert Aug 8, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

i156: Adding the target database connection string for the insert stage logging info #175

Are you sure you want to change the base?

i156: Adding the target database connection string for the insert stage logging info #175

Uh oh!

Conversation

QSparks commented Jul 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jameshiebert left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jameshiebert left a comment

Choose a reason for hiding this comment

Uh oh!

jameshiebert Aug 8, 2023

Choose a reason for hiding this comment

Uh oh!

rod-glover Aug 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rod-glover Aug 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jameshiebert Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

jameshiebert Aug 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jameshiebert Aug 8, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

QSparks commented Jul 31, 2023 •

edited

Loading

jameshiebert left a comment •

edited

Loading

rod-glover Aug 9, 2023 •

edited

Loading

rod-glover Aug 9, 2023 •

edited

Loading

jameshiebert Aug 8, 2023 •

edited

Loading