From fefdb5e951a8ced697d786faf5d80f6718537770 Mon Sep 17 00:00:00 2001 From: Aryton Tediarjo <35942857+arytontediarjo@users.noreply.github.com> Date: Wed, 20 Sep 2023 20:19:39 +0700 Subject: [PATCH 1/2] add data quality section --- scripts/metadata/README.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/scripts/metadata/README.md b/scripts/metadata/README.md index 17111cb..e0ae510 100644 --- a/scripts/metadata/README.md +++ b/scripts/metadata/README.md @@ -115,3 +115,11 @@ There is no "NTD status" for individuals or households. #### Comment on script functionality The `scripts/metadata/generate_metadata.R` script generates the metadata files necessary for the health economics suite of forms. Because the NTD "status" concept does not exist, and because the NTD form's metadata is not dynamic (it is generated once only), the NTD file is written only once. In the metadata generation script, it is written based on all eligible health economics individuals, filtered down only to those who are "pre-selected" for the NTD component of the project. This filtering + writing occurs _before_ filtering for "in" households (which is necessary for the "monthly" forms, since households which are "out" or "eos" should not show up in monthly forms). + +#### Data Quality Testing + +Series of data quality unit tests will be conducted on each metadata before being uploaded in ODK Central. Tests will be done via R package, more documentation on dataqualitybrewr github repo [here](https://github.com/databrew/dataqualitybrewr/tree/main) + +Once all data quality tests have passed, Data is stored for AWS for capturing all state changes from prior and current visits and ODK Central for drop-down functionality. + +Data quality tests will be managed and tracked here in this [Google Sheets](https://docs.google.com/spreadsheets/d/1OrSul3WqtsO_0JzAXShx8LDmLfNZEugVHvj2SsKNRu0/edit?usp=sharing). From edef60a81daa9d3f7e57bbe9a0a3597860a0aca9 Mon Sep 17 00:00:00 2001 From: Aryton Tediarjo <35942857+arytontediarjo@users.noreply.github.com> Date: Wed, 20 Sep 2023 20:24:29 +0700 Subject: [PATCH 2/2] update readme --- scripts/metadata/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/scripts/metadata/README.md b/scripts/metadata/README.md index e0ae510..3102bca 100644 --- a/scripts/metadata/README.md +++ b/scripts/metadata/README.md @@ -118,8 +118,8 @@ The `scripts/metadata/generate_metadata.R` script generates the metadata files n #### Data Quality Testing -Series of data quality unit tests will be conducted on each metadata before being uploaded in ODK Central. Tests will be done via R package, more documentation on dataqualitybrewr github repo [here](https://github.com/databrew/dataqualitybrewr/tree/main) +Series of data quality unit tests will be conducted on each metadata before being uploaded to ODK Central. Tests will be done via `dataqualitybrewr`, which is an R package that we will continuously manage and add in more testing features based on logic and requirements. More documentation on the use-case [here](https://github.com/databrew/dataqualitybrewr/tree/main) -Once all data quality tests have passed, Data is stored for AWS for capturing all state changes from prior and current visits and ODK Central for drop-down functionality. +Once all data quality tests have passed, Data will be stored to AWS for capturing all state changes from prior and current visits and ODK Central for drop-down functionality. Data quality tests will be managed and tracked here in this [Google Sheets](https://docs.google.com/spreadsheets/d/1OrSul3WqtsO_0JzAXShx8LDmLfNZEugVHvj2SsKNRu0/edit?usp=sharing).