Skip to content

Rhino sdk validating dir dd#64

Open
ddavid37 wants to merge 7 commits intomainfrom
rhino-sdk_validating_dir_dd
Open

Rhino sdk validating dir dd#64
ddavid37 wants to merge 7 commits intomainfrom
rhino-sdk_validating_dir_dd

Conversation

@ddavid37
Copy link
Contributor

verified and improved eda.ipynb, cox.ipynb, aggregate_quantile_example.ipynb. Improved markdown cells and README.md

cox.ipynb - verified, improved, and added 2 sample synthetic datasets. Organized it in a new directory.
eda.ipynb - verified, improved
aggregate_quantile_example.ipynb - verified, improved

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can the inputs folder be called input or inputs and not input_quantile? That seems more natural and easier to use consistently across examples

"name": "stdout",
"output_type": "stream",
"text": [
"Requirement already satisfied: joblib in /Users/danieldavid/.pyenv/versions/3.11.6/lib/python3.11/site-packages (1.4.2)\n"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please clear all outputs from the notebook

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Three comments:

  1. Can you please rename this file to something like results_analysis.ipynb to be more consistent (and correctly spelled)?
  2. What is the goal of this notebook? I see that it simply gets an input file that is checked into the directory and opens it - how is that performing any results analysis?
  3. If we're reorganizing this example - can we get rid of the QuantileRegression subdirectory (under the regression-quantile directory)? Seems redundant.


This folder contains examples for interacting with Rhino Health's Federated Computing Platform (FCP) using the Python SDK.

Each notebook demonstrates how to use the SDK to authenticate your user session, select a project, and perform a variety of federated analytics or compute tasks.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Each notebook demonstrates how to use the SDK to authenticate your user session, select a project, and perform a variety of federated analytics or compute tasks.
Each notebook provides an example using the Rhino SDK to perform a different task e.g., importing datasets from external data sources, data preprocessing, federated analytics, or triggering model training.

Comment on lines +128 to +129
"FIRST_TEST_DATASET_UID = \"XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX\" # Replace this with the ID of the first test dataset\n",
"SECOND_TEST_DATASET_UID = \"XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX\" # Replace this with the ID of the second test dataset\n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"FIRST_TEST_DATASET_UID = \"XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX\" # Replace this with the ID of the first test dataset\n",
"SECOND_TEST_DATASET_UID = \"XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX\" # Replace this with the ID of the second test dataset\n",
"FIRST_TEST_DATASET_UID = \"XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX\" # Replace this with the UID of the first test dataset\n",
"SECOND_TEST_DATASET_UID = \"XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX\" # Replace this with the UID of the second test dataset\n",

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. Can you please put these files in an inputs subdirectory?
  2. Let's use consistent naming, e.g. metrics_sample_data1.csv (consistent with the naming convention you used in the cox example).

" ConnectionDetails,\n",
")\n",
"from rhino_health.lib.metrics import Count, FilterType, Mean, StandardDeviation"
"from rhino_health.lib.constants import ApiEnvironment\n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Users shouldn't actually need to use ApiEnvironment

"id": "3b107de9",
"metadata": {},
"outputs": [],
"outputs": [
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please clear outputs

"print(\"Logging In\")\n",
"my_username = \"my_email@example.com\" # Replace this with the email you use to log into Rhino Health\n",
"session = rh.login(username=my_username, password=getpass())\n",
"my_username = \"daniel.david@rhinohealth.com\" # Replace this with the email you use to log into Rhino Health\n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You shouldn't use your email in public examples - please keep this as my_email@example.org.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants