You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| raw_labels.dvc # Reference to a version of raw_labels/
96
-
| processed_labels.dvc # Reference to a version of processed_labels/
97
-
│ compressed_features.tar.gz.dvc # Reference to a version of features/
94
+
| datasets.dvc # Reference to a version of datasets/
98
95
│ models.dvc # Reference to a version of models/
99
96
100
97
```
101
98
102
99
This project contains all the code necessary for: Adding data ➞ Training a model ➞ Creating a map.
103
100
101
+
**Important:** When code is pushed to the repository a Github action will be run to verify project configuration, data integrity, and script functionality. This action will pull data using dvc and thereby needs access to remote storage (your Google Drive). To allow the Github action to access the data add a new repository secret ([instructions](https://docs.github.com/en/actions/security-guides/encrypted-secrets#creating-encrypted-secrets-for-a-repository)).
102
+
- In step 5 of the instructions, name the secret: `GDRIVE_CREDENTIALS_DATA`
103
+
- In step 6, enter the value in .dvc/tmp/gdrive-user-creditnals.json (in your repository)
104
+
105
+
After this the Github action should successfully run.
106
+
104
107
105
108
## Adding data [![cb]](https://colab.research.google.com/github/nasaharvest/openmapflow/blob/main/openmapflow/notebooks/new_data.ipynb)
106
109
@@ -134,25 +137,20 @@ datasets = [
134
137
...
135
138
]
136
139
```
137
-
Run feature creation:
140
+
Run dataset creation:
138
141
```bash
139
142
earthengine authenticate # For getting new earth observation data
140
143
gcloud auth login # For getting cached earth observation data
141
144
142
-
openmapflow create-features# Initiatiates or checks progress of features creation
145
+
openmapflow create-dataset# Initiatiates or checks progress of dataset creation
143
146
openmapflow datasets # Shows the status of datasets
144
147
145
148
dvc commit && dvc push # Push new data to data version control
146
149
147
150
git add .
148
-
git commit -m'Created new features'
151
+
git commit -m'Created new dataset'
149
152
git push
150
153
```
151
-
**Important:** When new data is pushed to the repository a Github action will be run to verify data integrity. This action will pull data using dvc and thereby needs access to remote storage (your Google Drive). To allow the Github action to access the data add a new repository secret ([instructions](https://docs.github.com/en/actions/security-guides/encrypted-secrets#creating-encrypted-secrets-for-a-repository)).
152
-
- In step 5 of the instructions, name the secret: `GDRIVE_CREDENTIALS_DATA`
153
-
- In step 6, enter the value in .dvc/tmp/gdrive-user-creditnals.json (in your repository)
154
-
155
-
After this the Github action should successfully run if the data is valid.
156
154
157
155
158
156
## Training a model [![cb]](https://colab.research.google.com/github/nasaharvest/openmapflow/blob/main/openmapflow/notebooks/train.ipynb)
0 commit comments