This repository contains code sprints for Water Digitalization, a methods course for DigitaL Waters PhD pilot students.
Introduction to data science methods for environmental analysis. Topics covered include reproducible scientific computing (bash scripting for data management, git for version control, GitHub for code collaboration and distribution); open geospatial data sources; common structures of environmental data; space/time applications of supervised machine learning (in R and Python); reproducible computational pipelines; repository design and publication; and best practices for high-performance computing using a JupyterHub virtual research environment.
To use this repository:
- Log onto GitHub.com using your registered email and password
- Click “Fork” in the upper right corner.
- Use git clone to copy it into the DIWA DataLab
- Identify the code sprint assigned for the day
- Work through the code to the best of your ability.
- To submit, push any modifications to the assignment to your forked repository (git add, git commit, git push)
New assignments and solution keys will be added weekly, so we'll be working through upstream pulls and merges in class.
Did these resources help your research? Spread the word! Preferred citation: Carter, E., Hultquist, C., & Wen, T. (2023). GRRIEn analysis: a data science cheat sheet for earth scientists learning from global earth observations. Artificial Intelligence for the Earth Systems, 2(2), 220065.[https://journals.ametsoc.org/downloadpdf/view/journals/aies/2/2/AIES-D-22-0065.1.pdf]