NOTE: I happen to be using some data relevant to ocean science, otherwise the "Ocean Science" bit is neither here nor there.
Material mostly in the notebooks. At least three known ways of running these:
-
On your computer, by downloading the pack (there should be a green "code" button near the top right of this page, click it and there should be "download")
-
On your computer,
git clonethis repository. Then you can dogit pullyou get updates (if any). You may or may not want to fork the repository (need a GitHub account), then you can commit changes too. -
In Google colab, but you will need to pull copies (code and data) to your own Google drive, otherwise changes are not saved.
Clicking the icon below will open a temporary Colab instance, where changes are by default not saved (and you will need to pull the data files with !wget commands that are currently commented out in the notebooks). There should be a "copy to drive button" near "file" near the top left for copying to drive.
Core topics are
- some basics of Python, virtual environments, variable types, indexing, plotting
- (local/remote) reading and manipulating data in python, basics of pandas
- measure of mismatch, linear regression
- multi-linear regression, A/BIC, principal component analysis
- basics of probability, histograms, pdfs, statistical tests
- more statistical tests, other topics in probability (Shannon / diversity index)
- time series data, filtering, power spectrum
- missing data in time, interpolation/extrapolation
- multi-dimension array data, NetCDF data, xarray, basic plotting, subsetting
- maps, interpolation/extrapolation, empiricial orthogonal functions
Extra topics are mostly machine learning oriented, or exploring data repositories (e.g. satellite data from Copernicus)
- Ryan Abernathey's Earth and Environmental Data Science course
- Python for Environmental Science course
- Data Analysis course from Brian Powell
- Methods of Oceanographic data analysis held on Ethan Campbell's GitHub
- Jonathan Lilly's time series course
- all notebooks to run on Colab with modulo additiongs of
!pip+ will remote load data appropriately
- yaml and environment files added by Jonathan Lee
- default is to load data remotely; to test on Colab (the worst one would be
10and that seems to run ok) - added rendered notebooks with intended outputs (in folder
rendered)