Package

Installation

Example Usage

Train Time

from cover_class.train import setup_training_from_config

dataloader, X_test, Y_test = setup_training_from_config(
    '/my/path/config.yaml',
    batch_size,
    shuffle = True,
)

To get the dirichlet fractions, after every iteration, check dataloader.dataset.batch_dirichlet_fraction_store for the fractions. (Assuming that simulation.return_fractions was set to true in dataloader.yml)

To generate a simulated test set:

from cover_class.train import make_simulation_test_set

simulated_test_data, simulated_test_labels, simulated_test_fractions = make_simulation_test_set(
    dataloader,
    real_test_data, 
    real_test_labels,
    simulated_test_set_n_rows = <some_number>,
    seed = 42,
)

Static Dataset Processing

This is to demonstrate how to create a standardized set of HDF5 datasets for training. It gets the data matrix from a supported CSV format from either a VFS or downloaded through the network.

from cover_class.static.retrieval import generate_hdf5_from_config
generate_hdf5_from_config('/path/to/my/config.yml')

Reporting

Post-training, it is useful to standardize the training and evaluation metadata and metrics. As such, this cover-class repository offers a function to automatically generate a PDF report to compare against other models. Takes in:

Model:
- a model configuration with the model itself
- a dictionary of hyperparameter configs
Training:
- a dictionary to turn into a JSON
- a dictionary to turn into plots
- a list of self-made matplotlib figures to include
Evaluation:
- a dictionary to turn into a JSON
- a dictionary to turn into plots
- a list of self-made matplotlib figures to include
A number of other parameters to properly generate the report

Important

It's important to instantiate the Report object before the training loop as it'll do a number of checks, allowing for early failure before the training loop.

Important

This report relies on setting up a .netrc file in the cover-class/src/cover_class/reporting/assets/.netrc file path. There's no real good way to avoid it. When a Report object is instantiated, the control logic will try to download the qualitative assessment files, so there will be an early error if this is not possible. The file download only occurs whenever there aren't the detected files in cover-class/src/cover_class/reporting/qualitative. Steps:

Set up an account at https://urs.earthdata.nasa.gov
Copy your username and password to replace example@email.com and mySuperSecurePassword123 in cover-class/src/cover_class/reporting/assets/.netrc.

Example of generating a report:

# the dataloader, and train/test sets have been generated already

model = ...
dataloader = ...
test_data, test_labels = ...

from cover_class.reporting import ModelConfig, Report

report = Report(
        outdir=".",
        config="/path/to/dataloader.yml",
        author="Shun-Ichi Amari",
        model_config=ModelConfig(
            model=GPT6(n_channels, classes),
            model_name="GPT-6",
            hyperparams={
                "learning_rate": 0.001,
                "batch_size": 32,
                "optimizer": "Muon",
            },
        ),
        Y_test=test_labels,
        random_seed=42,
        notes="Welcome to my report!"
    )
    
## Now do the training
for X, Y in dataloader:
    model.train()

    ## You can also add in logs to the report
    report.train_metric_table.update(model.generate_some_step_metric())

## Or add in any figures
report.train_figures.append(
    make_some_really_important_figure_I_want_to_show_from_my_training_logs(model, train_data)
)

## And then add in any testing figures or metrics to the report as well
report.test_figures.append(
    some_cool_test_figure_generator_from_my_data(model, test_data)
)
report.test_metric_table.update(
    {'important test metric': generate_test_metric(model, test_data)}
)

## Finally, generate the report
with torch.no_grad():
    y_hat = torch.sigmoid(model(test_data))
    thresholds = ...
report.make_report(y_hat, thresholds)

Inference Over a Scene From Earth Data Archive

Important

This report relies on setting up a .netrc file in the cover-class/src/cover_class/reporting/assets/.netrc file path. There's no real good way to avoid it. When a Report object is instantiated, the control logic will try to download the qualitative assessment files, so there will be an early error if this is not possible. The file download only occurs whenever there aren't the detected files in cover-class/src/cover_class/reporting/qualitative. Steps:

Set up an account at https://urs.earthdata.nasa.gov
Copy your username and password to replace example@email.com and mySuperSecurePassword123 in cover-class/src/cover_class/reporting/assets/.netrc.

from cover_class.utils import read_config
from cover_class.reporting import download_scenes, inference_over_scene

fid = "EMIT_L2A_RFL_001_20250404T194611_2509413_006"
scene = f"https://data.lpdaac.earthdatacloud.nasa.gov/lp-prod-protected/EMITL2ARFL.001/{fid}/{fid}.nc"

# NOTE: all scenes will be output to `src/cover_class/reporting/qualitative`
download_scenes([scene])

config = read_config('/Users/makiper/Desktop/emit/cover-class/config/dataloader copy.yml')
output = inference_over_scene(
    f"src/cover_class/reporting/qualitative/{fid}.nc",
    model,
    config['drop-bands-wavelengths'],
)

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
models		models
src/cover_class		src/cover_class
static		static
tests		tests
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
config		config
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Package

Installation

Example Usage

Train Time

Static Dataset Processing

Reporting

Inference Over a Scene From Earth Data Archive

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

emit-sds/cover-class

Folders and files

Latest commit

History

Repository files navigation

Package

Installation

Example Usage

Train Time

Static Dataset Processing

Reporting

Inference Over a Scene From Earth Data Archive

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages