Task: Support content-identified datasets

## Summary

Introduce support for globally unique dataset identification via a content-derived ID.

- Define a deterministic content-based ID for datasets (e.g., hash of dataset manifest or slice list).
- Implement a way to resolve dataset names into these IDs.
- Ensure the training and scheduling flow can work entirely off this content ID to enable reproducibility and deduplication.

## Background

Right now, dataset names serve as identifiers, but they are not content-stable or unique. With content-addressed slices now in place, we should extend this to the dataset level for stronger integrity, reproducibility, and cacheability guarantees.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task: Support content-identified datasets #203

Summary

Background

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Task: Support content-identified datasets #203

Description

Summary

Background

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions