Skip to content

Scale up to Large Datasets #13

@AyanSinhaMahapatra

Description

@AyanSinhaMahapatra

Research in the Pipeline

  1. IO speed Analysis
  2. Multi-Threading Functionality
  3. Computation Bottlenecks
  4. Memory/Processing Profiling
  5. System Specific Changes in parameters
  6. Chunking data and Batching Tasks

So that the pipeline can be effectively and efficiently scaled to very large datasets, to perform the analysis on the whole Clearly Defined Dataset.

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions