Asteroid on non-randomly missing data

Hi Benoit,

Thanks for Asteroid, looks a very promising tool!
This is not an issue on the program, but more a question.
From what I understand of the paper, Asteroid performs well with high proportion of data that is missing because of a stochastic process of data deletion (in the case of simulated datasets) or data absence (in the case of empirical datasets).

Do you have any idea of the performance of Asteroid in case data is non-randomly missing?
For example, in case where a dataset combines a few species represented by a lot of genes (e.g. phylogenomic dataset) with a lot of species represented by a few genes (e.g. sanger sequencing/barcode data) (see e.g. https://doi.org/10.1093/molbev/msad109).

Did you tried to simulate missing data in a non random manner?

I'm curious to know whether Asteroid would perform similarly well with high levels of non-random missing data.

Thanks,
Léo-Paul

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Asteroid on non-randomly missing data #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Asteroid on non-randomly missing data #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions