Skip to content

Fast distance calculation#40

Draft
ggabernet wants to merge 8 commits intomasterfrom
rdb9_fastDist
Draft

Fast distance calculation#40
ggabernet wants to merge 8 commits intomasterfrom
rdb9_fastDist

Conversation

@ggabernet
Copy link
Contributor

No description provided.

db_cloned <- foreach::foreach(gp=1:n_groups,
.final=dplyr::bind_rows,
.inorder=TRUE,
.packages="scoper",
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not entirely sure if this line is needed or not

std::string s = as<std::string>(seqs[i]);
for (int p = 0; p < L; ++p) {
uint8_t c = code_char(s[p]);
if (c == 255) stop("Only A,C,G,T,N,? are allowed");
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
if (c == 255) stop("Only A,C,G,T,N,? are allowed");
if (c == 255) stop("Only A,C,G,T,N,? are allowed");

Can we print here the sequence with the "bad characters" and say it contains not allowed. Consider removing sequences containing not allowed characters or running hierarchicalClones with IUPAC=True (link to documentation).

Consider how long does it take to check how many sequences have non-supported characters, and inform the users on the number of sequences containing non-supported characters already at the beginning of hierarchicalClones.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants