Skip to content

Conversation

@hdabaghyan
Copy link

Issue: #171

@Michael-E-Rose
Copy link
Collaborator

Thanks a million for this!

May there be any side effects of this change? Like, what is a table outside the body? Is that common or is that rather a bug in PubMed?

@hdabaghyan
Copy link
Author

hdabaghyan commented Oct 1, 2025

May there be any side effects of this change? Like, what is a table outside the body? Is that common or is that rather a bug in PubMed?

Here is the structure of the nxml file, where I encountered the issue.
image

And documentation here suggests that article could be divided to different blocks. One of which is <float-groups>, which can contain tables.

I am not very familiar with this format, but I guess it is a valid use case.

Side effect of this change could be, that parser will collect a table that is not in an expected tag. Alternative fix could be to find all the tags that are supposed to contain tables, and write xpath for all such cases.
But I would rather collect any table information, regardless where it is.(though, this is my perspective)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants