Skip to content

Tables are ignored when they are not inside a <Body> tag. #171

@hdabaghyan

Description

@hdabaghyan

Describe the bug
When I try to convert nxml of this document, it doesn’t contain any tables because the table-wrap tag is not inside the .

To Reproduce

wget https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_package/b8/4a/PMC4159224.tar.gz
tar -xzvf PMC4159224.tar.gz 
import pubmed_parser
pubmed_parser.parse_pubmed_table("<PATH-TO-FOLDER>/PMC4159224/dddt-8-1195.nxml")

Result is None, while there are tables in the nxml.

Expected behavior
I expect that table data should be converted even if the table is not inside the tag.

Dependencies

  • MacOS, Python3.10, pubmed_parser==0.5.1

Proposed change: #172

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions