Skip to content

PDF ingest does not work properly with non-scholarly documents #79

Description

@gjreda

From @sehyod

I did some testing this morning and it appears there is indeed an issue with one of the pdfs I had in my directory. I'm joining the pdf to this message but it's not a scholar pdf, just a text pdf I used while working on the PDF viewer.

Since it is not a scholarly PDF, I suspect that grobid is unable to parse it and then downstream processing within ingest.py is not handling that properly.

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Fields

No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions