Processing an academic paper PDF through scipdf.parse_pdf_to_dict(pdf_path) returns values for the 'author' key, the list of all the names that appear in the paper, including not only the actual authors but also all the names that appear in contents and references.
Actual:
{
"title": title,
"authors": "William T Shaw; M Abramowitz; I A Stegun; R A Bagnold; O E Barndorff-Nielsen; E Eberlein; E ; U Keller; K Fergusson; E Platen; Warren Gilchrist; G W Hill; A W Davis; D B Madan; E Seneta; Wikipedia; On; W T Shaw; W T Shaw; I R C Buckley; G Steinbrecher; W T Shaw; Quantile Mechanics; Y Xiong",
"pub_date": "2009-02-27",
... }
Correct:
{
"title": title,
"authors": "William T Shaw",
"pub_date": "2009-02-27",
... }