Merge dataframe with second data (authors, institutions, countries) #3

paulsedille · 2020-11-24T02:42:34Z

Is your feature request related to a problem? Please describe.
The PDF formatting makes it difficult to scrap the authors and their institutions from the XML. Fortunately, there is another repository of the articles that makes this easier, and even more fortunately, someone has already done the hard work of scraping it with python, as well as adding for many institutions their country of affiliation, here: https://github.com/nd7141/icml2020

Describe the solution you'd like
Can the authors+institutions+countries data scraped by the above github user be collated into our dataframe, and output in a single csv file?

Describe alternatives you've considered
Will need to look into this!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge dataframe with second data (authors, institutions, countries) #3

Merge dataframe with second data (authors, institutions, countries) #3

paulsedille commented Nov 24, 2020

Merge dataframe with second data (authors, institutions, countries) #3

Merge dataframe with second data (authors, institutions, countries) #3

Comments

paulsedille commented Nov 24, 2020