Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Paper directory page #212

Open
stschiff opened this issue Sep 30, 2024 · 6 comments
Open

Paper directory page #212

stschiff opened this issue Sep 30, 2024 · 6 comments

Comments

@stschiff
Copy link
Member

I would like to initiate a paper directory page. The idea is to show papers (with titles, publishing date, first author and journal) and a status indicating whether that paper is present in the Poseidon Community Archive, the Poseidon AADR Archive or the Poseidon Minotaur archive.

I am thinking this should be a static webpage, which reads a list of DOIs and then automatically generates a HTML page with a table of all papers, sorted by date and little check marks or crosses, indicating whether papers are available in the three archives.

As a prototype, this could be coded up in Python or any other language at the availability of an assigned developer.

A core task is:

  1. Get a list of DOIs ready into a simple text file
  2. Write a parser, which queries the web for metadata around this DOI
  3. Collect stats from Poseidon archives using our remote APIs
  4. Render the final table.
@stschiff
Copy link
Member Author

@dhananjaya93 @93Boy this could be something for you. What do you think?

@stschiff
Copy link
Member Author

stschiff commented Sep 30, 2024

I think it would be good to prototype this is a stand-alone command line script, which reads a text-file with DOIs and then creates the table. As a second step, we can then see how we integrate this into our webpage, or some other stand-alone webpage.

@TCLamnidis
Copy link
Member

Just a thought: If we want to do this for papers that are in any one of the archives, then parsing the BibTex keys in the bib files should get a lot of the information out, right?
At least in the case of Minotaur, that should be straight-forward.

@stschiff
Copy link
Member Author

Are you now talking about how to get data from the archives? Or about how to get metadata for papers in general?

I was really thinking to start from a list of DOIs, then get the metadata via something like this and then query the Poseidon Server API for whether those papers are in the archives.

I don't see where Bibtex comes in directly. Maybe good to talk tomorrow about briefly.

@93Boy
Copy link
Contributor

93Boy commented Sep 30, 2024

Hello Stephan ,
I have gone through the tasks. The initial prototype you have suggested can be done through CrossRef API . I think I can complete a prototype as a static HTML website by this weekend.

@93Boy
Copy link
Contributor

93Boy commented Sep 30, 2024

I have planned an initial workflow based on python.

  1. A DOI list in .txt format as the input
  2. Use CrisRef to query the metadata for each DOI. This will include a python method to parse the JSON information.
  3. Use python requests to contact Poseidon API which checks if the DOI available in 3 Poseidon archives.
  4. Then use python jinja2 to create a static website that will render all that information.
    Do you have any input to add?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants