Take a pdf of a research paper and generate a comprehensive mind map as a means of visual summarization.
As executives in an undergraduate machine intelligence research community, we saw a need in fellow students with learning disabilities who had difficulty understanding the often tedious text of research papers. This project also proved to be extremely helpful for visual learners within the community as well. We’re hoping to make this tool open to the general public soon.
- Extract the structured text from the pdf
- Extract text itself
- Identify headings and subheadings
- Summarization of the lowest levels of content (for now: paragraph)
- Using outline, extract paragraph content under each heading
- Use summarizer to summarize paragraphs under one heading as one.
- Set limit to how many words regardless of number of paragraphs
- If the introductory paragraph falls under a heading with no other subheading attached, include summary with heading node.
- Update heading line by appending summary
- Replace content in Output_1
- Feed Output_2 into application that will generate interactive mind map
- Generate the interactive mind map by creating a graph using PyVis
- Extract figures and tables to add its labels to image nodes containing figure/table
- Extract formulas to add as separate nodes
- Generate a cleaner mind map