With the explosive growth of medical literature, making sense of medical evidence is harder than ever. The free text form also makes it difficult to perform evidence retrieval of appraisal. There is a great need for tools and methods that can structure and reason over medical evidence. The goal of this project is to develop computational and symbolic methods to extract evidence from PubMed abstracts, integrate it with evidence derived from real world clinical data (or practice-based evidence), and perform automated knowledge discovery and evidence reasoning. We also hope this research can support evidence-based medicine during the COVID-19 pandemic and provide opportunities for students to hone his/her skills on natural language processing, data mining, deep learning, and semantic knowledge engineering. We have solid preliminary results for the students to build upon. An open-source PICO parser that extracts Population, Intervention, Comparison and Outcome information from PubMed abstracts has been developed and published. Current COVID-19 literature has been downloaded from PubMed and been pre-processed. Preliminary analyses are under way to investigate the patterns in the study populations in COVID-19 clinical studies. Our next steps include but are not limited to evidence summarization at the study level and evidence reasoning at the problem/topic level.

This is an UNPAID research project.

Faculty Advisor

  • Professor: Chunhua Weng
  • Department/School: Biomedical Informatics in College of Physicians and Surgeons
  • Location: Presbyterian Hospital 20th floor
  • Dr. Weng conducts large-scale clinical research knowledge engineering using big data from ClinicalTrials.gov, PubMed, and electronic health records, and develops scalable computational and symbolic methods for automated medical evidence extraction and reasoning.

Project Timeline

  • Earliest starting date: 3/1/2021
  • End date: 12/31/2021
  • Number of hours per week of research expected during Spring 2021: ~10
  • Number of hours per week of research expected during Summer 2021: ~20

Candidate requirements

  • Skill sets: natural language processing, knowledge graph, propositional logic
  • Student eligibility: freshman, sophomore, junior, senior, master’s
  • International students on F1 or J1 visa: eligible
  • Academic Credit Possible: Yes