The aim of this project is to use artificial intelligence (AI) to extract valuable information from unstructured eye movements of highly-skilled domain experts, in particular those of expert clinicians as they perform complex diagnostic decision-making tasks. Such eye-movement data is rich in patterns that can be deciphered using the power of unsupervised machine learning algorithms (such as k-nearest neighbor/hierarchical clustering and principal components analysis) or unsupervised deep learning algorithms (such as deep generative models, autoencoders, and long short-term memory autoencoders for sequence data). Furthermore, as novices transform into experts, patterns embedded in their eye movements (time spent on regions of interest vs. time spent on surgical equipment) may offer a valuable tool for extracting features that pinpoint the critical mechanisms (’eureka moments’) behind expert decision-making. The primary objectives of this project are (1) to collect eye movements of novice and expert ophthalmologists as they view medical images during eye-disease diagnoses using benchtop-based, head-mounted, or Virtual Reality embedded eye trackers (Eyelink 1000, Pupil Labs Core, or HTC Vive Pro, respectively) and (2) to apply unsupervised machine learning/deep learning approaches to extract meaningful information from this data. Features to be extracted from this data include but are not limited to: fixation duration and fixation count in regions of interest, fixation order, saccade velocity, and pupil diameter. This data collection and data analytics project will enable extraction of the most relevant features for task-oriented training of future AI-based disease diagnosis systems. Capturing eye movements, and thereby the underlying visual decision-making mechanisms behind an expert’s knowledge that are not otherwise quantifiable, will allow us to mimic these mechanisms in AI systems, potentially improving their diagnostic accuracy and interpretability for future clinical applications.

Continue reading

Clinicians place orders for patients in the electronic health record (EHR). There is currently no internal mechanism to detect medical errors in EHR systems. Identifying and monitoring medical errors has relied on voluntary reporting and chart review, methods that are subject to substantial self-reporting bias. To quantify the magnitude of wrong-patient errors, I developed and validated the Wrong-Patient Retract-and-Reorder (RAR) Measure. The Wrong-Patient RAR measure overcomes limitations of voluntary reporting by using an electronic query to objectively detect wrong-patient orders in EHR data. Whereas previous data indicated an average of 9 wrong-patient medication errors per hospital per year based on voluntary reporting,2 the RAR measure identified 5,246 wrong-patient orders in a large healthcare system in 1 year. The vastly greater volume of errors detected provides insights into the epidemiology of wrong-patient orders, informs targeted intervention strategies, and yields sufficient numbers of events to power health IT safety intervention studies.

Continue reading

eMERGE is a NIH-funded national network of 10 sites that enroll participants into genetic research. It has had 3 earlier cycles with tens of thousands of participants recruited in each; the current cycle (eMERGE IV) plans to enroll 25,000 individuals, with a particular focus on patients from underserved communities. The network has developed several participant surveys that allow the collection of data across the sites. This study focuses on participants who self-identified as people with disabilities as well as those who have chronic conditions such as kidney disease or type 1 diabetes. Issues to be explored include diverse representation in cohort, recruitment and retention, identify stated needs for participation, health behaviors, and expectations surrounding genetic testing for common diseases. Tasks required to fulfill study aims are literature review, quantitative data analyses skills, familiarity with e-health records and ICD codes, and drafting of manuscript; the projected outcome is a manuscript in a peer-review journal.

Continue reading

Women identified as high-risk for breast cancer may benefit from personalized risk-reducing strategies; however, barriers exist, including the time required to conduct a risk assessment of each woman in a population. Electronic health records (EHRs), a common source for populating risk assessment models, present challenges, including missing data, and data types more accurate when provided by patients compared to EHRs. We previously extracted EHR data on age, race/ethnicity, family history of BC, benign breast disease, and breast density to calculate BC risk according to the Breast Cancer Surveillance Consortium (BCSC) model among 9,514 women. Comparing self-reported and EHR data, more women with a first-degree family history of BC (14.6% vs. 4.4%) and benign breast biopsies (21.3% vs. 11.3%) were identified with patient-reported data, but EHR data identified more women with atypia or lobular carcinoma in situ (1.1% vs. 2.3%). The EHR had missing data on race/ethnicity for 26.8% of women and on first-degree family history of BC for 87.2%. Opportunely, Fast Healthcare Interoperability Resources (FHIR), application programming interfaces (APIs), and new legislation offer an elegant solution for automated BC risk assessment that integrates both patient-generated health data and EHR data to harness the strengths of each approach. To increase the likelihood of developing disseminatable and equitable strategies that integrate EHR and patient-generated health data for risk assessment and personalized BC risk reduction, the focus of this project is to refine and test our approach among diverse multiethnic women. Our aims are: 1) upgrade FHIR integration of a patient decision aid, parse EHR data pulled by patients, develop a user interface to enable women to view and augment their EHR data, conduct user evaluations of the user interface; 2) assess the effect of the FHIR-enhanced patient decision aid on patient activation, risk perception, and usability in a pilot study of multiethnic high-risk women; and 3) identify multilevel barriers to implementing FHIR-enhanced patient decision aid into clinical care.

Continue reading

Author's picture

Columbia Data Science Institute (DSI) Scholars Program

The DSI Scholars Program is to engage and support undergraduate and master students in participating data science related research with Columbia faculty. The program’s unique enrichment activities will foster a learning and collaborative community in data science at Columbia.

Columbia University DSI

New York, NY