Interested in a late-summer research project, or want to get a jump on the fall semester? We are restructing the application process for the Data Science Institute and Data for Good Scholars programs, but until that system is in place any projects we receive will be sent to this interim mailing list. Subscribe for notifications.
Columbia Technology Ventures (CTV) and the Columbia Lab-to-Market Accelerator Network (L2M) are seeking undergraduate students from Columbia College, Columbia School of Engineering and Applied Science, Columbia General Studies, and Barnard for part-time (10 – 20 hours per week) temporary (6 – 8 weeks during June to August 2020) summer interns.
Given the COVID-19 environment and the increasing requests for additional support for projects, the Data Science Institute is pleased to announce a second wave of summer research projects through the Data Science Scholars and Data For Good programs. Please note that this is on a compressed timeline and student applications are due Sunday, May 24th, 2020 at 11:59pm Eastern time.
The goal of the DSI Scholars Program is to engage Columbia University’s undergraduate and master’s students in data science research with Columbia faculty through a research internship. The program connects students with research projects across Columbia and provides student researchers with an additional learning experience and networking opportunities. Through unique enrichment activities, this program aims to foster a learning and collaborative community in data science at Columbia.
The Data For Good Scholars program connects student volunteers to organizations and individuals working for the social good whose projects have developed a need for data science expertise. As “real world” problems with real world data, these projects are excellent opportunities for students to learn how data science is practiced outside of the university setting and to learn how to work effectively with people for whom data science sits outside of their subject area.
We need someone with strong data wrangling capabilities, to be able to determine quick ways to clean and merge data. The format of the data is spatial (GIS) but it could also be manipulated in tabular format. GRID3 is a program within CIESIN which is a research center located at the Lamont-Doherty Campus (with office space on the morningside campus) and is part of Columbia’s Earth Institute. Candidates can learn more about the program at the GRID3 website.
This was a 1-year prospective observational study to examine the relation between sleep and cardiometabolic risk among 506 women in the NYC area. All of the data has been collected and entered in a Redcap database, has been cleaned, and is ready for analysis.
In this project we’ll be expanding on the existing family of supervised topic models. These models extend LDA to document collections where, for each document, we observe additional labels or values of interest. More specifically, one of the goals of this project is to use additional document level data, such as author information, to develop better exploratory data tools.
Bacterial and viral genomic epidemiology. Ongoing and new projects.