We need someone with strong data wrangling capabilities, to be able to determine quick ways to clean and merge data. The format of the data is spatial (GIS) but it could also be manipulated in tabular format. GRID3 is a program within CIESIN which is a research center located at the Lamont-Doherty Campus (with office space on the morningside campus) and is part of Columbia’s Earth Institute. Candidates can learn more about the program at the GRID3 website.

Outcome

Scripts in R or Python to clean large volumes of incoming data; scripts to merge various sources of data; scripts to assess data quality metrics from various sources.

Learning opportunity

Exposure to GIS data for development project (African countries, mainly); work in conjunction with a dynamic team of researchers, project managers, and GIS specialists; exposure to real-life development issues

Selected candidate(s) may receive a stipend directly from the faculty advisor. Amount is subject to available funding.

Faculty Advisor

Project Timeline

  • Anticipated workload: Ongoing - we need as much support as possible
  • Duration: The project has an end date of December 2022. We would need help from 1-2 data scientists.

Candidate requirements

  • Skills required: R, Python, ArcGIS (desktop and/or pro), QGIS
  • Student eligibility: freshman, sophomore, junior, senior, master’s
  • Additional comments: Energetic, eager to learn, interest in spatial data.