NYC DDC has initiated a machine learning project to develop predictive model for estimating cost of project and work items. Using the latest technique in Machine Learning and Advanced Statistics, NYC DDC to develop a model that predicts the cost of future and active projects and construction work items in different phases of the lifecycle of the project based on historical data. DDC has partnered with Microsoft who is providing the proof of concept guidance and making tools available for the proof of concept development. DDC is seeking assistance of a data scientist from the Town and Gown program to develop the model.

Tasks expected from the data scientist candidate:

  • Explore and understand the data provided by DDC
  • Communicate iteratively with DDC Data Analytics team and Microsoft AI team to discuss meaningful insights
  • Interpret the data and draw meaningful inferences especially dealing with small datasets
  • Test and select a most optimum algorithm to build a model

This is a volunteer opportunity for students to use their skills for the social good.

Project Owners

Full details can be found here.

Project timeline

  • Earliest starting date: 10/01/2020
  • End date: 01/01/2021
  • Number of hours per week of research expected during Fall 2020: ~10
  • Project is ongoing and will be reviewed for future directions at the end of the semester

Candidate requirements

  • Skills:
    • Advanced statistical analysis and drawing inferences when dealing with relatively small datasets
    • Solid understanding of data distribution analysis and hypothesis testing
    • Experience in performing analysis with various Machine Learning algorithms
    • Microsoft AI team will provide guidance on MS Azure ML, proof of concept, techniques in Machine Learning algorithms and usage of Azure ML functionalities.
  • Student eligibility: freshman, sophomore, junior, senior, master’s
  • International students: eligible