Data Scientist

About Our Department - The Office of Digital Learning was created to transform teaching and learning at MIT and around the globe through the innovative use of digital technologies

Through MITx on edX, OpenCourseWare, and Residential MITx, ODL facilitates a wide range of experiments and enables distillation of research results from an unprecedented flood of online data, in many ways. The results of our research efforts inform and guide our digital learning efforts on campus and beyond.

As part of the ODL Engineering team, the Data Scientist will support the ODL business units and researchers parsing, analyzing, investigating, interpreting, modeling, and extracting knowledge from large volumes of data obtained from online, blended and residential courses.


  • Extract, parse, analyze, investigate, interpret, and extract knowledge from large volumes of data obtained from online, blended and residential courses.
  • Work closely with Engineering team to develop and maintain the existing ODL analytics platform
  • Work closely with Engineering team design, implement and maintain scalable software to transform and integrate large complex datasets into a reliable, accessible data infrastructure
  • Ensure integrity, security and up-time of all databases and data. Establish secure workflows for the transfer and storage of data. Develop secure, permissioned systems to present data and results to different audiences at ODL / MIT.
  • Assess data needs across ODL and learning researchers. Conduct ad-hoc analysis, visualizations, and data presentations. Deploy MIT tools, e.g,. Tableau, Brio, to develop standard, repeatable reports with customized parameters
  • Work with ODL business units, the Digital Learning Lab, Institutional Research and others to define data needs and requirements
  • Work closely with technology partners including ODL Engineering, the Senior Associate Dean for Digital Learning, MIT Information Systems and Technology, edX and others on management of these data systems


  • B.S. or higher in Computer Science or related technical field (or equivalent work experience)
  • Proficiency in at least one programming language: Python, R. Strong knowledge with data analysis tools, such as Stata
  • Strong knowledge of relational and non-relational database systems and SQL
  • Experience in designing, developing, delivering and maintaining data management tools and systems
  • Good understanding of the full web technology stack


  • Comfortable with Unix/Linux as a development environment.
  • Experience with Google’s BigQuery
  • Familiarity with source code control systems like GIT, Subversion, Mercurial.
  • Github profile or link to open source work
  • Experience in Software Development Life Cycle and interactive methodologies.
  • Excellent communication skills
  • Highly curious self-starter, with strong desire and ability to solve business and research problems
If you are interested, please send a resume, or get in touch with us at