Course Requirements

The curriculum for the Machine Learning Ph.D. is built on a foundation of five core courses and three electives (plus the Data Analysis Project requirement). These five courses also comprise the required courses for the MS degree. Together with the Data Analysis Project requirement, these should be completed during the first two years of study.

A typical full-time, graduate course load during the first two years consists each term of two classes (at 12 graduate units per class) plus 24 units of advanced research. Thus, during the first two years, a student has the opportunity to take several elective classes in addition to the five required courses.

The ML curriculum joins courses with a Computer Science main theme and those with a Probability and Statistics main theme. These may be grouped, as follows:

In CS, relevant sub-fields include: Databases; Machine Learning, Data Mining and algorithms applications in areas such as Robotics, Information Retrieval and AI.

In Statistics (including Philosophy), the sub-fields include: Statistical modeling (e.g., hierarchical and times series); Bayes' Nets, Causation, and experimental design. The curriculum is based on core academic courses on Intermediate Statistics, Machine Learning, Statistical Approaches for Learning & Discovery, Multimedia Databases, and Algorithms.

The five core courses provide, respectively: a secure foundation in mathematical statistics, a survey of basic machine learning techniques with numerous applications; the statistical and probabilistic theoretical underpinnings for these techniques; an introduction to databases for data mining, and a study of advanced algorithms.

Possible electives

10-910 Data Analysis Project Requirement, in the second year, which serves in lieu of an MS thesis.

Here is a typical schedule for the first two years of study.

Fall 1 Spring 1
10-701 Machine Learning

10-702 Statistical Machine Learning

10-705 Intermediate Statistics 15-750 Algorithms
10-920 Research 10-920 Research

 

 

 

Fall 2 Spring 2
Elective 10-910 Independent Study for the Data Analysis Project
Elective 15-826 Databases
10-920 Research 10-920 Research

 

 

 


The Data Analysis Project requirement: 10-910
During the second year a Ph.D. student is required to demonstrate data mining skills in the context of a focused project. The Data Analysis Project may be carried out either at Carnegie Mellon or at a sponsoring corporate institution under the joint supervision of the sponsor and a ML faculty. It will be concluded by a written report (in lieu of a Masters Thesis) in which the student demonstrates an ability to approach data mining problems in a way that cuts across existing disciplinary boundaries. The requirement includes giving a ML colloquium on the Data Analysis Project report. Passing this requirement will be the judgment of the ML faculty, under the advice of the faculty advisor for the project.

The Third Year
During the third year, a Ph.D. student completes the elective course requirements. One of these three electives is taken from the offerings in Statistics. The other two advanced electives, chosen in consultation with the students advisor, form a concentration in one of the allied disciplines with SCS, Biology, Philosophy, or GSIA. For those candidates seeking an academic position after completing the ML Ph.D. degree, the thoughtful selection of these three elective courses is particularly important. As in the each of the first two years, coursework is supplemented by 24 units/term of research.

The Fourth Year and Beyond
A Ph.D. student typically presents a thesis proposal no later than the start of the fourth year, and then spends the fourth and sometimes fifth year working on their thesis research.

Research

It is expected that all Ph.D. students engage in active research from their first semester. Moreover, advisor selection occurs in the first month of entering the Ph.D. program, with the option to change at a later time. Roughly half of a student's time should be allocated to research and lab work, and half to courses until these are completed.

This Research page is a list of some of the projects for which ML faculty may be interested in recruiting students. Within each project there can be lines of research which range in size from a semester's work to an entire thesis (or beyond). So, this page is intended as a resource for students looking for a thesis advisor, for a Data Analysis project, or to collaborate for any other reason.

Student Handbook

Thesis Defense Checklist