Course Requirements
The
curriculum for the Machine Learning Ph.D. is built on a foundation
of five
core courses and three electives (plus
the Data Analysis Project requirement). These
five courses also comprise the required courses for the MS degree.
Together
with the Data Analysis Project requirement, these should be completed
during the first two years of study.
A typical full-time,
graduate course load during the first two years consists each term
of two classes (at 12 graduate units per class) plus 24 units of
advanced research. Thus, during the first two years, a student
has the opportunity to take several elective classes in addition
to the five required courses.
The ML curriculum
joins courses with a Computer Science main theme and those with
a Probability and Statistics main theme. These may be grouped,
as follows:
In CS, relevant
sub-fields include: Databases; Machine Learning, Data Mining and
algorithms applications in areas such as Robotics, Information
Retrieval and AI.
In Statistics
(including Philosophy), the sub-fields include: Statistical modeling
(e.g., hierarchical and times series); Bayes' Nets, Causation,
and experimental design. The curriculum is based on core academic
courses on Intermediate Statistics, Machine Learning, Statistical
Approaches for Learning & Discovery, Multimedia Databases,
and Algorithms.
The five core courses provide, respectively: a secure foundation in mathematical
statistics, a survey of basic machine learning techniques with numerous applications;
the statistical and probabilistic theoretical underpinnings for these techniques;
an introduction to databases for data mining, and a study of advanced algorithms.
Possible
electives
10-910 Data
Analysis Project Requirement, in the second year, which serves
in lieu of an MS
thesis.
Here is a typical
schedule for the first two years of study.
| Fall
1 |
Spring
1 |
| 10-701
Machine Learning |
10-702
Statistical Machine Learning
|
| 10-705
Intermediate Statistics |
15-750
Algorithms |
| 10-920
Research |
10-920
Research |
| Fall
2 |
Spring
2 |
| Elective |
10-910
Independent Study for the Data Analysis Project |
| Elective |
15-826
Databases |
| 10-920
Research |
10-920
Research |
The Data Analysis Project
requirement: 10-910
During the second year a Ph.D. student is required to demonstrate data mining
skills in the context of a focused project. The Data
Analysis
Project may be carried
out either at Carnegie Mellon or at a sponsoring corporate institution under
the joint supervision of the sponsor and a ML faculty. It will be concluded
by a written report (in lieu of a Masters Thesis) in which the student demonstrates
an ability to approach data mining problems in a way that cuts across existing
disciplinary boundaries. The requirement includes giving a ML colloquium on
the Data Analysis Project report. Passing this requirement will be the judgment
of the ML faculty, under the advice of the faculty advisor for the project.
The Third Year
During the third year, a Ph.D. student completes the elective course requirements.
One of these three electives is taken from the offerings in Statistics.
The other two advanced electives, chosen in consultation with the students
advisor, form a concentration in one of the allied disciplines with SCS,
Biology, Philosophy, or GSIA. For those candidates seeking an academic
position after completing the ML Ph.D. degree, the thoughtful selection
of these three elective courses is particularly important. As in the each
of the first two years, coursework is supplemented by 24 units/term of
research.
The Fourth Year
and Beyond
A Ph.D. student typically presents a thesis proposal no later than the start
of the fourth year, and then spends the fourth and sometimes fifth year working
on their thesis research.
Research
It is expected
that all Ph.D. students engage in active research from their
first semester. Moreover, advisor selection occurs in the first
month of entering the Ph.D. program, with the option to change
at a later time. Roughly half of a student's time should be allocated
to research and lab work, and half to courses until these are
completed.
This Research
page is a list of some of the projects for which ML faculty
may be interested in recruiting students. Within each project
there can be lines of research which range in size from a semester's
work to an entire thesis (or beyond). So, this page is intended
as a resource for students looking for a thesis advisor, for
a Data Analysis project, or to collaborate for any other reason.
Student
Handbook
Thesis
Defense Checklist
|