CMU Course Listing
CMU Course Listing
Networks: Where do they come from? What do they tell us? Thirty years ago, the word
"network" was mostly used in reference to computers or television broadcasting
channels. Now we have networks of friends, enemies, phones, stars, tweets,
international governments, terrorists, etc. Where do these networks come from? How
are they built? What do they represent? As we learn more about how everything is
connected, we also face challenges in trying to understand the data that a network can
generate. In this course, you'll learn about networks from a New England monastery
facing a political crisis to social groups of friends (is obesity contagious? what about
divorce?) to 15th century marriages among prominent Italian families to international
political disputes and skirmishes (is the enemy of my enemy my friend?) to the spread
of HIV among intravenous drug users. Along the way, we'll explore how to describe,
visualize, analyze, and even break down the networks that surround us.
This course will introduce students to the basic concepts, logic, and issues involved in
statistical reasoning, as well as basic statistical methods used to analyze data and
evaluate studies. The major topics to be covered include methods for exploratory data
analysis, an introduction to research methods, elementary probability, and methods for
statistical inference. The objectives of this course are to help students develop a critical
approach to the evaluation of study designs, data and results, and to develop skills in
the application of basic statistical methods in empirical research. An important feature
of the course will be the use of the computer to facilitate the understanding of important
statistical ideas and for the implementation of data analysis. In addition to three lectures
a week, students will attend a computer lab once a week. Examples will be drawn from
areas of applications of particular interest to H&SS students. Not open to students who
have received credit for 36-207/70-207, 36-220, 36-225, 36-625, or 36-247
Spring: 9 units
This course builds on the principles and methods of statistical reasoning developed in
36-201 (or its equivalents). The course covers simple and multiple regression, analysis
of variance methods and logistic regression. Other topics may include non-parametric
methods and probability models, as time permits. The objectives of this course is to
develop the skills of applying the basic principles and methods that underlie statistical
practice and empirical research. In addition to three lectures a week, students attend a
computer lab once week for "hands-on" practice of the material covered in lecture. Not
open to students who have received credit for: 36-208/70-208, 36-309.
Fall: 9 units
This is the first half of a year long sequence in basic statistical methods that are used in
business and management. Topics include exploratory and descriptive techniques,
probability theory, statistical inference in simple settings, basic categorical analysis, and
statistical methods for quality control. Not open to students who have received credit for
36-201, 36-220, 36-625, or 36-247. Cross-listed as 70-207.
Spring: 9 units
This is the second half of a year long sequence in basic statistical methods that are
used in business and management. Topics include time series, regression and
forecasting. In addition to two lectures a week, students will attend a computer lab once
a week. Not open to students who have received credit for 36-202, 36-626. Cross-listed
as 70-208.
Prerequisites: (21-120 or 21-121 or 21-112) and (36-207 or 70-207 or 36-201 or 36-220
or 36-247).
Fall: 9 units
This course is the first half of a year long course which provides an introduction to
probability and mathematical statistics for students in economics, mathematics and
statistics. The use of probability theory is illustrated with examples drawn from
engineering, the sciences, and management. Topics include elementary probability
theory, conditional probability and independence, random variables, distribution
functions, joint and conditional distributions, law of large numbers, and the central limit
theorem. A grade of C or better is required in order to advance to 36-226 and 36-410.
Not open to students who have received credit for 36-217 or 36-625.
Spring: 9 units
This course is the second half of a year long course in probability and mathematical
statistics. Topics include maximum likelihood estimation, confidence intervals, and
hypothesis testing. If time permits there will also be a discussion of linear regression
and the analysis of variance. A grade of C or better is required in order to advance to
36-401, 36-402 or any 36-46x course. Not open to students who have received credit
for 36-626.
Spring: 9 units
Spring: 9 units
This course will revolve around the role of sampling and sample surveys in the context
of U.S. society and its institutions. We will examine the evolution of survey taking in the
United States in the context of its economic, social and political uses. This will
eventually lead to discussions about the accuracy and relevance of survey responses,
especially in light of various kinds of nonsampling error. Students will be required to
design, implement and analyze a survey sample.
Fall: 9 units
Statistical aspects of the design and analysis of planned experiments are studied in this
course. A clear statement of the experimental factors will be emphasized. The design
aspect will concentrate on choice of models, sample size and order of experimentation.
The analysis phase will cover data collection and computation, especially analysis of
variance and will stress the interpretation of results. In addition to a weekly lecture,
students will attend a computer lab once a week.
Spring: 9 units
Fall: 9 units
Fall: 9 units
This course is an introduction to the real world of statistics and data analysis. We will
explore real data sets, examine various models for the data, assess the validity of their
assumptions, and determine which conclusions we can make (if any). Data analysis is
a bit of an art; there may be several valid approaches. We will strongly emphasize the
importance of critical thinking about the data and the question of interest. Our overall
goal is to use a basic set of modeling tools to explore and analyze data and to present
the results in a scientific report. A minimum grade of C in any one of the pre-requisites
is required. A grade of C is required to move on to 36-402 or any 36-46x course.
Spring: 9 units
This course introduces modern methods of data analysis, building on the theory and
application of linear models from 36-401. Topics include nonlinear regression,
nonparametric smoothing, density estimation, generalized linear and generalized
additive models, simulation and predictive model-checking, cross-validation, bootstrap
uncertainty estimation, multivariate methods including factor analysis and mixture
models, and graphical models and causal inference. Students will analyze real-world
data from a range of fields, coding small programs and writing reports. Prerequisites:
36-401
Prerequisite: At least a C grade in 36-401
Spring: 9 units
Intermittent: 9 units
Epidemiology is concerned with understanding factors that cause, prevent, and reduce
diseases by studying associations between disease outcomes and their suspected
determinants in human populations. Epidemiologic research requires an understanding
of statistical methods and design. Epidemiologic data is typically discrete, i.e., data that
arise whenever counts are made instead of measurements. In this course, methods for
the analysis of categorical data are discussed with the purpose of learning how to apply
them to data. The central statistical themes are building models, assessing fit and
interpreting results. There is a special emphasis on generating and evaluating evidence
from observational studies. Case studies and examples will be primarily from the public
health sciences.
Prerequisite: 36-401
Intermittent: 9 units
Data mining is the science of discovering patterns and learning structure in large data
sets. Covered topics include information retrieval, clustering, dimension reduction,
regression, classification, and decision trees. Prerequisites: 36-401 (C or better).
Prerequisite: 36-401
Intermittent: 9 units
Multilevel and hierarchical models are among the most broadly applied "sophisticated"
statistical models, especially in the social and biological sciences. They apply to
situations in which the data "cluster" naturally into groups of units that are more related
to each other than they are the rest of the data. In the first part of the course we will
learn about Bayesian statistical methods. In the second part we will relate multilevel
and hierarchical models to other areas of statistics, and in the third part of the course
we will build and apply these models using a variety of data sets and examples.
Prerequisite: 36-401
Intermittent: 9 units
Prerequisite: 36-401
Intermittent: 9 units
The course will focus on how to construct hypotheses from a large data set and confirm
them statistically. Exploratory methods include discriminant analysis, principal
component analysis, projection pursuit, clustering, and nonparametric density
estimation. Confirmatory methods include confidence intervals, posterior distributions,
and Bayes factors. In addition, students will learn how to think in terms of probabilistic
models and use data mining software effectively. Some computer programming
required. Pre-requisites: 36401 (grade of at least C) or permission from instructor.
Prerequisite: 36-401
Spring: 9 units
Prerequisite: 36-401
Corequisite: 36-402
Fall: 12 units