## STATS 32: Introduction to R for Undergraduates

This short course runs for weeks one through five of the quarter. It is recommended for undergraduate students who want to use R in the humanities or social sciences and for students who want to learn the basics of R programming. The goal of the short course is to familiarize students with R's tools for data analysis. Lectures will be interactive with a focus on learning by example, and assignments will be application-driven. No prior programming experience is needed. Topics covered include basic data structures, File I/O, data transformation and visualization, simple statistical tests, etc, and some useful packages in R. Prerequisite: undergraduate student. Priority given to non-engineering students. Laptops necessary for use in class.

Terms: Aut, Spr
| Units: 1

Instructors:
Tay, J. (PI)

## STATS 48N: Riding the Data Wave (BIODS 48N)

Imagine collecting a bit of your saliva and sending it in to one of the personalized genomics company: for very little money you will get back information about hundreds of thousands of variable sites in your genome. Records of exposure to a variety of chemicals in the areas you have lived are only a few clicks away on the web; as are thousands of studies and informal reports on the effects of different diets, to which you can compare your own. What does this all mean for you? Never before in history humans have recorded so much information about themselves and the world that surrounds them. Nor has this data been so readily available to the lay person. Expression as "data deluge'' are used to describe such wealth as well as the loss of proper bearings that it often generates. How to summarize all this information in a useful way? How to boil down millions of numbers to just a meaningful few? How to convey the gist of the story in a picture without misleading oversimplifications? To an
more »

Imagine collecting a bit of your saliva and sending it in to one of the personalized genomics company: for very little money you will get back information about hundreds of thousands of variable sites in your genome. Records of exposure to a variety of chemicals in the areas you have lived are only a few clicks away on the web; as are thousands of studies and informal reports on the effects of different diets, to which you can compare your own. What does this all mean for you? Never before in history humans have recorded so much information about themselves and the world that surrounds them. Nor has this data been so readily available to the lay person. Expression as "data deluge'' are used to describe such wealth as well as the loss of proper bearings that it often generates. How to summarize all this information in a useful way? How to boil down millions of numbers to just a meaningful few? How to convey the gist of the story in a picture without misleading oversimplifications? To answer these questions we need to consider the use of the data, appreciate the diversity that they represent, and understand how people instinctively interpret numbers and pictures. During each week, we will consider a different data set to be summarized with a different goal. We will review analysis of similar problems carried out in the past and explore if and how the same tools can be useful today. We will pay attention to contemporary media (newspapers, blogs, etc.) to identify settings similar to the ones we are examining and critique the displays and summaries there documented. Taking an experimental approach, we will evaluate the effectiveness of different data summaries in conveying the desired information by testing them on subsets of the enrolled students.

Terms: Aut
| Units: 3
| UG Reqs: WAY-AQR, WAY-FR

Instructors:
Sabatti, C. (PI)
;
Zhao, Q. (TA)

## STATS 60: Introduction to Statistical Methods: Precalculus (PSYCH 10, STATS 160)

Techniques for organizing data, computing, and interpreting measures of central tendency, variability, and association. Estimation, confidence intervals, tests of hypotheses, t-tests, correlation, and regression. Possible topics: analysis of variance and chi-square tests, computer statistical packages.

Terms: Aut, Win, Spr, Sum
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

Instructors:
Cook, N. (PI)
;
Poldrack, R. (PI)
;
Zhu, X. (PI)
;
Bhattacharya, S. (TA)
;
Dey, A. (TA)
;
Lemhadri, I. (TA)

## STATS 101: Data Science 101

https://statweb.stanford.edu/~tibs/stat101.html This course will provide a hands-on introduction to statistics and data science. Students will engage with the fundamental ideas in inferential and computational thinking. Each week, we will explore a core topic comprising three lectures and two labs (a module), in which students will manipulate real-world data and learn about statistical and computational tools. Students will engage in statistical computing and visualization with current data analytic software (Jupyter, R). The objectives of this course are to have students (1) be able to connect data to underlying phenomena and to think critically about conclusions drawn from data analysis, and (2) be knowledgeable about programming abstractions so that they can later design their own computational inferential procedures. No programming or statistical background is assumed. Freshmen and sophomores interested in data science, computing and statistics are encouraged to attend. Open to graduates as well.

Terms: Aut, Spr
| Units: 5
| UG Reqs: GER: DB-NatSci, WAY-AQR

## STATS 110: Statistical Methods in Engineering and the Physical Sciences

Introduction to statistics for engineers and physical scientists. Topics: descriptive statistics, probability, interval estimation, tests of hypotheses, nonparametric methods, linear regression, analysis of variance, elementary experimental design. Prerequisite: one year of calculus.

Terms: Aut, Sum
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

## STATS 116: Theory of Probability

Probability spaces as models for phenomena with statistical regularity. Discrete spaces (binomial, hypergeometric, Poisson). Continuous spaces (normal, exponential) and densities. Random variables, expectation, independence, conditional probability. Introduction to the laws of large numbers and central limit theorem. Prerequisites:
MATH 52 and familiarity with infinite series, or equivalent.

Terms: Aut, Spr, Sum
| Units: 4
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

Instructors:
Donoho, D. (PI)
;
Wong, W. (PI)
;
GAO, Z. (TA)
;
Kunnasagaran, A. (TA)
;
Tirlea, M. (TA)

## STATS 141: Biostatistics (BIO 141)

Introductory statistical methods for biological data: describing data (numerical and graphical summaries); introduction to probability; and statistical inference (hypothesis tests and confidence intervals). Intermediate statistical methods: comparing groups (analysis of variance); analyzing associations (linear and logistic regression); and methods for categorical data (contingency tables and odds ratio). Course content integrated with statistical computing in R.

Terms: Aut
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR

Instructors:
Holmes, S. (PI)
;
Cheng, C. (TA)
;
Sesia, M. (TA)
;
Tsao, A. (TA)
;
Tuzhilina, E. (TA)
;
Xu, H. (TA)

## STATS 155: Statistical Methods in Computational Genetics

The computational methods necessary for the construction and evaluation of sequence alignments and phylogenies built from molecular data and genetic data such as micro-arrays and data base searches. How to formulate biological problems in an algorithmic decomposed form, and building blocks common to many problems such as Markovian models, multivariate analyses. Some software covered in labs (Python, Biopython, XGobi, MrBayes, HMMER, Probe). Prerequisites: knowledge of probability equivalent to
STATS 116,
STATS 202 and one class in computing at the
CS 106 level. Writing intensive course for undergraduates only. Instructor consent required. (WIM)

Terms: Aut
| Units: 3

Instructors:
Holmes, S. (PI)

## STATS 160: Introduction to Statistical Methods: Precalculus (PSYCH 10, STATS 60)

Techniques for organizing data, computing, and interpreting measures of central tendency, variability, and association. Estimation, confidence intervals, tests of hypotheses, t-tests, correlation, and regression. Possible topics: analysis of variance and chi-square tests, computer statistical packages.

Terms: Aut, Win, Spr, Sum
| Units: 5

Instructors:
Cook, N. (PI)
;
Poldrack, R. (PI)
;
Zhu, X. (PI)
;
Bhattacharya, S. (TA)
;
Dey, A. (TA)
;
Lemhadri, I. (TA)

## STATS 191: Introduction to Applied Statistics

Statistical tools for modern data analysis. Topics include regression and prediction, elements of the analysis of variance, bootstrap, and cross-validation. Emphasis is on conceptual rather than theoretical understanding. Applications to social/biological sciences. Student assignments/projects require use of the software package R. Prerequisite: introductory statistical methods course. Recommended: 60, 110, or 141.

Terms: Aut
| Units: 3
| UG Reqs: GER:DB-Math, WAY-AQR

Filter Results: