## STATS 32: Introduction to R for Undergraduates

This short course runs for weeks one through five of the quarter. It is recommended for undergraduate students who want to use R in the humanities or social sciences and for students who want to learn the basics of R programming. The goal of the short course is to familiarize students with R's tools for data analysis. Lectures will be interactive with a focus on learning by example, and assignments will be application-driven. No prior programming experience is needed. Topics covered include basic data structures, File I/O, data transformation and visualization, simple statistical tests, etc, and some useful packages in R. Prerequisite: undergraduate student. Priority given to non-engineering students. Laptops necessary for use in class.

Terms: Aut, Spr
| Units: 1

Instructors:
Tay, J. (PI)

## STATS 48N: Riding the Data Wave (BIODS 48N)

Imagine collecting a bit of your saliva and sending it in to one of the personalized genomics company: for very little money you will get back information about hundreds of thousands of variable sites in your genome. Records of exposure to a variety of chemicals in the areas you have lived are only a few clicks away on the web; as are thousands of studies and informal reports on the effects of different diets, to which you can compare your own. What does this all mean for you? Never before in history humans have recorded so much information about themselves and the world that surrounds them. Nor has this data been so readily available to the lay person. Expression as "data deluge'' are used to describe such wealth as well as the loss of proper bearings that it often generates. How to summarize all this information in a useful way? How to boil down millions of numbers to just a meaningful few? How to convey the gist of the story in a picture without misleading oversimplifications? To an
more »

Imagine collecting a bit of your saliva and sending it in to one of the personalized genomics company: for very little money you will get back information about hundreds of thousands of variable sites in your genome. Records of exposure to a variety of chemicals in the areas you have lived are only a few clicks away on the web; as are thousands of studies and informal reports on the effects of different diets, to which you can compare your own. What does this all mean for you? Never before in history humans have recorded so much information about themselves and the world that surrounds them. Nor has this data been so readily available to the lay person. Expression as "data deluge'' are used to describe such wealth as well as the loss of proper bearings that it often generates. How to summarize all this information in a useful way? How to boil down millions of numbers to just a meaningful few? How to convey the gist of the story in a picture without misleading oversimplifications? To answer these questions we need to consider the use of the data, appreciate the diversity that they represent, and understand how people instinctively interpret numbers and pictures. During each week, we will consider a different data set to be summarized with a different goal. We will review analysis of similar problems carried out in the past and explore if and how the same tools can be useful today. We will pay attention to contemporary media (newspapers, blogs, etc.) to identify settings similar to the ones we are examining and critique the displays and summaries there documented. Taking an experimental approach, we will evaluate the effectiveness of different data summaries in conveying the desired information by testing them on subsets of the enrolled students.

Terms: Aut
| Units: 3
| UG Reqs: WAY-AQR, WAY-FR

Instructors:
Sabatti, C. (PI)
;
Zhao, Q. (TA)

## STATS 60: Introduction to Statistical Methods: Precalculus (PSYCH 10, STATS 160)

Techniques for organizing data, computing, and interpreting measures of central tendency, variability, and association. Estimation, confidence intervals, tests of hypotheses, t-tests, correlation, and regression. Possible topics: analysis of variance and chi-square tests, computer statistical packages.

Terms: Aut, Win, Spr, Sum
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

Instructors:
Cook, N. (PI)
;
Miolane, N. (PI)
;
Poldrack, R. (PI)
...
more instructors for STATS 60 »

Instructors:
Cook, N. (PI)
;
Miolane, N. (PI)
;
Poldrack, R. (PI)
;
Zhu, X. (PI)
;
Dey, A. (TA)
;
Lemhadri, I. (TA)

## STATS 100: Mathematics of Sports

This course will teach you how statistics and probability can be applied in sports, in order to evaluate team and individual performance, build optimal in-game strategies and ensure fairness between participants. Topics will include examples drawn from multiple sports such as basketball, baseball, soccer, football and tennis. The course is intended to focus on data-based applications, and will involve computations in R with real data sets via tutorial sessions and homework assignments. Prereqs: No statistical or programming background is assumed, but introductory courses, e.g,
Stats 60,101 or 116, are recommended. A prior knowledge of Linear Algebra (e.g.,
Math 51) and basic probability is strongly recommended.

Terms: Spr
| Units: 3
| UG Reqs: GER:DB-Math

## STATS 101: Data Science 101

https://statweb.stanford.edu/~tibs/stat101.html This course will provide a hands-on introduction to statistics and data science. Students will engage with the fundamental ideas in inferential and computational thinking. Each week, we will explore a core topic comprising three lectures and two labs (a module), in which students will manipulate real-world data and learn about statistical and computational tools. Students will engage in statistical computing and visualization with current data analytic software (Jupyter, R). The objectives of this course are to have students (1) be able to connect data to underlying phenomena and to think critically about conclusions drawn from data analysis, and (2) be knowledgeable about programming abstractions so that they can later design their own computational inferential procedures. No programming or statistical background is assumed. Freshmen and sophomores interested in data science, computing and statistics are encouraged to attend. Open to graduates as well.

Terms: Aut, Spr
| Units: 5
| UG Reqs: GER: DB-NatSci, WAY-AQR

## STATS 110: Statistical Methods in Engineering and the Physical Sciences

Introduction to statistics for engineers and physical scientists. Topics: descriptive statistics, probability, interval estimation, tests of hypotheses, nonparametric methods, linear regression, analysis of variance, elementary experimental design. Prerequisite: one year of calculus.

Terms: Aut, Sum
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

## STATS 110U: Statistical Methods in Engineering and the Physical Sciences

For Summer UG Visitors only. Same as 110. This course is offered remotely only via video segments. TAs will host remote weekly office hours using an online platform such as Zoom.

Terms: Sum
| Units: 5

## STATS 116: Theory of Probability

Probability spaces as models for phenomena with statistical regularity. Discrete spaces (binomial, hypergeometric, Poisson). Continuous spaces (normal, exponential) and densities. Random variables, expectation, independence, conditional probability. Introduction to the laws of large numbers and central limit theorem. Prerequisites:
MATH 52 and familiarity with infinite series, or equivalent.

Terms: Aut, Spr, Sum
| Units: 4
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

Instructors:
Donoho, D. (PI)
;
Kaluwa Devage, P. (PI)
;
Wong, W. (PI)
...
more instructors for STATS 116 »

Instructors:
Donoho, D. (PI)
;
Kaluwa Devage, P. (PI)
;
Wong, W. (PI)
;
GAO, Z. (TA)
;
Kunnasagaran, A. (TA)
;
Tirlea, M. (TA)

## STATS 116U: Theory of Probability

For Summer UG Visitors only. Same as
Stats 116. This course is offered remotely only via video segments. TAs will host remote weekly office hours using an online platform such as Zoom.

Terms: Sum
| Units: 4

Instructors:
Kaluwa Devage, P. (PI)

## STATS 141: Biostatistics (BIO 141)

Introductory statistical methods for biological data: describing data (numerical and graphical summaries); introduction to probability; and statistical inference (hypothesis tests and confidence intervals). Intermediate statistical methods: comparing groups (analysis of variance); analyzing associations (linear and logistic regression); and methods for categorical data (contingency tables and odds ratio). Course content integrated with statistical computing in R.

Terms: Aut
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR

Instructors:
Holmes, S. (PI)
;
Cheng, C. (TA)
;
Sesia, M. (TA)
;
Tsao, A. (TA)
;
Tuzhilina, E. (TA)
;
Xu, H. (TA)

Filter Results: