## STATS 32: Introduction to R for Undergraduates

This short course runs for weeks one through five of the quarter. It is recommended for undergraduate students who want to use R in the humanities or social sciences and for students who want to learn the basics of R programming. The goal of the short course is to familiarize students with R's tools for data analysis. Lectures will be interactive with a focus on learning by example, and assignments will be application-driven. No prior programming experience is needed. Topics covered include basic data structures, File I/O, data transformation and visualization, simple statistical tests, etc, and some useful packages in R. Prerequisite: undergraduate student. Priority given to non-engineering students. Laptops necessary for use in class.

Terms: Spr
| Units: 1

Instructors:
Pavlyshyn, D. (PI)

## STATS 48N: Riding the Data Wave (BIODS 48N)

Imagine collecting a bit of your saliva and sending it in to one of the personalized genomics company: for very little money you will get back information about hundreds of thousands of variable sites in your genome. Records of exposure to a variety of chemicals in the areas you have lived are only a few clicks away on the web; as are thousands of studies and informal reports on the effects of different diets, to which you can compare your own. What does this all mean for you? Never before in history humans have recorded so much information about themselves and the world that surrounds them. Nor has this data been so readily available to the lay person. Expression as "data deluge'' are used to describe such wealth as well as the loss of proper bearings that it often generates. How to summarize all this information in a useful way? How to boil down millions of numbers to just a meaningful few? How to convey the gist of the story in a picture without misleading oversimplifications? To an
more »

Imagine collecting a bit of your saliva and sending it in to one of the personalized genomics company: for very little money you will get back information about hundreds of thousands of variable sites in your genome. Records of exposure to a variety of chemicals in the areas you have lived are only a few clicks away on the web; as are thousands of studies and informal reports on the effects of different diets, to which you can compare your own. What does this all mean for you? Never before in history humans have recorded so much information about themselves and the world that surrounds them. Nor has this data been so readily available to the lay person. Expression as "data deluge'' are used to describe such wealth as well as the loss of proper bearings that it often generates. How to summarize all this information in a useful way? How to boil down millions of numbers to just a meaningful few? How to convey the gist of the story in a picture without misleading oversimplifications? To answer these questions we need to consider the use of the data, appreciate the diversity that they represent, and understand how people instinctively interpret numbers and pictures. During each week, we will consider a different data set to be summarized with a different goal. We will review analysis of similar problems carried out in the past and explore if and how the same tools can be useful today. We will pay attention to contemporary media (newspapers, blogs, etc.) to identify settings similar to the ones we are examining and critique the displays and summaries there documented. Taking an experimental approach, we will evaluate the effectiveness of different data summaries in conveying the desired information by testing them on subsets of the enrolled students.

Terms: Aut
| Units: 3
| UG Reqs: WAY-AQR, WAY-FR

Instructors:
Sabatti, C. (PI)
;
Ren, Z. (TA)

## STATS 60: Introduction to Statistical Methods: Precalculus (PSYCH 10, STATS 160)

Techniques for organizing data, computing, and interpreting measures of central tendency, variability, and association. Estimation, confidence intervals, tests of hypotheses, t-tests, correlation, and regression. Possible topics: analysis of variance and chi-square tests, computer statistical packages.

Terms: Aut, Win, Spr, Sum
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

Instructors:
Auelua-Toomey, S. (PI)
;
Jain, V. (PI)
;
Kong, N. (PI)
...
more instructors for STATS 60 »

Instructors:
Auelua-Toomey, S. (PI)
;
Jain, V. (PI)
;
Kong, N. (PI)
;
Poldrack, R. (PI)
;
Walters, J. (PI)
;
Walther, G. (PI)
;
Dey, A. (TA)
;
Feldman, M. (TA)
;
Harrison, M. (TA)
;
Jeong, Y. (TA)
;
Jing, A. (TA)
;
Kirshenbaum, J. (TA)
;
Xu, H. (TA)

## STATS 100: Mathematics of Sports

This course will teach you how statistics and probability can be applied in sports, in order to evaluate team and individual performance, build optimal in-game strategies and ensure fairness between participants. Topics will include examples drawn from multiple sports such as basketball, baseball, soccer, football and tennis. The course is intended to focus on data-based applications, and will involve computations in R with real data sets via tutorial sessions and homework assignments. Prereqs: No statistical or programming background is assumed, but introductory courses, e.g,
Stats 60,101 or 116, are recommended. A prior knowledge of Linear Algebra (e.g.,
Math 51) and basic probability is strongly recommended.

Terms: Spr
| Units: 3
| UG Reqs: GER:DB-Math

Instructors:
Dey, A. (PI)

## STATS 110: Statistical Methods in Engineering and the Physical Sciences

Introduction to statistics for engineers and physical scientists. Topics: descriptive statistics, probability, interval estimation, tests of hypotheses, nonparametric methods, linear regression, analysis of variance, elementary experimental design. Prerequisite: one year of calculus.

Terms: Aut
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

## STATS 116: Theory of Probability

Probability spaces as models for phenomena with statistical regularity. Discrete spaces (binomial, hypergeometric, Poisson). Continuous spaces (normal, exponential) and densities. Random variables, expectation, independence, conditional probability. Introduction to the laws of large numbers and central limit theorem. Prerequisites:
MATH 52 and familiarity with infinite series, or equivalent.

Terms: Aut, Spr, Sum
| Units: 4
| UG Reqs: GER:DB-Math, WAY-AQR, WAY-FR

Instructors:
Dubey, P. (PI)
;
Schramm, T. (PI)
;
Bhattacharya, S. (TA)
...
more instructors for STATS 116 »

Instructors:
Dubey, P. (PI)
;
Schramm, T. (PI)
;
Bhattacharya, S. (TA)
;
Gupta, S. (TA)
;
Zhou, K. (TA)

## STATS 141: Biostatistics (BIO 141)

Introductory statistical methods for biological data: describing data (numerical and graphical summaries); introduction to probability; and statistical inference (hypothesis tests and confidence intervals). Intermediate statistical methods: comparing groups (analysis of variance); analyzing associations (linear and logistic regression); and methods for categorical data (contingency tables and odds ratio). Course content integrated with statistical computing in R.

Terms: Win
| Units: 5
| UG Reqs: GER:DB-Math, WAY-AQR

Instructors:
Dubey, P. (PI)

## STATS 155: Statistical Methods in Computational Genetics

The computational methods necessary for the construction and evaluation of sequence alignments and phylogenies built from molecular data and genetic data such as micro-arrays and data base searches. How to formulate biological problems in an algorithmic decomposed form, and building blocks common to many problems such as Markovian models, multivariate analyses. Some software covered in labs (Python, Biopython, XGobi, MrBayes, HMMER, Probe). Prerequisites: knowledge of probability equivalent to
STATS 116,
STATS 202 and one class in computing at the
CS 106 level. Writing intensive course for undergraduates only. Instructor consent required. (WIM)

Terms: Spr
| Units: 3

## STATS 160: Introduction to Statistical Methods: Precalculus (PSYCH 10, STATS 60)

Techniques for organizing data, computing, and interpreting measures of central tendency, variability, and association. Estimation, confidence intervals, tests of hypotheses, t-tests, correlation, and regression. Possible topics: analysis of variance and chi-square tests, computer statistical packages.

Terms: Aut, Win, Spr, Sum
| Units: 5

Instructors:
Auelua-Toomey, S. (PI)
;
Harrison, M. (PI)
;
Jain, V. (PI)
...
more instructors for STATS 160 »

Instructors:
Auelua-Toomey, S. (PI)
;
Harrison, M. (PI)
;
Jain, V. (PI)
;
Kirshenbaum, J. (PI)
;
Kong, N. (PI)
;
Poldrack, R. (PI)
;
Walters, J. (PI)
;
Walther, G. (PI)
;
Dey, A. (TA)
;
Feldman, M. (TA)
;
Jeong, Y. (TA)
;
Jing, A. (TA)
;
Xu, H. (TA)

## STATS 191: Introduction to Applied Statistics

Statistical tools for modern data analysis. Topics include regression and prediction, elements of the analysis of variance, bootstrap, and cross-validation. Emphasis is on conceptual rather than theoretical understanding. Applications to social/biological sciences. Student assignments/projects require use of the software package R. Prerequisite: introductory statistical methods course. Recommended: 60, 110, or 141.

Terms: Win
| Units: 3
| UG Reqs: GER:DB-Math, WAY-AQR

Instructors:
Walther, G. (PI)

Filter Results: