TODO include example problems

Sampling

Variance/SD of a sample ($s$) divides by n-1—this is the unbiased estimate of the population’s variance, since otherwise dividing by n underestimates the variance
- Often don’t know population statistics. Estimate pop mean with sample mean, and pop SD with sample SD $s=\sum{\frac{(x-\bar{x})^2}{n-1}}$.
Sampling distribution: distribution of a random sample statistic
- “Variance of the sampling distribution of the sample mean”
- Standard error: SD of the sampling distribution, $\sigma_{\bar{x}}=\sqrt{\frac{\sigma}{n}}$. But if you don’t know $\sigma$ of population, then estimate it with sample SD $s$.
https://stats.stackexchange.com/questions/116889/why-dont-we-use-the-unbiased-sample-variance-to-calculate-the-standard-error
When to use z (normal) vs t distribution/interval/test?
- Use z only when you know the true pop SD or when sample large enough, else use t (source)
- For t, use dof=n-1
- limit of t distribution as n approaches inf is normal

Confidence and significance

Relationship: confidence intervals, significance level (alpha), and critical values
- Critical value is the z-value or t-value cutoff—the #SDs you want in the confidence interval

Untitled

Significance testing
- One-tailed vs two-tailed
- Type I error: the common human tendency to worry about, a false positive (you wrongly report significance / reject null hypothesis / accept alternative)
- Type II error: fail to reject null hypothesis when it’s actually wrong (false negative)
- Statistical power of a test: ability to avoid type II error

Statistical tests

means: t test, since you often don’t know the population SD. Or z test if n is large enough.
proportions: z test
difference of proportions, always independent samples (no paired): z test on just the paired differences, and variance is the sum of variances.
difference of 2 means, paired: t test on just the paired differences. (example with t interval). variance is not sum of variances since they’re not independent samples.
difference of 2 means, independent samples: bootstrapped resampling on computer, or t-test/z-test (if n>30) of the difference, where variance is sum of variances. df=. (example)
regression analysis: relationship on continuous data (chi square for relationship of discrete data) [TODO be more precise here]
chi square tests (video on the differences)
- chi square test for goodness of fit: test if 1 sample of a multinomial matches expected distribution. dof=n-1
- chi square test of independence / for relationship (e.g. with 2-way tables), same as test of homogeneity: check whether two categorical variables are related, i.e. if they’re independent, which is the same as whether they’re from the same distribution. dof=(r-1)(c-1)
  - Homogeneity and independence are the same because can view as either axes, either getting treated as the multinomial values or as multiple binomial proportions
F-test / ANOVA: compare means (continuous) among more than 2 groups.
- Beyond accept/reject, ANOVA tells you exactly where the variance is coming from
- One-way vs two-way: video
- MANOVA: multiple groups/classes