why-t-test-dont-work-with-large-samples

statistics

Usually, the goal of statistical tests is to show with high degree of confidence that an empirically estimated statistic is similar to a theoretically derived statistic
- The null hypothesis (the two statistics are similar) and alternative hypothesis (the two statistics are not) are presented to represent this where:
- $H_{null}: |\hat{\mu}-\mu| <=\Delta, H_{alt}: |\hat{\mu} - \mu| > \Delta$
- The $p_{value}$ from the test is what is the likelihood that we can observe an value larger than the test-statistic under the static distribution from chance
  - It is used to reject the null hypothesis, if $p_{value} < \alpha$
  - We establish a critical value $\alpha$ that is the threshold for which we can evaluate the $p_{value}$ with
  - ==The critical value should be established with the context of sample size==
As the sample size $N$ increases, the standard error gets smaller and the test statistic gets larger and $p_{value}$ gets smaller
- Then, the $\alpha$ needed to reject the null hypothesis needs to decrease because any differences no matter how small will be “significantly” different than the theoretical value
- At some point the critical value $\alpha$ needs to be incredibly small for huge samples
A way to evaluate large samples is to calculate the confidence intervals and see if that embraces the theoretically derived statistic
t-test is used to to measure the mean, z-test is to measure the mean when there is a large sample and the distribution is normal