diff --git a/education/statistics/Hypothesis Tests.md b/education/statistics/Hypothesis Tests.md index 1a33502..3af3533 100644 --- a/education/statistics/Hypothesis Tests.md +++ b/education/statistics/Hypothesis Tests.md @@ -60,9 +60,11 @@ Investigators should: Investigators should not: - Blindly compare P to 5% or 1%, there are real world factors that change how important this value is. +### Data Mining/Snooping/P-hacking +If $n$ tests are made, you can expect that $n * p$ tests are statistically significant (an outlier/unusual result). +Data snooping isĀ **a form of statistical bias manipulating data or analysis to artificially get statistically significant results**. -### Data Mining/Snooping - +Reliable investigators test their conclusions on an independant batch of data (Ch 28-29, stat 1040) ## Goodness of fit tests ($\chi ^2$)