From bf97dddb847d3b06c9c625360730b46f900fb205 Mon Sep 17 00:00:00 2001 From: zleyyij Date: Tue, 13 Feb 2024 14:02:56 -0700 Subject: [PATCH] vault backup: 2024-02-13 14:02:56 --- education/statistics/Hypothesis Tests.md | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/education/statistics/Hypothesis Tests.md b/education/statistics/Hypothesis Tests.md index 1a33502..3af3533 100644 --- a/education/statistics/Hypothesis Tests.md +++ b/education/statistics/Hypothesis Tests.md @@ -60,9 +60,11 @@ Investigators should: Investigators should not: - Blindly compare P to 5% or 1%, there are real world factors that change how important this value is. +### Data Mining/Snooping/P-hacking +If $n$ tests are made, you can expect that $n * p$ tests are statistically significant (an outlier/unusual result). +Data snooping isĀ **a form of statistical bias manipulating data or analysis to artificially get statistically significant results**. -### Data Mining/Snooping - +Reliable investigators test their conclusions on an independant batch of data (Ch 28-29, stat 1040) ## Goodness of fit tests ($\chi ^2$)