From bf97dddb847d3b06c9c625360730b46f900fb205 Mon Sep 17 00:00:00 2001
From: zleyyij <zleyyij@noreply.users.github.com>
Date: Tue, 13 Feb 2024 14:02:56 -0700
Subject: [PATCH] vault backup: 2024-02-13 14:02:56

---
 education/statistics/Hypothesis Tests.md | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/education/statistics/Hypothesis Tests.md b/education/statistics/Hypothesis Tests.md
index 1a33502..3af3533 100644
--- a/education/statistics/Hypothesis Tests.md	
+++ b/education/statistics/Hypothesis Tests.md	
@@ -60,9 +60,11 @@ Investigators should:
 
 Investigators should not:
 - Blindly compare P to 5% or 1%, there are real world factors that change how important this value is. 
+### Data Mining/Snooping/P-hacking
+If $n$ tests are made, you can expect that $n * p$ tests are statistically significant (an outlier/unusual result).
+Data snooping is **a form of statistical bias manipulating data or analysis to artificially get statistically significant results**.
 
-### Data Mining/Snooping
-
+Reliable investigators test their conclusions on an independant batch of data
 
 (Ch 28-29, stat 1040)
 ## Goodness of fit tests ($\chi ^2$)