| mom_iq | kid_score | |
|---|---|---|
| 1 | 121.1 | 65 |
| 2 | 89.4 | 98 |
| 3 | 115.4 | 85 |
| 4 | 99.4 | 83 |
| 5 | 92.7 | 115 |
| 6..429 | ||
| 430 | 84.9 | 94 |
| 431 | 93.0 | 76 |
| 432 | 94.9 | 50 |
| 433 | 96.9 | 88 |
| 434 | 91.3 | 70 |
Or: How Robust are Scientific Results?
Hochschule für Musik, Theater und Medien – Hannover
Institut für Musikphysiologie und Musiker-Medizin
2026-05-08
Aczel et al. (2026)
Aczel, B., Szaszi, B., Clelland, H. T., Kovacs, M., Holzmeister, F., Ravenzwaaij, D. van, et al. (2026). Investigating the analytical robustness of the social and behavioural sciences. Nature 652, 135–142. doi: 10.1038/s41586-025-09844-9
| mom_iq | kid_score | |
|---|---|---|
| 1 | 121.1 | 65 |
| 2 | 89.4 | 98 |
| 3 | 115.4 | 85 |
| 4 | 99.4 | 83 |
| 5 | 92.7 | 115 |
| 6..429 | ||
| 430 | 84.9 | 94 |
| 431 | 93.0 | 76 |
| 432 | 94.9 | 50 |
| 433 | 96.9 | 88 |
| 434 | 91.3 | 70 |
Is there a relation between the IQ and the test score?
What would be the best method to answer this question?
It depends…
Test the scores of kids with low-IQ moms against the scores of kids with high-IQ moms
Test the scores of kids with low-IQ moms against the scores of kids with high-IQ moms
Test (high-IQ moms vs low-IQ moms) vs (high-score kids vs low-score kids)
high_iq, set to 1 for moms with IQ >= 100, 0 otherwise
Welch Two Sample t-test
data: kid_score by high_iq
t = -9, df = 431, p-value <2e-16
alternative hypothesis: true difference in means between group 0 and group 1 is not equal to 0
95 percent confidence interval:
-19.3 -12.2
sample estimates:
mean in group 0 mean in group 1
79.7 95.5
Wilcoxon rank sum test with continuity correction
data: kid_score by high_iq
W = 12716, p-value = 4e-16
alternative hypothesis: true location shift is not equal to 0
95 percent confidence interval:
-19 -12
sample estimates:
difference in location
-15
Always plot the distributions of both groups, as the Mann-Whitney test can lead you to falsely reject the Null when both distributions’ shape and spread are different, yet their medians are identical!
mom_iq and kid_score associated?mom_iq explain/predict kid_score? (Intercept) standard_mom_iq
86.80 0.61
By dichotomizing both mom_iq and kid_score we can test independence:
high_score
high_iq 0 1
0 138 101
1 48 147
Pearson's Chi-squared test with Yates' continuity correction
data: tabyl(kids, high_iq, high_score)
X-squared = 47, df = 1, p-value = 8e-12
| id | Test | (adjusted) Cohen's d | SE |
|---|---|---|---|
| 1 | Welch Two Sample t-test | 0.834 | 0.102 |
| 2 | Wilcoxon rank sum test with continuity correction | 0.546 | NA |
| 3 | Correlation | 1.003 | 0.002 |
| 4 | OLS regression | 1.003 | 0.000 |
| 5 | Pearson's Chi-squared test with Yates' continuity correction | 2.307 | NA |
Calculation of (adjusted) Cohen’s d after Borenstein (2009)
Given the differing results by the different methods:
IMMM 2026