![]() |
![]() |
![]() |
|
![]() |
|
![]() |
|
Category: P-values. A p-value is a measure of evidence commonly used in hypothesis testing. These pages describe some of the controversies associated with the use of p-values. Articles are arranged by date with the most recent entries at the top. You can find the theme and closely related categories, definitions, and other resources at the bottom of this page.
Stats: Choosing between two conflicting analyses (May 16, 2007). Someone wrote in and asked about an analysis where there was only a limited amount of data. The simple analysis using an odds ratio produced a significant result (p=0.048). A referee suggested that they run a logistic regression model adjusting for two covariates. These covariates were not imbalanced between the two groups. With the logistic regression model, the p-value changed from 0.048 to 0.06.
Stats: Can the p-value actually equal 1.0? (May 30, 2006). Dear Professor Mean, I have a data set that compares the proportions in two groups. In the first group, the proportion is 19% (5/26). In the second group, the proportion is also 19% (3/16). I computed a p-value of 1.0 for this data, but a referee tells me that a p-value of 1.0 is impossible. How can I convince the referee that he/she is wrong.
Stats: Relationship between sample size and p-values (February 14, 2005). I got a rather basic inquiry about p-values, but it is worth mentioning. Someone had a data set with 9,000 observations and was unhappy with the p-value that he got in a logistic regression model. So just as an experiment, he decided to replicate the data set (copy the entire matrix and paste it immediately below). This gave him a sample size of 18,000 observations. He noted that the odds ratio stayed the same but the p-value got smaller.
Stats: A small p-value does not mean a large difference (February 8, 2005). Someone asked me if the p-value for a t-test indicates the size of the difference between two groups. It turns out that the p-value is related both to the size of the difference and the sample size. In general, a very small p-value might indicate a large difference, a large sample size, or both.
Stats: Confusion about p-values (January 18, 2005). Someone wrote to me with a statement that represents a commonly held, but false belief. He stated, in effect, that a p-value of 0.06 means that there is only a 6% probability that the null hypothesis is true.
Stats: One-tailed p-values (April 12, 2004). Someone asked me how to compute one-sided p-values in SPSS. The output from SPSS always uses two-sided p-values. This was worth an explanation, so I added a new question to the Ask Professor Mean page on how to do this. There is a fierce debate about when you should use one-sided tests.
Stats: One-tailed p-values (April 12, 2004). Dear Professor Mean, SPSS produces two-tailed p-values, but I want a one-tailed p-value. How do I get this?
Theme and closely related categories:
- The Case Against Statistical Significance Testing. Carver RP. Harvard Educational Review 1978: 48(3); 378-399.
- The Earth Is Round (p < .05). Cohen J. American Psychologist 1994: 49(12); 997 - 1003.
- On the origins of the .05 level of statistical significance. Cowles M. American Psychologist 1982: 37(5); 553-8.
- Scientific Versus Statistical Inference. Dixon P, O'Reilly T. Canadian Journal of Experimental Psychology 1999: 53(2); 133 - 149.
- p Values, hypothesis tests, and likelihood: implications for epidemiology of a neglected historical debate. Goodman S. American Journal of Epidemiology 1993: 137(5); 485-95. [Medline]
- How to read a paper. Statistics for the non-statistician. II: "Significant" relations and their pitfalls. Greenhalgh T. British Medical Journal 1997: 315(7105); 422-5. [Full text]
- Basic statistics for clinicians: 1. Hypothesis testing. Guyatt G, Jaeschke R, Heddle N, Cook D, Shannon H, Walter S. Cmaj 1995: 152(1); 27-32. [Full text]
- A Picture is Worth a Thousand p Values: On the Irrelevance of Hypothesis Testing in the Microcomputer Age. Loftus GR. Behavior Research Methods, Instruments & Computers 1993: 25(2); 250-256.
- Sifting the evidence. Likelihood ratios are alternatives to P values. Perneger TV. British Medical Journal 2001: 322(7295); 1184-5. [Full text]
- Is statistical significance testing useful in interpreting data? Savitz DA. Reprod Toxicol 1993: 7(2); 95-100. [Medline]
- Sifting the evidence- what's wrong with significance tests? Sterne JAC, Smith GD. BMJ 2001: 322; 226-231. [Medline] [Full text] [PDF]
- Gergen versus the mainstream: Are hypothesis in social psychology subject to empirical test? Wallach L, Wallach MA. J. Pers. Soc. Psychol. 1994: 67; 233-242.
- Understanding P-values. Berger J, Duke University. Accessed on 2003-03-19. www.stat.duke.edu/~berger/p-values.html
- P Values. Dallal GE, Tufts University. Accessed on 2003-03-19. www.tufts.edu/~gdallal/pval.htm
- Clinical vs Statistical Significance. Hopkins WG, Sportscience. Accessed on 2003-03-17. www.sportsci.org/jour/0103/inbrief.htm
- The Insignificance of Statistical Significance Testing. Johnson DH, Based on the publication Johnson, Douglas H. 1999. The Insignificance of Statistical Significance Testing. Journal of Wildlife Management 63(3):763-772. Accessed on 2005-01-18. www.npwrc.usgs.gov/resource/1999/statsig/statsig.htm
- Special Issue: Statistical Significance Testing. Roberts D, Penn State University. Accessed on 2003-03-20. roberts.ed.psu.edu/users/droberts/sigtest.htm
- What is a P-value? [pdf]. Thisted R. Accessed on 2003-06-20. www.stat.uchicago.edu/~thisted/Distribute/pvalue.pdf
- 326 Articles/Books Questioning the Indiscriminate Use of Statistical Hypothesis Tests in Observational Studies. Thompson WL. Accessed on 2003-03-19. www.cnr.colostate.edu/~anderson/thompson1.html
[Return to full topic list] [Read current weblog entries]
This webpage was written by Steve Simon on 2007-09-11, edited by Steve Simon, and was last modified on 2008-07-08. Send feedback to ssimon at cmh dot edu or click on the email link at the top of the page.