How Marginal Are ‘Marginally Significant’ <em>p</em>-Values?

Observation

How Marginal Are ‘Marginally Significant’ p-Values?

February 27, 2019

Tags:

Methodology

As the research community debates whether the p-value should be swept into the statistical dustbin, the question remains: How are authors actually presenting p-values? Are authors reporting only the values that make the .05 cutoff or are they reporting every p-value, significant or not? And for the values that reside above .05, how often do authors succumb to the temptation of the “marginally significant”?

In a 2016 study in Psychological Science, Pritschet and colleagues found cause for concern, showing an increase in the number of articles containing marginally significant results reported over time. But Tilburg University researchers Anton Olsson-Collentine, Marcel A. L. M. van Assen, and Chris H. J. Hartgerink found a different trend when they accounted for base rates. These findings appear in Psychological Science.

Olsson-Collentine and colleagues argue that since authors now report more p-values per article than they used to, more articles will also contain p-values between .05 and .10. Consequently, even if the proportion of p-values reported as “marginally significant” stays the same over time, one would expect more articles to contain “marginally significant” results. In other words, observing that more articles contain “marginally significant” results doesn’t necessarily mean that the tendency to report any given p-value as “marginally significant” is actually increasing.

The researchers used regular expressions to search for and automatically extract p-values from articles published in 70 American Psychological Association journals between 1985 and 2016.

Searching for any mention of “margin*” and “approach*” in the 200 characters preceding and succeeding any p-value result, the researchers obtained a final sample of 42,504 p-values between .05 and .10.

In line with results reported in Pritschet et al. in 2016, the results showed an increase in articles reporting “marginally significant” results in the two journals, Journal of Personality and Social Psychology and Developmental Psychology.

But closer inspection of the data revealed a more complex story. In Developmental Psychology, the percentage of p-values between .05 and .10 that were described as “marginally significant” actually decreased over time, but this was masked by an increase in the overall number of reported p-values that fell between .05 and .10.

This finding “demonstrates the importance of distinguishing results at the level of the articles from those at the level of p-values,” Olsson-Collentine and colleagues write.

Overall, the researchers found results described as “marginally significant” to be quite common, characterizing about 40% of all the p-values in the sample that fell between .05 and .10. Across nine psychology disciplines represented in the journals, they found the practice to be most common in journals focused on organizational psychology (45% of p-values between .05 and .10) and least common in those focused on clinical psychology (30% of p-values between .05 and .10).

Of note, the results showed that the percentage of p-values reported as “marginally significant” decreased over time across all journals, and also within most of the disciplines. In no discipline was there evidence of an increasing percentage of “marginally significant” results, although the trend was largely stable over time for several disciplines.

Olsson-Collentine, van Assen, and Hartgerink suggest several possible explanations for decreasing usage of “marginally significant” to describe individual p-values, including increasing statistical awareness on the part of researchers and increasingly stringent editorial criteria.

“Such a high prevalence is a call for disciplines and journal editors to examine where they stand on the reporting of p-values as marginally significant,” Olsson-Collentine says. “We recommend not interpreting p-values between .05 and .1 as marginally significant due to their low evidential value, and note that doing so might be an indication of post-hoc flexibility in decision rules.”

References

Olsson-Collentine, A., van Assen, M. A. L. M., & Hartgerink, C. H. J. (2019). The prevalence of marginally significant results in psychology over time. Psychological Science. doi.org/10.117/0956797619830326

Pritschet, L., Powell, D., Horne, Z. (2016). Marginally significant effects as evidence for hypotheses: Changing attitudes over four decades. Psychological Science, 27(7), 1036–1042. doi.org/10.1177/0956797616645672

Observer > 2019 > March > How Marginal Are ‘Marginally Significant’ p-Values?

Cookie	Duration	Description
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
uvc	1 year 27 days	Set by addthis.com to determine the usage of addthis.com service.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_3507334_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Cookie	Duration	Description
loc	1 year 27 days	AddThis sets this geolocation cookie to help understand the location of users who share the information.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Observation

How Marginal Are ‘Marginally Significant’ p-Values?

Comments

Related

How Should Psychologists Use AI and Big Data? Nine Guides Point the Way

The Biggest Threat to Online Data Collection Is Humans, Not Bots

“Lesser of Two Evils”: Applying Artificial Intelligence to Move Beyond Self-Reports

Yale University

Assistant Professor, Clinical Psychology

Yale University

Assistant/Associate/Full Professor, Developmental Psychology

Comments

Related

How Should Psychologists Use AI and Big Data? Nine Guides Point the Way

The Biggest Threat to Online Data Collection Is Humans, Not Bots

“Lesser of Two Evils”: Applying Artificial Intelligence to Move Beyond Self-Reports

Yale University

Assistant Professor, Clinical Psychology

Yale University

Assistant/Associate/Full Professor, Developmental Psychology

Cookies