New Tools for Designing Powerful Studies

October 18, 2017

Tags:

Abstract illustration with arrows and gears

Why do studies fail to replicate? There are several possible explanations but a notable one is that many studies are underpowered – that is, they have sample sizes that are simply too small given the size of the effect under investigation. In an article in Psychological Science, researchers from the University of Notre Dame explain why many studies end up inadequately powered and offer open-source tools that can help researchers proactively avoid the problem.

Statistical power, as psychological scientists Samantha F. Anderson, Ken Kelley, and Scott E. Maxwell describe in their article, is the “probability of rejecting the null hypothesis of no effect when the true effect is nonnull in the population.” Researchers want to be fairly confident that they’ll be able to detect an effect if one truly does exist – ensuring their studies have adequate power is an important component of experimental design.

To do this, they calculate the total number of participants needed to detect an effect of a specific size with their targeted level of power. Researchers can’t know how big an effect actually is in the population, so they often estimate it using effect sizes in published studies. And this is where the problem arises, Anderson and colleagues argue, as such effect-size estimates have several inherent flaws.

One notable flaw, the researchers explain, is that an effect size in published research is likely to be greater than the true population effect size due to the so-called file drawer problem. A publication bias that strongly favors statistically significant findings produces a literature with “upwardly biased” effect size estimates.

Estimates based on previously published effect sizes also fail to account for the uncertainty intrinsic to statistical inferences. Researchers can specify the uncertainty of an effect size via a confidence interval that indicates the range of values within which the true population effect size is likely to exist. This uncertainty is often ignored, however, when researchers use the single-value point estimate from published studies to determine the sample size required for their own studies.

“Given the ubiquity of bias and uncertainty in estimates of effect size, researchers who conscientiously plan their sample sizes using published effect sizes from prior studies can have actual power that is abysmal, especially when the population effect size is small,” Anderson, Kelley, and Maxwell write.

Underpowered studies mean that researchers may not be able to detect effects when they do exist, but they can also have other consequences, including increasing the proportion of studies in the literature that falsely find an effect when it doesn’t exist and producing effect-size estimates that are inflated. In a broader context, they also limit the replicability of study findings.

Building on a strategy originally proposed by Taylor and Muller in 1996, Anderson and colleagues outline a procedure that enables researchers to account for these flaws from the beginning by adjusting effect-size estimates for publication bias and uncertainty.

Researchers can use this method for free via an open-source R package (BUCSS) and web-based apps – they simply need to have a few key pieces of information to use these platforms.

“We hope that more accurate estimates of effect size will result in new psychological studies that are more adequately powered and will lead to a replicable literature that inspires more confidence and is less in crisis,” Anderson, Kelley, and Maxwell conclude.

BUCSS (Bias and Uncertainty Corrected Sample Size) R package

Shiny Web Apps

Reference

Anderson, S.F., Kelley, K., & Maxwell, S.E. (2017). Sample-size planning for more accurate statistical power: A method adjusting sample effect sizes for publication bias and uncertainty. Psychological Science. doi:10.1177/0956797617723724

Publications > Observer > Observations > New Tools for Designing Powerful Studies

Robot hand touching human hand illustration

The Biggest Threat to Online Data Collection Is Humans, Not Bots

Concerns about bots answering online surveys are exaggerated, but a new threat is emerging in artificial intelligence agents.

Emojis ranging from negative to positive emotions.

“Lesser of Two Evils”: Applying Artificial Intelligence to Move Beyond Self-Reports

Two researchers advocate for new AI-based measures not because they offer measurement free from error, but rather because they avoid specific problematic forms of error linked to overreliance on self-reports.

APS regularly opens certain online articles for discussion on our website. Effective February 2021, you must be a logged-in APS member to post comments. By posting a comment, you agree to our Community Guidelines and the display of your profile information, including your name and affiliation. Any opinions, findings, conclusions, or recommendations present in article comments are those of the writers and do not necessarily reflect the views of APS or the article’s author. For more information, please see our Community Guidelines.

Please login with your APS account to comment.

Cookie	Duration	Description
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
uvc	1 year 27 days	Set by addthis.com to determine the usage of addthis.com service.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_3507334_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Cookie	Duration	Description
loc	1 year 27 days	AddThis sets this geolocation cookie to help understand the location of users who share the information.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

New Tools for Designing Powerful Studies

Related

How Should Psychologists Use AI and Big Data? Nine Guides Point the Way

The Biggest Threat to Online Data Collection Is Humans, Not Bots

“Lesser of Two Evils”: Applying Artificial Intelligence to Move Beyond Self-Reports

Related

How Should Psychologists Use AI and Big Data? Nine Guides Point the Way

The Biggest Threat to Online Data Collection Is Humans, Not Bots

“Lesser of Two Evils”: Applying Artificial Intelligence to Move Beyond Self-Reports

Cookies