Advances in Methods and Practices in Psychological Science

Bridging Traditional-Statistics and Machine-Learning Approaches in Psychology: Navigating Small Samples, Measurement Error, Nonindependent Observations, and Missing Data

Rosa Lavelle-Hill, Gavin Smith, Kou Murayama

Volume 8, Issue 3 | July 2025

https://doi.org/10.1177/25152459251345696

Abstract

In recent years, machine learning has propagated into different aspects of psychological research, and supervised machine-learning methods have increasingly been used as a tool for predicting human behavior or psychological characteristics when there is a large number of possible predictors. However, researchers often face practical challenges when using machine-learning methods on psychological data. In this article, we identify and discuss four key challenges that often arise when applying machine learning to data collected for psychological research. The four challenge areas cover (a) limited sample size, (b) measurement error, (c) nonindependent data, and (d) missing data. Such challenges are extensively discussed in the “traditional” statistical literature but are often not explicitly addressed, or at least not to the same extent, in the applied-machine-learning community. We present how each of these challenges is dealt with first from a traditional-statistics perspective and then from a machine-learning perspective and discuss the strengths and weaknesses of these solutions by comparing the approaches. We argue that the boundary between traditional statistics and machine learning is fluid and emphasize the need for cross-disciplinary collaboration to better tackle these core challenges and improve replicability.

Robot Hands Holding a Blank Brown Notebook

AI Revolution or Revulsion? APS Journal Editors Weigh In

As AI dominates conversations in psychological science, journal editors are faced with a suite of decisions on how they will incorporate these new tools into their editorial processes. Even within APS’s seven academic journals, opinions and stances vary.

Hands typing on a laptop with yellow folders emerging from a cloud, symbolizing cloud computing and data storage, on a textured blue background.

Data Sharing Is Growing but Looks Different for Qualitative and Quantitative Methods

Quantitative and qualitative approaches face different challenges and expectations, particularly when it comes to data sharing.

Cookie	Duration	Description
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
uvc	1 year 27 days	Set by addthis.com to determine the usage of addthis.com service.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_3507334_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Cookie	Duration	Description
loc	1 year 27 days	AddThis sets this geolocation cookie to help understand the location of users who share the information.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Bridging Traditional-Statistics and Machine-Learning Approaches in Psychology: Navigating Small Samples, Measurement Error, Nonindependent Observations, and Missing Data

More from Advances in Methods and Practices in Psychological Science

AI Revolution or Revulsion? APS Journal Editors Weigh In

How Should Psychologists Use AI and Big Data? Nine Guides Point the Way

Data Sharing Is Growing but Looks Different for Qualitative and Quantitative Methods

Yale University

Assistant Professor, Clinical Psychology

Yale University

Assistant/Associate/Full Professor, Developmental Psychology

More from Advances in Methods and Practices in Psychological Science

AI Revolution or Revulsion? APS Journal Editors Weigh In

How Should Psychologists Use AI and Big Data? Nine Guides Point the Way

Data Sharing Is Growing but Looks Different for Qualitative and Quantitative Methods

Yale University

Assistant Professor, Clinical Psychology

Yale University

Assistant/Associate/Full Professor, Developmental Psychology

Cookies