Advances in Methods and Practices in Psychological Science

Language Models Accurately Infer Correlations Between Psychological Items and Scales From Text Alone

Björn E. Hommel, Ruben C. Arslan

Volume 8, Issue 4 | October 2025

https://doi.org/10.1177/25152459251377093

Abstract

Many behavioral scientists do not agree on core constructs and how they should be measured. Different literatures measure related constructs, but the connections are not always obvious to readers and meta-analysts. Many measures in behavioral science are based on agreement with survey items. Because these items are sentences, computerized language models can make connections between disparate measures and constructs and help researchers regain an overview over the rapidly growing, fragmented literature. Our fine-tuned language model, the SurveyBot3000, accurately predicts the correlations between survey items, the reliability of aggregated measurement scales, and intercorrelations between scales from item positions in semantic vector space. We measured the model’s performance as the convergence between its synthetic model estimates and empirical coefficients observed in human data. In our pilot study, the out-of-sample accuracy was .71 for item correlations, .89 for reliabilities, and .89 for scale correlations. In our preregistered validation study using novel items, the out-of-sample accuracy was slightly reduced to .59 for item correlations, .84 for reliabilities, and .84 for scale correlations. The synthetic item correlations showed an average prediction error of .17, and there were larger errors for middling correlations. Predictions exhibited generalizability beyond the training data and across various domains, with some variability in accuracy. Our work shows language models can reliably predict psychometric relationships between survey items, enabling researchers to evaluate new measures against existing scales, reduce redundancy in measurement, and work toward a more unified behavioral-science taxonomy.

Robot Hands Holding a Blank Brown Notebook

AI Revolution or Revulsion? APS Journal Editors Weigh In

As AI dominates conversations in psychological science, journal editors are faced with a suite of decisions on how they will incorporate these new tools into their editorial processes. Even within APS’s seven academic journals, opinions and stances vary.

Hands typing on a laptop with yellow folders emerging from a cloud, symbolizing cloud computing and data storage, on a textured blue background.

Data Sharing Is Growing but Looks Different for Qualitative and Quantitative Methods

Quantitative and qualitative approaches face different challenges and expectations, particularly when it comes to data sharing.

Cookie	Duration	Description
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
uvc	1 year 27 days	Set by addthis.com to determine the usage of addthis.com service.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_3507334_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Cookie	Duration	Description
loc	1 year 27 days	AddThis sets this geolocation cookie to help understand the location of users who share the information.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Language Models Accurately Infer Correlations Between Psychological Items and Scales From Text Alone

More from Advances in Methods and Practices in Psychological Science

AI Revolution or Revulsion? APS Journal Editors Weigh In

How Should Psychologists Use AI and Big Data? Nine Guides Point the Way

Data Sharing Is Growing but Looks Different for Qualitative and Quantitative Methods

Yale University

Assistant Professor, Clinical Psychology

Yale University

Assistant/Associate/Full Professor, Developmental Psychology

More from Advances in Methods and Practices in Psychological Science

AI Revolution or Revulsion? APS Journal Editors Weigh In

How Should Psychologists Use AI and Big Data? Nine Guides Point the Way

Data Sharing Is Growing but Looks Different for Qualitative and Quantitative Methods

Yale University

Assistant Professor, Clinical Psychology

Yale University

Assistant/Associate/Full Professor, Developmental Psychology

Cookies