Troubled #hearts — in 140 characters

October 20, 2014

Tags:

I joined Twitter in 2008, and I’ve always been impressed by the diversity of this floating conversation. People will just as soon tweet about dinner as the sorry state of American politics, and they are by turns thoughtful and shallow, original and fraudulent, snide and generous of spirit. In 140 characters or fewer, users reflect the range of human emotion, from joy to rage, wonder to boredom, cynicism to hopefulness.

Individual Twitter users can obviously reveal a lot about their lives and feelings, even in terse tweets. But what about very large numbers of tweets, by many people in many places? Is it possible that aggregate Twitter patterns might also be revealing in some useful way? Could Twitter offer snapshots of communities as well as individuals?

A large team of University of Pennsylvania scientists has been exploring this possibility. Led by psychological scientist Johannes Eichstaedt and information scientist Hansen Schwartz, the researchers wondered if the vast amount of language contained in tweets might be a valuable public health resource—specifically, if this linguistic bonanza might offer a way to gauge a community’s risk for heart disease.

Scientists have identified many of the key risk factors for heart disease, such as smoking, inactivity, obesity and hypertension, and these insights have significantly diminished risk of the world’s leading killer. Psychological traits such as chronic stress and depression are also important risk factors, while optimism and social support are known to be protective. These psychological characteristics often affect entire communities, putting large numbers of people at risk for disease. Community-wide interventions could improve health, but assessing community risk is difficult and expensive.

That’s where Twitter comes in. The Penn scientists are pioneers in an emerging field called digital epidemiology, and their aim is to use social media as a cheap and flexible method to assess the psychological traits—and thus health risks—of entire communities. To test this method’s potential, the scientists collected 148 million tweets from across the U.S., sorted into their 1347 counties of origin. The scientists also gathered socioeconomic and demographic data on these counties, which are home to 88 percent of Americans.

They used two different methods to analyze the language used in each county’s aggregate tweets for ten months in 2009 and 2010. They measured specific words and topics, both negative (hostility, cursing, aggression, boredom and fatigue) and positive (wonder, hope, triumph, opportunity), and used these linguistic patterns to characterize communities at risk for heart disease. They then compared these risk patterns to the actual mortality rates for each county, obtained from the Centers for Disease Control. The idea was to see if the disease-relevant information contained in a given county’s Twitter language predicted heart disease mortality.

And it did, clearly. As reported in a forthcoming article in the journal Psychological Science, negative relationships, negative emotions, disengagement and (especially) anger were all significantly correlated with heart disease. This held true even after controlling for income and education, suggesting that Twitter language captures important information not accounted for by socioeconomic status. By contrast, positive emotions and engagement were associated with lower heart disease mortality. Engagement with life—considered a key component of successful aging—emerged as the most potent protective factor.

What’s more, Twitter language was a better predictor of heart disease mortality than 10 common demographic and behavioral risk factors, including such infamous ones as smoking and high blood pressure.

It’s interesting to note that the typical Twitter user is 31, considerably younger than those at risk for heart disease. So the people tweeting are not the people dying. This suggests that the young adults’ tweets are revealing the combined psychological character of their community, which in turn predicts aggregate health outcomes. In short, the language of Twitter may offer a window into the potent influence of community character, and may prove to be a valuable tool for public health.

Follow Wray Herbert’s reporting on psychological science in The Huffington Post and on Twitter at @wrayherbert.

News > We're Only Human > Troubled #hearts — in 140 characters

Science for Society: How Research Can Foster Social Equity

To create lasting social change, psychological scientists are not just studying marginalized communities, but partnering with them.

Science in Service: Leveraging Psychological Science to Put the “Public” in Public Health

Psychological scientist Diane M. Hall explains how her training informs her work at the U.S. Centers for Disease Control and Prevention and public health more broadly.

New Report Finds “Gaps and Variation” in Behavioral Science at NIH

A new NIH report emphasizes the importance of behavioral science in improving health, observes that support for these sciences at NIH is unevenly distributed, and makes recommendations for how to improve their support at the agency.

APS regularly opens certain online articles for discussion on our website. Effective February 2021, you must be a logged-in APS member to post comments. By posting a comment, you agree to our Community Guidelines and the display of your profile information, including your name and affiliation. Any opinions, findings, conclusions, or recommendations present in article comments are those of the writers and do not necessarily reflect the views of APS or the article’s author. For more information, please see our Community Guidelines.

Please login with your APS account to comment.

Cookie	Duration	Description
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
uvc	1 year 27 days	Set by addthis.com to determine the usage of addthis.com service.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_3507334_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Cookie	Duration	Description
loc	1 year 27 days	AddThis sets this geolocation cookie to help understand the location of users who share the information.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Related

Science for Society: How Research Can Foster Social Equity

Science in Service: Leveraging Psychological Science to Put the “Public” in Public Health

New Report Finds “Gaps and Variation” in Behavioral Science at NIH

Cookies