An expanded modal box, which conveys the concept of listening to a recording of someone else's voice.

Give audio option for user input

Humanize users in one anothers' eyes

Our confidence rating


Share This Intervention

What It Is

A button that allows the users to respond to content with a recording of their own voice.

Civic Signal Being Amplified

Encourage the humanization of others

When To Use It


What Is Its Intended Impact

Reduces likelihood of showing or reporting hostility when people discuss a topic via voice memos.

Evidence That It Works

Evidence That It Works

Four experiments conducted by Schroeder, et al. (2017) predicted that a person’s speech, beyond conveying a person’s thoughts, also "conveys their mental capacity, such that hearing a person explain his or her beliefs makes the person seem more mentally capable—and therefore seem to possess more uniquely human mental traits—than reading the same content” (ibid).

This study ran three experiments “involving polarizing attitudinal issues and political opinions” which found “this effect to emerge when people are perceived as relatively mindless, such as when they disagree with the evaluator’s own beliefs” (ibid).

A fourth experiment found that “paralinguistic cues in the voice” (e.g., tone, echoing) were an indicator for mental capacities.

Why It Matters

In an interview with Casey Newton of the Platformer, Kayvon Beykpour, Twitter’s head of product, had this to say about audio:

“Our mechanics incentivize very short-form, high-brevity conversation, which is amazing and powerful and has led to all the impact that Twitter has had in the world. But it’s a very specific type of discourse, right? It's very difficult to have long, deep, thoughtful conversations."Audio is interesting for us because the format lends itself to a different kind of behavior. When you can hear someone’s voice, you can empathize with them in a way that is just more difficult to do when a you’re in an asynchronous environment. … We think audio is powerful, because that empathy is is real and raw in a way that you can’t achieve over text in the same way.

According to Schroeder et al. (2017), modern technology allows for predominantly text-based interactions over vast distances, but that may not be optimal for cultivated mutual appreciation and understanding the minds of others. Voice, however, allows us to better build a theory of mind about others, the absence of which fuels dehumanization.

Special Considerations

The television spot opens with a white man making calls to a rental office. He gives a different name and uses a different voice for each call, reflecting various races and ethnicities. Each time, he is told that the apartment is not for rent. Yet, when he uses a Caucasian-sounding name and accent, he is assured that the apartment is—for some certain reason—available.

The ad, titled "Accents", appeared on television stations across the United States.  It also suggests a very real underlying problem with voice only, as it can, unfortunately, still prove to be a vector for discrimination.

Another drawback is that voice-only risks being inaccessible to the deaf and hard of hearing. However, given advancements in real-time closed captioning, this latter drawback is potentially solvable.


This intervention entry currently lacks photographic evidence (screencaps, &c.)


The Humanizing Voice: Speech Reveals, and Text Conceals, a More Thoughtful Mind in the Midst of Disagreement

Juliana Schroeder, Michael Kardas, Nicholas Epley
Association for Psychological Sciences
October 25, 2017

Is this intervention missing something?

You can help us! Click here to contribute a citation, example, or correction.

Further reading

Back to Prosocial Design Library