Study explores how African American English speakers adapt their speech to be understood by voice technology

Machine listening: Making speech recognition systems more inclusive — African American English speakers adjust rate and pitch based on audience. Credit: Michelle Cohn, Zion Mengesha, Michal Lahav, and Courtney Heldreth

Interactions with voice technology, such as Amazon’s Alexa, Apple’s Siri, and Google Assistant, can make life easier by increasing efficiency and productivity. However, errors in generating and understanding speech during interactions are common. When using these devices, speakers often style-shift their speech from their normal patterns into a louder and slower register, called technology-directed speech.

Research on technology-directed speech typically focuses on mainstream varieties of U.S. English without considering speaker groups that are more consistently misunderstood by technology. In JASA Express Letters, researchers from Google Research, the University of California, Davis, and Stanford University wanted to address this gap.

One group commonly misunderstood by voice technology are individuals who speak African American English, or AAE. Since the rate of automatic speech recognition errors can be higher for AAE speakers, downstream effects of linguistic discrimination in technology may result.

“Across all automatic speech recognition systems, four out of every ten words spoken by Black men were being transcribed incorrectly,” said co-author Zion Mengesha. “This affects fairness for African American English speakers in every institution using voice technology, including health care and employment.”

“We saw an opportunity to better understand this problem by talking to Black users and understanding their emotional, behavioral, and linguistic responses when engaging with voice technology,” said co-author Courtney Heldreth.

The team designed an experiment to test how AAE speakers adapt their speech when imagining talking to a voice assistant, compared to talking to a friend, family member, or stranger.

The study tested familiar human, unfamiliar human, and voice assistant-directed speech conditions by comparing speech rate and pitch variation. Study participants included 19 adults identifying as Black or African American who had experienced issues with voice technology.

Each participant asked a series of questions to a voice assistant. The same questions were repeated as if speaking to a familiar person and, again, to a stranger. Each question was recorded for a total of 153 recordings.

Analysis of the recordings showed that the speakers exhibited two consistent adjustments when they were talking to voice technology compared to talking to another person: a slower rate of speech with less pitch variation (more monotone speech).

“These findings suggest that people have mental models of how to talk to technology,” said co-author Michelle Cohn. “A set ‘mode’ that they engage to be better understood, in light of disparities in speech recognition systems.”

There are other groups misunderstood by voice technology, such as second-language speakers. The researchers hope to expand the language varieties explored in human-computer interaction experiments and address barriers in technology so that it can support everyone who wants to use it.

More information:
African American English speakers’ pitch variation and rate adjustments for imagined technological and human addressees, JASA Express Letters (2024). DOI: 10.1121/10.0025484

Provided by
American Institute of Physics

Citation:
Study explores how African American English speakers adapt their speech to be understood by voice technology (2024, April 30)
retrieved 30 April 2024
from https://techxplore.com/news/2024-04-explores-african-american-english-speakers.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

The team designed an experiment to test how AAE speakers adapt their speech when imagining talking to a voice assistant, compared to talking to a friend, family member, or stranger.

More information:
African American English speakers’ pitch variation and rate adjustments for imagined technological and human addressees, JASA Express Letters (2024). DOI: 10.1121/10.0025484

Provided by
American Institute of Physics

Study explores how African American English speakers adapt their speech to be understood by voice technology

Turning up the heat on data storage: New memory device paves the way for AI computing in extreme environments

Australia just made a billion-dollar bet on building the world’s first ‘useful’ quantum computer

Related Posts

Meta unveils cheaper VR headset, AI updates and shows off prototype for holographic AR glasses

ChatGPT’s rise linked to decline in public knowledge sharing on online Q&A platforms

The design tricks keeping your kids hooked on games and apps—and three things you can do about it

Australia just made a billion-dollar bet on building the world's first 'useful' quantum computer

Leave a Reply Cancel reply

Most popular

Surgeon General: Health Disparities Remain as US Smoking Rates Decline

Ripple (XRP) Price Prediction for This Week

Polyhedra and Berkeley RDI Launch Production-Ready zkML, Revolutionizing AI Trust

Recent Posts

Recent Comments

Top rated products

Tags

About

Help

Follow

5-star reviews

Welcome Back!

Create New Account!

Retrieve your password

Add New Playlist