NewsTechNew Research Warns of AI Bias Dangers

New Research Warns of AI Bias Dangers

Findings reveal how the demographics and backgrounds of people training AI models influences their outputs 

New research analysing data from the Prolific research platform reveals how the demographics of people labelling the data used to build and train AI models influences their decisions. What one person finds offensive, another may find perfectly acceptable. This has major ramifications for the development of AI systems with the danger that existing biases are baked into them and amplified. 

Machine learning and artificial intelligence systems often rely on high-quality human labelling and annotation – people reviewing and categorising the output of language models to train them. For example, to learn what kind of content is offensive or toxic or to better understand human intentions. This is often referred to as ‘Human-in-the-loop’ or Reinforcement Learning from Human Feedback (RLHF).  

The study conducted collaboratively by Prolific, Potato, a web-based annotation tool, and the University of Michigan, found that age, race and education are statistically significant factors in determining how something is labelled. For example, when asked to rate the offensiveness of online comments, Black participants tended to rate the same comments as being significantly more offensive compared to other racial groups. 

Prior research on annotator background has mostly focused on specific aspects of identity, like gender, and on certain tasks, like toxic language detection. This study aimed to undertake a much broader analysis, including offensiveness detection, question answering and politeness. The dataset contains 45,000 annotations from 1,484 annotators, drawn from a representative sample regarding sex, age, and race as the US population.  

Findings from the research include: 

Offensiveness Detection 

  • Gender: The research found no statistically significant difference between men and women in rating content as offensive. 
  • Race: The study found significant racial differences in offensiveness rating. Black participants rated the same comments with significantly more offensiveness than all other racial groups. The scores of white participants strongly correlated with the original Ruddit dataset* which suggests that the original annotations were likely done by white annotators. 
  • Age: People aged 60 or over tend to find comments more offensive than middle-aged/younger participants. 
  • Education: There were no significant differences found with respect to participant education. 

Question Answering 

Despite this task being largely objective (i.e. questions correlated to single right answers), accuracy at question answering did vary according to background. The largest effects were seen with race and age variation, with a smaller effect for education. The performance differences mirror known disparities in education and economic opportunities for minorities compared to their white male peers in the US.   

Politeness rewriting 

  • Politeness is one of the most prominent social factors in interpersonal communication. The study found that: 
  • Women judged messages as being less polite than men did. 
  • Older participants were more likely to give higher politeness ratings. 
  • Those with high education levels tended to give lower ratings. 
  • Black participants rated messages as being more polite than their white peers. 
  • Asian participants gave lower politeness ratings overall. 

Commenting on the research, Phelim Bradley, CEO and co-founder of Prolific said, “Artificial intelligence will touch all aspects of society and there is a real danger that existing biases will get baked into these systems. This research is very clear: who annotates your data matters. Anyone who is building and training AI systems must make sure that the people they use are nationally representative across age, gender, and race or bias will simply breed more bias.”

“Systems like ChatGPT are increasingly used by people for everyday tasks,” says assistant professor David Jurgens from the University of Michigan School of Information. “But on whose values are we instilling in the trained model? If we keep taking a representative sample without accounting for differences, we continue marginalising certain groups of people.” 

The correct training and fine-tuning of AI systems is incredibly important to the safe development of AI, avoiding these systems amplifying existing biases and toxicity. This means ensuring that annotators are nationally representative across race, age, and gender. With a vetted and verified pool of 120,000 participants, Prolific offers researchers and developers the ability to access nationally representative or custom demographic groups for their RLHF needs. 

The fair treatment of annotators is another crucial element of AI training and development. Reports have emerged of low-paid workers in developing countries being used for labelling and being subjected to reams of toxic online content. The ethical treatment of participants is a top priority for Prolific. Participants on Prolific are guaranteed a fair, minimum payment, have complete control over which studies they choose to take part in, and can immediately flag to Prolific any content they find offensive or disturbing.  

News Desk
News Deskhttps://www.businessmanchester.co.uk/
The Business Manchester News Desk team is a collective of experienced journalists and editors dedicated to delivering comprehensive business news and insights from the Manchester area and beyond. With a strong background in finance, technology, property, and innovation, our team ensures that our readers stay well-informed about the latest trends and developments in the business world. Through in-depth reports and insightful analysis, the Business Manchester News Desk team is committed to providing high-quality journalism to its audience.
Latest

Top Software Development Companies for UK Businesses in 2026

Some UK companies pour money into digital projects that go nowhere, often because they picked the wrong technology partner. The British software development market...

Residence Inn by Marriott Piccadilly Manchester Strengthens Leadership Team with Two Senior Appointments

MANCHESTER, UK. June 15th, 2026 - The Residence Inn by Marriott Piccadilly Manchester, a leading luxury extended-stay hotel in the city, has appointed two...

Father’s Day gift guide 2026: Best gifts for every type of dad

Father’s Day (June 21) is the perfect time to celebrate the people who make everyday moments feel special.  Whether he enjoys slow mornings with coffee,...

Building safety compliance: A growing concern for developers and investors

Building safety now sits at the centre of property development and investment decisions across the UK. Increased scrutiny from authorities and greater awareness among residents have...
Subscribe to our newsletter
Business Manchester will use the information you provide on this form to be in touch with you and to provide updates and marketing.
Don't miss

Residence Inn by Marriott Piccadilly Manchester Strengthens Leadership Team with Two Senior Appointments

MANCHESTER, UK. June 15th, 2026 - The Residence Inn by Marriott Piccadilly Manchester, a leading luxury extended-stay hotel in the city, has appointed two...

BJC Logistics Secures Major 2026 Defense Contracts, Expanding High-Security Supply Chain Network

Ventura County firm adds aerospace and AI-defense programs and expands its global distribution network amid rapid revenue growth. VENTURA COUNTY, California. June 8, 2026 - BJC...

Why Creative Judgment Now Matters More Than AI Access

WANDSWORTH, London. June 10th, 2026 - BearJam is highlighting how storytelling ability, aesthetic judgment, and human direction are becoming the key factors that separate...

UK SMEs Face Growing Employment Tribunal Threat as Case Numbers Continue to Climb

WARWICKSHIRE, UK, June 11, 2026 – UK small and medium-sized businesses are being urged to review their HR practices as employment tribunal claims continue...

More News

Top Software Development Companies for UK Businesses in 2026

Some UK companies pour money into digital projects that go nowhere, often because they picked the wrong technology partner. The British software development market...

Brighton Residents Increasingly Opt for Expert TV Wall Mounting as Home Entertainment Expands

As modern televisions continue to grow in size and become a focal design feature in living rooms, many homeowners in Brighton are now relying...

MotorDesk Rolls Out Latest Platform Enhancements for Independent Dealers

MotorDesk has launched its most recent set of platform improvements for May 2026, aimed at helping independent motor dealers work more efficiently. The update...