News

When Big Data Becomes Bad Data

Corporations are increasingly relying on algorithms to make business decisions and that raises new legal questions.

By: Lauren Kirchner ,; ProPublica

Published: September 6, 2015

Corporations are increasingly relying on algorithms to make business decisions and that raises new legal questions. (Image: Business data via Shutterstock)

A recent ProPublica analysis of The Princeton Review’s prices for online SAT tutoring shows that customers in areas with a high density of Asian residents are often charged more. When presented with this finding, The Princeton Review called it an “incidental” result of its geographic pricing scheme. The case illustrates how even a seemingly neutral price model could potentially lead to inadvertent bias – bias that’s hard for consumers to detect and even harder to challenge or prove.

Over the past several decades, an important tool for assessing and addressing discrimination has been the “disparate impact” theory. Attorneys have used this idea to successfully challenge policies that have a discriminatory effect on certain groups of people, whether or not the entity that crafted the policy was motivated by an intent to discriminate. It’s been deployed in lawsuits involving employment decisions, housing and credit. Going forward, the question is whether the theory can be applied to bias that results from new technologies that use algorithms.

The term “disparate impact” was first used in the 1971 Supreme Court case Griggs v. Duke Power Company. The Court ruled that, under Title VII of the Civil Rights Act, it was illegal for the company to use intelligence test scores and high school diplomas – factors which were shown to disproportionately favor white applicants and substantially disqualify people of color – to make hiring or promotion decisions, whether or not the company intended the tests to discriminate. A key aspect of the Griggs decision was that the power company couldn’t prove their intelligence tests or diploma requirements were actually relevant to the jobs they were hiring for.

In the years since, several disparate impact cases have made their way to the Supreme Court and lower courts, most having to do with employment discrimination. This June, the Supreme Court’s decision in Texas Dept. of Housing and Community Affairs v. Inclusive Communities Project, Inc. affirmed the use of the disparate impact theory to fight housing discrimination. The Inclusive Communities Project had used a statistical analysis of housing patterns to show that a tax credit program effectively segregated Texans by race. Sorelle Friedler, a computer science researcher at Haverford College and a fellow at Data & Society, called the Court’s decision “huge,” both “in favor of civil rights…and in favor of statistics.”

So how will the courts address algorithmic bias? From retail to real estate, from employment to criminal justice, the use of data mining, scoring software and predictive analytics programs is proliferating at an exponential rate. Software that makes decisions based on data like a person’s ZIP code can reflect, or even amplify, the results of historical or institutional discrimination. “[A]n algorithm is only as good as the data it works with,” Solon Barocas and Andrew Selbst write in their article “Big Data’s Disparate Impact,” forthcoming in the California Law Review. “Even in situations where data miners are extremely careful, they can still affect discriminatory results with models that, quite unintentionally, pick out proxy variables for protected classes.”

It’s troubling enough when Flickr’s auto-tagging of online photos label pictures of black men as “animal” or “ape,” or when researchers determine that Google search results for black-sounding names are more likely to be accompanied by ads about criminal activity than search results for white-sounding names. But what about when big data is used to determine a person’s credit score, ability to get hired, or even the length of a prison sentence?

Because disparate impact theory is results-oriented, it would seem to be a good way to challenge algorithmic bias in court. A plaintiff would only need to demonstrate bias in the results, without having to prove that a program was conceived with bias as its goal. But there is little legal precedent. Barocas and Selbst argue in their article that expanding disparate impact theory to challenge discriminatory data-mining in court “will be difficult technically, difficult legally, and difficult politically.”

Some researchers argue that it makes more sense to design systems from the start in a more considered and discrimination-conscious way. Barocas and Moritz Hardt established a traveling workshop called Fairness, Accountability and Transparency in Machine Learning to encourage other computer scientists to do just that. Some of their fellow organizers are also developing tools they hope companies and government agencies could use to test whether their algorithms yield discriminatory results and to fix them when necessary. Some legal scholars (including the University of Maryland’s Danielle Keats Citron and Frank Pasquale) argue for the creation of new regulations or even regulatory bodies to govern the algorithms that make increasingly important decisions in our lives.

There still exists “a large legal difference between whether there is explicit legal discrimination or implicit discrimination,” said Friedler, the computer science researcher. “My opinion is that, because more decisions are being made by algorithms, that these distinctions are being blurred.”

We’re resisting Trump’s authoritarian pressure.

As the Trump administration moves a mile-a-minute to implement right-wing policies and sow confusion, reliable news is an absolute must.

Truthout is working diligently to combat the fear and chaos that pervades the political moment. We’re requesting your support at this moment because we need it – your monthly gift allows us to publish uncensored, nonprofit news that speaks with clarity and truth in a moment when confusion and misinformation are rampant. As well, we’re looking with hope at the material action community activists are taking. We’re uplifting mutual aid projects, the life-sustaining work of immigrant and labor organizers, and other shows of solidarity that resist the authoritarian pressure of the Trump administration.

As we work to dispel the atmosphere of political despair, we ask that you contribute to our journalism. Over 80 percent of Truthout’s funding comes from small individual donations from our community of readers, and over a third of our total budget is supported by recurring monthly donors.

9 days remain in our fundraiser, and you can help by giving today. Whether you can make a small monthly donation or a larger gift, Truthout only works with your support.

Data Mining You: How the Intelligence Community Is Creating a New American World

I was out of the country only nine days, hardly a blink in time, but time enough, as it happened, for another small, airless room to be added to …

By: Tom Engelhardt ,; TomDispatch

Latest Stories

News

Economy & Labor

IRS Said He Led the “Worst of the Worst” Tax Scams. Now He’s a Trump Adviser.

A proponent of a tax deduction decried as “abusive” is now advising the agency that manages US government property.

By: Peter Elkind ,; ProPublica

News

Immigration

As Trump Bars Asylum Claims, Migrants Struggle in Mexico City

Trump’s crackdown has caused distress and anguish for many, compounding their already precarious mental health.

By: Mariana Martínez Barba ,; Prism

News

Reproductive Rights

Abortion Laws Are Eroding Trust Between Mental Health Providers and Clients

As a resident of Texas, one woman was too scared to tell her therapist about her abortion because of the state’s laws.

By: Gina Jiménez ,; PublicHealthWatch

News

Culture & Media

Washington Post Refuses to Run Ad Demanding Donald Trump Fire Elon Musk

The ad had been scheduled to be delivered to members of Congress as well as subscribers at the Pentagon and White House.

By: Jon Queally ,; CommonDreams

News

War & Peace

Genocide Ran the Yazidi From Their Homeland. A Decade Later, Some Are Returning.

ISIS wrecked 80 percent of public infrastructure and 70 percent of civilian homes in Sinjar City and surrounding areas.

By: Jaclynn Ashly ,; Truthout

News

Politics & Elections

Princeton Students Behind Gaza Solidarity Encampment Head to Trial

Despite intimidation from the university, the activists are defending their right to protest for Palestinian liberation.

By: Sam Carliner ,; Mondoweiss

News Analysis

Economy & Labor

Behind Trump Tariffs Is Capital’s Warfare Against the Working Class

Trump’s tariffs will escalate exploitation and desperation of US workers. That’s the point.

By: William I. Robinson ,; Truthout

News

Environment & Health

Urgent CDC Data on Influenza and Bird Flu Go Missing as Outbreaks Escalate

In a letter, CDC advisory committee members requested an investigation into missing data and delayed reports.

By: Amy Maxmen ,; KFFHealthNews

News

Politics & Elections

Trump Openly Suggests on Social Media He’s Above the Law, Raising Alarm

It’s the latest brazen signal that he doesn't recognize limits on his authority to impose his far right agenda.

By: Common Dreams Staff ,; CommonDreams

Op-Ed

Human Rights

I Would Not Have Survived Without UNRWA

The attack on the United Nations Relief and Works Agency for Palestine Refugees is an attack on Palestinian existence.

By: Malak Hijazi ,; Mondoweiss

Sections

Latest

IRS Said He Led the “Worst of the Worst” Tax Scams. Now He’s a Trump Adviser.

As Trump Bars Asylum Claims, Migrants Struggle in Mexico City

Abortion Laws Are Eroding Trust Between Mental Health Providers and Clients

Washington Post Refuses to Run Ad Demanding Donald Trump Fire Elon Musk

More

When Big Data Becomes Bad Data

We’re resisting Trump’s authoritarian pressure.

Menu

We’re resisting Trump’s authoritarian pressure.