Transcript
9K9ZR_lGRpY • Search Engine Breakdown: Are Algorithms Racist and Sexist? | Full Film
/home/itcorpmy/itcorp.my.id/harry/yt_channel/out/novapbs/.shards/text-0001.zst#text/0823_9K9ZR_lGRpY.txt
Kind: captions
Language: en
foreign
[Music]
fake news continues to be rapidly
distributed on the internet our reality
has become increasingly shaped by false
information many people don't know the
difference between something real and
something created to deceive them
I spent about 15 years in advertising
and marketing and while I was there
Google arrived on the scene
I understood the transformative effect
that this search engine was having and
helping us curate through all kinds of
information
but I was surprised having just left
advertising that everybody was thinking
about Google as this new public trusted
resource because I thought of it as an
advertising platform
foreign
most people who use search engines
believe that search engine results are
fair and unbiased
public and especially kids and young
people use search engines to tell them
the facts about the world
[Music]
one weekend my nieces were coming over
to hang out and I was thinking oh let me
pull my laptop out and see if I can find
some cool things for us to do this
weekend I just thought to type in black
girls and the whole first page of search
results was almost exclusively
pornography or hypersexualized content
in 2012. I started to see some of the
results changing
Google had started to suppress the
pornography around black girls
unfortunately still today we see
pornography and a kind of
hypersexualized content as the primary
way in which Latina and Asian girls are
represented
what makes Asian girls so attractive
Asian fetish hot ladies from Asians see
who we ranked number one in 2020 tender
Asian girls meet Royal Beauties
this is the study that was done
by the markup that replicated my study
from 10 years ago they found that black
girls Latina girls and Asian girls those
phrases were look so profoundly linked
with kind of adult content
zero for white girls zero for white boys
there are so many racial stereotypes and
gender stereotypes that show up in
search results
[Music]
what about actual girls and children who
go and look for themselves in these
spaces
[Music]
it's very disheartening when women
become sex objects in a space like this
it's really profound because the public
generally relates to search engines as
kind of fact checkers
[Music]
before we were so heavily Reliant upon a
database we use something like a card
catalog
we didn't break content it was
alphabetical it also might be by subject
it's a summary of the organization
system we call the Dewey Decimal System
now when we're in a subject we know
there's a lot in relationship to that
one item that we might be looking for we
might go look for a book in the stacks
for example and find that there's
hundreds of books around that one that
tell us something about that book and we
might serendipitously find all kinds of
other bits of information that are
amazing but we can see a little bit more
about the Logics of that we don't
understand the Logics of how certain
things make it to the first page in a
search
[Music]
Google has a very complicated and
nuanced algorithm for search
[Music]
over 200 different factors go into how
they decide what we see of course
they're indexing about half of all of
the information that is on the web and
even that is trillions of pages billions
of times a day Google software locates
all the potentially relevant results on
the web removes all the spam and ranks
them based on hundreds of factors like
keywords links location and freshness
all in oh 0.81 seconds
the whole premise of a search engine is
to categorize and classify information
a lot of the content that comes back to
us on the Internet it's in a cultural
context of ranking we know very early
what it means to be number one so
ranking logic signals to us that the
classification is accurate from one
being the best to whatever's on page 48
of search which nobody ever looks at
part of what it's doing is picking up
signals from things that we've clicked
on in the past that a lot of other
people have clicked on things that are
popular
so an algorithm is in essence a decision
tree
if these conditions are present then
this decision should be made and the
decision tree gets automated so that it
becomes like a sorting mechanism
foreign
Google's very reliable for certain types
of information
if you're using it in these kind of
phone book fashion it's Fairly reliable
but when you start asking a search
engine more complex questions or you
start looking for knowledge the evidence
isn't there that it's capable of doing
that
this combination of hyperlinking It's a
combination of advertising and capital
and also what people click on that
really drives what we find on the web
this is where we start falling into
trickier situations because those who
have the most money are really able to
optimize their content better than
anyone else
there have been great studies about the
disparate impact of what a profile
online says about who you are
[Music]
I was the first African-American woman
to get a PhD in computer science at MIT
[Music]
so I visit Harvard I'm being interviewed
there by a reporter and he wants to see
a particular paper that I had done
before
so I go over to my computer
I type in my name into a Google search
bar and up will pop this ad implying I
had an arrest record
[Music]
he says Ah forget that article tell me
about the time you were arrested I said
well I have never been arrested and he
says then why does your computer say
you've been arrested
so I click on the ad I go to the company
to show him not only did I not have an
arrest record but nobody with a Latonya
Sweeney name had an arrest record and he
says yeah but why did it say that
if you type in the name Latonya in the
Google image search you can see a lot of
Black Faces staring back whereas if I
type Tanya I see a lot of white faces
staring back so we get the idea that
there are some first names given more
often to black babies than white babies
so I then took a month and I researched
almost 150 000 ad deliveries around the
country and I found that if your name
was given more often to white babies
than black babies the ad would be
neutral
and if your first name was given more
often to black babies than white babies
you were 80 likely to get an ad implying
you had an arrest record
even if no one with your name had any
arrest record in their database
foreign
[Music]
specific way that algorithms
discriminate is that they just are too
crude
the idea of if x then y if you have this
type of name it means you're
automatically associated with
criminality that blunt crude kind of
Association that is the staple
logic of how algorithms work
[Music]
the types of bias that we find on the
internet are often blunt we are being
profiled into similar groups of people
who do the kinds of things that we might
be doing and we're clustered and sold as
a cluster to advertisers and so there's
certainly a commercial bias but we also
have the bias of the people who design
the Technologies
much
to think that technologies will be
neutral or never have bias is really an
improper framing
of course there will always be a point
of view in our Technologies the question
is is the point of view in service of
Oppression is it sexist is it racist
here I was a passionate believer in the
future of Equitable technology and if
the people when they were hiring me at
Harvard had typed my name into the
Google search bar and paid attention to
this ad it put me at a disadvantage and
not just me but a whole group of black
people would be placed at a disadvantage
how could these biases of society be
invading the technology that I really
had grown to love and now Civil Rights
was up for grabs by what technology
design allowed or didn't allow
Google's ad delivery system is really
quite amazing you click on a web page
and that webpage has a slot that an ad
is going to be delivered and in that
fraction of a second while the page is
being delivered Google runs a fast
digital auction
and in that digital auction they decide
which of competing ads are going to be
the ad they're going to place right
there
at first the Google algorithm will
choose one of them randomly but if
somebody clicks on one then that one
becomes weighted more often to be
delivered
so one way the Discrimination in online
ads Could Happen would have been that
society would have been biased on which
ads they clicked most often and that
this would have represented the bias of
society itself
foreign
[Music]
sharing are so powerful that they are
kind of like the new policy maker
we don't have oversight over these
designs but yet how the Technology's
design dictates the rules we live by and
this meant that
we were moving from a democracy to a new
kind of technocracy it became the chief
technology officer at the Federal Trade
Commission
they're sort of the de facto Police
Department of the internet
one of the experiments that I had done
while I was at the FTC showed that
everyone's online experience is not the
same
[Music]
foreign
[Laughter]
[Music]
Technologies and social media is the
criteria for surfacing what's most
important can be deeply highly
manipulated one of the hardest
case studies to write in my book was
about Dylan roof he went online and he
was trying to make sense of the trial of
George Zimmerman
the first thing that
is
Trayvon Martin an unarmed black teenager
was shot down by a white neighborhood
Watchman who claimed self-defense
eventually I decided to you know look
look his name I'm just type him in the
Google you know what I'm saying
to maybe type in the words black on
white crime we know from Dylan roof's
own words that the first sight that he
comes to is the Council of conservative
citizens
the CCC is an organization that the
Southern Poverty Law Center calls
vehemently racist
let's say he had been my student I could
have just immediately said did you know
that that phrase is kind of a racist red
herring
the FBI statistics show us that the
majority of white people are actually
killed by other white people
but instead he goes to the internet and
he finds the CCC and he goes down a
rabbit hole of white supremacist
websites did you read a lot did you read
books or watch videos or watch movies or
YouTube or anything like that
specifically about that subject matter
no it's pretty much just reading Arts
reading articles
and we know that shortly thereafter he
goes into a church murders nine
African-Americans and says his intent is
to start a race war
foreign
this is not an atypical possibility when
you don't get a Counterpoint to the
query you don't get black studies
scholarship or FBI statistics or
anything that would reframe the very
question that you're asking this is an
extreme case of acting upon white power
radicalization but this is not unlike
things that are happening right now
every day in search engines on Facebook
on Twitter in gab people are being
targeted and radicalized in very
dangerous ways
this is what is at stake when people are
so susceptible to disinformation hate
speech hate propaganda in our society
[Music]
racism itself can't be solved by
technology the question is to what
extent can we make sure technology
doesn't perpetuate it doesn't allow
harms to be made because of it
we need a diverse and inclusive
community in the design stage in the
marketing and business stage in the
Regulatory and journal stages as well
foreign I am really interested in
Solutions it's easy to talk about the
problems and it's painful also to talk
about the problems but that pain and
that struggle should lead us to thinking
about alternatives
those are the kinds of things that I
like to talk to other information
professionals and researchers and
Librarians about
as a person who has a name that doesn't
sound like Jennifer Wright or Sarah or
something that paper made the difference
for me because I was just this grad
student and you were this esteemed
Harvard professor and you were having
these experiences too when I think about
the like the the foundations of
something like ethical AI
I go back to you in that early paper I
think what I feel most hopeful about is
that there's this new cottage industry
called ethical Ai and I know that our
work is profoundly tied to that but on
another level I feel like these
Predictive Technologies are so much more
ubiquitous than they were 10 years ago
you know what I find really painful is
that as we move forward it's harder to
track one thing that comes clear is we
could use a heck of a lot more
transparency
as a computer scientist my vision is I
want Society to enjoy the benefits of
all these new technologies without these
problems technology doesn't have to be
made this way that's right that's right
I see so many more women and girls of
color interested in these conversations
and one of the things that I also see is
how we see things because we ask
different questions based on our lived
experiences just the fact that the
questions are being raised means that
the space is less hostile
means there's an opportunity for your
voice and and the other thing that's
really important about this work it
means that it's a new kind of way of
thinking about computer science it's
it's in this conversation with you that
I see a future I'm hopeful because it's
not one isolated paper but in fact it's
a it's a movement who are asking the
right questions exposing the right
unforeseen consequences and pushing this
forward towards a solution
[Music]
some questions cannot be answered
instantly
some issues we're dealing with in
society we need time and we need
discussion
how could we look for new Logics and new
metaphors and new ways to get a bigger
picture
maybe we could see when we do that query
that that's just nothing but propaganda
and we could even see the sources of the
disinformation Farms maybe we could see
the financial backers there's a lot of
ways that we could reimagine our
information landscape
so I do feel like there is some hope
[Music]
[Applause]
[Music]
foreign
[Music]