Facebook disputes report that its AI can’t detect hate speech or violence consistently


Facebook vice president of integrity Guy Rosen wrote in blog post Sunday that the prevalence of hate speech on the platform had dropped by 50 percent over the past three years, and that “a narrative that the technology we use to fight hate speech is inadequate and that we deliberately misrepresent our progress” was false.
“We don’t want to see hate on our platform, nor do our users or advertisers, and we are transparent about our work to remove it,” Rosen wrote. “What these documents demonstrate is that our integrity work is a multi-year journey. While we will never be perfect, our teams continually work to develop our systems, identify issues and build solutions.”
The post appeared to be in response to a Sunday article in the Wall Street Journal, which said the Facebook employees tasked with keeping offensive content off the platform don’t believe the company is able to reliably screen for it.
The WSJ report states that internal documents show that two years ago, Facebook reduced the time that human reviewers focused on hate speech complaints, and made other adjustments that reduced the number of complaints. That in turn helped create the appearance that Facebook’s artificial intelligence had been more successful in enforcing the company’s rules than it actually was, according to the WSJ.
A team of Facebook employees found in March that the company’s automated systems were removing posts which generated between 3 and 5 percent of the views of hate speech on the social platform, and less than 1 percent of all content that was in violation of its rules against violence and incitement, the WSJ reported.
But Rosen argued that focusing on content removals alone was “the wrong way to look at how we fight hate speech.” He says the technology to remove hate speech is just one method Facebook uses to fight it. “We need to be confident that something is hate speech before we remove it,” Rosen said.
Instead, he said, the company believes focusing on the prevalence of hate speech people actually see on the platform and how it reduces it using various tools is a more important measure. He claimed that for every 10,000 views of a piece of content on Facebook, there were five views of hate speech. “Prevalence tells us what violating content people see because we missed it,” Rosen wrote. “It’s how we most objectively evaluate our progress, as it provides the most complete picture.”
But the internal documents obtained by the WSJ showed some significant pieces of content were able to evade Facebook’s detection, including videos of car crashes that showed people with graphic injuries, and violent threats against trans children.
The WSJ has produced a series of reports about Facebook based on internal documents provided by whistleblower Frances Haugen. She testified before Congress that the company was aware of the negative impact its Instagram platform could have on teenagers. Facebook has disputed the reporting based on the internal documents.
Facebook vice president of integrity Guy Rosen wrote in blog post Sunday that the prevalence of hate speech on the platform had dropped by 50 percent over the past three years, and that “a narrative that the technology we use to fight hate speech is inadequate and that we deliberately…
Recent Posts
- Everything missing from the iPhone 16e, including MagSafe and Photographic Styles
- Reddit is reportedly experiencing some outages
- Google may be close to launching YouTube Premium Lite
- Someone wants to sell you a digital version of the antiquated typewriter but without a glued-on keyboard (no really)
- Carbon removal is the next big fossil fuel boom, oil company says
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010