Meta wants to fix broken Wikipedia citations for good


Wikipedia may be the go-to resource on almost everything these days, but according to Meta, it’s filled with dodgy, inaccurate citations.
But don’t worry, the company says its AI is here to help, having developed Sphere, a model capable of automatically scanning hundreds of thousands of citations at once to check whether they truly support the corresponding claims.
Meta claims it created a new dataset of 134 million public web pages as a knowledge source for the model, which says is “an order of magnitude larger and significantly more intricate than ever used for this sort of research”.
Sphere (opens in new tab) uses open web data rather than traditional, proprietary search engines such as Google, and has already compiled 134 million documents from across the web.
Built using CCNet (opens in new tab), a variant of Common Crawl, Meta says Sphere will help other AI researchers working on knowledge retrieval projects.
Meta says the eventual goal of the project is to build a platform to help Wikipedia editors systematically spot citation issues and quickly fix the citation or correct the content of the corresponding article at scale.
The tool reportedly calls attention to questionable citations, allowing human editors to evaluate the cases most likely to be flawed without having to sift through thousands of properly cited statements.
If a citation seems irrelevant, Meta says its model will suggest a more applicable source, even pointing to the specific passage that supports the claim.
The news comes as Wikipedia is looking towards new ways of raising revenue other than donations.
The Wikimedia Enterprise (opens in new tab) platform recently announced it would start to charge companies such as Google, Amazon, and Facebook which use Wikipedia as a resource.
READ MORE:
You can grab the source code for the project on GitHub here (opens in new tab), and interested parties can also read a full write-up of the project’s findings here (opens in new tab) or access the demo here (opens in new tab).
Audio player loading… Wikipedia may be the go-to resource on almost everything these days, but according to Meta, it’s filled with dodgy, inaccurate citations. But don’t worry, the company says its AI is here to help, having developed Sphere, a model capable of automatically scanning hundreds of thousands of citations…
Recent Posts
- Framework’s first tiny Desktop beautifully straddles the line between cute and badass
- Samsung Galaxy Z Fold 7 unofficial renders tease a slimmer design and a bigger, hidden-in-plain-sight upgrade
- Netflix drops an uneasy new teaser for You season 5, and I can’t help but laugh as killer Casanova Joe calls himself ‘the luckiest guy in New York’
- Popular Android financial help app is actually dangerous malware
- Our Favorite Internal SSD Is on Sale Right Now
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010