Tumblr’s owner is striking deals with OpenAI and Midjourney for training data, says report


The owner of Tumblr and WordPress.com is in talks with AI companies Midjourney and OpenAI to provide training data scraped from users’ posts, a report from 404 Media alleges. The report, based on an anonymous source inside the company, says that deals between Automattic and the two AI companies are “imminent.” It follows nebulous rumors that have spread on Tumblr over the past week, suggesting a deal with Midjourney could provide a new revenue stream for the site.
According to 404’s report, Automattic plans to launch a new setting Wednesday that will “allow users to opt-out of data sharing with third parties, including AI companies.” But it cites internal posts that suggest the company scraped an “initial data dump” containing “all Tumblr’s public post content between 2014 and 2023,” including — apparently by mistake — content that wouldn’t be publicly visible on blogs. It’s unclear what was done with this data and what data (if any) has been sent to Midjourney and OpenAI.
OpenAI and Midjourney did not immediately respond to requests for comment from The Verge. Automattic directed us to a public statement it published on Tuesday following 404’s report. The post, titled “Protecting User Choice,” alludes to partnerships with unnamed AI companies. “We currently block, by default, major AI platform crawlers — including ones from the biggest tech companies — and update our lists as new ones launch,” it says, and “will share only public content that’s hosted on WordPress.com and Tumblr from sites that haven’t opted out.” It goes on to note that “we are also working directly with select AI companies as long as their plans align with what our community cares about: attribution, opt-outs, and control.”
A number of companies have struck deals with AI tool makers to provide training data — which has historically been scraped from publicly available online data, a process that’s become legally riskier in recent years. Reddit reportedly has a $60 million annual deal with Google, while Shutterstock has signed a deal with OpenAI to train on its photo library. But a number of artists and writers — in other words, the creative community that Tumblr in particular caters to — have protested their work being used for training. Companies have struggled to walk a line between satisfying users and experimenting with new AI tools, leading to backlash against online spaces like DeviantArt that have flirted with the tech.
For now, there’s not much information about what any deal would entail, nor how much Automattic stands to gain from it. The company has a long-standing web hosting business with WordPress.com and WordPress VIP, both built on the open-source WordPress software. But it’s struggled with a variety of methods for monetizing Tumblr — which it acquired from Verizon in 2019 — and announced that it would downscale its ambitions for the site last year.
Update 3:50PM ET: Added statement from Automattic.
The owner of Tumblr and WordPress.com is in talks with AI companies Midjourney and OpenAI to provide training data scraped from users’ posts, a report from 404 Media alleges. The report, based on an anonymous source inside the company, says that deals between Automattic and the two AI companies are…
Recent Posts
- Balatro has had its PEGI 18 age rating overturned following appeal: ‘I hope this change will allow developers to create without being unfairly punished’
- Three years later, the Steam Deck has dominated handheld PC gaming
- Google Gemini’s AI coding tool is now free for individual users
- Attention, Kindle owners –today is your last chance to download backups of your ebooks
- Scooby-Doo is a good movie with a bad Rotten Tomatoes score – here’s why you should ignore the critics and watch it before it leaves Netflix
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010