Meta admits it scraped all Australian Facebook posts since 2007 to train its AI


Meta has admitted it used Facebook and Instagram publicposts for Australian users to train its Artificial Intelligence models, and has scraped information from as far back as 2007.
An Australian Parliamentary committee has heard that whilst European users can opt out thanks to GDPR laws, Australian customers are not given that choice.
Meta has denied using the information of anyone under 18, but did confirm it had used over a decade’s worth of data. The firm could not answer whether it has scraped the photos of children who are now adults (i.e. those who created their accounts as a child, but have since turned 18).
A turning tide
The process of ‘scraping’ is essential for the development of AI and is basically data harvesting from websites, extracting the information and feeding it back to a Large Language Models (LLMs) which learns from the data. This means that GDPR regulations are becoming troublesome for more and more LLMs such as ChatGPT, which collects data from all over the internet without consent from the original source.
Meta’s global privacy director Melinda Claybaugh sat before the inquiry and admitted that the company was forced to pause the launch of AI products in Europe due to a lack of certainty, and it has had to give European users an opt-out due to more robust privacy laws. Senator Shoebridge grilled the Meta representative,
“The truth of the matter is that, unless you consciously had set those posts to private, since 2007, Meta has just decided you will scrape all of the photos and all of the text from every public post on Instagram or Facebook that Australians have shared since 2007, unless there was a conscious decision to set them on private. But that’s actually the reality, isn’t it?”
Claybaugh replied, “Correct”. She added that users can set their posts to private now to prevent future scraping, but this would have no effect on the data already taken.
The realization seems to be creeping in for the public and for tech companies that training AI models requires such vast amounts of data that it is ‘impossible’ to do so without using copyrighted materials. Considering millions of user’s posts have been used without their consent, it looks like tech giants might face much stricter regulations in future.
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
Via The Guardian
More from TechRadar Pro
Meta has admitted it used Facebook and Instagram publicposts for Australian users to train its Artificial Intelligence models, and has scraped information from as far back as 2007. An Australian Parliamentary committee has heard that whilst European users can opt out thanks to GDPR laws, Australian customers are not given…
Recent Posts
- Elon Musk says Grok 2 is going open source as he rolls out Grok 3 for Premium+ X subscribers only
- FTC Chair praises Justice Thomas as ‘the most important judge of the last 100 years’ for Black History Month
- HP acquires Humane AI assets and the AI pin will suffer a humane death
- HP acquires Humane AI assets and the AI pin may suffer a humane death
- HP acquires Humane Ai and gives the AI pin a humane death
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010