Amazon unveils next-generation Graviton4 and Trainium2 chips to power the business AI future


AWS has reaffirmed its intentions to be one of the world’s major hardware providers with the launch of its most powerful and efficient chips today.
The new Graviton4 and Trainium2 chips, unveiled at its AWS re:Invent 2023 event by CEO Adam Selipsky are designed to power the next generation of AI and machine learning models, offering more performance and efficiency than ever before.
“Silicon underpins every customer workload, making it a critical area of innovation for AWS,” said David Brown, vice president of Compute and Networking at AWS. “By focusing our chip designs on real workloads that matter to customers, we’re able to deliver the most advanced cloud infrastructure to them.
AWS Graviton4 and Trainium2
AWS is promising a major step up for Graviton4, claiming it provides up to 30% better compute performance, 50% more cores, and 75% more memory bandwidth than the current generation Graviton3 processors.
The company says that as they experiment and deploy more and more AI-powered workloads, customers are seeing their compute, memory, storage, and networking requirements increase, so need higher performance and larger instance sizes, all at an affordable cost and energy efficiency level to lessen any effects on the environment.
AWS is making Graviton4 available in Amazon EC2 R8g instances that it says offer larger instance sizes with up to three times more vCPUs and three times more memory than current generation, allowing customers to process larger amounts of data, scale their workloads, improve time-to-results, and lower their total cost of ownership.
Following its original launch in 2020, the second-generation Trainium2 looks to offer faster and more efficient training for current and future AI models that are using bigger datasets than ever, with today’s most advanced FMs and LLMs boasting hundreds of billions to trillions of parameters.
AWS says Trainium2 will deliver up to four times faster training than the first generation hardware, along with offering three times more memory capacity and can improve energy efficiency up to two times.
It can be deployed in EC2 UltraClusters of up to 100,000 chips, making it possible to train foundation models and large language models in a fraction of the time previously taken, with the company giving the example of training a a 300-billion parameter LLM in weeks versus months.
More from TechRadar Pro
AWS has reaffirmed its intentions to be one of the world’s major hardware providers with the launch of its most powerful and efficient chips today. The new Graviton4 and Trainium2 chips, unveiled at its AWS re:Invent 2023 event by CEO Adam Selipsky are designed to power the next generation of…
Recent Posts
- Reddit is reportedly experiencing some outages
- Google may be close to launching YouTube Premium Lite
- Someone wants to sell you a digital version of the antiquated typewriter but without a glued-on keyboard (no really)
- Carbon removal is the next big fossil fuel boom, oil company says
- This is probably the best looking docking station I’ve ever seen in my entire life – and I can’t wait to test it
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010