AWS Trainium

Realizes: High-throughput transformer and large-scale neural network training across sparse and dense workloads on EC2 Trn1 clusters.

AWS custom training chip powering EC2 Trn1 instances with high throughput, supporting InfiniBand fabric for massive multi-node synchronization while running dense and sparse machine learning workloads at scale.

Examples

AWS Trn1 instances

AWS documentation describing EC2 Trn1 instances powered by Trainium, highlighting InfiniBand-enabled clusters optimized for large-scale, high-throughput ML training across multiple nodes.

large-scale neural network training high-throughput cluster-scale low