Update April 13, 2023 — Amazon Elastic Compute Cloud (EC2) Trn1n instances, powered by AWS Trainium, are now generally available. Amazon EC2 Trn1n instances double the network bandwidth (compared to Trn1 instances) to 1600 Gbps of Elastic Fabric Adapter (EFA) to deliver even higher performance for training network-intensive generative artificial intelligence (AI) models, such as large […]| Amazon Web Services
In this post, we demonstrate how to deploy the DeepSeek-R1-Distill-Qwen-32B model using AWS DLCs for vLLMs on Amazon EKS, showcasing how these purpose-built containers simplify deployment of this powerful open source inference engine. This solution can help you solve the complex infrastructure challenges of deploying LLMs while maintaining performance and cost-efficiency.| Amazon Web Services
Which EC2 Instance Type Is Ideal for Your Computing Needs?| blog.economize.cloud
AWS and Habana collaborated to enable EFA Peer Direct support on the Gaudi-based AWS DL1 instances, offering users significant improvement in multi-instance model training performance.| Intel Gaudi Developers