AWS Announces New Amazon EC2 Instances and Networking Enhancements
New A1 instances are the first to be powered by custom AWS Graviton processors based on the Arm architecture, reducing costs up to 45 percent for scale-out workloads like microservices and web servers
100 Gbps networking capability in new P3 and C5 instances deliver increased throughput for scale-out, distributed workloads like machine learning and high performance computing
New latency-optimized Elastic Fabric Adapter scales tightly-coupled high performance computing applications across tens of thousands of cores
AWS Global Accelerator improves availability and performance for geographically distributed applications by intelligently routing internet traffic
- A1 instances—powered by custom-designed AWS Graviton processors for scale-out workloads
- P3dn GPU instances—ideal for distributed machine learning and high performance computing applications
- C5n instances—deliver increased network bandwidth for running advanced compute-intensive workloads
The new P3dn GPU and C5n compute optimized instances feature 100 Gbps networking throughput and enable scale-out of distributed workloads like high performance computing (HPC), machine learning training, and data analytics.
AWS also announced the availability of two new networking offerings:
- Elastic Fabric Adapter (EFA)—a network adapter for Amazon EC2 instances that delivers the performance of on-premises HPC clusters with AWS’s elasticity and scalability
- AWS Global Accelerator—a fully managed service that uses AWS's global backbone and edge locations to improve the availability and performance of applications running in one or more AWS regions
“Two of the requests we get most from customers are how can you help us keep lowering our costs for basic workloads, and how can you make it more efficient to run our demanding, scale-out, high performance computing and machine learning workloads in the cloud,” said
New A1 instances feature custom-designed AWS Graviton processors that deliver significant cost saving for scale-out workloads
Customers increasingly run a diverse set of workloads in the cloud and are looking for solutions that lower costs without compromising performance. Although general purpose processors continue to provide great value for many workloads, new and emerging scale-out workloads like containerized microservices and web tier applications that do not rely on the x86 instruction set can gain additional cost and performance benefits from running on smaller and modern 64-bit Arm processors that work together to share an application’s computational load. Available today, A1 instances feature a custom-designed processor (Graviton) that leverages AWS’s extensive expertise in systems design and cloud infrastructure, making Arm processors available in the cloud for the first time. With A1 instances, customers will benefit from up to a 45 percent cost reduction (compared to other Amazon EC2 general purpose instances) for scale-out workloads. A1 instances are supported by several Linux distributions, including Amazon Linux 2,
SmugMug is a paid image sharing, image hosting and online video platform. “We are constantly striving to make our service more affordable for our fast growing customer base,” said
New P3dn and C5n Instances feature100 Gbps networking performance, accelerating computationally demanding workloads
Today, many customers are turning to AWS compute-optimized C5 instances and GPU-powered P3 instances for some of the most compute intensive workloads in the cloud. From machine learning training, to HPC applications like computational fluid dynamics and weather simulations, to video encoding, these workloads benefit from powerful processors and high speed networking. AWS is the first cloud provider to deliver 100 Gbps networking performance in a secure, scalable, and elastic manner so that customers can use it not just for HPC, but also for analytics, machine learning, big data, and data lake workloads with standard drivers and protocols.
- P3dn instances (available next week) will be the most powerful GPU instances in the cloud for machine learning training. P3 instances already help customers accelerate machine learning model training time from several days to a few hours, and with the 100 Gbps networking performance of the new larger size P3dn instances, customers can further lower their training times to less than an hour by distributing their machine learning workload across multiple GPU instances. Since launching P3 instances in
October 2017, customer demand for higher performance compute has increased as machine learning adoption grows and customers apply it to tackle more complex applications. The new P3dn instances deliver a 4X increase in network throughput compared to existing P3 instances, providing up to 100 Gbps of networking throughput, fast NVMe instance storage, custom IntelCPUs with 96 vCPUs and support for AVX512 instructions, and NVIDIATesla V100 GPUs each with 32 GB of memory. This enables developers to linearly scale their model training performance across multiple instances, accelerate preprocessing and remove data transfer bottlenecks, and rapidly improve the quality of their machine learning models.
fast.ai is an organization dedicated to making the power of deep learning accessible to all. “We have been using Amazon EC2 P3 instances to demonstrate how machine learning models can be trained quickly, easily, and inexpensively. In
- C5n instances (available today) significantly increase the maximum throughput performance available in AWS’s compute-intensive instance family. C5 instances offer up to 25 Gbps of network bandwidth addressing the requirements of a wide range of workloads, but highly distributed and HPC applications can benefit from even higher network performance. C5n instances offer 100 Gbps of network bandwidth, providing four times as much throughput as C5 instances. This performance increase delivered with C5n instances enables previously network bound applications to scale up or scale out effectively on AWS. Customers can also take advantage of higher network performance to accelerate data transfer to and from Amazon Simple Storage Service (Amazon S3), reducing the data ingestion wait time for applications and speeding up delivery of results.
Low-latency Elastic Fabric Adapter aids migration of HPC workloads to AWS
Customers typically rely on fixed-size, on-premises HPC systems. Because HPC systems are capital-intensive and expensive, companies tend to under-procure this capacity, resulting in long wait times that decrease productivity, limit experimentation, and delay critical project work. Moreover, customers are held captive to the available hardware capabilities and technologies until the next infrastructure refresh cycle – forcing developers to adapt their applications to the infrastructure instead of the other way around. EFA (available in preview today) enhances the performance of inter-instance communications that is critical for scaling HPC applications, providing customers the performance they expect from on-premises HPC clusters in the cloud, with the added benefits of AWS’s elasticity and scalability. EFA is integrated with the Message Passing Interface (MPI), which allows HPC applications to scale to tens of thousands of CPU cores without any modification. EFA is available on Amazon EC2 P3dn and C5n instances, and it will be enabled on additional Amazon EC2 instance types in 2019, giving customers added flexibility to choose the right compute configuration for their workloads on-demand, without any upfront planning.
AWS Global Accelerator improves availability and performance of globally distributed applications
As customers scale for a larger and more geographically-diverse set of users, they have to operate with better availability and improved performance. These customers deploy applications in multiple AWS regions for better performance, but this means they have to route users to the right healthy application endpoint. Further, they must regularly scale up or down the application’s endpoints, each with their own IP address, in response to application failures, performance testing, or spikes in traffic. As their applications scale, they also have to update every client that connects to the application (typically done via a Domain Name Service), which increases the time it takes for these clients to discover the new endpoints. AWS Global Accelerator (available today) improves the availability and performance of applications and makes it simple to direct internet traffic from users to application endpoints running in multiple AWS regions. It uses AWS’s vast, highly available and congestion-free global network backbone and edge locations to direct internet traffic from users to the application endpoints. Clients are directed to the right application endpoint based on their geographic location, application health, and customer-configurable routing policies. AWS Global Accelerator also allocates a set of static Anycast IP addresses that are unique per application and do not change, thus removing the need to update clients as the application scales. Application endpoints are continuously monitored and AWS Global Accelerator only directs clients to healthy endpoints without any need to change client configuration.
About Amazon Web Services
For over 12 years, Amazon Web Services has been the world’s most comprehensive and broadly adopted cloud platform. AWS offers over 125 fully featured services for compute, storage, databases, networking, analytics, machine learning and artificial intelligence (AI),