Introducing OCI A4 Standard Instances: Delivering Next Gen Performance with AmpereOne® M

Team Ampere

15 December 2025

Ampere and Oracle announced today the general availability of A4 Standard shapes powered by AmpereOne® M, the latest generation of AmpereOne family of processors.

Building on the strong foundation established by our A1 and A2 generations, A4 Standard instances deliver enhanced performance, greater efficiency, and even better price-performance across a vast array of workloads, from traditional enterprise workloads to demanding AI inference tasks.

OCI Ampere A4: A New Level of Cloud Native Performance

A4 Standard delivers higher per-core performance and greater memory bandwidth than prior Ampere-based OCI A1 and A2 instances. For a 16vcpu (8 OCPU; 1 OCPU=2 AmpereOne M cores) VM configuration, customers can expect sizeable performance improvements on A4 compared to the previous generation A2, as shown in the results published by Oracle. For reference, a 24% performance improvement was measured on SPECrate®2017_int_base (est.) on A4 compared to A2.

With industry-leading pricing at $0.0138 per OCPU-hour and $0.0027 per GB-hour, A4 Standard delivers exceptional price-performance value. Industry standard benchmarks such as SPECrate®2017_int_base (est.) demonstrate 16% higher performance and 27% better price–performance on OCI A4 compared to AMD EPYC Turin–based E6 shapes.

Fig 1: SPECrate®2017_int_base (est.) Performance and Price-Performance for OCI A4 vs. E6 VMs

As organizations accelerate AI adoption, CPU-based inference offers a balance of performance, cost, and operational simplicity, making it a compelling solution for deploying models at scale.

A4 Standard Delivers Strong Performance for Modern LLM Inference

LLMs today demand more than just raw compute. They depend on high memory bandwidth, large token buffers, and fast data movement across cores. AmpereOne M was designed specifically to meet these needs.

With 192 cores, 12 channel DDR5 and a large, distributed cache architecture, it delivers highly scalable inference, predictable multi-tenant performance, and seamless integration into existing cloud environments. This architecture also allows for larger context windows and more complex model graph structures.

Results on STREAM Triad, a popular benchmark that measures memory bandwidth, show A4 Standard throughput at 143 GB/sec vs. E6 at 53 GB/s, a 3.8x performance advantage for A4, underscoring its suitability for LLM inference.

Fig 2: STREAM Triad Performance (GB/s) for OCI A4 vs. OCI E6 VMs

To further enhance LLM deployment and performance, AmpereOne M is supported by Ampere® AI Optimizer (AIO), a framework that helps developers transform, optimize and tune LLMs for Ampere’s architecture. It streamlines quantization, layer fusion, and format conversion to improve throughput and reduce latency, without requiring model rewrites. Ampere also maintains upstream contributions and ecosystem integrations with popular AI frameworks, ensuring that developers can bring models to production with minimal friction.

NVIDIA A10 is the only small AI inference solution in OCI. By comparing the number of simultaneous AI users that can be supported while still meeting a Service Level Agreement (SLA) of 10 tokens/second, a 90-core A4 Standard shape delivers a 2x price–performance advantage over NVIDIA’s A10 Bare Metal GPU instance on Llama 3.1 8B model inference.

Fig 3: Llama 3.1 8B Performance and Price-Performance for 90-core A4 VM vs. A10

These results show how compelling CPU-based inference in the cloud has now become with the OCI Ampere A4 Standard shape. Its price-performance advantage is even stronger when compared to other CPU-based shapes, reinforcing the benefits of the hardware-software co-design with AmpereOne M and AIO.

For more information:

Oracle Blog: Introducing OCI Ampere A4 Standard: Next-Generation Arm-Based Cloud Compute for Performance and Efficiency
Oracle Web Page: OCI Ampere Arm-based Compute
Uber Blog: How Uber, OCI™, and Ampere® Co-Optimized the OCI AmpereOne® M A4 Silicon
SPECrate® is a registered trademark of the Standard Performance Evaluation Corporation. For more information about SPEC, see www.spec.org

Disclaimer
All data and information contained herein is for informational purposes only, and Ampere reserves the right to change it without notice. This document may contain technical inaccuracies, omissions, and typographical errors, and Ampere is under no obligation to update or correct this information.

Ampere makes no representations or warranties of any kind, including express or implied guarantees of noninfringement, merchantability, or fitness for a particular purpose, and assumes no liability of any kind. Where data or information is sourced from or provided by third-party partners, Ampere does not independently verify such data or information and makes no representations or warranties regarding its accuracy, completeness, or reliability, and assumes no liability of any kind related to such third-party content. All information is provided “AS IS.”

This document is not an offer or a binding commitment by Ampere.

Introducing OCI A4 Standard Instances: Delivering Next Gen Performance with AmpereOne® M

OCI Ampere A4: A New Level of Cloud Native Performance

A4 Standard Delivers Strong Performance for Modern LLM Inference

Related Content