

#### Leveraging EPI Outcomes for an Open Source Cloud-Based Services Ecosystem

The Role of SiPearl's ARM-Based CPUs

**Roberto MOSTALLINO** 

Novembre 2024

### Agenda

- #1 Market Opportunities for ARM CPUs in the Cloud
- **#2** SiPearl in the ARM ecosystem
- **#3** Overview of SiPearl contribution to RISER Project
- #4 SiPearl's ARM-Based CPU & Server Blades Seine platform



# Market Opportunities for ARM CPUs in the Cloud

#### Semiconductor Market is expected to growth at 8% CAGR until 2028



#### **Server Market Forecast WW**

Forecast 2022-2028 of Server Revenue by CPU

![](_page_4_Figure_2.jpeg)

X86 shows a linear progress and ARM based server is the second biggest market

### Server/CPU Forecast for ARM in WW

![](_page_5_Figure_1.jpeg)

#### The WW market for ARM CPU will increase by a factor of 3 in terms of shipment and revenue in four years.

### **Server/CPU Forecast for ARM in Europe**

![](_page_6_Figure_1.jpeg)

# The European market for ARM CPU will increase by a factor of 3 in terms of shipment and revenue in 4 years.

#### **Server Market Share Forecast by CPU**

Global server forecast by Instruction Set Architecture (ISA)

![](_page_7_Figure_2.jpeg)

Although x86 will maintain its dominant position in the WW market, the ARM ecosystem will see a 2,5x increase in market share in 4 years.

![](_page_8_Picture_0.jpeg)

# SiPearl in the ARM ecosystem

### **Competitive positioning Highlights**

Market analysts predict a shift of market share, with ARM gaining traction at the cost of x86

![](_page_9_Figure_2.jpeg)

#### Software Ecosystem

#### **C**omparison of Arm and x86

Illustration of factors assessed in this research

- x86 Per-core performance - Arm Programming High Processor language support core count Virtualization Price-performance support ratio Low Software Energy ecosystem efficiency Operating system Ease of processor maturity (Linux) customization Operating system maturity (Windows) Source: Gartner 805675\_C

Since ARM's entry into the server CPU market in 2008, its superior price-performance ratio and energy efficiency has fueled robust growth year after year.

Copyright © SiPearl 2024

11

### **Datacenter Software Ecosystem maturity**

![](_page_11_Figure_1.jpeg)

### **Server Based CPU landscape**

![](_page_12_Figure_1.jpeg)

### SiPearl in the value chain

![](_page_13_Figure_1.jpeg)

![](_page_14_Picture_0.jpeg)

# Overview of SiPearl contribution to RISER Project

### **RISER – project summary and SiPearl role**

#### Scope

- Leverage the upstream of EPI and EUPilot projects to develop the first all-European RISC-V cloud server infrastructure.
- Europe's open strategic autonomy in the semiconductor technology market

#### **SiPearl contribution**

- Seine Platform based on RHEA processor and integrated with the PCIe acceleration board
- Requirement and use case definition for Riser prototype

#### Future

- Usage of Seine Platform for Cloud and Al applications
- Bring up of the demonstration Riser platform
- Development of drivers for Riser accelerator
- Evaluation of different use cases

### Hyperstack Hardware Modules

#### Logical Blocks overlaid on Physical Blocks for a Datacenter-ready Integrated System (DC-Stack)

![](_page_16_Figure_2.jpeg)

### **After Riser: Higher**

**Open Compute Project OCP** 

The OCP focuses on redesigning hardware technology to efficiently support the growing demands on compute infrastructure

OCP Server Project: provides standardized server system specifications for scale computing

#### DC-MHS: Data Center - Modular Hardware System

 Interoperability between key elements of datacenter, by providing consistent interfaces and form factors among modular building blocks

![](_page_17_Figure_6.jpeg)

#### New Model w / DC-MHS

![](_page_17_Figure_8.jpeg)

18

### **SiPearl role in Higher**

• Typical Higher DC-MHS server composed of:

**DC-SCM (Security & Control Module)** 

Interchangeable Host Processor Modules based on European processors technology: HPM-Rhea2, HPM-EUPILOT

**OCP Compliant Network Interface Card (NIC)** 

PCIe Acceleration board inherited from Riser1

• In this context:

SiPearl to develop a Rhea2 based HPM (dual socket)

Exapsys and SiPearl develop the EPI-based DC-SCM (BMC board)

![](_page_18_Figure_9.jpeg)

![](_page_19_Picture_0.jpeg)

## SiPearl's ARM-Based CPU & Server Blades Seine platform

# RHEAT

HPC and AI processor

Designed with

80 arm<sup>®</sup> Neoverse V1 cores with 2 x 256 SVE each

4 x HBM

4 x DDR5 interfaces

![](_page_20_Picture_6.jpeg)

### Rheal, our 1st generation processor

High performance per watt: Arm ISA power efficiency

Very high memory bandwidth

#### **Built-in HBM**

• Ideal performances for AI inference

Unique memory architecture: High Byte/Flops

#### Openess

Arm ecosystem from IoT/edge to HPC and cloud

Fully auditable & backdoor-free

Pre-integration with proven accelerator (AMD, Intel, Nvidia)

Rheal will deliver extraordinary performance and efficiency with an unmatched Byte/Flops ratio.

Confidential – Under NDA - Copyright © SiPearl 2024

![](_page_21_Picture_12.jpeg)

### **Rheal CPU performances**

![](_page_22_Figure_1.jpeg)

(\*)CPU with high HPCG efficiency can perform the same amount of work using less electrical power  $_{23}$ 

#### What are LLMs?

Large Language Models (LLMs) are a category of foundation models trained on immense data sets. They have the potential to improve productivity across industries and academia to solve the world's toughest problems.

![](_page_23_Figure_2.jpeg)

#### LLMs can equal or surpass human performance

- Sentiment analysis
- Clinical Note Summarization
- Scenario generation & recommended measures
- Mental Health support
- Scientific explanation

• ...

LLMs exist as a subset of deep learning models, which are a subset of machine learning models

### **Key considerations**

![](_page_24_Figure_1.jpeg)

### Maturity of Software stack for AI on ARM

![](_page_25_Figure_1.jpeg)

#### Covers Inference, Finetuning, Quantization and Training

#### Common characteristics of HPC and AI /SLM /LLM workloads

#### Challenges are similar for AI/LLM and HPC

Al tends to be more demanding in latency HPC tends to be more demanding in Bandwidth

**HPC domain** 

AI /SLM /LLM domain

![](_page_26_Figure_5.jpeg)

### **Rheal supporting Al**

HBM allows state of the art models to be tackled with lower cost and lower power while minimizing latency.

Rhea supports state of the art precision types (int8, bfloat16) for LLM

![](_page_27_Figure_3.jpeg)

### Server board design as reference design

#### A key step in the development of high-performance microprocessors

- · Internal testing and characterization of silicon
- Software development and integration
- Demos

#### An essential prerequisite for the server design of our direct customers

Reduced design costs

#### A platform to be used by our partners, EU & local projects, customers

- Software development and integration
- Performance evaluation

# A design meeting all the functional requirements of different uses

**#1** Reference design

**#2** Validation & testing

**#3** Multiple demos

A multifonctional, flexible & versatile platform

# In response: a single modular reference server solution

#### **Overview of Seine Platform**

![](_page_30_Figure_2.jpeg)

![](_page_30_Picture_3.jpeg)

- #1 EVB Board
- #2 Power Board
- #3 Rear-IO board
- #4 PCIe Riser cards
- #5 ACDC power supply
- #6 GPU boards connected to PCIe
- #7 PCIe IO boards (Example: NIC) connected to PCIe riser cards

### A modular solution for every application (1/3)

#### Config #1 Single-CPU with 2xGPU for...

![](_page_31_Picture_2.jpeg)

- Al inference: Bert, Stable Diffusion, Llama7b
- Offloading memory operation workload
- Al inference & Training

### A modular solution for every application (2/3)

![](_page_32_Picture_1.jpeg)

![](_page_32_Picture_2.jpeg)

### A modular solution for every application (3/3)

#### Config #3

#### Single-CPU with 1xGPU & 1xSATA

![](_page_33_Picture_3.jpeg)

![](_page_33_Picture_4.jpeg)

### Seine server is now available and ready for Rheal

![](_page_34_Picture_1.jpeg)