How to Choose a GPU?

Introduction

Novita AI offers a variety of GPU container cloud computing specifications to meet different virtualization capability needs, supporting a wide range of business applications and service scenarios. Each computing specification is divided into several families based on hardware capability differences. Each family utilizes different Intel processors, CPU/memory ratios, GPU cards, cloud disk types, and network card virtualization methods to achieve differentiated computing, storage, and network performance. Each family is further subdivided into various instance specifications, with higher specifications offering stronger performance.

How to Choose a GPU

Model	Memory	Single Precision (FP32)	Half Precision (FP16)	Description
Tesla P40	24 GB	11.76 T	11.76 T	Based on the earlier Pascal architecture, suitable for tasks requiring large memory and using versions of cuda before 11.x.
TITAN XP	12 GB	12.15 T	12.15 T	An older model under the Pascal architecture, a suitable entry-level choice for beginners.
1080 Ti	11 GB	11.34 T	11.34 T	A product of the same generation as TITAN XP, suitable for entry-level users, though its 11 GB memory may be limiting in some cases.
2080Ti	11 GB	13.45 T	53.8 T	Turing architecture GPU, offers good performance, especially suitable for mixed precision computing scenarios, with a high cost-performance ratio.
V100	16/32 GB	15.7 T	125 T	A high-end product designed for professional computing scenarios, especially suitable for high half-precision computing tasks, leading the previous generation of compute cards.
3060	12 GB	12.74 T	About 24 T	If the memory of 1080 Ti does not meet the requirements, 3060 provides a good alternative, especially suitable for beginners and requires the use of cuda 11.x.
A4000	16 GB	19.17 T	About 76 T	Balanced memory and computing power, suitable for intermediate users. Requires the use of cuda 11.x environment.
3080Ti	12 GB	34.10 T	About 70 T	Excellent performance output makes it an ideal choice for scenarios not requiring extreme memory. Requires the use of cuda 11.x.
A5000	24 GB	27.77 T	About 117 T	High-performance GPU, suitable for scenarios requiring large memory and high half-precision computing power. Requires the use of cuda 11.x.
3090	24 GB	35.58 T	About 71 T	Provides excellent performance and memory quota, suitable for a wide range of application scenarios, the first choice for cost-effectiveness. Requires the use of cuda 11.x.
A40	48 GB	37.42 T	149.7 T	Huge memory capacity, computing power close to 3090, suitable for computing tasks with extremely high memory requirements. Requires the use of cuda 11.x.
A100 SXM4	40/80 GB	19.5 T	312 T	Top professional computing GPU, with huge memory and half-precision computing capability, suitable for the most complex computing tasks. Supports NVLink, optimized for multi-card parallel computing. Requires the use of cuda 11.x.
4090	24 GB	82.58 T	165.2 T	A new generation high-performance GPU, provides excellent single precision and half precision computing capability, suitable for scenarios with high cost-performance ratio. Apart from relatively small memory, it has almost no obvious shortcomings.

Notes

GPU instance prices and configuration options (system disk, data disk, public IP, etc.) can be viewed on the console page. These prices are for reference only, and the actual order on the Novita AI console shall prevail.
Instance specifications vary by region and availability zone; please refer to the actual display on the console.
GPU instances can only be changed within the same specification family. For detailed information, please refer to the document on modifying instance specifications.
For purchasing instances, please refer to the sections on purchasing GPU container cloud computing instances and purchasing high-performance computing GPU instances.

Actual supply is subject to the GPU Market.

Get started

Model APIs

Agent Sandbox

GPUs

Observability

Resources

Introduction