Model | Micro- architecture |
Launch | Core | Core clock (MHz) |
Shaders | Memory | Processing power (TFLOPS)[a] | CUDA compute capability[b] |
TDP (W) |
Notes, form factor | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CUDA cores (total) |
Base clock (MHz) | Max boost clock (MHz)[c] |
Bus type | Bus width (bit) |
Size (GB) |
Clock (MT/s) |
Bandwidth (GB/s) |
Half precision Tensor Core FP32 Accumulate |
Single precision (MAD or FMA) |
Double precision (FMA) | ||||||||
C870 GPU Computing Module[d] | Tesla | May 2, 2007 | 1× G80 | 600 | 128 | 1,350 | — | GDDR3 | 384 | 1.5 | 1,600 | 76.8 | No | 0.3456 | No | 1.0 | 170.9 | Internal PCIe GPU (full-height, dual-slot) |
D870 Deskside Computer[d] | May 2, 2007 | 2× G80 | 600 | 256 | 1,350 | — | GDDR3 | 2× 384 | 2× 1.5 | 1,600 | 2× 76.8 | No | 0.6912 | No | 1.0 | 520 | Deskside or 3U rack-mount external GPUs | |
S870 GPU Computing Server[d] | May 2, 2007 | 4× G80 | 600 | 512 | 1,350 | — | GDDR3 | 4× 384 | 4× 1.5 | 1,600 | 4× 76.8 | No | 1.3824 | No | 1.0 | 1U rack-mount external GPUs, connect via 2× PCIe (×16) | ||
C1060 GPU Computing Module[e] | April 9, 2009 | 1× GT200 | 602 | 240 | 1,296[2] | — | GDDR3 | 512 | 4 | 1,600 | 102.4 | No | 0.62208 | 0.07776 | 1.3 | 187.8 | Internal PCIe GPU (full-height, dual-slot) | |
S1070 GPU Computing Server "400 configuration"[e] | June 1, 2008 | 4× GT200 | 602 | 960 | 1296 | — | GDDR3 | 4× 512 | 4× 4 | 1,538.4 | 4× 98.5 | No | 2.4883 | 0.311 | 1.3 | 800 | 1U rack-mount external GPUs, connect via 2× PCIe (×8 or ×16) | |
S1070 GPU Computing Server "500 configuration"[e] | June 1, 2008 | 1,440 | — | No | 2,.7648 | 0.3456 | ||||||||||||
S1075 GPU Computing Server[e][3] | June 1, 2008 | 4× GT200 | 602 | 960 | 1,440 | — | GDDR3 | 4× 512 | 4× 4 | 1,538.4 | 4× 98.5 | No | 2.7648 | 0.3456 | 1.3 | 1U rack-mount external GPUs, connect via 1× PCIe (×8 or ×16) | ||
Quadro Plex 2200 D2 Visual Computing System[f] | July 25, 2008 | 2× GT200GL | 648 | 480 | 1,296 | — | GDDR3 | 2× 512 | 2× 4 | 1,600 | 2× 102.4 | No | 1.2442 | 0.1555 | 1.3 | Deskside or 3U rack-mount external GPUs with 4 dual-link DVI outputs | ||
Quadro Plex 2200 S4 Visual Computing System[f] | July 25, 2008 | 4× GT200GL | 648 | 960 | 1,296 | — | GDDR3 | 4× 512 | 4× 4 | 1,600 | 4× 102.4 | No | 2.4883 | 0.311 | 1.3 | 1,200 | 1U rack-mount external GPUs, connect via 2× PCIe (×8 or ×16) | |
C2050 GPU Computing Module[4] | Fermi | July 25, 2011 | 1× GF100 | 575 | 448 | 1,150 | — | GDDR5 | 384 | 3[g] | 3000 | 144 | No | 1.0304 | 0.5152 | 2.0 | 247 | Internal PCIe GPU (full-height, dual-slot) |
M2050 GPU Computing Module[5] | July 25, 2011 | — | 3,092 | 148.4 | No | 225 | ||||||||||||
C2070 GPU Computing Module[4] | July 25, 2011 | 1× GF100 | 575 | 448 | 1,150 | — | GDDR5 | 384 | 6[g] | 3,000 | 144 | No | 1.0304 | 0.5152 | 2.0 | 247 | Internal PCIe GPU (full-height, dual-slot) | |
C2075 GPU Computing Module[6] | July 25, 2011 | — | 3,000 | 144 | No | 225 | ||||||||||||
M2070/M2070Q GPU Computing Module[7] | July 25, 2011 | — | 3,132 | 150.336 | No | 225 | ||||||||||||
M2090 GPU Computing Module[8] | July 25, 2011 | 1× GF110 | 650 | 512 | 1,300 | — | GDDR5 | 384 | 6[g] | 3700 | 177.6 | No | 1.3312 | 0.6656 | 2.0 | 225 | Internal PCIe GPU (full-height, dual-slot) | |
S2050 GPU Computing Server | July 25, 2011 | 4× GF100 | 575 | 1792 | 1150 | — | GDDR5 | 4× 384 | 4× 3[g] | 3 | 4× 148.4 | No | 4.1216 | 2.0608 | 2.0 | 900 | 1U rack-mount external GPUs, connect via 2× PCIe (×8 or ×16) | |
S2070 GPU Computing Server | July 25, 2011 | — | 4× 6[g] | No | ||||||||||||||
K10 GPU accelerator[9] | Kepler | May 1, 2012 | 2× GK104 | — | 3,072 | 745 | ? | GDDR5 | 2× 256 | 2× 4 | 5,000 | 2× 160 | No | 4.577 | 0.1907 | 3.0 | 225 | Internal PCIe GPU (full-height, dual-slot) |
K20 GPU accelerator[10][11] | November 12, 2012 | 1× GK110 | — | 2,496 | 706 | 758 | GDDR5 | 320 | 5 | 5,200 | 208 | No | 3.524 | 1.175 | 3.5 | 225 | Internal PCIe GPU (full-height, dual-slot) | |
K20X GPU accelerator[12] | November 12, 2012 | 1× GK110 | — | 2,688 | 732 | ? | GDDR5 | 384 | 6 | 5,200 | 250 | No | 3.935 | 1.312 | 3.5 | 235 | Internal PCIe GPU (full-height, dual-slot) | |
K40 GPU accelerator[13] | October 8, 2013 | 1× GK110B | — | 2,880 | 745 | 875 | GDDR5 | 384 | 12[g] | 6,000 | 288 | No | 4.291–5.040 | 1.430–1.680 | 3.5 | 235 | Internal PCIe GPU (full-height, dual-slot) | |
K80 GPU accelerator[14] | November 17, 2014 | 2× GK210 | — | 4,992 | 560 | 875 | GDDR5 | 2× 384 | 2× 12 | 5,000 | 2× 240 | No | 5.591–8.736 | 1.864–2.912 | 3.7 | 300 | Internal PCIe GPU (full-height, dual-slot) | |
M4 GPU accelerator[15][16] | Maxwell | November 10, 2015 | 1× GM206 | — | 1,024 | 872 | 1,072 | GDDR5 | 128 | 4 | 5,500 | 88 | No | 1.786–2.195 | 0.05581–0.06861 | 5.2 | 50–75 | Internal PCIe GPU (half-height, single-slot) |
M6 GPU accelerator[17] | August 30, 2015 | 1× GM204-995-A1 | — | 1536 | 722 | 1,051 | GDDR5 | 256 | 8 | 4,600 | 147.2 | No | 2.218–3.229 | 0.0693–0.1009 | 5.2 | 75–100 | Internal MXM GPU | |
M10 GPU accelerator[18] | May 18th, 2016 | 4× GM107 | — | 2,560 | 1,033 | ? | GDDR5 | 4× 128 | 4× 8 | 5,188 | 4× 83 | No | 5.289 | 0.1653 | 5.2 | 225 | Internal PCIe GPU (full-height, dual-slot) | |
M40 GPU accelerator[16][19] | November 10, 2015 | 1× GM200 | — | 3,072 | 948 | 1,114 | GDDR5 | 384 | 12 or 24 | 6,000 | 288 | No | 5.825–6.844 | 0.182–0.2139 | 5.2 | 250 | Internal PCIe GPU (full-height, dual-slot) | |
M60 GPU accelerator[20] | August 30, 2015 | 2× GM204-895-A1 | — | 4,096 | 899 | 1,178 | GDDR5 | 2× 256 | 2× 8 | 5,000 | 2× 160 | No | 7.365–9.650 | 0.2301–0.3016 | 5.2 | 225–300 | Internal PCIe GPU (full-height, dual-slot) | |
P4 GPU accelerator[21] | Pascal | September 13, 2016 | 1× GP104 | — | 2,560 | 810 | 1,063 | GDDR5 | 256 | 8 | 6,000 | 192.0 | No | 4.147–5.443 | 0.1296–0.1701 | 6.1 | 50-75 | PCIe card |
P6 GPU accelerator[22][23] | March 24, 2017 | 1× GP104-995-A1 | — | 2,048 | 1,012 | 1,506 | GDDR5 | 256 | 16 | 3,003 | 192.2 | No | 6.169 | 0.1928 | 6.1 | 90 | MXM card | |
P40 GPU accelerator[21] | September 13, 2016 | 1× GP102 | — | 3,840 | 1,303 | 1,531 | GDDR5 | 384 | 24 | 7,200 | 345.6 | No | 10.007–11.758 | 0.3127–0.3674 | 6.1 | 250 | PCIe card | |
P100 GPU accelerator (mezzanine)[24][25] | April 5, 2016 | 1× GP100-890-A1 | — | 3,584 | 1,328 | 1,480 | HBM2 | 4,096 | 16 | 1,430 | 732 | No | 9.519–10.609 | 4.760–5.304 | 6.0 | 300 | SXM card | |
P100 GPU accelerator (16 GB card)[26] | June 20, 2016 | 1× GP100 | — | 1126 | 1303 | No | 8,071‒9,340 | 4,036‒4,670 | 250 | PCIe card | ||||||||
P100 GPU accelerator (12 GB card)[26] | June 20, 2016 | — | 3,072 | 12 | 549 | No | 8.071‒9.340 | 4.036‒4.670 | ||||||||||
V100 GPU accelerator (mezzanine)[27][28][29] | Volta | May 10, 2017 | 1× GV100-895-A1 | — | 5120 | Unknown | 1,455 | HBM2 | 4,096 | 16 or 32 | 1,750 | 900 | 119.192 | 14.899 | 7.450 | 7.0 | 300 | SXM card |
V100 GPU accelerator (PCIe card)[27][28][29] | June 21, 2017 | 1× GV100 | — | Unknown | 1,370 | 112.224 | 14.028 | 7.014 | 250 | PCIe card | ||||||||
V100 GPU accelerator (PCIe FHHL card) | March 27, 2018 | 1× GV100 | — | 937 | 1,290 | 16 | 1,620 | 829.44 | 105.68 | 13.21 | 6.605 | 250 | PCIe FHHL card | |||||
T4 GPU accelerator (PCIe card)[30][31] | Turing | September 12, 2018 | 1× TU104-895-A1 | — | 2,560 | 585 | 1,590 | GDDR6 | 256 | 16 | 5,000 | 320 | 64.8 | 8.1 | Unknown | 7.5 | 70 | PCIe card |
A2 GPU accelerator (PCIe card)[32] | Ampere | November 10, 2021 | 1× GA107 | — | 1,280 | 1,440 | 1,770 | GDDR6 | 128 | 16 | 6,252 | 200 | 18.124 | 4.531 | 0.14 | 8.6 | 40-60 | PCIe card (half height, single-slot) |
A10 GPU accelerator (PCIe card)[33] | April 12, 2021 | 1× GA102-890-A1 | — | 9,216 | 885 | 1,695 | GDDR6 | 384 | 24 | 6,252 | 600 | 124.96 | 31.24 | 0.976 | 8.6 | 150 | PCIe card (single-slot) | |
A16 GPU accelerator (PCIe card)[34] | April 12, 2021 | 4× GA107 | — | 4× 1,280 | 885 | 1,695 | GDDR6 | 4× 128 | 4× 16 | 7,242 | 4× 200 | 4x 18.432 | 4× 4.608 | 1.0848 | 8.6 | 250 | PCIe card (dual-slot) | |
A30 GPU accelerator (PCIe card)[35] | April 12, 2021 | 1× GA100 | — | 3,584 | 930 | 1,440 | HBM2 | 3,072 | 24 | 1,215 | 933.1 | 165.12 | 10.32 | 5.161 | 8.0 | 165 | PCIe card (dual-slot) | |
A40 GPU accelerator (PCIe card)[36] | October 5, 2020 | 1× GA102 | — | 10,752 | 1,305 | 1,740 | GDDR6 | 384 | 48 | 7,248 | 695.8 | 149.68 | 37.42 | 1.168 | 8.6 | 300 | PCIe card (dual-slot) | |
A100 GPU accelerator (PCIe card)[37][38] | May 14, 2020[39] | 1× GA100-883AA-A1 | — | 6,912 | 765 | 1410 | HBM2 | 5,120 | 40 or 80 | 1,215 | 1,555 | 312.0 | 19.5 | 9.7 | 8.0 | 250 | PCIe card (dual-slot) | |
H100 GPU accelerator (PCIe card)[40] | Hopper | March 22, 2022[41] | 1× GH100[42] | — | 14,592 | 1,065 | 1,755 CUDA 1620 TC | HBM2E | 5120 | 80 | 1,000 | 2,039 | 756.449 | 51.2 | 25.6 | 9.0 | 350 | PCIe card (dual-slot) |
H100 GPU accelerator (SXM card) | — | 16,896 | 1,065 | 1,980 CUDA 1,830 TC | HBM3 | 5,120 | 80 | 1,500 | 3,352 | 989.43 | 66.9 | 33.5 | 9.0 | 700 | SXM card | |||
L40 GPU accelerator[43] | Ada Lovelace | October 13, 2022 | 1× AD102[44] | — | 18,176 | 735 | 2,490 | GDDR6 | 384 | 48 | 2,250 | 864 | 362.066 | 90.516 | 1.414 | 8.9 | 300 | PCIe card (dual-slot) |
L4 GPU accelerator[45][46] | March 21, 2023[47] | 1x AD104[48] | — | 7,424 | 795 | 2,040 | GDDR6 | 192 | 24 | 1,563 | 300 | 121.0 | 30.3 | 0.49 | 8.9 | 72 | HHHL single slot PCIe card |
Notes
- ^ To calculate the processing power see Tesla (microarchitecture)#Performance, Fermi (microarchitecture)#Performance, Kepler (microarchitecture)#Performance, Maxwell (microarchitecture)#Performance, or Pascal (microarchitecture)#Performance. A number range specifies the minimum and maximum processing power at, respectively, the base clock and maximum boost clock.
- ^ Core architecture version according to the CUDA programming guide.
- ^ GPU Boost is a default feature that increases the core clock rate while remaining under the card's predetermined power budget. Multiple boost clocks are available, but this table lists the highest clock supported by each card.[1]
- ^ a b c Specifications not specified by Nvidia assumed to be based on the GeForce 8800 GTX
- ^ a b c d Specifications not specified by Nvidia assumed to be based on the GeForce GTX 280
- ^ a b Specifications not specified by Nvidia assumed to be based on the Quadro FX 5800
- ^ a b c d e f With ECC on, a portion of the dedicated memory is used for ECC bits, so the available user memory is reduced by 12.5%. (e.g. 4 GB total memory yields 3.5 GB of user available memory.)
References
- ^ "Nvidia GPU Boost For Tesla" (PDF). January 2014. Retrieved 7 December 2015.
- ^ "Tesla C1060 Computing Processor Board" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Difference between Tesla S1070 and S1075". 31 October 2008. Retrieved January 29, 2017.
S1075 has one interface card
- ^ a b "Tesla C2050 and Tesla C2070 Computing Processor" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla M2050 and Tesla M2070/M2070Q Dual-Slot Computing Processor Modules" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla C2075 Computing Processor Board" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ Hand, Randall (2010-08-23). "NVidia Tesla M2050 & M2070/M2070Q Specs OnlineVizWorld.com". VizWorld.com. Retrieved 2015-12-11.
- ^ "Tesla M2090 Dual-Slot Computing Processor Module" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla K10 GPU accelerator" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla K20 GPU active accelerator" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla K20 GPU accelerator" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla K20X GPU accelerator" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla K40 GPU accelerator" (PDF). Nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla K80 GPU accelerator" (PDF). Images.nvidia.com. Retrieved 2015-12-11.
- ^ "Nvidia Announces Tesla M40 & M4 Server Cards - Data Center Machine Learning". Anandtech.com. Retrieved 2015-12-11.
- ^ a b "Accelerating Hyperscale Datacenter Applications with Tesla GPUs | Parallel Forall". Devblogs.nvidia.com. 2015-11-10. Retrieved 2015-12-11.
- ^ "Tesla M6" (PDF). Images.nvidia.com. Retrieved 2016-05-28.
- ^ "Tesla M10" (PDF). Images.nvidia.com. Retrieved 2016-10-29.
- ^ "Tesla M40" (PDF). Images.nvidia.com. Retrieved 2015-12-11.
- ^ "Tesla M60" (PDF). Images.nvidia.com. Retrieved 2016-05-27.
- ^ a b Smith, Ryan (13 September 2016). "Nvidia Announces Tesla P40 & Tesla P4 - Network Inference, Big & Small". Anandtech. Retrieved 13 September 2016.
- ^ "Tesla P6" (PDF). www.nvidia.com. Retrieved 2019-03-07.
- ^ "Tesla P6 Specs". www.techpowerup.com. Retrieved 2019-03-07.
- ^ Smith, Ryan (5 April 2016). "Nvidia Announces Tesla P100 Accelerator - Pascal GP100 for HPC". Anandtech.com. Anandtech.com. Retrieved 5 April 2016.
- ^ Harris, Mark. "Inside Pascal: Nvidia's Newest Computing Platform". Retrieved 13 September 2016.
- ^ a b Smith, Ryan (20 June 2016). "NVidia Announces PCI Express Tesla P100". Anandtech.com. Retrieved 21 June 2016.
- ^ a b Smith, Ryan (10 May 2017). "The Nvidia GPU Technology Conference 2017 Keynote Live Blog". Anandtech. Retrieved 10 May 2017.
- ^ a b Smith, Ryan (10 May 2017). "NVIDIA Volta Unveiled: GV100 GPU and Tesla V100 Accelerator Announced". Anandtech. Retrieved 10 May 2017.
- ^ a b Oh, Nate (20 June 2017). "NVIDIA Formally Announces V100: Available later this Year". Anandtech.com. Retrieved 20 June 2017.
- ^ "NVIDIA TESLA T4 TENSOR CORE GPU". NVIDIA. Retrieved 17 October 2018.
- ^ "NVIDIA Tesla T4 Tensor Core Product Brief" (PDF). www.nvidia.com. Retrieved 2019-07-10.
- ^ "NVIDIA TESLA A2 TENSOR CORE GPU".
- ^ "NVIDIA TESLA A10 TENSOR CORE GPU".
- ^ "NVIDIA TESLA A16 TENSOR CORE GPU".
- ^ "NVIDIA TESLA A30 TENSOR CORE GPU".
- ^ "NVIDIA TESLA A40 TENSOR CORE GPU".
- ^ "NVIDIA TESLA A100 TENSOR CORE GPU". NVIDIA. Retrieved 14 January 2021.
- ^ "NVIDIA Tesla A100 Tensor Core Product Brief" (PDF). www.nvidia.com. Retrieved 2020-09-22.
- ^ Smith, Ryan (May 14, 2020). "NVIDIA Ampere Unleashed: NVIDIA Announces New GPU Architecture, A100 GPU, and Accelerator". AnandTech.
- ^ "NVIDIA H100 Tensor Core GPU". NVIDIA. Retrieved 15 April 2024.
- ^ Mujtaba, Hassan (22 March 2022). "NVIDIA Hopper GH100 GPU Unveiled: The World's First & Fastest 4nm Data Center Chip, Up To 4000 TFLOPs Compute, HBM3 3 TB/s Memory". Wccftech. Retrieved 15 April 2024.
- ^ "NVIDIA H100 PCIe 80 GB Specs". TechPowerUp. 21 March 2023. Retrieved 15 April 2024.
- ^ "NVIDIA L40 GPU for Data Center". NVIDIA. 18 May 2023. Retrieved 15 April 2024.
- ^ "NVIDIA L40 Specs". TechPowerUp. 13 October 2022. Retrieved 15 April 2024.
- ^ "NVIDIA L4 Tensor Core GPU". NVIDIA. Retrieved 15 April 2024.
- ^ "NVIDIA ADA GPU Architecture" (PDF). nvidia.com. Retrieved 15 April 2024.
- ^ "NVIDIA and Google Cloud Deliver Powerful New Generative AI Platform, Built on the New L4 GPU and Vertex AI". NVIDIA Corporation. 21 March 2023. Retrieved 15 April 2024.
- ^ "NVIDIA L4 Specs". TechPowerUp. 21 March 2023. Retrieved 15 April 2024.