Nvidia Tesla - Misplaced Pages

(Redirected from Nvidia tesla) Nvidia's line of general purpose GPUs This article is about GPGPU cards. For the GPU microarchitecture, see Tesla (microarchitecture). "Tesla P100" redirects here. For the line of performance cars by Tesla Motors (P100D), see Tesla Model S and Tesla Model X.

Nvidia Tesla

Manufacturer	Nvidia
Introduced	May 2, 2007; 17 years ago (2007-05-02)
Discontinued	The brand Tesla discontinued in May 2020; 4 years ago (2020-05), now branded as Nvidia Data Center GPUs
Type	General purpose graphics cards

Nvidia Tesla is the former name for a line of products developed by Nvidia targeted at stream processing or general-purpose graphics processing units (GPGPU), named after pioneering electrical engineer Nikola Tesla. Its products began using GPUs from the G80 series, and have continued to accompany the release of new chips. They are programmable using the CUDA or OpenCL APIs.

The Nvidia Tesla product line competed with AMD's Radeon Instinct and Intel Xeon Phi lines of deep learning and GPU cards.

Nvidia retired the Tesla brand in May 2020, reportedly because of potential confusion with the brand of cars. Its new GPUs are branded Nvidia Data Center GPUs as in the Ampere-based A100 GPU.

Nvidia DGX servers feature Nvidia GPGPUs.

Overview

Offering computational power much greater than traditional microprocessors, the Tesla products targeted the high-performance computing market. As of 2012, Nvidia Teslas power some of the world's fastest supercomputers, including Summit at Oak Ridge National Laboratory and Tianhe-1A, in Tianjin, China.

Tesla cards have four times the double precision performance of a Fermi-based Nvidia GeForce card of similar single precision performance. Unlike Nvidia's consumer GeForce cards and professional Nvidia Quadro cards, Tesla cards were originally unable to output images to a display. However, the last Tesla C-class products included one Dual-Link DVI port.

Applications

Tesla products are primarily used in simulations and in large-scale calculations (especially floating-point calculations), and for high-end image generation for professional and scientific fields.

In 2013, the defense industry accounted for less than one-sixth of Tesla sales, but Sumit Gupta predicted increasing sales to the geospatial intelligence market.

Specifications

Model	Micro- architecture	Launch	Core	Core clock (MHz)	Shaders			Memory					Processing power (TFLOPS)			CUDA compute capability	TDP (W)	Notes, form factor
Model	Micro- architecture	Launch	Core	Core clock (MHz)	CUDA cores (total)	Base clock (MHz)	Max boost clock (MHz)	Bus type	Bus width (bit)	Size (GB)	Clock (MT/s)	Bandwidth (GB/s)	Half precision Tensor Core FP32 Accumulate	Single precision (MAD or FMA)	Double precision (FMA)	CUDA compute capability	TDP (W)	Notes, form factor
C870 GPU Computing Module	Tesla	May 2, 2007	1× G80	600	128	1,350	—	GDDR3	384	1.5	1,600	76.8	No	0.3456	No	1.0	170.9	Internal PCIe GPU (full-height, dual-slot)
D870 Deskside Computer		May 2, 2007	2× G80	600	256	1,350	—	GDDR3	2× 384	2× 1.5	1,600	2× 76.8	No	0.6912	No	1.0	520	Deskside or 3U rack-mount external GPUs
S870 GPU Computing Server		May 2, 2007	4× G80	600	512	1,350	—	GDDR3	4× 384	4× 1.5	1,600	4× 76.8	No	1.3824	No	1.0		1U rack-mount external GPUs, connect via 2× PCIe (×16)
C1060 GPU Computing Module		April 9, 2009	1× GT200	602	240	1,296	—	GDDR3	512	4	1,600	102.4	No	0.62208	0.07776	1.3	187.8	Internal PCIe GPU (full-height, dual-slot)
S1070 GPU Computing Server "400 configuration"		June 1, 2008	4× GT200	602	960	1296	—	GDDR3	4× 512	4× 4	1,538.4	4× 98.5	No	2.4883	0.311	1.3	800	1U rack-mount external GPUs, connect via 2× PCIe (×8 or ×16)
S1070 GPU Computing Server "500 configuration"		June 1, 2008	4× GT200	602	960	1,440	—	GDDR3	4× 512	4× 4	1,538.4	4× 98.5	No	2,.7648	0.3456	1.3	800
S1075 GPU Computing Server		June 1, 2008	4× GT200	602	960	1,440	—	GDDR3	4× 512	4× 4	1,538.4	4× 98.5	No	2.7648	0.3456	1.3		1U rack-mount external GPUs, connect via 1× PCIe (×8 or ×16)
Quadro Plex 2200 D2 Visual Computing System		July 25, 2008	2× GT200GL	648	480	1,296	—	GDDR3	2× 512	2× 4	1,600	2× 102.4	No	1.2442	0.1555	1.3		Deskside or 3U rack-mount external GPUs with 4 dual-link DVI outputs
Quadro Plex 2200 S4 Visual Computing System		July 25, 2008	4× GT200GL	648	960	1,296	—	GDDR3	4× 512	4× 4	1,600	4× 102.4	No	2.4883	0.311	1.3	1,200	1U rack-mount external GPUs, connect via 2× PCIe (×8 or ×16)
C2050 GPU Computing Module	Fermi	July 25, 2011	1× GF100	575	448	1,150	—	GDDR5	384	3	3000	144	No	1.0304	0.5152	2.0	247	Internal PCIe GPU (full-height, dual-slot)
M2050 GPU Computing Module		July 25, 2011	1× GF100	575	448	1,150	—	GDDR5	384	3	3,092	148.4	No	1.0304	0.5152	2.0	225	Internal PCIe GPU (full-height, dual-slot)
C2070 GPU Computing Module		July 25, 2011	1× GF100	575	448	1,150	—	GDDR5	384	6	3,000	144	No	1.0304	0.5152	2.0	247	Internal PCIe GPU (full-height, dual-slot)
C2075 GPU Computing Module		July 25, 2011					—				3,000	144	No				225
M2070/M2070Q GPU Computing Module		July 25, 2011					—				3,132	150.336	No				225
M2090 GPU Computing Module		July 25, 2011	1× GF110	650	512	1,300	—	GDDR5	384	6	3700	177.6	No	1.3312	0.6656	2.0	225	Internal PCIe GPU (full-height, dual-slot)
S2050 GPU Computing Server		July 25, 2011	4× GF100	575	1792	1150	—	GDDR5	4× 384	4× 3	3	4× 148.4	No	4.1216	2.0608	2.0	900	1U rack-mount external GPUs, connect via 2× PCIe (×8 or ×16)
S2070 GPU Computing Server		July 25, 2011	4× GF100	575	1792	1150	—	GDDR5	4× 384	4× 6	3	4× 148.4	No	4.1216	2.0608	2.0	900
K10 GPU accelerator	Kepler	May 1, 2012	2× GK104	—	3,072	745	?	GDDR5	2× 256	2× 4	5,000	2× 160	No	4.577	0.1907	3.0	225	Internal PCIe GPU (full-height, dual-slot)
K20 GPU accelerator		November 12, 2012	1× GK110	—	2,496	706	758	GDDR5	320	5	5,200	208	No	3.524	1.175	3.5	225	Internal PCIe GPU (full-height, dual-slot)
K20X GPU accelerator		November 12, 2012	1× GK110	—	2,688	732	?	GDDR5	384	6	5,200	250	No	3.935	1.312	3.5	235	Internal PCIe GPU (full-height, dual-slot)
K40 GPU accelerator		October 8, 2013	1× GK110B	—	2,880	745	875	GDDR5	384	12	6,000	288	No	4.291–5.040	1.430–1.680	3.5	235	Internal PCIe GPU (full-height, dual-slot)
K80 GPU accelerator		November 17, 2014	2× GK210	—	4,992	560	875	GDDR5	2× 384	2× 12	5,000	2× 240	No	5.591–8.736	1.864–2.912	3.7	300	Internal PCIe GPU (full-height, dual-slot)
M4 GPU accelerator	Maxwell	November 10, 2015	1× GM206	—	1,024	872	1,072	GDDR5	128	4	5,500	88	No	1.786–2.195	0.05581–0.06861	5.2	50–75	Internal PCIe GPU (half-height, single-slot)
M6 GPU accelerator		August 30, 2015	1× GM204-995-A1	—	1536	722	1,051	GDDR5	256	8	4,600	147.2	No	2.218–3.229	0.0693–0.1009	5.2	75–100	Internal MXM GPU
M10 GPU accelerator		May 18th, 2016	4× GM107	—	2,560	1,033	?	GDDR5	4× 128	4× 8	5,188	4× 83	No	5.289	0.1653	5.2	225	Internal PCIe GPU (full-height, dual-slot)
M40 GPU accelerator		November 10, 2015	1× GM200	—	3,072	948	1,114	GDDR5	384	12 or 24	6,000	288	No	5.825–6.844	0.182–0.2139	5.2	250	Internal PCIe GPU (full-height, dual-slot)
M60 GPU accelerator		August 30, 2015	2× GM204-895-A1	—	4,096	899	1,178	GDDR5	2× 256	2× 8	5,000	2× 160	No	7.365–9.650	0.2301–0.3016	5.2	225–300	Internal PCIe GPU (full-height, dual-slot)
P4 GPU accelerator	Pascal	September 13, 2016	1× GP104	—	2,560	810	1,063	GDDR5	256	8	6,000	192.0	No	4.147–5.443	0.1296–0.1701	6.1	50-75	PCIe card
P6 GPU accelerator		March 24, 2017	1× GP104-995-A1	—	2,048	1,012	1,506	GDDR5	256	16	3,003	192.2	No	6.169	0.1928	6.1	90	MXM card
P40 GPU accelerator		September 13, 2016	1× GP102	—	3,840	1,303	1,531	GDDR5	384	24	7,200	345.6	No	10.007–11.758	0.3127–0.3674	6.1	250	PCIe card
P100 GPU accelerator (mezzanine)		April 5, 2016	1× GP100-890-A1	—	3,584	1,328	1,480	HBM2	4,096	16	1,430	732	No	9.519–10.609	4.760–5.304	6.0	300	SXM card
P100 GPU accelerator (16 GB card)		June 20, 2016	1× GP100	—		1126	1303		4,096	16		732	No	8,071‒9,340	4,036‒4,670		250	PCIe card
P100 GPU accelerator (12 GB card)		June 20, 2016	1× GP100	—		1126	1303		3,072	12		549	No	8.071‒9.340	4.036‒4.670		250	PCIe card
V100 GPU accelerator (mezzanine)	Volta	May 10, 2017	1× GV100-895-A1	—	5120	Unknown	1,455	HBM2	4,096	16 or 32	1,750	900	119.192	14.899	7.450	7.0	300	SXM card
V100 GPU accelerator (PCIe card)		June 21, 2017	1× GV100	—		Unknown	1,370			16 or 32	1,750	900	112.224	14.028	7.014		250	PCIe card
V100 GPU accelerator (PCIe FHHL card)		March 27, 2018	1× GV100	—		937	1,290			16	1,620	829.44	105.68	13.21	6.605		250	PCIe FHHL card
T4 GPU accelerator (PCIe card)	Turing	September 12, 2018	1× TU104-895-A1	—	2,560	585	1,590	GDDR6	256	16	5,000	320	64.8	8.1	Unknown	7.5	70	PCIe card
A2 GPU accelerator (PCIe card)	Ampere	November 10, 2021	1× GA107	—	1,280	1,440	1,770	GDDR6	128	16	6,252	200	18.124	4.531	0.14	8.6	40-60	PCIe card (half height, single-slot)
A10 GPU accelerator (PCIe card)		April 12, 2021	1× GA102-890-A1	—	9,216	885	1,695	GDDR6	384	24	6,252	600	124.96	31.24	0.976	8.6	150	PCIe card (single-slot)
A16 GPU accelerator (PCIe card)		April 12, 2021	4× GA107	—	4× 1,280	885	1,695	GDDR6	4× 128	4× 16	7,242	4× 200	4x 18.432	4× 4.608	1.0848	8.6	250	PCIe card (dual-slot)
A30 GPU accelerator (PCIe card)		April 12, 2021	1× GA100	—	3,584	930	1,440	HBM2	3,072	24	1,215	933.1	165.12	10.32	5.161	8.0	165	PCIe card (dual-slot)
A40 GPU accelerator (PCIe card)		October 5, 2020	1× GA102	—	10,752	1,305	1,740	GDDR6	384	48	7,248	695.8	149.68	37.42	1.168	8.6	300	PCIe card (dual-slot)
A100 GPU accelerator (PCIe card)		May 14, 2020	1× GA100-883AA-A1	—	6,912	765	1410	HBM2	5,120	40 or 80	1,215	1,555	312.0	19.5	9.7	8.0	250	PCIe card (dual-slot)
H100 GPU accelerator (PCIe card)	Hopper	March 22, 2022	1× GH100	—	14,592	1,065	1,755 CUDA 1620 TC	HBM2E	5120	80	1,000	2,039	756.449	51.2	25.6	9.0	350	PCIe card (dual-slot)
H100 GPU accelerator (SXM card)	Hopper	March 22, 2022	1× GH100	—	16,896	1,065	1,980 CUDA 1,830 TC	HBM3	5,120	80	1,500	3,352	989.43	66.9	33.5	9.0	700	SXM card
L40 GPU accelerator	Ada Lovelace	October 13, 2022	1× AD102	—	18,176	735	2,490	GDDR6	384	48	2,250	864	362.066	90.516	1.414	8.9	300	PCIe card (dual-slot)
L4 GPU accelerator	Ada Lovelace	March 21, 2023	1x AD104	—	7,424	795	2,040	GDDR6	192	24	1,563	300	121.0	30.3	0.49	8.9	72	HHHL single slot PCIe card

Notes

To calculate the processing power see Tesla (microarchitecture)#Performance, Fermi (microarchitecture)#Performance, Kepler (microarchitecture)#Performance, Maxwell (microarchitecture)#Performance, or Pascal (microarchitecture)#Performance. A number range specifies the minimum and maximum processing power at, respectively, the base clock and maximum boost clock.
Core architecture version according to the CUDA programming guide.
GPU Boost is a default feature that increases the core clock rate while remaining under the card's predetermined power budget. Multiple boost clocks are available, but this table lists the highest clock supported by each card.
^ Specifications not specified by Nvidia assumed to be based on the GeForce 8800 GTX
^ Specifications not specified by Nvidia assumed to be based on the GeForce GTX 280
^ Specifications not specified by Nvidia assumed to be based on the Quadro FX 5800
^ With ECC on, a portion of the dedicated memory is used for ECC bits, so the available user memory is reduced by 12.5%. (e.g. 4 GB total memory yields 3.5 GB of user available memory.)

References

Casas, Alex (19 May 2020). "NVIDIA Drops Tesla Brand To Avoid Confusion With Tesla". Wccftech. Retrieved 8 July 2020.
"NVIDIA Supercomputing Solutions".
"NVIDIA A100 GPUs Power the Modern Data Center". NVIDIA. Retrieved 8 July 2020.
"High Performance Computing - Supercomputing with Tesla GPUs".
"Professional Workstation Solutions".
Tesla Technical Brief (PDF)
"Nvidia chases defense, intelligence ISVs with GPUs". www.theregister.com. Retrieved 8 July 2020.
"Nvidia GPU Boost For Tesla" (PDF). January 2014. Retrieved 7 December 2015.
"Tesla C1060 Computing Processor Board" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Difference between Tesla S1070 and S1075". 31 October 2008. Retrieved 29 January 2017. S1075 has one interface card
^ "Tesla C2050 and Tesla C2070 Computing Processor" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Tesla M2050 and Tesla M2070/M2070Q Dual-Slot Computing Processor Modules" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Tesla C2075 Computing Processor Board" (PDF). Nvidia.com. Retrieved 11 December 2015.
Hand, Randall (23 August 2010). "NVidia Tesla M2050 & M2070/M2070Q Specs OnlineVizWorld.com". VizWorld.com. Retrieved 11 December 2015.
"Tesla M2090 Dual-Slot Computing Processor Module" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Tesla K10 GPU accelerator" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Tesla K20 GPU active accelerator" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Tesla K20 GPU accelerator" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Tesla K20X GPU accelerator" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Tesla K40 GPU accelerator" (PDF). Nvidia.com. Retrieved 11 December 2015.
"Tesla K80 GPU accelerator" (PDF). Images.nvidia.com. Retrieved 11 December 2015.
"Nvidia Announces Tesla M40 & M4 Server Cards - Data Center Machine Learning". Anandtech.com. Retrieved 11 December 2015.
^ "Accelerating Hyperscale Datacenter Applications with Tesla GPUs | Parallel Forall". Devblogs.nvidia.com. 10 November 2015. Retrieved 11 December 2015.
"Tesla M6" (PDF). Images.nvidia.com. Retrieved 28 May 2016.
"Tesla M10" (PDF). Images.nvidia.com. Retrieved 29 October 2016.
"Tesla M40" (PDF). Images.nvidia.com. Retrieved 11 December 2015.
"Tesla M60" (PDF). Images.nvidia.com. Retrieved 27 May 2016.
^ Smith, Ryan (13 September 2016). "Nvidia Announces Tesla P40 & Tesla P4 - Network Inference, Big & Small". Anandtech. Retrieved 13 September 2016.
"Tesla P6" (PDF). www.nvidia.com. Retrieved 7 March 2019.
"Tesla P6 Specs". www.techpowerup.com. Retrieved 7 March 2019.
Smith, Ryan (5 April 2016). "Nvidia Announces Tesla P100 Accelerator - Pascal GP100 for HPC". Anandtech.com. Anandtech.com. Retrieved 5 April 2016.
Harris, Mark. "Inside Pascal: Nvidia's Newest Computing Platform". Retrieved 13 September 2016.
^ Smith, Ryan (20 June 2016). "NVidia Announces PCI Express Tesla P100". Anandtech.com. Retrieved 21 June 2016.
^ Smith, Ryan (10 May 2017). "The Nvidia GPU Technology Conference 2017 Keynote Live Blog". Anandtech. Retrieved 10 May 2017.
^ Smith, Ryan (10 May 2017). "NVIDIA Volta Unveiled: GV100 GPU and Tesla V100 Accelerator Announced". Anandtech. Retrieved 10 May 2017.
^ Oh, Nate (20 June 2017). "NVIDIA Formally Announces V100: Available later this Year". Anandtech.com. Retrieved 20 June 2017.
"NVIDIA TESLA T4 TENSOR CORE GPU". NVIDIA. Retrieved 17 October 2018.
"NVIDIA Tesla T4 Tensor Core Product Brief" (PDF). www.nvidia.com. Retrieved 10 July 2019.
"NVIDIA TESLA A2 TENSOR CORE GPU".
"NVIDIA TESLA A10 TENSOR CORE GPU".
"NVIDIA TESLA A16 TENSOR CORE GPU".
"NVIDIA TESLA A30 TENSOR CORE GPU".
"NVIDIA TESLA A40 TENSOR CORE GPU".
"NVIDIA TESLA A100 TENSOR CORE GPU". NVIDIA. Retrieved 14 January 2021.
"NVIDIA Tesla A100 Tensor Core Product Brief" (PDF). www.nvidia.com. Retrieved 22 September 2020.
Smith, Ryan (14 May 2020). "NVIDIA Ampere Unleashed: NVIDIA Announces New GPU Architecture, A100 GPU, and Accelerator". AnandTech.
"NVIDIA H100 Tensor Core GPU". NVIDIA. Retrieved 15 April 2024.
Mujtaba, Hassan (22 March 2022). "NVIDIA Hopper GH100 GPU Unveiled: The World's First & Fastest 4nm Data Center Chip, Up To 4000 TFLOPs Compute, HBM3 3 TB/s Memory". Wccftech. Retrieved 15 April 2024.
"NVIDIA H100 PCIe 80 GB Specs". TechPowerUp. 21 March 2023. Retrieved 15 April 2024.
"NVIDIA L40 GPU for Data Center". NVIDIA. 18 May 2023. Retrieved 15 April 2024.
"NVIDIA L40 Specs". TechPowerUp. 13 October 2022. Retrieved 15 April 2024.
"NVIDIA L4 Tensor Core GPU". NVIDIA. Retrieved 15 April 2024.
"NVIDIA ADA GPU Architecture" (PDF). nvidia.com. Retrieved 15 April 2024.
"NVIDIA and Google Cloud Deliver Powerful New Generative AI Platform, Built on the New L4 GPU and Vertex AI". NVIDIA Corporation. 21 March 2023. Retrieved 15 April 2024.
"NVIDIA L4 Specs". TechPowerUp. 21 March 2023. Retrieved 15 April 2024.

External links

Nvidia

GeForce (List of GPUs)

Fixed pixel pipeline

Pre-GeForce

Vertex and pixel shaders

GeForce 3

4 Ti

Unified shaders

Unified shaders & NUMA

Ray tracing & Tensor Cores

Software and technologies

Multimedia acceleration	NVENC (video encoding) NVDEC (video decoding) PureVideo (video decoding)
Software	Cg (shading language) CUDA Nvidia GameWorks OptiX (ray tracing API) PhysX (physics SDK) Nvidia Omniverse (3D graphics) Nvidia RTX (ray tracing platform) Nvidia System Tools VDPAU (video decode API)
Technologies	Nvidia 3D Vision (stereo 3D) Nvidia G-Sync (variable refresh rate) Nvidia Optimus (GPU switching) Nvidia Surround (multi-monitor) MXM (module/socket) SXM (module/socket) NVLink (protocol) Scalable Link Interface (multi-GPU) TurboCache (framebuffer in system memory) Video Super Resolution (live video upscaling)
GPU microarchitectures	Celsius Kelvin Rankine Curie Tesla Fermi Kepler Maxwell Pascal Volta Turing Ampere Hopper Ada Lovelace Blackwell Rubin

Other products

Graphics Workstation cards	Nvidia Quadro Quadro Plex
GPGPU	Nvidia Tesla DGX
Console components	NV2A (Xbox) RSX 'Reality Synthesizer' (PlayStation 3) Tegra NX-SoC (Nintendo Switch)
Nvidia Shield	Shield Portable Shield Tablet Shield Android TV GeForce Now
SoCs and embedded	GoForce Drive Jetson Tegra
CPUs	Project Denver
Computer chipsets	nForce

Company

Key people	Jen-Hsun Huang Chris Malachowsky Curtis Priem David Kirk Bill Dally Debora Shoquist Ranga Jayaraman Jonah M. Alben
Acquisitions	3dfx Interactive Ageia ULi Bright Computing Cumulus Networks DeepMap Icera Mellanox Technologies Mental Images PortalPlayer Exluna MediaQ Stexar

Graphics processing unit

GPU

Desktop	Intel GT Xe Arc Nvidia GeForce Quadro Tesla Tegra AMD Radeon Radeon Pro Instinct Matrox InfiniteReality NEC µPD7220 3dfx Voodoo S3 Glaze3D Apple silicon
Mobile	Adreno Apple silicon Mali PowerVR VideoCore Vivante Imageon Intel 2700G

Architecture

Components

Memory

DMA
Framebuffer
SGRAM
- GDDR
- GDDR2
- GDDR3
- GDDR4
- GDDR5
- GDDR6
- GDDR7
HBM
- HBM2
- HBM2E
- HBM3
- HBM-PIM
- HBM3E
Memory bandwidth
Memory controller
Shared graphics memory
Texture memory
VRAM

Form factor

Performance

Misc

Categories:

Overview

Applications

Specifications

See also

References

External links