Amped Up: HPC Centers Ride A100 GPUs to Accelerate Science

Supercomputers put AI in the loop, moving into the exascale era with the NVIDIA Ampere architecture.

by DION HARRIS

Six supercomputer centers around the world are among the first to adopt the NVIDIA Ampere architecture. They’ll use it to bring science into the exascale era in fields from astrophysics to virus microbiology.

The high performance computing centers scattered across the U.S. and Germany will use a total of nearly 13,000 A100 GPUs.

Together these GPUs pack more than 250 petaflops in peak performance for simulations that use 64-bit floating point math. For AI inference jobs that use mixed precision math and leverage the A100 GPU’s support for sparsity, they deliver a whopping 8.07 exaflops.

Researchers will harness that horsepower to drive science forward in many dimensions. They plan to simulate larger models, train and deploy deeper networks, and pioneer an emerging hybrid field of AI-assisted simulations.

Argonne deployed one of the first NVIDIA DGX-A100 systems. Photo courtesy of Argonne National Laboratory.

For example, Argonne’s researchers will seek a COVID-19 vaccine by simulating a key part of a protein spike on a coronavirus that’s made up of as many as 1.5 million atoms.

The molecule “is a beast, but the A100 lets us accelerate simulations of these subsystems so we can understand how this virus infects humans,” said Arvind Ramanathan, a computational biologist at Argonne National Laboratory that will use a cluster of 24 NVIDIA DGX A100 systems.

In other efforts, “we will see substantial improvement in drug discovery by scanning millions and billions of drugs at a time. And we may see things we could never see before, like how two proteins bind to one another,” he said.

A100 Puts AI in the Scientific Loop

“Much of this work is hard to simulate on a computer, so we use AI to intelligently guide where and when we will sample next,” said Ramanathan.

It’s part of an emerging trend of scientists using AI to steer simulations. The GPUs then will speed up the time to process biological samples by “at least two orders of magnitude,” he added.

Across the country, the National Energy Research Scientific Computing Center (NERSC) is poised to become the largest of the first wave of A100 users. The center in Berkeley, Calif., is working with Hewlett Packard Enterprise to deploy 6,200 of the GPUs in Perlmutter, its pre-exascale system.

“Across NERSC’s science and algorithmic areas, we have increased performance by up to 5x when comparing a single V100 GPU to a KNL CPU node on our current-generation Cori system, and we expect even greater gains with the A100 on Perlmutter,” said Sudip Dosanjh, NERSC’s director.

Exascale Computing Team Works on Simulations, AI

A team dedicated to exascale computing at NERSC has defined nearly 30 projects for Perlmutter that use large-scale simulations, data analytics or deep learning. Some projects blend HPC with AI, such as one using reinforcement learning to control light source experiments. Another employs generative models to reproduce expensive simulations at high-energy physics detectors.

Two of NERSC’s HPC applications already prototyped use of the A100 GPU’s double-precision Tensor Cores. They’re seeing significant increases in performance over previous generation Volta GPUs.

Software optimized for the 10,000-way parallelism Perlmutter’s GPUs offer will be ready to run on future exascale systems, Christopher Daley, an HPC performance engineer at NERSC said in a talk at GTC Digital. NERSC supports nearly a thousand scientific applications in areas such as astrophysics, Earth science, fusion energy and genomics.

“On Perlmutter, we need compilers that support all the programming models our users need and expect — MPI, OpenMP, OpenACC, CUDA and optimized math libraries. The NVIDIA HPC SDK checks all of those boxes,” said Nicholas Wright, NERSC’s chief architect.

German Effort to Map the Brain

AI will be the focus of some of the first applications for the A100 on a new 70-petaflops system designed by France’s Atos for the Jülich Supercomputing Center in western Germany.

One, called Deep Rain, aims to make fast, short-term weather predictions, complementing traditional systems that use large, relatively slow simulations of the atmosphere. Another project plans to construct an atlas of fibers in the human brain, assembled with deep learning from thousands of high-resolution 2D brain images.

The new A100 system at Jülich also will help researchers push the edges of understanding the strong forces binding quarks, the sub-atomic building blocks of matter. At the macro scale, a climate science project will model the Earth’s surface and subsurface water flow.

“Many of these applications are constrained by memory,” said Dirk Pleiter, a theoretical physicist who manages a research team in applications-oriented technology development at Jülich. “So, what is extremely interesting for us is the increased memory footprint and memory bandwidth of the A100,” he said.

The new GPU’s ability to accelerate double-precision math by up to 2.5x is another feature researchers are keen to harness. “I’m confident when people realize the opportunities of more compute performance, they will have a strong incentive to use GPUs,” he added.

Data-Hungry System Likes Fast NVLink

Some 230 miles south of Jülich, the Karlsruhe Institute of Technology (KIT) is partnering with Lenovo to build a new 17-petaflops system that will pack 740 A100 GPUs on an NVIDIA Mellanox 200 Gbit/s InfiniBand network. It will tackle grand challenges that include:

-Atmospheric simulations at the kilometer scale for climate science

-Research to fight COVID-19, including support for Folding@home

-Explorations of particle physics beyond the Higgs boson for the Large Hadron Collider

-Research on next-generation materials that could replace lithium-ion batteries

AI applications in robotics, language processing and renewable energy

“We focus on data-intensive simulations and AI workflows, so we appreciate the third-generation NVLink connecting the new GPUs,” said Martin Frank, director of KIT’s supercomputing center and a professor of computational science and math.

“We also look forward to the multi-instance GPU feature that effectively gives us up to 28 GPUs per node instead of four — that will greatly benefit many of our applications,” he added.

Just outside Munich, the computer center for the Max Planck Institute is creating with Lenovo a system called Raven-GPU, powered by 768 NVIDIA A100 GPUs. It will support work in fields like astrophysics, biology, theoretical chemistry and advanced materials science. The research institute aims to have Raven-GPU installed by the end of the year and is taking requests now for support porting applications to the A100.

Indiana System Counters Cybersecurity Threats

Finally, Indiana University is building Big Red 200, a 6 petaflops system expected to become the fastest university-owned supercomputer in the U.S. It will use 256 A100 GPUs.

Announced in June, it’s among the first academic centers to adopt the Cray Shasta technology from Hewlett Packard Enterprise that others will use in future exascale systems.

Big Red 200 will apply AI to counter cybersecurity threats. It also will tackle grand challenges in genetics to help enable personalized healthcare as well as work in climate modeling, physics and astronomy.

Photo at top: Shyh Wang Hall at UC Berkeley will be the home of NERSC’s Perlmutter supercomputer.

Supercomputers put AI in the loop, moving into the exascale era with the NVIDIA Ampere architecture.

by DION HARRIS

The high performance computing centers scattered across the U.S. and Germany will use a total of nearly 13,000 A100 GPUs.

Argonne deployed one of the first NVIDIA DGX-A100 systems. Photo courtesy of Argonne National Laboratory.

For example, Argonne’s researchers will seek a COVID-19 vaccine by simulating a key part of a protein spike on a coronavirus that’s made up of as many as 1.5 million atoms.

A100 Puts AI in the Scientific Loop

“Much of this work is hard to simulate on a computer, so we use AI to intelligently guide where and when we will sample next,” said Ramanathan.

It’s part of an emerging trend of scientists using AI to steer simulations. The GPUs then will speed up the time to process biological samples by “at least two orders of magnitude,” he added.

Exascale Computing Team Works on Simulations, AI

Two of NERSC’s HPC applications already prototyped use of the A100 GPU’s double-precision Tensor Cores. They’re seeing significant increases in performance over previous generation Volta GPUs.

German Effort to Map the Brain

AI will be the focus of some of the first applications for the A100 on a new 70-petaflops system designed by France’s Atos for the Jülich Supercomputing Center in western Germany.

Data-Hungry System Likes Fast NVLink

-Atmospheric simulations at the kilometer scale for climate science

-Research to fight COVID-19, including support for Folding@home

-Explorations of particle physics beyond the Higgs boson for the Large Hadron Collider

-Research on next-generation materials that could replace lithium-ion batteries

AI applications in robotics, language processing and renewable energy

“We also look forward to the multi-instance GPU feature that effectively gives us up to 28 GPUs per node instead of four — that will greatly benefit many of our applications,” he added.

Indiana System Counters Cybersecurity Threats

Finally, Indiana University is building Big Red 200, a 6 petaflops system expected to become the fastest university-owned supercomputer in the U.S. It will use 256 A100 GPUs.

Announced in June, it’s among the first academic centers to adopt the Cray Shasta technology from Hewlett Packard Enterprise that others will use in future exascale systems.

Photo at top: Shyh Wang Hall at UC Berkeley will be the home of NERSC’s Perlmutter supercomputer.

Amped Up: HPC Centers Ride A100 GPUs to Accelerate Science

etetewtgae

Top Rated

Mazda make global golf tournament ‘Mazda AJGA’ A Pathway to Pro Golf from U.S. to Thailand for the first time ever

Bridgestone Receives “The Best Supplier of Overall Performance in 2023 (Truck Business)” Award, As a Strong Partnership with Hino

FIRST BESPOKE LIMITED EDITION IN INDIA CURATED BY BENTLEY MULLINER

OUTRIGGER Koh Samui Beach Resort Introduces Exclusive Laser Tag Experience

BENTAYGA EWB - INFINITE CHOICE, CURATED BY MULLINER

Escape to Bliss: The Spa at The Standard, Hua Hin Launches a VIP Pass to Better Wellness

Be the first to test drive the Volvo EX30 at the 45th Bangkok International Motor Show

Nippon Express (South Asia & Oceania) to Exhibit at Future Mobility Asia

Kia Sales (Thailand) unveils the full line-up of The Kia EV5, Thailand’s first-ever all-electric versatile mid-size SUV, with special launch price starting from 1,249,000 baht, at the 45th Bangkok International Motor Show.

Continental Increases Earnings in 2023 and Targets Further Improvement This Year

Product Information: NEW MG CYBERSTER

MG gets closer to Young Consumers with #DareToBeYou, a Marketing Breakthrough on Self-Expression

OMODA & JAECOO Officially Launches in Thailand, Unveiling Four New Car Models to Provide Better Alternatives for Thai Drivers. Set to Hit the Market Mid-Year!

TSMC and Synopsys Bring Breakthrough NVIDIA Computational Lithography Platform to Production

Bridgestone Delivers Utmost High-Performance Driving Experience On-Road & Off-Road with Premium All-Terrain Tire, “BRIDGESTONE DUELER ALL-TERRAIN A/T002” in 10 Sizes

The Standard x Corona Sunsets: Songkran Edition is Here! Splash Into the Thai New Year at The Standard, Hua Hin

DLSS 3.5 and Full Ray Tracing Coming To Black Myth: Wukong, NARAKA: BLADEPOINT and Portal with RTX; Star Wars™ Outlaws Launching With DLSS 3 and Ray Tracing

Informa - Tarsus Group and the Rubber Authority of Thailand, are organizing "TyreXpo Asia 2024" with the goal of leading Thailand to become the hub of the rubber industry in ASEAN.

AI Decoded: Demystifying AI and the Hardware, Software and Tools That Power It

Shining Brighter Together: Google’s Gemma Optimized to Run on NVIDIA GPUs

Say What? Chat With RTX Brings Custom Chatbot to NVIDIA RTX AI PCs

Igniting the Future: TensorRT-LLM Release Accelerates AI Inference Performance, Adds Support for New Models Running on RTX-Powered Windows 11 PCs

Climate Solutions Prize: Continental Honors Winner of Tech Challenge on Pioneering Sustainable Materials

MG reaffirms MG4 ELECTRIC success with the launch of MG4 XPOWER with official price announcement at the Motor Show

Striking Performance: Large Language Models up to 4x Faster on RTX With TensorRT-LLM for Windows

MG launches three “100th Anniversary Special Edition” models in celebration of its significant milestone

“BRIDGESTONE ECOPIA EP150 with the Ultimate Customizationof Cutting-Edge ENLITEN® Technology” Selected as Original Equipment to Power “New Xpander HEV and New Xpander Cross HEV” from Mitsubishi Motors

NVIDIA DLSS & GeForce RTX: List Of All Games, Engines And Applications Featuring GeForce RTX-Powered Technology And Features

Mazda make global golf tournament ‘Mazda AJGA’ A Pathway to Pro Golf from U.S. to Thailand for the first time ever