The DGX GH200 AI supercomputer is targeted toward developing and supporting large language models. Google Cloud, Meta, and Microsoft already have access to it. Credit: Nvidia Nvidia has unveiled a new DGX GH200 AI supercomputer, underpinned by its new Grace Hopper superchip and targeted toward developing and supporting large language models. “DGX GH200 AI supercomputers integrate Nvidia’s most advanced accelerated computing and networking technologies to expand the frontier of AI,” Nvidia CEO Jensen Huang said in a blog post. The supercomputer, according to Huang, combines the company’s GH200 Grace Hopper superchip and Nvidia’s NVLink and Switch System, to allow the development of large language models for generative AI language applications, recommender systems, and data analytics workloads. Nvidia’s DGX GH200 uses the NVLink interconnect technology to combine 256 Grace Hopper superchips into a single graphics processing unit (GPU) in order to extract “1 exaflop of performance and 144 terabytes of shared memory — nearly 500 times more memory than the previous generation NVIDIA DGX A100, which was introduced in 2020.” The chip maker will emulate the strategy it took with its DGX Pods in making the new supercomputer available. Earlier in March, Huang said the company struck a deal to make its DGX systems available through multiple cloud providers, rather than installing the necessary hardware on-premises. Currently, Microsoft, Meta, and Google Cloud have access to the new supercomputer, the company said. Nvidia also said the new Grace Hopper superchip that fuels the DGX GH200 AI supercomputer is in full production mode and systems with the superchip are expected to be available later this year. The company also said that it was using the new Grace Hopper superchip to help SoftBank design next-generation distributed data centers that will be capable of handling generative AI and 6G applications. These data centers will be distributed across Japan, the companies said in a blog post. Earlier in March, the company had launched new data processing units (DPUs) and GPUs, including the BlueField 3 DPU. Related content news Cisco patches actively exploited zero-day flaw in Nexus switches The moderate-severity vulnerability has been observed being exploited in the wild by Chinese APT Velvet Ant. By Lucian Constantin Jul 02, 2024 1 min Network Switches Network Security news Nokia to buy optical networker Infinera for $2.3 billion Customers struggling with managing systems able to handle the scale and power needs of soaring generative AI and cloud operations is fueling the deal. By Evan Schuman Jul 02, 2024 4 mins Mergers and Acquisitions Networking news French antitrust charges threaten Nvidia amid AI chip market surge Enforcement of charges could significantly impact global AI markets and customers, prompting operational changes. By Prasanth Aby Thomas Jul 02, 2024 3 mins Technology Industry GPUs Cloud Computing news Lenovo adds new AI solutions, expands Neptune cooling range to enable heat reuse Lenovo’s updated liquid cooling addresses the heat generated by data centers running AI workloads, while new services help enterprises get started with AI. By Lynn Greiner Jul 02, 2024 4 mins Cooling Systems Generative AI Data Center PODCASTS VIDEOS RESOURCES EVENTS NEWSLETTERS Newsletter Promo Module Test Description for newsletter promo module. Please enter a valid email address Subscribe