MLCommons sets new AI benchmark tests measuring speed

MLCommons sets new AI benchmark tests measuring speed is tracked as a internet infrastructure institution within the internet infrastructure ecosystem.

Caption: MLCommons sets new AI benchmark tests measuring speed · Source context: featured article image · Relevance reason: visual context for MLCommons sets new AI benchmark tests measuring speed · Image provenance: BTW media library

MLCommons sets new AI benchmark tests measuring speed is profiled by BTW Media because published evidence links it to internet infrastructure, governance, operational dependencies, or market visibility.

MLCommons introduces new AI benchmark tests measuring speed of AI chips and systems in generating responses from large language models.
Nvidia’s H100 chips, alongside servers from Google, Supermicro, and Nvidia, outperform competitors in both new benchmarks for raw performance.

On Wednesday, MLCommons sets new AI benchmark tests to measure response speed to users’ queries, in an effort to improve efficiency.

New AI benchmark tests by MLCommons

On Wednesday, AI benchmark group MLCommons sets a series of tests and releases multiple results to evaluate the speed and efficiency of top-tier hardware in responding to user interactions.

Among the new benchmarks introduced by MLCommons, two focus on the responsiveness of AI chips and systems in generating outputs, which offer insights into the speed at which AI applications, such as ChatGPT, can provide responses to user queries.

Also read: Eliyan raises US$60million for chiplet interconnects that speed up AI chips

One of the newly introduced benchmarks, dubbed Llama 2, specifically measures the speed of question-and-answer scenarios for large language models, boasting 70 billion parameters developed by Meta Platforms. Furthermore, MLCommons expanded benchmark tools by incorporating a second text-to-image generator, called MLPerf, based on Stability AI’s Stable Diffusion XL model.

Server performance showdown: Nvidia runs the game

In terms of raw performances, servers equipped with Nvidia’s H100 chips, including those from Google, Supermicro, and Nvidia itself, stood out as frontrunners in the latest benchmarks.

Also read: Nvidia’s next-generation data centres to work with cloud providers

Several server manufacturers introduced designs based on Nvidia’s less powerful L40S chip, but Krai presented a design for the image generation benchmark, featuring a Qualcomm AI chip known for its lower power consumption compared to Nvidia’s state-of-the-art processors.

Intel also showcased its Gaudi2 accelerator chips, emphasizing the results as impressive. However, it’s crucial to note that while raw performance is vital, the energy consumption of advanced AI chips poses a significant challenge.

0.90–1.00	A	High — direct sources
0.75–0.89	A/B	Strong
0.55–0.74	B/C	Medium
0.35–0.54	C/D	Weak–medium
0.10–0.34	D	Weak signal
0.00–0.09	D	Internal monitoring

MLCommons sets new AI benchmark tests measuring speed

Sources

New AI benchmark tests by MLCommons

Server performance showdown: Nvidia runs the game

At A Glance

What It Does

Why It Matters

What To Watch

Deeper Profile Context

Strategic Circle Access

Leadership Alliance Access

Strategy Circle Briefing

Leadership Alliance Briefing

Sources

New AI benchmark tests by MLCommons

Server performance showdown: Nvidia runs the game

Related Organizations

At A Glance

What It Does

Why It Matters

What To Watch

Deeper Profile Context

Strategic Circle Access

Leadership Alliance Access

Recommended Reading

Recommended Reading