Nvidia's context-optimized Rubin CPX GPUs were inevitable • The Register

The Register

The Register5 hrs ago

Nvidia's context-optimized Rubin CPX GPUs were inevitable • The Register

Analysis Nvidia on Tuesday unveiled the Rubin CPX, a GPU designed specifically to accelerate extremely long-context AI workflows like those seen in code assistants such as Microsoft's GitHub Copilot, while simultaneously cutting back on pricey and power-hungry high-bandwidth memory (HBM).

The first indication that Nvidia might be moving in this direction came when CEO Jensen Huang unveiled Dynamo during his GTC keynote in spring. The framework brought mainstream attention to the idea of disaggregated inference.

As you may already be aware, inference on large language models (LLMs) can be broken into two categories: a computationally intensive compute phase and a second memory bandwidth-bound decode phase.

Traditionally, both the prefill and decode have taken place on the same GPU. Disag

17

Apple’s biggest announcement of the year is happening today. Here’s what to expect

Apple’s biggest announcement of the year is happening today. Here’s what to expect

CNN Business09/09

13713

Thanks to AI, this guy is running a Google rival from his laundry room

Thanks to AI, this guy is running a Google rival from his laundry room

Fast Company Technology

Fast Company Technology12 hrs ago

122

24 Hours at Apple Park: Can a Mostly Analog Girl Survive—or Even Thrive—in the Epicenter of the Digital World?

24 Hours at Apple Park: Can a Mostly Analog Girl Survive—or Even Thrive—in the Epicenter of the Digital World?

Vogue Culture US

Vogue Culture US17 hrs ago

16

Arm's bid for smarter, AI-powered phones and PCs begins with Lumex

Arm's bid for smarter, AI-powered phones and PCs begins with Lumex

PC World15 hrs ago

95

Best Back-to-School Laptop From Staples With Photos

Best Back-to-School Laptop From Staples With Photos

POPSUGAR3 hrs ago

94

How semiconductor boom and ASU are transforming our economy

How semiconductor boom and ASU are transforming our economy

AZ BIG Media Economy

AZ BIG Media Economy6 hrs ago

97

RFK Jr's HHS Deploys ChatGPT for All Staff

RFK Jr's HHS Deploys ChatGPT for All Staff

Gizmodo5 hrs ago

118

Apple's move to eSIM-only strengthens global trend

Apple's move to eSIM-only strengthens global trend

Omak Okanogan County Chronicle

Omak Okanogan County Chronicle21 hrs ago

28

Artificial Intelligence (AI) Unicorn Anthropic Just Hit a $183 Billion Valuation. Here's What It Means for Amazon Investors

Artificial Intelligence (AI) Unicorn Anthropic Just Hit a $183 Billion Valuation. Here's What It Means for Amazon Investors

The Motley Fool

The Motley Fool11 hrs ago

57

Trump Unfazed by Loud Protesters at DC Dinner Outing

Trump Unfazed by Loud Protesters at DC Dinner Outing

New York Post Video

New York Post Video1 hrs ago

79

Looks like you've reached the bottom