Dedicated GPU Memory - Search News

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

Nvidia BlueField-4 STX adds a context memory layer to storage to close the agentic AI throughput gap

Nvidia's BlueField-4 STX reference architecture inserts a dedicated context memory layer between GPUs and traditional storage, claiming 5x token throughput and 4x energy efficiency for agentic AI ...

Kioxia Announces New SSD Model Optimized for AI GPU-Initiated Workloads

"Kioxia fully supports the NVIDIA Storage-Next initiative and will deliver purpose-built SSDs to effectively address the need for GPU-accessible memory," said Makoto Hamada, Senior Director of the SSD ...

Hosted on MSN

Intel is following AMD in adding a crucial feature to Core Ultra — especially if you're using local AI

AMD has had a feature on its APUs for a while now that's attractive not just to gamers, but also local AI users; Variable Graphics Memory. Now, Intel is following suit, by adding a similar feature to ...

Ars Technica

SteamOS tested on dedicated GPUs: No, it’s not always faster than Windows

I wrote a couple of weeks ago about my personal homebrew Steam Machine, a self-built desktop under my TV featuring an AMD Ryzen 7 8700G processor and a Radeon 780M integrated GPU. I wouldn’t recommend ...

CIO

The memory demand crunch: creating a device strategy that meets the challenge

A structural shift in memory supply driven by AI infrastructure demand is pushing device costs sharply higher. Here's what CIOs need to know right now – and what they need to do.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results