Nvidia says its new AI tools are like a chip foundry for large language models

Nvidia's GPT language model drawing links between associated words.
(Image credit: Nvidia)

At Nvidia's GTC keynote today, CEO Jen-Hsun Huang announced that the company will soon be rolling out a collection of large language model (LLM) frameworks, known as Nvidia AI Foundations.

Jen-Hsun is so confident about the AI Foundations package, he's calling it a "TSMC for custom, large language models." Definitely not a comparison I was expecting to hear today, but I guess it fits alongside Huang's wistful comments about AI having had it's "iPhone moment."

The Foundations package includes the Picasso and BioNeMo services that will serve the media and medical industries respectively, as well as NeMo: a framework aimed at businesses looking to integrate large language models into their workflows.

NeMo is "for building custom language, text-to-text generative models" that can inform what Nvidia calls "intelligent applications".

With a little something called P-Tuning, companies will be able to train their own custom language models to create more apt branded content, compose emails with personalised writing styles, and summarise financial documents so us humans don't have waste away staring at numbers all day—that sounds like a nightmare for me.

Hopefully it'll take some weight off the everyman, and stop your boss shouting "BUNG IT IN THE CHATBOT THING," because that's supposedly faster.

Your next upgrade

(Image credit: Future)

Best CPU for gaming: The top chips from Intel and AMD
Best gaming motherboard: The right boards
Best graphics card: Your perfect pixel-pusher awaits
Best SSD for gaming: Get into the game ahead of the rest

NeMo's language models include 8 billion, 43 billion, 530 billion parameter versions, meaning there will be distinct tiers to pick from with vastly differing power levels.

For context, Chat GPT's original GPT-3 subsisted on 175 billion parameters, and although OpenAI isn't telling people how many parameters GPT-4 is working with at the moment, AX Semantics guesses around 1 trillion.

So, no it's not quite going to be a direct ChatGPT competitor, and may not have the same depth of parameters, but as a framework for designing large language models its sure going to change the face of every industry it touches. That's for certain.

Katie Wickens
Hardware Writer

Screw sports, Katie would rather watch Intel, AMD and Nvidia go at it. Having been obsessed with computers and graphics for three long decades, she took Game Art and Design up to Masters level at uni, and has been demystifying tech and science—rather sarcastically—for three years since. She can be found admiring AI advancements, scrambling for scintillating Raspberry Pi projects, preaching cybersecurity awareness, sighing over semiconductors, and gawping at the latest GPU upgrades. She's been heading the PCG Steam Deck content hike, while waiting patiently for her chance to upload her consciousness into the cloud.