NVIDIA Corporation

NVDA

US67066G1040

Semiconductors

Real-time Estimate Cboe BZX Other stock markets 01:21:35 2024-04-29 pm EDT			5-day change	1st Jan Change
874.6 ^USD	-0.32%		+9.36%	+75.60%

01:57pm	Melius Research Raises Price Target on NVIDIA to $1,125 From $1,000, Buy Rating Maintained	MT
12:52pm	Social Buzz: Wallstreetbets Stocks Mostly Higher Premarket Monday; Tesla, ImmunityBio to Advance	MT

The Building Blocks of AI: Decoding the Role and Significance of Foundation Models

April 10, 2024 at 09:17 am EDT

Editor's note: This post is part of theAI Decoded series, which demystifies AI by making the technology more accessible, and which showcases new hardware, software, tools and accelerations for RTX PC users.

Skyscrapers start with strong foundations. The same goes for apps powered by AI.

A foundation model is an AI neural network trained on immense amounts of raw data, generally with unsupervised learning.

It's a type of artificial intelligence model trained to understand and generate human-like language. Imagine giving a computer a huge library of books to read and learn from, so it can understand the context and meaning behind words and sentences, just like a human does.

[Link]Foundation models.

A foundation model's deep knowledge base and ability to communicate in natural language make it useful for a broad range of applications, including text generation and summarization, copilot production and computer code analysis, image and video creation, and audio transcription and speech synthesis.

ChatGPT, one of the most notable generative AI applications, is a chatbot built with OpenAI's GPT foundation model. Now in its fourth version, GPT-4 is a large multimodal model that can ingest text or images and generate text or image responses.

Online apps built on foundation models typically access the models from a data center. But many of these models, and the applications they power, can now run locally on PCs and workstations with NVIDIA GeForce and NVIDIA RTX GPUs.

Foundation Model Uses

Foundation models can perform a variety of functions, including:

Language processing: understanding and generating text
Code generation: analyzing and debugging computer code in many programming languages
Visual processing: analyzing and generating images
Speech: generating text to speech and transcribing speech to text

They can be used as is or with further refinement. Rather than training an entirely new AI model for each generative AI application - a costly and time-consuming endeavor - users commonly fine-tune foundation models for specialized use cases.

Pretrained foundation models are remarkably capable, thanks to prompts and data-retrieval techniques like retrieval-augmented generation, or RAG. Foundation models also excel at transfer learning, which means they can be trained to perform a second task related to their original purpose.

For example, a general-purpose large language model (LLM) designed to converse with humans can be further trained to act as a customer service chatbot capable of answering inquiries using a corporate knowledge base.

Enterprises across industries are fine-tuning foundation models to get the best performance from their AI applications.

Types of Foundation Models

More than 100 foundation models are in use - a number that continues to grow. LLMs and image generators are the two most popular types of foundation models. And many of them are free for anyone to try - on any hardware - in the NVIDIA API Catalog.

LLMs are models that understand natural language and can respond to queries. Google's Gemma is one example; it excels at text comprehension, transformation and code generation. When asked about the astronomer Cornelius Gemma, it shared that his "contributions to celestial navigation and astronomy significantly impacted scientific progress." It also provided information on his key achievements, legacy and other facts.

Extending the collaboration of the Gemma models, accelerated with the NVIDIA TensorRT-LLM on RTX GPUs, Google's CodeGemma brings powerful yet lightweight coding capabilities to the community. CodeGemma models are available as 7B and 2B pretrained variants that specialize in code completion and code generation tasks.

MistralAI's Mistral LLM can follow instructions, complete requests and generate creative text. In fact, it helped brainstorm the headline for this blog, including the requirement that it use a variation of the series' name "AI Decoded," and it assisted in writing the definition of a foundation model.

[Link]Hello, world, indeed.

Meta's Llama 2 is a cutting-edge LLM that generates text and code in response to prompts.

Mistral and Llama 2 are available in the NVIDIA ChatRTX tech demo, running on RTX PCs and workstations. ChatRTX lets users personalize these foundation models by connecting them to personal content - such as documents, doctors' notes and other data - through RAG. It's accelerated by TensorRT-LLM for quick, contextually relevant answers. And because it runs locally, results are fast and secure.

Image generators like StabilityAI's Stable Diffusion XL and SDXL Turbo let users generate images and stunning, realistic visuals. StabilityAI's video generator, Stable Video Diffusion, uses a generative diffusion model to synthesize video sequences with a single image as a conditioning frame.

Multimodal foundation models can simultaneously process more than one type of data - such as text and images - to generate more sophisticated outputs.

A multimodal model that works with both text and images could let users upload an image and ask questions about it. These types of models are quickly working their way into real-world applications like customer service, where they can serve as faster, more user-friendly versions of traditional manuals.

[Link]Many foundation models are free to try - on any hardware - in the NVIDIA API Catalog.

Kosmos 2 is Microsoft's groundbreaking multimodal model designed to understand and reason about visual elements in images.

Think Globally, Run AI Models Locally

GeForce RTX and NVIDIA RTX GPUs can run foundation models locally.

The results are fast and secure. Rather than relying on cloud-based services, users can harness apps like ChatRTX to process sensitive data on their local PC without sharing the data with a third party or needing an internet connection.

Users can choose from a rapidly growing catalog of open foundation models to download and run on their own hardware. This lowers costs compared with using cloud-based apps and APIs, and it eliminates latency and network connectivity issues. Generative AI is transforming gaming, videoconferencing and interactive experiences of all kinds. Make sense of what's new and what's next by subscribing to theAI Decoded newsletter.

Categories: Generative AI

Tags: AI Decoded | Artificial Intelligence | GeForce | NVIDIA RTX

Attachments

Original Link
Permalink

Disclaimer

Nvidia Corporation published this content on 10 April 2024 and is solely responsible for the information contained therein. Distributed by Public, unedited and unaltered, on 10 April 2024 13:16:05 UTC.

Latest news about NVIDIA Corporation

Melius Research Raises Price Target on NVIDIA to $1,125 From $1,000, Buy Rating Maintained	07:57am	MT
Social Buzz: Wallstreetbets Stocks Mostly Higher Premarket Monday; Tesla, ImmunityBio to Advance	06:52am	MT
Wall Street: the worst week of 2024 is followed by the best	01:46am	CF
U.S. chip bans not meant to hobble China's growth, Blinken says	Apr. 26	RE
Wall Street: the worst week of 2024 is followed by the best	Apr. 26	CF
US Homeland Security names AI safety, security advisory board	Apr. 26	RE
Wall Street shares lifted by rally in megacap tech stocks	Apr. 26	RE
Wall St rises as Big Tech charges higher	Apr. 26	RE
Siltronic makes up most of its price slide in a positive tech environment	Apr. 26	DP
Homeland Security Department Forms AI Safety Board With Top Tech Executives	Apr. 26	MT
Wall Street: a bullish end to the week	Apr. 26	CF
Wall Street: Microsoft and Alphabet reassure, PCE too	Apr. 26	CF
Well, it isn't as bad as feared...	Apr. 26
Wall St set to open higher on tech boost, PCE data	Apr. 26	RE
Social Buzz: Wallstreetbets Stocks Mostly Higher Pre-Bell Friday; Snap, Alphabet to Open Higher	Apr. 26	MT
Intel falls as weak PC chip demand hurts second-quarter forecast	Apr. 26	RE
North American Morning Briefing : Tech Still in Focus as Fresh Inflation Data Eyed	Apr. 26	DJ
Futures climb as Alphabet, Microsoft results lift megacaps	Apr. 26	RE
Beijing city to subsidise domestic AI chips, targets self-reliance by 2027	Apr. 25	RE
Tech Drops After Meta Earnings, But Microsoft, Alphabet Rise After Hours - Tech Roundup	Apr. 25	DJ
Juniper misses revenue estimates on weak networking gear demand	Apr. 25	RE
Transcript : NVIDIA Corporation - Special Call	Apr. 25
Huawei-led Chinese firms aim to make advanced memory chips by 2026, The Information reports	Apr. 25	RE
Oracle U.S. Government Cloud Customers Accelerate Sovereign AI with NVIDIA AI Enterprise	Apr. 25	CI
Social Buzz: Wallstreetbets Stocks Mostly Lower Pre-Bell Thursday; Meta, Microsoft, Snap Lead Declines	Apr. 25	MT

Chart NVIDIA Corporation

Duration

Period

More charts

Company Profile

NVIDIA Corporation is the world leader in the design, development, and marketing of programmable graphics processors. The group also develops associated software. Net sales break down by family of products as follows: - computing and networking solutions (55.9%): data center platforms and infrastructure, Ethernet interconnect solutions, high-performance computing solutions, platforms and solutions for autonomous and intelligent vehicles, solutions for enterprise artificial intelligence infrastructure, crypto-currency mining processors, embedded computer boards for robotics, teaching, learning and artificial intelligence development, etc.; - graphics processors (44.1%): for PCs, game consoles, video game streaming platforms, workstations, etc. (GeForce, NVIDIA RTX, Quadro brands, etc.). The group also offers laptops, desktops, gaming computers, computer peripherals (monitors, mice, joysticks, remote controls, etc.), software for visual and virtual computing, platforms for automotive infotainment systems and cloud collaboration platforms. Net sales break down by industry between data storage (55.6%), gaming (33.6%), professional visualization (5.7%), automotive (3.4%) and other (1.7%). Net sales are distributed geographically as follows: the United States (30.7%), Taiwan (25.9%), China (21.5%) and other (21.9%).

Sector

Semiconductors

Calendar

2024-05-22 - Q1 2025 Earnings Release

Related indices

S&P 500

More about the company

Income Statement Evolution

More financial data

Analysis / Opinion

Why Nvidia and AI-Related Stocks Are Selling Off

April 19, 2024 at 04:48 pm EDT

Nvidia Offers Best Exposure to Artificial Intelligence, Morgan Stanley Says

April 10, 2024 at 12:09 pm EDT

NVIDIA Corporation : Silicon Valley wants to break Nvidia's CUDA software monopoly

March 27, 2024 at 10:55 am EDT

More Strategies

Ratings for NVIDIA Corporation

Trading Rating

Investor Rating

ESG Refinitiv

B-

More Ratings

Analysts' Consensus

Sell

Buy

Mean consensus

BUY

Number of Analysts

Last Close Price

877.4 USD

Average target price

995.6 USD

Spread / Average Target

+13.48%

Consensus

EPS Revisions

Estimates Revisions

Quarterly earnings - Rate of surprise

Company calendar

Sector Other Semiconductors

	1st Jan change	Capi.
NVIDIA CORPORATION	+75.60%	2,159B
BROADCOM INC.	+19.64%	623B
TAIWAN SEMICONDUCTOR MANUFACTURING COMPANY LIMITED	+34.06%	622B
AMD (ADVANCED MICRO DEVICES)	+8.24%	254B
QUALCOMM, INC.	+15.95%	185B
TEXAS INSTRUMENTS INCORPORATED	+4.68%	162B
INTEL CORPORATION	-37.62%	136B
MICRON TECHNOLOGY, INC.	+33.51%	127B
ARM HOLDINGS PLC	+35.29%	105B
ANALOG DEVICES, INC.	+2.43%	100B

Other Semiconductors

NVIDIA Corporation

Equities

NVDA

US67066G1040

Semiconductors

The Building Blocks of AI: Decoding the Role and Significance of Foundation Models

EPS Revisions