Oops! Something went wrong while submitting the form.
🚨
NEW
Industry
Democratizing AI Compute, Part 4: CUDA is the incumbent, but is it any good?
Answering the question of whether CUDA is “good” is much trickier than it sounds.
February 20, 2025
/
Chris Lattner
,
Read
🚨
NEW
🔥
Popular
Company
MAX 25.1 - Introducing MAX Builds
February 18, 2025
/
Modular Team
,
Read
🚨
NEW
Industry
Democratizing AI Compute, Part 3: How did CUDA succeed?
If we as an ecosystem hope to make progress, we need to understand how the CUDA software empire became so dominant.
February 12, 2025
/
Chris Lattner
,
Read
🚨
NEW
🔥
Popular
Product
Paged Attention & Prefix Caching Now Available in MAX Serve
PagedAttention & Prefix Caching Now Available in MAX Serve
February 6, 2025
/
Ehsan M. Kermani
,
Read
🚨
NEW
Industry
Democratizing AI Compute, Part 2: What exactly is “CUDA”?
February 5, 2025
/
Chris Lattner
,
Read
🚨
NEW
🔥
Popular
Industry
Democratizing AI Compute, Part 1: DeepSeek’s Impact on AI
Part 1 of an article that explores the future of hardware acceleration for AI beyond CUDA, framed in the context of the release of DeepSeek
January 30, 2025
/
Chris Lattner
,
Read
🚨
NEW
Developer
Agentic Building Blocks: Creating AI Agents with MAX Serve and OpenAI Function Calling
Agentic Building Blocks: Creating AI Agents with MAX Serve and OpenAI Function Calling
January 30, 2025
/
Ehsan M. Kermani
,
Read
🚨
NEW
Developer
Use MAX with Open WebUI for RAG and Web Search
Learn how quickly MAX and Open WebUI get you up-and-running with RAG, web search, and Llama 3.1 on GPU
January 23, 2025
/
Bill Welense
,
Read
🚨
NEW
Developer
Hands-on with Mojo 24.6
Mojo 24.6 introduces key improvements in argument conventions, memory management, and reference tracking, enhancing code clarity and safety with features like 'mut' for mutable arguments, 'origins' for references, and new collection types.
January 21, 2025
/
Ehsan M. Kermani
,
Read
🚨
NEW
Developer
Evaluating Llama Guard with MAX 24.6 and Hugging Face
Imagine unlocking a world of open innovation while ensuring secure, reliable, and enterprise-ready Gen AI deployments—MAX 24.6 enables enterprise AI teams to seamlessly run a vast range of cutting-edge AI models from Hugging Face on NVIDIA GPUs.