Oops! Something went wrong while submitting the form.
🚨
NEW
🔥
Popular
Company
MAX 25.1 - Introducing MAX Builds
February 18, 2025
/
Modular Team
,
Read
🚨
NEW
🔥
Popular
Industry
Democratizing AI Compute, Part 3: How did CUDA succeed?
If we as an ecosystem hope to make progress, we need to understand how the CUDA software empire became so dominant.
February 12, 2025
/
Chris Lattner
,
Read
🚨
NEW
🔥
Popular
Product
Paged Attention & Prefix Caching Now Available in MAX Serve
PagedAttention & Prefix Caching Now Available in MAX Serve
February 6, 2025
/
Ehsan M. Kermani
,
Read
🚨
NEW
🔥
Popular
Industry
Democratizing AI Compute, Part 2: What exactly is “CUDA”?
February 5, 2025
/
Chris Lattner
,
Read
🚨
NEW
🔥
Popular
Industry
Democratizing AI Compute, Part 1: DeepSeek’s Impact on AI
Part 1 of an article that explores the future of hardware acceleration for AI beyond CUDA, framed in the context of the release of DeepSeek
January 30, 2025
/
Chris Lattner
,
Read
🚨
NEW
🔥
Popular
Developer
Agentic Building Blocks: Creating AI Agents with MAX Serve and OpenAI Function Calling
Agentic Building Blocks: Creating AI Agents with MAX Serve and OpenAI Function Calling
January 30, 2025
/
Ehsan M. Kermani
,
Read
🚨
NEW
🔥
Popular
Developer
Use MAX with Open WebUI for RAG and Web Search
Learn how quickly MAX and Open WebUI get you up-and-running with RAG, web search, and Llama 3.1 on GPU
January 23, 2025
/
Bill Welense
,
Read
🚨
NEW
🔥
Popular
Developer
Hands-on with Mojo 24.6
Mojo 24.6 introduces key improvements in argument conventions, memory management, and reference tracking, enhancing code clarity and safety with features like 'mut' for mutable arguments, 'origins' for references, and new collection types.
January 21, 2025
/
Ehsan M. Kermani
,
Read
🚨
NEW
Developer
Evaluating Llama Guard with MAX 24.6 and Hugging Face
Imagine unlocking a world of open innovation while ensuring secure, reliable, and enterprise-ready Gen AI deployments—MAX 24.6 enables enterprise AI teams to seamlessly run a vast range of cutting-edge AI models from Hugging Face on NVIDIA GPUs.
December 19, 2024
/
Bill Welense
,
Read
🚨
NEW
🔥
Popular
Engineering
MAX GPU: State of the Art Throughput on a New GenAI platform
Measuring state of the art GPU performance compared to vLLM on Modular's MAX 24.6