All Articles  (X)

Clear
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

🚨

NEW

🔥

Popular

Company

MAX 25.1 - Introducing MAX Builds

February 18, 2025

/

Modular Team

Read

🚨

NEW

🔥

Popular

Industry

Democratizing AI Compute, Part 3: How did CUDA succeed?

If we as an ecosystem hope to make progress, we need to understand how the CUDA software empire became so dominant.

February 12, 2025

/

Chris Lattner

Read

🚨

NEW

🔥

Popular

Product

Paged Attention & Prefix Caching Now Available in MAX Serve

PagedAttention & Prefix Caching Now Available in MAX Serve

February 6, 2025

/

Ehsan M. Kermani

Read

🚨

NEW

🔥

Popular

Industry

Democratizing AI Compute, Part 2: What exactly is “CUDA”?

February 5, 2025

/

Chris Lattner

Read

🚨

NEW

🔥

Popular

Industry

Democratizing AI Compute, Part 1: DeepSeek’s Impact on AI

Part 1 of an article that explores the future of hardware acceleration for AI beyond CUDA, framed in the context of the release of DeepSeek

January 30, 2025

/

Chris Lattner

Read

🚨

NEW

🔥

Popular

Developer

Agentic Building Blocks: Creating AI Agents with MAX Serve and OpenAI Function Calling

Agentic Building Blocks: Creating AI Agents with MAX Serve and OpenAI Function Calling

January 30, 2025

/

Ehsan M. Kermani

Read

🚨

NEW

🔥

Popular

Developer

Use MAX with Open WebUI for RAG and Web Search

Learn how quickly MAX and Open WebUI get you up-and-running with RAG, web search, and Llama 3.1 on GPU

January 23, 2025

/

Bill Welense

Read

🚨

NEW

🔥

Popular

Developer

Hands-on with Mojo 24.6

Mojo 24.6 introduces key improvements in argument conventions, memory management, and reference tracking, enhancing code clarity and safety with features like 'mut' for mutable arguments, 'origins' for references, and new collection types.

January 21, 2025

/

Ehsan M. Kermani

Read

🚨

NEW

Developer

Evaluating Llama Guard with MAX 24.6 and Hugging Face

Imagine unlocking a world of open innovation while ensuring secure, reliable, and enterprise-ready Gen AI deployments—MAX 24.6 enables enterprise AI teams to seamlessly run a vast range of cutting-edge AI models from Hugging Face on NVIDIA GPUs.

December 19, 2024

/

Bill Welense

Read

🚨

NEW

🔥

Popular

Engineering

MAX GPU: State of the Art Throughput on a New GenAI platform

Measuring state of the art GPU performance compared to vLLM on Modular's MAX 24.6

December 17, 2024

/

Max Hutchinson

Tyler Kenney

Read

🤔

No results for this query