Blog

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

🚨

NEW

Product

Modular 25.6: Unifying the latest GPUs from NVIDIA, AMD, and Apple

We’re excited to announce Modular Platform 25.6 – a major milestone in our mission to build AI’s unified compute layer. With 25.6, we’re delivering the clearest proof yet of our mission: a unified compute layer that spans from laptops to the world’s most powerful datacenter GPUs. The platform now delivers:

September 22, 2025

Modular Team

Read

🚨

NEW

Product

Modular Platform 25.5: Introducing Large Scale Batch Inference

Modular Platform 25.5 is here, and introduces Large Scale Batch Inference: a highly asynchronous, at-scale batch API built on open standards and powered by Mammoth. We're launching this new capability through our partner SF Compute, enabling high-volume AI performance with a fast, accurate, and efficient platform that seamlessly scales workloads across any hardware.

August 5, 2025

Modular Team

Read

🚨

NEW

Product

AI Agents for AWS Marketplace

Modular Inc. announces MAX High-Performance GenAI Serving and MAX Code Repo Agent now available in AWS Marketplace's new AI Agents and Tools category, delivering 10x performance improvements and streamlined AI deployment for enterprises.

July 16, 2025

Modular Team

Read

🚨

NEW

Product

Modular 25.4: One Container, AMD and NVIDIA GPUs, No Lock-In

We're excited to announce Modular Platform 25.4, a major release that brings the full power of AMD GPUs to our entire platform. This release marks a major leap toward democratizing access to high-performance AI by enabling seamless portability to AMD GPUs.

June 18, 2025

Modular Team

Read

🚨

NEW

Product

Introducing Mammoth: Enterprise-Scale GenAI Deployments Made Simple

Introducing Mammoth, a distributed AI serving tool built specifically for the realities of enterprise AI deployment.

June 10, 2025

Modular Team

Read

🚨

NEW

Product

Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging

Announcing Modular Platform 25.3: our largest open source release, with 450k+ lines of high-performance AI kernels, plus pip install modular.

May 6, 2025

Modular Team

Read

🚨

NEW

Product

A New, Simpler License for MAX and Mojo

New licensing terms for MAX and Mojo that allows for unlimited non-commercial usage

April 23, 2025

Modular Team

Read

🚨

NEW

Product

MAX 25.2: Unleash the power of your H200's–without CUDA!

We’re excited to announce MAX 25.2, a major update that unlocks industry-leading performance on the largest language models–built from the ground up without CUDA.

March 25, 2025

Modular Team

Read

🚨

NEW

Product

MAX 25.1 - Introducing MAX Builds

February 18, 2025

Modular Team

Read

🚨

NEW

Product

Paged Attention & Prefix Caching Now Available in MAX Serve

PagedAttention & Prefix Caching Now Available in MAX Serve

February 6, 2025

Ehsan M. Kermani

Read

Sign up for our newsletter

Get all our latest news, announcements and updates delivered directly to your inbox. Unsubscribe at anytime.

Thank you for your submission.

Your report has been received and is being reviewed by the Sales team. A member from our team will reach out to you shortly.

Thank you,

Modular Sales Team

Start building with Modular

Get started - Docs

Blog

Modular 25.6: Unifying the latest GPUs from NVIDIA, AMD, and Apple

Modular Platform 25.5: Introducing Large Scale Batch Inference

AI Agents for AWS Marketplace

Modular 25.4: One Container, AMD and NVIDIA GPUs, No Lock-In

Introducing Mammoth: Enterprise-Scale GenAI Deployments Made Simple

Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging

A New, Simpler License for MAX and Mojo

MAX 25.2: Unleash the power of your H200's–without CUDA!

MAX 25.1 - Introducing MAX Builds

Paged Attention & Prefix Caching Now Available in MAX Serve

Sign up for our newsletter

Quick start resources