SERVICES YOU CAN LAUNCH WITH THE RAFAY PLATFORM

Rafay-powered Bare Metal GPUs as a Service (BMaaS)

Rafay-powered Bare Metal GPUs as a Service (BMaaS) gives customers direct access to the host operating system on dedicated GPU servers, for teams running large distributed training jobs or highly opinionated software stacks that need control a virtualized environment cannot offer.

The platform delivers each server with secure access, networked with the customer's other rented servers and the right storage mounted, running with zero virtualization overhead for maximum performance on AI training, inference, and HPC.

Automated Bring-Up: Rafay matches the request to inventory, boots and discovers the server, installs the OS, and configures drivers and services.
Networking & Tenant Isolation: North-south through the Tenant Access Network, east-west over InfiniBand, with EVPN and Pkey isolation via NVIDIA UFM so one tenant's traffic never reaches another's.‍
Storage, Access & Firewall: Per-tenant storage mounted to the server, SSH via password or key, a public IP from the tenant pool, and automated security groups and NAT rules.

Request a demo

Download PDF

Teal geometric pattern with repeating triangular shapes forming an angular design on a white background.

Bare Metal-as-a-Service in the Rafay Platform

Check out the end-user experience in this quick click-through demonstration.

Learn more

Simplify and Automate GPU Operations

Rafay automates how GPU servers are deployed, managed, and scaled across multiple tenants and environments.

Automated Bare Metal Bring-Up

Match servers from inventory, reserve IPs, boot and discover, install the OS, and configure drivers and services in one workflow

Network Automation & Tenant Isolation

Per-tenant VRFs, north-south tenant access, and optional firewall rules, programmed automatically as tenants onboard

High-Speed Interconnects

InfiniBand and Ethernet with RDMA for low-latency distributed training and HPC

Latest GPU Hardware

NVIDIA H100, A100, L40S and more, up to 8 GPUs per server with NVLink and NVSwitch

“We are able to deliver new, innovative products and services to the global market faster and manage them cost-effectively with Rafay.”

Joe Vaughan

Chief Technology Officer

MoneyGram

Drive Efficiency and Business Value

Stand up a governed, multi-tenant bare metal service in weeks, not the months a hand-built platform requires

Move beyond commodity rentals with premium SKUs, performance SLAs, analytics, and integrated billing

Deliver air-gapped, compliant GPU environments tailored for regulated industries and national AI projects

Reallocate expensive GPUs to the next tenant in minutes, not weeks, with per-second billing to maximize utilization

Featured Resources

Operationalizing AI Fabrics with Aviz ONES, NVIDIA Spectrum-X, and Rafay

Discover the new AI operations model available to enterprises that enables self-service consumption and cloud-native orchestration for developers.

Learn More

The Definitive GPU PaaS Reference Architecture

Understand what it takes to deliver the right GPU infrastructure to your business.

Learn More

Unlock Your AI Potential with Cisco and Rafay: Transform AI PODs into a Self-Service GPU Cloud

Cisco provides AI-optimized infrastructure. Rafay makes it usable across teams, tenants, and use cases in days.

Learn More

The CIO’s guide to scalable, compliant, and developer-ready AI deployment

Orchestrating the future of AI: The CIO’s guide to scalable, compliant, and developer-ready AI deployment

Learn More

Rafay Named Outperformer in 2025 GigaOm Radar Report for Managed Kubernetes

The latest Radar report from GigaOm, Managed Kubernetes Rafay is ranked as an “Outperformer” for its solution.

Learn More

Building AI Value within Borders

Rafay's central orchestration platform facilitates efficient, self-service infrastructure and AI application management.

Learn More

GPU cloud evaluation report

Evaluating how the Rafay Platform delivers a GPU cloud for enterprises and cloud service providers by PivotNine.

Learn More

How Enterprise Platform Teams Can Accelerate AI/ML Initiatives

This paper explores the key challenges that organizations experience supporting these initiatives, as well as best practices for successfully leveraging Kubernetes to accelerate AI/ML projects.

Learn More

Whitepaper

Hybrid Cloud Meets Kubernetes

Learn how to Streamline Kubernetes Ops in Hybrid Clouds with AWS & Rafay

DOWNLOAD More Resources

Start a conversation with Rafay

Talk with Rafay experts to assess your infrastructure, explore your use cases, and see how teams like yours operationalize AI/ML and cloud-native initiatives with self-service and governance built in.

Start a Conversation