Back

EVENT

Cloud Native Day With Kubernetes

Cloud-native technologies are fast becoming an integral part of IT environments as organizations continuously accelerate their development efforts to meet business demands. Cloud-Native Days 2021 explores the cloud-native ecosystem beyond Kubernetes and ways in which organizations can leverage cloud-native technologies to move faster and more securely.Sessions will be geared toward practitioners, managers and C-level executives, and led by industry thought leaders and doers in the cloud-native space. Attendees will walk away with a better understanding of cloud-native and its impact on the IT landscape.Mohan Atreya - Rafay Systems' SVP of Products and Services presented the following:Streamlining Amazon EKS Operations & ManagementKubernetes is the de facto container orchestration tool, and Amazon AWS is the leading cloud platform. But when you have to rapidly scale your Kubernetes deployments in AWS, you may find that Amazon EKS demands skills and headcount that your organization doesn’t have. View this recording to learn:

How to simplify multi-cluster management, even across global AWS regions
How to use GitOps & Cluster Blueprints to automate deployments
How to secure access to clusters across your organization, and
How to gain single-pane-of-glass Kubernetes visibility of all of your clusters no matter where they reside: in AWS, on-prem, or at the edge.

Start a conversation

Request a demo

Teal geometric pattern with repeating triangular shapes forming an angular design on a white background.

Rafay's Valued Partnerships:

Featured Resources

Unlock Your AI Potential with Cisco and Rafay: Transform AI PODs into a Self-Service GPU Cloud

Cisco provides AI-optimized infrastructure. Rafay makes it usable across teams, tenants, and use cases in days.

Learn More

GPU cloud evaluation report

Evaluating how the Rafay Platform delivers a GPU cloud for enterprises and cloud service providers by PivotNine.

Learn More

From GPUs to Revenue: A Practical Guide to AI Factory Builds

This white paper breaks down what it actually takes to turn GPU investments into measurable business outcomes.

Learn More

AI Token Factory

AI Token Factory extends the Rafay Platform to deliver AI services through APIs and token-metered consumption. Production-ready AI APIs run on GPU infrastructure while maintaining governance, multi-tenancy, and operational control. Token-metered consumption provides visibility into usage and enables internal chargeback or monetization models.

Learn More

AI Factory FAQs

Learn how Rafay helps companies go from idle and expensive GPUs to building fully-scaled AI factories to accelerate AI and ML innovations.

Who uses AI factories?

AI factories are used by enterprises, cloud service providers, and sovereign AI clouds that need to scale AI workloads efficiently, maximize GPU utilization, and deliver AI as a production service rather than isolated projects. You can see how Rafay worked with Canadian telecommunications provider Telus in this case study.

What role does Rafay play in AI factories?

Rafay provides the control plane for AI factories, handling orchestration, multi-tenancy, governance, and self-service access to AI infrastructure across cloud, on-prem, and sovereign environments.

Is Rafay an AI factory?

Rafay is not a GPU manufacturer or model provider. Rafay provides an infrastructure orchestration and consumption platform that enables organizations to operate AI factories by turning AI infrastructure into a governed, self-service platform. Learn more about AI factories here: https://rafay.co/ai-and-cloud-native-blog/what-is-an-ai-factory

Does Rafay support NVIDIA NIMs/NIM?

Yes, Rafay supports NVIDIA NIM (NVIDIA Inference Microservices). NIM is NVIDIA’s proprietary solution for delivering packaged inferencing capabilities. It comes pre-configured with NVIDIA’s in-house models and has been optimized for use with a wide range of open-source models, including Meta’s Llama variants. While NIM is often viewed as an alternative to the open-source kServe package, Rafay’s platform supports both NIM and kServe. This flexibility allows customers to choose their preferred inference endpoint and deploy it effortlessly on GPU instances using the Rafay platform. By supporting multiple inferencing solutions, Rafay enables organizations to leverage the most suitable tools for their specific AI/ML needs while maintaining a consistent and manageable infrastructure.

How is Rafay different from Run.AI?

Run:AI focuses on providing fractional/virtualized GPU consumption and a proprietary scheduler optimized for AI/GenAI workloads, replacing the default Kubernetes scheduler. Rafay, however, provides a more comprehensive platform that manages the full lifecycle of underlying Kubernetes clusters and environments. Rafay offers an out-of-the-box experience to deploy and consume Run:AI on Rafay’s GPU PaaS, while also providing its own GPU virtualization and AI-friendly Kubernetes scheduler for customers preferring a single-vendor solution. Essentially, Rafay can either complement Run:AI’s offerings or provide a standalone solution that covers similar functionalities along with broader infrastructure management capabilities, giving customers flexibility in their AI infrastructure choices.

Does Rafay offer a GPU PaaS?

Yes, Rafay provides infrastructure orchestration and workflow automation for cloud-native (Kubernetes) and AI use cases for enterprises, cloud providers, neoclouds, and Sovereign AI clouds. Rafay helps companies deploy a Platform-as-a-Service (PaaS) experience that supports both CPU-only and GPU-accelerated compute environments. Platform teams can quickly set up and deliver customized self-service experiences for developers and data scientists, typically within days or weeks. This flexible platform allows end-users to easily access the computational resources they need, whether it’s standard CPU processing or more powerful GPU capabilities. Rafay’s solution streamlines the deployment and management of diverse computing environments, making it easier for organizations to support a wide range of applications, from standard software to complex AI/ML projects.