The Kubernetes Current Blog

Announcing Rafay’s Templates for AI and Generative AI

We constantly hear from our customers about wanting their developers to experiment with Generative AI. No organization wants to be left behind and they are all trying to find ways to empower their developers and application teams to be able to experiment with use cases powered especially by Generative AI.

According to recent Gartner research, > 80% of enterprises will have used Generative AI APIs or Deployed Generative AI-Enabled Applications by 2026.

We have been listening to our customers and are happy to announce Rafay’s Templates for AI & Generative AI. Platform teams can now provide their developers with a self service experience for infrastructure so that developers can experiment with new and innovative AI and Generative AI use cases.

Requirements for Self Service

In our conversations with platform teams, developers and key technology partners, a few key requirements bubbled up to the top as critical requirements to provide this self service experience with transparent enforcement of critical controls.

  1. Developer Self Service: This was emphasized as the most important. They wanted a self service experience for their developers because they do not want any bottlenecks for experimentation. Platform and Ops teams are also swamped supporting other priorities.
  2. Powered by Standards based IaC: Organizations have made significant investments in Infrastructure as Code (IaC) and they they wanted the self service experience to be backed by their preferred IaC such as Terraform.
  3. Infrastructure Provider: Most of the organizations that we spoke with were either on AWS or Azure.
  4. Access to Multiple Models: We consistently heard that organizations would like to experiment with different models for different use cases. Given how fast the Generative AI landscape is evolving, it is sensible to not be locked into a provider that can only support a single model.
  5. Customize the Model: Organizations mentioned that they need the ability to further tune/train a foundational model with custom data to ensure it can be optimized for their use case.
  6. Security Organizations said they were uncomfortable about using public/open models until they have guarantees and clarity on whether their data would not be used for public use.

Our first version of the “Templates for AI and Gen AI” is focused on AWS as the infrastructure provider.

We will also be releasing updates to the templates with a focus on Azure in a few weeks.

Self Service Experience

This is essentially a “2-step” process. The first step involves the platform engineer publishing the environment templates for the organization. The second step involves the developer/data scientist “consuming” the environment templates to provision the environments so that they can use it. The diagram below shows the high level steps.

Rafay's Templates for AI and Generative AI

Learn More/Try It

Are you interested in learning more about Rafay’s “Templates for AI and Gen AI”?

  1. Read through the documentation for the Templates form
  2. Watch videos of some of our examples
  3. Schedule a demo
  4. Meet us and watch a live demo at upcoming conferences and industry events
  5. If you would like to try this yourself, you can sign up for a Free Org.
The Infrastructure Templates for AI and Gen AI require Rafay Environment Manager. This is currently feature flagged and needs to be enabled in your Org. Contact us to get Environment Manager enabled in your Org.


Trusted by leading companies