GKE and Cloud Run @Next’25

Abdellfetah SGHIOUAR
5 min readMar 24, 2025

--

Google Cloud Next 2025

Google Cloud Next 2025 is around the corner. Next is Google’s Cloud flagship event and this year it’s happening April 9–11 in Las Vegas, USA. Four days of technical content, product updates, demos and various activities.

I scrolled through the long agenda looking for GKE and Cloud Run content and curated it in this article to make it easier for you to find. Don’t forget to bookmark this article and come back to check it in a few weeks. Once the recordings of the sessions below are on Youtube I will list them here.

Keynotes

DEVKEY-Developer Keynote: You can just build things

As usual the highly anticipated developers Keynote with speakers like Richard Seroter, Stephanie Wong and Paige Baily and many more is looking very promising. So make sure you don’t miss that one.

Kubernetes Engine Breakout sessions

BRK2–177-From AWS to Google Cloud: Expand your cloud toolkit

If you are an AWS developer and looking for guidance to understand how to migrate your application to Google Cloud this is the session for you. The speakers will cover how to move an AWS application to GKE and moving a database to CloudSQL using the Database Migration Service.

BRK2–126-Build your next-generation AI/ML platform with Ray on GKE

Learn how to Leverage the best of Ray and Google Kubernetes Engine (GKE) to build your next-generation machine learning (ML) platform.

BRK2–128-Cluster Director with GKE: Optimal performance at max scale

Managing massive deployments of accelerators for AI and high performance computing (HPC) workloads can be complex. This talk dives into running AI-optimized Google Kubernetes Engine (GKE) clusters that streamline infrastructure provisioning, workload orchestration, and ongoing operations for tens of thousands of accelerators.

BRK2–083-Scale your ML platform from zero to hero

This session explores patterns for productionizing AI applications on Google Kubernetes Engine (GKE).

BRK2–084-How Shopify runs their biggest business event of the year with GKE

Join this session where Shopify engineers will discuss how they leverage the latest Google Kubernetes Engine (GKE) innovations to build robust, scalable platforms that not only handle everyday traffic with ease but also gracefully absorb unpredictable spikes during peak events like Black Friday and Cyber Monday.

BRK2–125-Serve open models on TPUs and GKE with superior portability and price-performance

Facing challenges with the cost and performance of your AI inference workloads? This talk presents TPUs and Google Kubernetes Engine (GKE) as a solution for achieving both high throughput and low latency while optimizing costs with open source models and libraries. Learn how to leverage TPUs to scale massive inference workloads efficiently.

BRK3–028-Monitor performance for LLM training and inference workloads on GKE

This session unveils how the Google Cloud Observability suite provides a comprehensive solution for monitoring leading AI model servers like Ray, NVIDIA Triton, vLLM, TGI, and others.

BRK3–034-How Anthropic is pushing the computing limits of AI at scale with GKE

In this session, we’ll explore Google’s latest developments in Google Kubernetes Engine (GKE) that enable unprecedented scale and performance for AI workloads. We’ll dive into how Anthropic leverages these capabilities to manage mega-scale Kubernetes clusters, orchestrate diverse workloads, and achieve breakthrough efficiency optimizations.

BRK3–032-Build an inferencing platform on GKE with Argo CD and fleets

This session provides a look into how Abridge built a secure and scalable inferencing platform on Google Kubernetes Engine (GKE). We’ll demonstrate how they leverage GKE fleets, Teams, Argo CD, and multi-cluster orchestration to manage and deploy inferencing workloads that span multiple clusters

Cloud Run Breakout sessions

BRK2–175-The ultimate Cloud Run guide: From zero to production

Developers love Cloud Run. In this demo-driven talk, you’ll discover why Cloud Run offers simplicity alongside flexibility for running your code. We’ll begin with a couple of basic getting-started concepts

BRK2–071-Run high-availability multi-region services with Cloud Run

Come join us as we take a deep dive into using Cloud Run for high-availability applications that are resilient to regional outages with no additional costs or complexity. Learn how you can minimize service disruptions to ensure your business continues to operate smoothly, even during a regional outage, with minimal toil on Cloud Run

BRK2–070-Unleash the power of serverless GPUs with Cloud Run

Dive into the world of serverless GPUs with Cloud Run. This talk explores how Cloud Run delivers on-demand GPUs with unprecedented flexibility and cost efficiency. Learn how you can achieve optimal performance and resource utilization with rapid autoscaling and scaling to zero

BRK2–063-Build a serverless toolkit: Empower developers with Cloud Run

Learn how a team of developers built a serverless toolkit with Cloud Run to simplify application development and deployment. This session shares best practices from Shopify for creating a robust toolkit that empowers developers to seamlessly ship serverless applications while integrating with essential Google Cloud services and adhering to security best practices. Discover how to enhance scalability, reduce toil, and boost productivity.

BRK2–065-Enterprise-grade security and scale for serverless workloads with Cloud Run

This session dives into the latest advancements in securing and managing your Cloud Run workloads at enterprise scale. Join us to learn about new features and techniques to meet the highest security standards, strategies for managing large-scale deployments, and solutions to common issues like IP exhaustion. Plus, one of our customers will share their firsthand experience managing a massive fleet of Cloud Run workloads.

BRK2–004-Master serverless gen AI with Gemini and Cloud Run

Join us for an interactive session where we’ll build, deploy, and scale inference apps. Imagine creating and launching generative AI apps that deliver personalized recommendations and stunning images, all with the unparalleled efficiency and scalability of serverless computing. You’ll learn how to build gen AI apps effortlessly using Gemini Code Assist; deploy gen AI apps in minutes on Cloud Run, using Vertex AI or on-demand, scale-to-zero serverless GPUs; and optimize the performance and cost of AI workloads by implementing best practices.

Have a good time at Next. I will be there so make sure to stop by and say HI :)

--

--

Abdellfetah SGHIOUAR
Abdellfetah SGHIOUAR

Written by Abdellfetah SGHIOUAR

Google Cloud Engineer with a focus on Serverless, Kubernetes, and Devops Methodologies. A supporter and contributor to OSS. Podcast Host @cloudcareers.dev

No responses yet