The Kubernetes AI Toolchaing Operator (AKS) for AKS has a RAG engine feature that allows to “chat” and ask questions on your private documents in conjuction […]
In the previous blog post, I have shown the setup of KAITO workspaces in deploying language models such as phi-4-mini-instruct inference service. I will show how […]
Reading Time: 10 minutesWhilst at KubeCon Atlanta, the Kubernetes project announced something significant: Ingress NGINX is being retired in March 2026. The project is winding down the […]
Kubernetes AI Toolchain Operator (KAITO) is an operator that automates the AI/ML model inference or tuning workload in a Kubernetes cluster. Basically you can run LLM […]
Reading Time: 5 minutesRecently while browsing through the Azure CLI extensions repo (yes, that’s how I spend my evenings), I stumbled across a pair of pull […]
This post will explore the new support in Azure Application Gateway for Containers (AGC) for Web Application Firewall (WAF) as documented in https://aka.ms/agc/waf. This blog is part of a […]
Reading Time: 5 minutesRecently, I was tasked with implementing distributed tracing for a microservices platform running on Azure Kubernetes Service (AKS). The requirements were clear: use […]
Reading Time: 2 minutesIntro I recently ran into capacity issues in Azure and saw firsthand how AKS VMSS node pools behave when you use deallocated nodes […]