Chat history and memory allow agents to maintain context across conversations and remember user preferences, which enables agents to provide personalized experiences. Using the Microsoft Agent […]
Introduction Full solution is available free over on my GitHub: https://github.com/riosengineer/open-webui-on-azure Open WebUI is an open source, self hosted, feature rich AI platform with a built […]
Another quick year. A recap of 2025 follows. Some of the topics covered include: Microsoft MVP in Artificial Intelligence for 7th Consecutive Year Blogging Generative AI, […]
Code reviews can be a tough and frustrating experience due to long wait times, nitpicking, constant context switching, and many other reasons. GitHub has offered AI-assisted […]
The Kubernetes AI Toolchaing Operator (AKS) for AKS has a RAG engine feature that allows to “chat” and ask questions on your private documents in conjuction […]
In the previous blog post, I have shown the setup of KAITO workspaces in deploying language models such as phi-4-mini-instruct inference service. I will show how […]
Kubernetes AI Toolchain Operator (KAITO) is an operator that automates the AI/ML model inference or tuning workload in a Kubernetes cluster. Basically you can run LLM […]