Three months into production, our RAG system started failing at 2AM. Not gracefully—complete outages. The problem wasn’t the models or the embeddings. It was the architecture. After rebuilding it twice, here’s what I learned about building RAG systems that actually work in production. Figure 1: Production RAG Architecture Overview The Night Everything Broke It was […]
Read more →Category: Microsoft
Microsoft Corporation (NASDAQ: MSFT) is an American multinational corporation headquartered in Redmond, Washington, United States that develops, manufactures, licenses, and supports a wide range of products and services predominantly related to computing through its various product divisions. Established on April 4, 1975 to develop and sell BASIC interpreters for the Altair 8800, Microsoft rose to dominate the home computer operating system market with MS-DOS in the mid-1980s, followed by the Microsoft Windows line of operating systems.
Microsoft would also come to dominate the office suite market with Microsoft Office. The company has diversified in recent years into the video game industry with the Xbox and its successor, the Xbox 360 as well as into the consumer electronics and digital services market with Zune, MSN and the Windows Phone OS. The ensuing rise of stock in the company’s 1986 initial public offering (IPO) made an estimated three billionaires and 12,000 millionaires…
CDA (Clinical Document Architecture): The XML Standard for Medical Documents
What is CDA and Why It Matters CDA Document Structure Sample CDA Document Structure .NET CDA Parsing Implementation CDA Document Generation Common CDA Sections (C-CDA) CDA vs FHIR Documents Standards and References Related Articles in This Series Conclusion
Read more →Azure Machine Learning: A Solutions Architect’s Guide to Enterprise MLOps
The journey from experimental machine learning models to production-ready AI systems represents one of the most challenging transitions in modern software engineering. Having spent over two decades architecting enterprise solutions, I’ve witnessed the evolution from manual model deployment to sophisticated MLOps platforms. Azure Machine Learning stands at the forefront of this transformation, offering a comprehensive […]
Read more →Difference between workload managed identity, Pod Managed Identity and AKS Managed Identity
Azure Kubernetes Service(AKS) offers several options for managing identities within Kubernetes clusters, including AKS Managed Identity, Pod Managed Identity, and Workload Managed Identity. Here’s a comparison of these three options: Key Features AKS Managed Identity Pod Managed Identity Workload Managed Identity Overview A built-in feature of AKS that allows you to assign an Azure AD […]
Read more →Private Kubernetes cluster in AKS with Azure Private Link
Today, we’ll take a look at a new feature in AKS called Azure Private Link, which allows you to connect to AKS securely and privately over the Microsoft Azure backbone network. In the past, connecting to AKS from an on-premises network or other virtual network required using a public IP address, which posed potential security […]
Read more →Running LLMs on Kubernetes: Production Deployment Guide
Deploying LLMs on Kubernetes requires careful planning. After deploying 25+ LLM models on Kubernetes, I’ve learned what works. Here’s the complete guide to running LLMs on Kubernetes in production. Figure 1: Kubernetes LLM Architecture Why Kubernetes for LLMs Kubernetes offers significant advantages for LLM deployment: Scalability: Auto-scale based on demand Resource management: Efficient GPU and […]
Read more →