Nithin Mohan TK

Batch Inference Optimization: Maximizing Throughput and Minimizing Costs

Posted on February 8, 2025 by Nithin Mohan TK 18 min read

Introduction: Batch inference optimization is critical for cost-effective LLM deployment at scale. Processing requests individually wastes GPU resources—the model loads weights once but processes only a single sequence. Batching multiple requests together amortizes this overhead, dramatically improving throughput and reducing per-request costs. This guide covers the techniques that make batch inference efficient: dynamic batching strategies, […]

Read more →

GitOps with a comparison between Flux and ArgoCD and which one is better for use in Azure AKS

Posted on February 6, 2025 by Nithin Mohan TK 4 min read

GitOps has emerged as a powerful paradigm for managing Kubernetes clusters and deploying applications. Two popular tools for implementing GitOps in Kubernetes are Flux and ArgoCD. Both tools have similar functionalities, but they differ in terms of their architecture, ease of use, and integration with cloud platforms like Azure AKS. In this blog, we will […]

Read more →

Mastering Hybrid Cloud with Google Anthos: Unified Kubernetes Management Across Any Environment

Posted on February 5, 2025 by Nithin Mohan TK 10 min read

Introduction: Google Anthos provides a unified platform for managing applications across on-premises data centers, Google Cloud, and other cloud providers. This comprehensive guide explores Anthos’s enterprise capabilities, from GKE Enterprise and Config Management to Service Mesh and multi-cluster networking. After implementing hybrid cloud architectures for enterprises with complex compliance and data residency requirements, I’ve found […]

Read more →

HL7 v2: The Messaging Standard That Powers Healthcare IT

Posted on February 3, 2025 by Nithin Mohan TK 13 min read

Executive Summary HL7 v2.x remains the most widely deployed healthcare messaging standard globally, powering 95% of hospital interfaces despite being developed in the 1980s. This deep dive explores why HL7 v2 continues to dominate healthcare IT, how it works at a technical level, and how modern .NET developers can implement robust v2 interfaces for Irish […]

Read more →

LLM Monitoring and Alerting: Building Observability for Production AI Systems

Posted on February 3, 2025 by Nithin Mohan TK 20 min read

Introduction: LLM monitoring is essential for maintaining reliable, cost-effective AI applications in production. Unlike traditional software where errors are obvious, LLM failures can be subtle—degraded output quality, increased hallucinations, or slowly rising costs that go unnoticed until the monthly bill arrives. Effective monitoring tracks latency, token usage, error rates, output quality, and cost metrics in […]

Read more →

Machine Learning Fundamentals: A Comprehensive Guide to Enterprise AI Foundations

Posted on February 3, 2025 by Nithin Mohan TK 9 min read

Discover the foundations of machine learning from an enterprise architect’s perspective. Learn core ML concepts, the ML workflow, and practical Python implementations to kickstart your AI journey.

Read more →

Searching in

Author: Nithin Mohan TK

Batch Inference Optimization: Maximizing Throughput and Minimizing Costs

GitOps with a comparison between Flux and ArgoCD and which one is better for use in Azure AKS

Mastering Hybrid Cloud with Google Anthos: Unified Kubernetes Management Across Any Environment

HL7 v2: The Messaging Standard That Powers Healthcare IT

LLM Monitoring and Alerting: Building Observability for Production AI Systems