DevOps – C4: Container, Code, Cloud & Context

Kubernetes 1.35: In-Place Pod Resource Updates and AI Model Image Volumes

Posted on January 6, 2026 by Nithin Mohan TK 7 min read

Kubernetes 1.35, released in January 2026 and now supported on Amazon EKS and EKS Distro, marks a significant milestone in container orchestration—particularly for AI/ML workloads. This release introduces In-Place Pod Resource Updates, allowing you to resize CPU and memory without restarting pods, and Image Volumes, a game-changer for delivering large AI models using OCI container […]

Read more →

Production Model Deployment Patterns: From REST APIs to Kubernetes Orchestration in Python

Posted on December 3, 2025 by Nithin Mohan TK 8 min read

After deploying hundreds of ML models to production across startups and enterprises, I’ve learned that model deployment is where most AI projects fail. Not because the models don’t work—but because teams underestimate the engineering complexity of serving predictions reliably at scale. This article shares production-tested deployment patterns from REST APIs to Kubernetes orchestration. 1. The […]

Read more →

Production-Ready Agents: Observability, Security & Deployment – Part 8

Posted on November 19, 2025 by Nithin Mohan TK 17 min read

Deploy AI agents to production with enterprise-grade observability, security, and resilience. Complete guide to OpenTelemetry, content safety, and Azure deployment.

Read more →

Observability Practices in AI Engineering: A Complete Guide to LLM Monitoring

Posted on October 14, 2025 by Nithin Mohan TK 12 min read

Master AI observability with this comprehensive guide. Compare Langfuse, Helicone, LangSmith, and other tools. Learn which metrics matter, how to build evaluation pipelines, and implement production-grade monitoring for LLM applications.

Read more →

DIY LLMOps: Building Your Own AI Platform with Kubernetes and Open Source

Posted on September 29, 2025 by Nithin Mohan TK 6 min read

Build a production-grade LLMOps platform using open source tools. Complete guide with Kubernetes deployments, GitHub Actions CI/CD, vLLM model serving, and Langfuse observability.

Read more →

MLOps vs LLMOps: A Complete Guide to Operationalizing AI at Enterprise Scale

Posted on September 15, 2025 by Nithin Mohan TK 10 min read

Understand the critical differences between MLOps and LLMOps. Learn prompt management, evaluation pipelines, cost tracking, and CI/CD patterns for LLM applications in production.

Read more →

Searching in

Category: DevOps

Production Model Deployment Patterns: From REST APIs to Kubernetes Orchestration in Python

Production-Ready Agents: Observability, Security & Deployment – Part 8

Observability Practices in AI Engineering: A Complete Guide to LLM Monitoring

DIY LLMOps: Building Your Own AI Platform with Kubernetes and Open Source

MLOps vs LLMOps: A Complete Guide to Operationalizing AI at Enterprise Scale