April 2024 – Page 2 – C4: Container, Code, Cloud & Context

Testing LLM Applications: Unit Tests, Integration Tests, and Evaluation

Posted on April 8, 2024 by Nithin Mohan TK 14 min read

Introduction: Testing LLM applications presents unique challenges compared to traditional software. Outputs are non-deterministic, quality is subjective, and the same input can produce different but equally valid responses. This guide covers practical testing strategies: unit testing with mocked LLM responses, integration testing with real API calls, evaluation frameworks for quality assessment, and regression testing to […]

Read more →

Vector Search Optimization: HNSW, IVF, and Hybrid Retrieval

Posted on April 8, 2024 by Nithin Mohan TK 12 min read

Introduction: Vector search powers semantic retrieval in RAG systems, recommendation engines, and similarity search applications. But naive vector search doesn’t scale—searching millions of vectors with brute force is too slow for production. This guide covers optimization techniques: HNSW indexes for fast approximate search, IVF partitioning for large datasets, product quantization for memory efficiency, hybrid search […]

Read more →

Introduction to Tokenization

Posted on April 6, 2024 by Nithin Mohan TK 5 min read

The moment I truly understood tokenization was not when I read about it in a textbook, but when I watched a production NLP pipeline fail catastrophically because of an edge case the tokenizer could not handle. After two decades of building enterprise systems, I have learned that tokenization—the seemingly simple act of breaking text into […]

Read more →

Function Calling Deep Dive: Building LLM-Powered Tools and Agents

Posted on April 1, 2024 by Nithin Mohan TK 9 min read

Introduction: Function calling transforms LLMs from text generators into action-taking agents. Instead of just describing what to do, the model can actually do it—query databases, call APIs, execute code, and interact with external systems. OpenAI’s function calling (now called “tools”) and similar features from Anthropic and others let you define available functions, and the model […]

Read more →

Searching in

Month: April 2024

Testing LLM Applications: Unit Tests, Integration Tests, and Evaluation

Vector Search Optimization: HNSW, IVF, and Hybrid Retrieval

Introduction to Tokenization

Function Calling Deep Dive: Building LLM-Powered Tools and Agents