March 2024 – Page 2 – C4: Container, Code, Cloud & Context

Structured Output from LLMs: JSON Mode, Function Calling, and Instructor

Posted on March 12, 2024 by Nithin Mohan TK 8 min read

Introduction: Getting LLMs to return structured data instead of free-form text is essential for building reliable applications. Whether you need JSON for API responses, typed objects for downstream processing, or specific formats for data extraction, structured output techniques ensure consistency and parseability. This guide covers the major approaches: JSON mode, function calling, the Instructor library, […]

Read more →

LLM Testing and Evaluation: Building Confidence in AI Applications

Posted on March 8, 2024 by Nithin Mohan TK 11 min read

Introduction: LLM applications are notoriously hard to test. Outputs are non-deterministic, “correct” is often subjective, and traditional unit tests don’t apply. Yet shipping untested LLM features is risky—prompt changes can break functionality, model updates can degrade quality, and edge cases can embarrass your product. This guide covers practical testing strategies: building evaluation datasets, implementing automated […]

Read more →

Prompt Injection Defense: Sanitization, Detection, and Output Validation

Posted on March 5, 2024 by Nithin Mohan TK 3 min read

Introduction: Prompt injection is the most significant security vulnerability in LLM applications. Attackers craft inputs that manipulate the model into ignoring instructions, leaking system prompts, or performing unauthorized actions. Unlike traditional injection attacks, prompt injection exploits the model’s inability to distinguish between instructions and data. This guide covers practical defense strategies: input sanitization, injection detection, […]

Read more →

Streaming LLM Responses: Building Real-Time AI Applications

Posted on March 5, 2024 by Nithin Mohan TK 8 min read

Introduction: Waiting 10-30 seconds for an LLM response feels like an eternity. Streaming changes everything—users see tokens appear in real-time, creating the illusion of instant response even when generation takes just as long. Beyond UX, streaming enables early termination (stop generating when you have enough), progressive processing (start working with partial responses), and better error […]

Read more →

Searching in

Month: March 2024

Structured Output from LLMs: JSON Mode, Function Calling, and Instructor

LLM Testing and Evaluation: Building Confidence in AI Applications

Prompt Injection Defense: Sanitization, Detection, and Output Validation

Streaming LLM Responses: Building Real-Time AI Applications