#evals

2 pages tagged evals.

2/2

Prompting

Prompt engineering patterns, RAG, evaluations, few-shot, chain-of-thought, and structured output — foundational techniques for extracting reliable, structured behavior from LLMs.

05-25-2026#prompting#rag#llm

LLM Evaluations

Build production evaluation pipelines for LLM applications — golden datasets, LLM-as-judge, rubrics, statistical significance, regression detection, and evals vs tests.

05-25-2026#evals#evaluation#llm-as-judge