#evals
2 pages tagged evals.
2/2
Prompting
Prompt engineering patterns, RAG, evaluations, few-shot, chain-of-thought, and structured output — foundational techniques for extracting reliable, structured behavior from LLMs.
LLM Evaluations
Build production evaluation pipelines for LLM applications — golden datasets, LLM-as-judge, rubrics, statistical significance, regression detection, and evals vs tests.