LangSmith¶
Status¶
π Todo β send sources when ready.
Learning roadmap¶
- Setup β API key, env vars, automatic tracing for LangChain
- Tracing β runs, projects, tags, metadata
- Tracing non-LangChain code with
@traceable - Datasets β creating from production traces or manually
- Evaluators β built-in (LLM-as-judge, exact match), custom
- Running evaluations β
evaluate()API - Playground β testing prompts against datasets
- Production monitoring & alerting
- Annotation queues for human review
- Cost & latency analysis
What I want to be able to do after this topic¶
- Get full visibility into any LLM pipeline I build.
- Build a regression eval suite that runs on every PR.
- Debug a misbehaving agent by tracing through its runs.
- Track cost and latency in production.