Skip to content

LangSmith

Status

πŸ“‹ Todo β€” send sources when ready.

Learning roadmap

  • Setup β€” API key, env vars, automatic tracing for LangChain
  • Tracing β€” runs, projects, tags, metadata
  • Tracing non-LangChain code with @traceable
  • Datasets β€” creating from production traces or manually
  • Evaluators β€” built-in (LLM-as-judge, exact match), custom
  • Running evaluations β€” evaluate() API
  • Playground β€” testing prompts against datasets
  • Production monitoring & alerting
  • Annotation queues for human review
  • Cost & latency analysis

What I want to be able to do after this topic

  • Get full visibility into any LLM pipeline I build.
  • Build a regression eval suite that runs on every PR.
  • Debug a misbehaving agent by tracing through its runs.
  • Track cost and latency in production.