By Vivek Trivedy, Product Manager @ LangChain
Evals are training data for Agents
In classical machine learning, training data guides the model’s learning process. Each training example contributes a gradient that updates the model’s weights toward “correctness.” We have a similar learning loop for agents.
Evals encode the behavior we want our agent to exhibit in production. They’re the "training d...
