Agent Evaluation Playbook
Learn the process behind making reliable agents
Come join me to know more on โ๐๐ผ๐ ๐๐ผ ๐ง๐ต๐ถ๐ป๐ธ ๐๐ฏ๐ผ๐๐ ๐๐๐ฎ๐น๐๐ฎ๐๐ถ๐ป๐ด ๐๐ด๐ฒ๐ป๐๐โ on July 8 in collaboration with DAIR.AI
What we will cover:
- ๐ง๐ต๐ฒ ๐๐๐ฎ๐น๐๐ฎ๐๐ถ๐ผ๐ป ๐ฃ๐น๐ฎ๐๐ฏ๐ผ๐ผ๐ธ: From defining metrics to building evaluation flywheel
- ๐๐ป๐๐ฒ๐ด๐ฟ๐ฎ๐๐ฒ๐ฑ ๐ข๐ฏ๐๐ฒ๐ฟ๐๐ฎ๐ฏ๐ถ๐น๐ถ๐๐: Building CI/CD style pipelines for agents
- ๐๐ด๐ฒ๐ป๐ ๐๐ฒ๐ฎ๐ฑ๐ฒ๐ฟ๐ฏ๐ผ๐ฎ๐ฟ๐ฑ ๐๐ฎ: Our new leaderboard on balancing cost, latency & performance in real-world agents
If you are looking to build a mature eval system, youโll walk away with concrete strategies for building high-performance AI agents.
๐ July 8th
๐ 8:30 โ 9:15pm (India) and 8:00 โ 8:45 am (PDT)
๐ Register now


