How we stopped YOLOing our MCP tool descriptions with role-play-based evals hume.ai 1 points by twitchard 2 days ago