For in-browser agents, it might be interesting to think of ways in which developers can measure how their WebMCP implementation is doing.
While theoretically they can instrument their tool calls to see how often the agent calls them, they won't be able to know if that tool call actually corresponded to what the user was trying to achieve.
A way to measure the latter (e.g. by reflecting user's thumbs up/down votes to the site) could help enable some measurement of changes on either the WebMCP implementation or the agent's.
For in-browser agents, it might be interesting to think of ways in which developers can measure how their WebMCP implementation is doing.
While theoretically they can instrument their tool calls to see how often the agent calls them, they won't be able to know if that tool call actually corresponded to what the user was trying to achieve.
A way to measure the latter (e.g. by reflecting user's thumbs up/down votes to the site) could help enable some measurement of changes on either the WebMCP implementation or the agent's.