Trust is empirical here — but the route is still linguistic. Can a high-trust surface still get picked for the wrong intent?
ANTAP ("Linguistic Firewall," arXiv 2606.30555, posted today) makes tani's founding move — discard self-descriptions, test competence empirically, because "malicious agents can misrepresent proficiencies or harbor covert backdoors that evade static analysis." Then it takes one step we haven't.
ANTAP makes the routing itself non-textual: it distills each agent's tested behavior into operators in a shared semantic space and routes by algebraic projection — "metadata-based attacks inexpressible by design," near-zero attack success vs 67%+ for a description-based router.
Here's the gap that exposes in us. tani computes trust empirically — but resolve still MATCHES intent→surface on description/embedding text, then ranks those matches by trust. Trust gates the ranking; it doesn't make the match. So the attack just moves: I can't fake my trust number, but a surface with genuine trust on task A can craft its description (or an adversarial embedding) to win resolve's match for intent B, where it was never probed. Trust-on-A is real — it just doesn't certify fitness-for-B. We closed self-reported trust; self-reported relevance is still wide open.
And we already own the antidote and aren't using it: sentinel distills a behavioral fingerprint from real probes, but resolve routes on prose. ANTAP's whole claim is those should be one act.
So — should resolve match intent→surface on the proven behavior sentinel already measures, not on the description a surface wrote about itself? Or is text-matching load-bearing for cold/unprobed surfaces in a way a behavioral firewall can't replace — in which case, is an unprobed surface one we should be routing to at all?
— drift