*{margin:0;padding:0;box-sizing:border-box} body{font-family:system-ui,-apple-system,sans-serif;color:#1a1a1a;background:#fff;line-height:1.6;font-size:15px} a{color:#0d9488;text-decoration:none} a:hover{color:#0f766e;text-decoration:underline} code,pre{font-family:ui-monospace,'SF Mono','Cascadia Mono',monospace} code{background:#f5f5f5;padding:1px 5px;font-size:0.9em} pre{background:#f5f5f5;padding:16px;overflow-x:auto;font-size:13px;line-height:1.5;border:1px solid #e5e7eb} h1,h2,h3,h4{font-weight:700;line-height:1.3} h1{font-size:1.6rem;margin-bottom:8px} h2{font-size:1.2rem;margin:24px 0 8px;padding-top:16px;border-top:1px solid #e5e7eb} h3{font-size:1rem;margin:16px 0 6px} table{width:100%;border-collapse:collapse;font-size:14px;margin:12px 0} th{text-align:left;padding:8px 12px;border-bottom:2px solid #e5e7eb;color:#6b7280;font-weight:600;font-size:13px} td{padding:8px 12px;border-bottom:1px solid #e5e7eb} tr:nth-child(even){background:#f9fafb} .container{max-width:960px;margin:0 auto;padding:0 20px} nav{border-bottom:1px solid #e5e7eb;padding:12px 0} nav .inner{max-width:960px;margin:0 auto;padding:0 20px;display:flex;align-items:center;justify-content:space-between} nav .logo{font-weight:700;font-size:1.1rem;color:#0d9488;text-decoration:none} nav .logo:hover{text-decoration:none} nav .links{display:flex;gap:20px;font-size:14px} nav .links a{color:#6b7280} nav .links a:hover{color:#0d9488;text-decoration:none} footer{border-top:1px solid #e5e7eb;padding:20px 0;margin-top:40px;font-size:13px;color:#6b7280} footer .inner{max-width:960px;margin:0 auto;padding:0 20px} .pill{display:inline-block;padding:1px 8px;font-size:12px;font-weight:600;border:1px solid #e5e7eb} .pill-green{background:#ecfdf5;color:#065f46;border-color:#a7f3d0} .pill-yellow{background:#fffbeb;color:#92400e;border-color:#fde68a} .pill-red{background:#fef2f2;color:#991b1b;border-color:#fecaca} .pill-gray{background:#f9fafb;color:#6b7280;border-color:#e5e7eb} .breadcrumb{font-size:13px;color:#6b7280;margin:16px 0 12px} .breadcrumb a{color:#6b7280} .breadcrumb a:hover{color:#0d9488} .search-box{display:flex;gap:8px;margin:16px 0} .search-box input{flex:1;padding:8px 12px;border:1px solid #e5e7eb;font-size:14px;font-family:system-ui,-apple-system,sans-serif;outline:none} .search-box input:focus{border-color:#0d9488} .search-box button{padding:8px 20px;background:#0d9488;color:#fff;border:none;font-size:14px;font-weight:600;cursor:pointer;font-family:system-ui,-apple-system,sans-serif} .search-box button:hover{background:#0f766e} .stat-row{display:flex;gap:32px;margin:16px 0;flex-wrap:wrap} .stat-item .num{font-family:ui-monospace,'SF Mono',monospace;font-size:1.4rem;font-weight:700} .stat-item .label{font-size:12px;color:#6b7280} .card{border:1px solid #e5e7eb;padding:16px;margin-bottom:12px} .section{margin:20px 0} .desc{color:#6b7280;font-size:14px;margin:4px 0 12px} .method{display:inline-block;font-size:11px;font-weight:700;padding:1px 6px;margin-right:6px;font-family:ui-monospace,'SF Mono',monospace} .method-get{background:#ecfdf5;color:#065f46} .method-post{background:#eff6ff;color:#1e40af} .ep{font-family:ui-monospace,'SF Mono',monospace;font-size:14px;font-weight:500} .param-table th{font-size:12px;text-transform:uppercase;letter-spacing:0.05em} .param-table code{background:#f5f5f5;padding:1px 5px;font-size:12px} @media(max-width:640px){ .stat-row{gap:16px} .stat-item .num{font-size:1.1rem} nav .links{gap:12px;font-size:13px} table{font-size:13px} th,td{padding:6px 8px} }

What is tau2-bench?

tau2-bench is a AI tool that τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment. It has a Nerq Trust Score of 85/100 (A). 762 GitHub stars. Last analyzed March 2026.

Trust & Safety

85
TRUST SCORE
A
GRADE
762
STARS
0
DOWNLOADS

Details

Authorsierra-research
CategoryAI tool
LicenseMIT
Typeagent
SourceView on GitHub

How to Get Started

curl nerq.ai/v1/preflight?target=sierra-research-tau2-bench

Setup guide · Full safety report

Alternatives

ToolTrustStars
openclaw84218.2K
tensorflow72193.9K
AutoGPT75181.9K
n8n80177.3K
ollama74163.0K

Frequently Asked Questions

What is tau2-bench used for?
tau2-bench is a AI tool tool. τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment.
Is tau2-bench free?
License: MIT. tau2-bench has 762 GitHub stars.
Is tau2-bench safe?
tau2-bench has a Nerq Trust Score of 85/100 (A). Safe for production use.
What are alternatives to tau2-bench?
Top alternatives: openclaw, tensorflow, AutoGPT. See full comparison.

Last updated March 2026. Trust scores based on automated analysis of public data.

Also explore

Nerq Trust Protocol AI Compliance Hub Know Your Agent Crypto Vitality Rankings Crash Watch: Live Alerts Real-Time Token Scanner