Skip to content

Actions: sierra-research/tau2-bench

Actions

Deploy Leaderboard to GitHub Pages

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
18 workflow runs
18 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Qwen3 max thinking submission (#149)
Deploy Leaderboard to GitHub Pages #17: Commit 70e700c pushed by benshi34
1m 6s main
Fix toolorchestrator trajectory loading (#120)
Deploy Leaderboard to GitHub Pages #16: Commit 337326e pushed by benshi34
54s main
Submit Toolorchestra to leaderboard - Revised (#119)
Deploy Leaderboard to GitHub Pages #15: Commit 1b67be7 pushed by benshi34
57s main
Leaderboard/qwen 3 max thinking preview (#117)
Deploy Leaderboard to GitHub Pages #14: Commit 5704c21 pushed by benshi34
1m 27s main
support custom scaffold on leaderboard view (#101)
Deploy Leaderboard to GitHub Pages #13: Commit b00cd42 pushed by benshi34
1m 42s main
Deploy Leaderboard to GitHub Pages
Deploy Leaderboard to GitHub Pages #10: Manually run by benshi34
1m 25s main
Deploy Leaderboard to GitHub Pages
Deploy Leaderboard to GitHub Pages #9: Manually run by benshi34
5m 38s main
submission: qwen3.5-max results (#67)
Deploy Leaderboard to GitHub Pages #6: Commit 9df0df8 pushed by victorb-sierra
1m 2s main
Fix mobile view (#56)
Deploy Leaderboard to GitHub Pages #4: Commit db6ce6a pushed by victorb-sierra
54s main
Deploy Leaderboard to GitHub Pages
Deploy Leaderboard to GitHub Pages #2: Manually run by victorb-sierra
54s main
[Feature] tau2 leaderboard (#53)
Deploy Leaderboard to GitHub Pages #1: Commit bb6a9e4 pushed by victorb-sierra
59s main