-
Notifications
You must be signed in to change notification settings - Fork 243
Pull requests: web-arena-x/webarena
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support HuggingFace completion prompts for non-Llama models
#266
opened May 23, 2026 by
mitre88
Loading…
Add MiniMax as LLM provider for web agent evaluation
#250
opened Mar 24, 2026 by
octo-patch
Loading…
3 of 4 tasks
add trajectory JSONL streaming for persistence/debug/replay
#240
opened Nov 27, 2025 by
thanay-sisir
Loading…
add automatic 3-attempt retry for LLM parse failures
#239
opened Nov 27, 2025 by
thanay-sisir
Loading…
add exact text match to TYPE action equivalence in is_equivalent
#238
opened Nov 26, 2025 by
thanay-sisir
Loading…
Unify Consecutive Repeating Action Detection in
early_stop
#237
opened Nov 26, 2025 by
thanay-sisir
Loading…
prioritize recent obs content via suffix truncation
#236
opened Nov 26, 2025 by
thanay-sisir
Loading…
Wrong scroll action specification in the prompts vs. in the parsing function
#110
opened Mar 13, 2024 by
michalspiegel
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.