Add stricter basic economy policy restrictions#94
Add stricter basic economy policy restrictions#94chrisgorgo wants to merge 1 commit intosierra-research:devfrom
Conversation
Policy changes: - Basic economy reservations cannot have flight segments modified regardless of subsequent cabin changes - Cabin upgrades do not remove flight modification restrictions - Basic economy cancellation limited to: 24hr window, airline cancellation, or insurance with covered reason (health/weather) - Basic economy refunds limited to airline credits unless user has insurance Task fixes: - Task 7: Accept basic economy cancellation refusal, expected total $1,004 - Task 39: Add health reason for cancellation - Task 44: Add health reason and explicit credit card payment preference DB updates: - K1NW8N: basic_economy -> economy, insurance: yes - OWZ4XL: basic_economy -> economy 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
|
Thanks for the PR. The fact that it is possible to upgrade and then cancel is not an unknown loophole but a known feature of the policy (as you pointed out, task 7 uses it). Are you saying that task 7 is incorrect? (and same for the two other tasks? - I think your numbering if off btw). |
|
Sorry I should've added more context. What we observed is that Opus 4.5 started finding creative ways to use the loopholes in questions that did not expect them. For example in questions 36 and 31. So in general there was inconsistency of when the loopholes were expected and when they would not be allowed. The PR makes the policy clear and adjusts the the other questions to not have to rely on the loopholes. |
|
The fact that you can upgrade to economy and then do the change is not something we were not aware of. The tasks were designed with this in mind. What would be most helpful would be to be able to see the trajectories where the model is using this rule (upgrade and then cancel) in way that leads to a failure of the benchmark.
Here also, could you share trajectories? Are we talking about the same issue or a different one? |
|
Hello, and sorry for imposing, but doesn't the policy say clearly that: In other cases, all reservations, including basic economy, can change cabin without changing the flights. So the solution is not-policy compliant because changing the flight details is implicitly excluded. |
|
@chrisgorgo @victorb-sierra I'd actually suggest that for tasks 31 and 36, we can add more instructions for the user to refuse cabin upgrades if the agent asks about it. That way, we keep the expected outcomes the same to maintain backward compatibility while dealing with the case of the agent trying out this upgrade strategy. What do you think? |
|
This could solve the current problem (although I am not sure if it would be truly backward compatible), but I am worried that future models might start using loopholes in other questions. Is keeping an existing loophole in the policy realistic? Another alternative is to come up with a more robust grading. |
|
Relevant for RFC #128 (Tau3 domain fixes) |
Summary
Added explicit policy restrictions for basic economy reservations to prevent agents from using upgrade workarounds to bypass cancellation rules.
Problem
Agents were finding loopholes in basic economy policies by:
This allowed circumventing the stricter basic economy cancellation rules.
Changes
Policy Updates (
policy.md)Database Updates (
db.json)basic_economytoeconomyto align with updated task expectationsTask Updates (
tasks.json)Impact
Agents will now correctly enforce basic economy restrictions without allowing upgrade-then-cancel workarounds. The evaluation expects agents to explain policy limitations rather than find loopholes.
🤖 Generated with Claude Code