Description
SWEBenchMultimodal uses several hyperparameters/ constant values and constant values in its implementation in benchmarks/swebenchmultimodal
A constant value is any value int or str that must be set to correctly parametrized/run the benchmark. It can appear as input of a method or as default argument of a method.
Here are example of what can be a constant value:
- "validation"
- "2023_all"
- "finish_with_patch"
- "lite"
- "wentingzhao/commit0_combined"
- ....
Those values are hyper parameters of the full evaluation workflow.
Painpoint
It's not easy to check if all such values are correctly parametrized for a benchmark because they are scattered in many different places.
Solution
For SWEBenchMultimodal
- Go through the code in benchmarks/swebenchmultimodal and find ALL such constant values
- Redefine all those constant values in benchmarks/swebenchmultimodal/constants.py and make sure all other files import those values from the constant.py which constitutes the single source of truth.