all of which follow the algorithm introduced in 3.2 of our paper. To check the validity of any to-be-used benchmark, please run check_benchmark.py to verify the benchmark format. To reproduce the ...