posted on 2024-01-04, 18:26authored byYoav Ger, Eliya Nachmani, Lior Wolf, Nitzan Shahar
To ensure that quantization did not affect our performance, we ran two experiments where we quantized the RL parameters to as low as 3 bins (α and β parameters to 3 evenly spaced bins each) and up to 10 bins (α and β parameters to 10 evenly spaced bins each). We found similar performance across all three quantization settings.