Importing quantized models after bias correction #827

i-colbert · 2024-02-06T18:36:09Z

When quantizing floating-point models that don't have a bias in their layer (e.g., nn.Linear(in_features=10, out_features=2, bias=False)), bias correction currently will add a bias to the layer. This leads to the new bias being exported with the state dictionary. However, when loading this modified state dictionary in a new instance of original model, there is a missing keys error from pytorch because there is no bias in the floating-point (or even quantized model) without first running bias correction.

The issue can be resolved by first running bias correction before loading the modified state dictionary (see below), but a more flexible solution may be to add support into the state dictionary loading mechanism itself.

def _prepare_bias_corrected_quant_model(model: nn.Module):
    model.eval()
    dtype = next(model.parameters()).dtype
    device = next(model.parameters()).device
    images = torch.randn(10, 3, 32, 32)
    images = images.to(device)
    images = images.to(dtype)
    with torch.no_grad():
        with bias_correction_mode(model):
            model(images)

The text was updated successfully, but these errors were encountered:

Giuseppe5 added the enhancement New feature or request label Feb 7, 2024

costigt-dev self-assigned this Feb 7, 2024

nickfraser added the good first issue Good for newcomers label Feb 7, 2024

costigt-dev mentioned this issue Feb 20, 2024

Enhance: Importing quantized models after bias correction #868

Merged

capnramses linked a pull request Feb 21, 2024 that will close this issue

Enhance: Importing quantized models after bias correction #868

Merged

Giuseppe5 closed this as completed Mar 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Importing quantized models after bias correction #827

Importing quantized models after bias correction #827

i-colbert commented Feb 6, 2024

Importing quantized models after bias correction #827

Importing quantized models after bias correction #827

Comments

i-colbert commented Feb 6, 2024