Add FP16 support to ptq_evaluate.py and update README argument list #1174

hkayann · 2025-02-05T15:05:10Z

Reason for this PR

Changes Made in this PR

Testing Summary

Risk Highlight

This PR includes code from another work (please detail).
This PR contains API-breaking changes.
This PR depends on work in another PR (please provide links/details).
This PR introduces new dependencies (please detail).
There are coverage gaps not covered by tests.
Documentation updates required in subsequent PR.

Checklist

Code comments added to any hard-to-understand areas, if applicable.
Changes generate no new warnings.
Updated any relevant tests, if applicable.
No conflicts with destination dev branch.
I reviewed my own code changes.
Initial CI/CD passing.
1+ reviews given, and any review issues addressed and approved.
Post-review full CI/CD passing.

Signed-off-by: hkayann <[email protected]>

Giuseppe5

Just 2 quick comments, otherwise looks good!

Thanks again for this.

.pre-commit-config.yaml

Giuseppe5 · 2025-02-05T15:54:30Z

src/brevitas_examples/imagenet_classification/ptq/ptq_evaluate.py

-    '--dtype', default='float', choices=['float', 'bfloat16'], help='Data type to use')
+    '--dtype',
+    default='float',
+    choices=['float', 'bfloat16', 'half'],


I think half and float16 are equivalent, and for consistency reasons I think I prefer float16.

If I am missing something, let me know please.

Yes, they are functionally equivalent. My initial thought was that using half could help prevent typos, as the only difference is the letter b.

Now, I am trying to save custom-bit-width models, for example FP16 with mantissa 9 bits, exponent 6 bits etc, but seems like not possible given the available PyTorch dtypes.

Unfortunately I'm not sure I can fully help with the second issue, since we mostly focus on minifloat quantization with 8 bits of fewer.

In the meantime, would you mind changing half to float16? I understand the potential for typos but I still prefer trying to be consistent across the codebase, and we never (or rarely) use half instead of float16.

Thanks!

I can simulate for now, so I have a working workaround which is something. I have done all the requested changes as requested. Many thanks again.

Signed-off-by: hkayann <[email protected]>

Giuseppe5 · 2025-02-06T13:52:55Z

Thanks for this again :)

I will let the tests run and then merge it

Added FP16 support to ptq_evaluate.py and updated README

26412f8

Signed-off-by: hkayann <[email protected]>

Giuseppe5 reviewed Feb 5, 2025

View reviewed changes

Renamed half to float16 for consistency

ebe901b

Signed-off-by: hkayann <[email protected]>

Giuseppe5 self-requested a review February 6, 2025 13:53

Giuseppe5 merged commit 58c3ba9 into Xilinx:dev Feb 7, 2025
112 checks passed

Giuseppe5 mentioned this pull request Feb 7, 2025

Lack of half precision as datatype for PTQ. #1173

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FP16 support to ptq_evaluate.py and update README argument list #1174

Add FP16 support to ptq_evaluate.py and update README argument list #1174

hkayann commented Feb 5, 2025

Giuseppe5 left a comment

Giuseppe5 Feb 5, 2025

hkayann Feb 5, 2025

hkayann Feb 5, 2025

Giuseppe5 Feb 6, 2025

hkayann Feb 6, 2025

Giuseppe5 commented Feb 6, 2025

Add FP16 support to ptq_evaluate.py and update README argument list #1174

Add FP16 support to ptq_evaluate.py and update README argument list #1174

Conversation

hkayann commented Feb 5, 2025

Reason for this PR

Changes Made in this PR

Testing Summary

Risk Highlight

Checklist

Giuseppe5 left a comment

Choose a reason for hiding this comment

Giuseppe5 Feb 5, 2025

Choose a reason for hiding this comment

hkayann Feb 5, 2025

Choose a reason for hiding this comment

hkayann Feb 5, 2025

Choose a reason for hiding this comment

Giuseppe5 Feb 6, 2025

Choose a reason for hiding this comment

hkayann Feb 6, 2025

Choose a reason for hiding this comment

Giuseppe5 commented Feb 6, 2025