[Illustrative; not for merge] How to prefer float16 as the main float type #1802

Birch-san · 2023-03-12T00:48:11Z

Currently, if your intention is to build a CoreML model which targets float16 (very likely if you're targeting ANE):
the process for tracing+converting the model is "trace in float32; casts to float16 will be added during conversion, then we try to optimize away those casts". I think.

This does have an (admittedly minor) downside of having to load a float32 model (and start with 32-bit weights), only to ultimately throw half of them away.

It also relies on optimization passes being effective in eliding all the casts. And you have to wait for them (slightly).

The main downside for me was when trying to debug failures: the ops were complicated by lots of casts, and looked more different from the initial torchscript than they needed to be.

To simplify all this: I changed everywhere I could find, the convention:
"Python floats will be interpreted as ~~np.float32~~ np.float16".

I'm not proposing to merge this or anything, but it actually wasn't many places that the changes needed to be made, in order to successfully compile stable-diffusion, using a model that was traced by torchscript in float16.
Note: on PyTorch, the CPU device doesn't implement float16 operations, so this trick requires one to trace the model in float16 via the MPS device instead.

If it turns out it's not just me who finds this useful:
I wonder whether this could be exposed somehow as a configurable option? "default float width".

…loat literal, or how to convert ints to floats), to simplify the conversion process (rather than starting in fp32, spraying fp16 casts everywhere, and trying to remove them during conversion).

…dtype

aseemw · 2023-03-12T03:40:19Z

If you start with a float 16 traced torch model, does it work out of the box? Or you need the changes in this PR to convert such a model?

Birch-san added 2 commits March 11, 2023 19:12

switch to fp16 as the go-to float dtype (i.e. how to infer a python f…

9b4e106

…loat literal, or how to convert ints to floats), to simplify the conversion process (rather than starting in fp32, spraying fp16 casts everywhere, and trying to remove them during conversion).

upsample_nearest_neighbor: cast float16 scale factors to a supported …

0fa03a0

…dtype

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Illustrative; not for merge] How to prefer float16 as the main float type #1802

[Illustrative; not for merge] How to prefer float16 as the main float type #1802

Birch-san commented Mar 12, 2023

aseemw commented Mar 12, 2023

[Illustrative; not for merge] How to prefer float16 as the main float type #1802

Are you sure you want to change the base?

[Illustrative; not for merge] How to prefer float16 as the main float type #1802

Conversation

Birch-san commented Mar 12, 2023

aseemw commented Mar 12, 2023