Endianness support is missing #97

leofang · 2022-02-28T05:26:48Z

Complaints on endianness has been something I've recurrently seen (ex: CuPy cupy/cupy#3652 and mpi4py mpi4py/mpi4py#177), and I anticipate at some point we'd start receiving bug reports on this. Apparently there is at least a few communities out there (astropy and hdf5) that prefer (or could work with) non-native (that is, big) endianness data. This causes problems if two libraries exchange but do not communicate the endianness for how to interpret the data.

I suggest two possible solutions:

Add an endianness enum (big, little, native, etc) and include it in DLDataType as a new struct member:
- Cons: This will be an API/ABI incompatible change, unfortunately.
Alternatively, we could apply a bitwise mask to DLDataType::code to make it carry this information:
- The mask should be designed such that when not applied it refers to the little endianness, the de facto standard used by all projects so far

cc: @tqchen @rgommers @tirthasheshpatel

The text was updated successfully, but these errors were encountered:

rgommers · 2022-02-28T09:28:05Z

FITS as an astrophysics-specific big-endian format is annoying indeed. Your mpi4py link also seems to come from that community. I can't think of many other fields where this comes up, but it's hard to be sure.

A few thoughts:

Ignoring it completely yields correctness issues; transparently converting to native endianness in I/O routines as CuPy just did makes sense.
Right now, DLPack implementations should check for this and raise an exception on the exporter side. Not doing so looks like a pretty severe bug to me.
Solution 1 above: there's a longer list of info we'd like to see captures (like readonly), when that is done this could be taken along - it's not an extra/separate ABI break.
Solution 2 above: not a fan of such an obscure solution.
It's not clear to me that either solution will be generally helpful. If the vast majority of libraries supporting DLPack do not support non-native byteorder, then an exception will still be raised on the importer side once endianness info is present. Hence not much change compared to the current state where the exporter must raise. So it's probably okay to add if it can be taken along in a larger set of changes, but is not worth a separate (breaking or convoluted) change.

tirthasheshpatel · 2022-02-28T09:39:41Z

It's not clear to me that either solution will be generally helpful. If the vast majority of libraries supporting DLPack do not support non-native byteorder, then an exception will still be raised on the importer side once endianness info is present. Hence not much change compared to the current state where the exporter must raise. So it's probably okay to add if it can be taken along in a larger set of changes, but is not worth a separate (breaking or convoluted) change

I agree with this. If and when we get to adding something like readonly, we can also add this along with it.

tqchen · 2022-02-28T14:51:48Z

While non-native byteorder is useful for serialization and less for in-memory exchange.

As of now the implicit assumption seems to be that the data should always follow the native-byteorder in the system. So perhaps one possible actionable item

A2: confirm this fact(that data field should follow the machine native byteorder) and document it as part of the standard. Note that given almost all libraries operate under the native-order for computation.

We do need to acknowledge that for serialization/networking having a fixed(non-machine native) endian is useful. I personally would consider that to be outside the scope(as the main purpose is fast in-memory exchange for computing). We can always run explicit byteswaping(which is needed anyway as most frameworks need that) in the scenario of serialization.

rgommers · 2022-03-03T14:23:26Z

We do need to acknowledge that for serialization/networking having a fixed(non-machine native) endian is useful. I personally would consider that to be outside the scope(as the main purpose is fast in-memory exchange for computing). We can always run explicit byteswaping(which is needed anyway as most frameworks need that) in the scenario of serialization.

I think the question here is whether to automatically byteswap or not. If users get FITS or HDF5 file with non-native byteorder and byteswapping is not done upon loading that data into memory (as is the case now), then there are two options:

Byteswapping is done automatically upon exporting a DLPack capsule,
The exporter raises an informative error and asks the user to byteswap.

I think I have a slight preference for (2).

tqchen · 2022-03-03T17:44:12Z

I also prefer (2) as it simpler

tirthasheshpatel · 2022-03-03T17:46:29Z

I think I have a slight preference for (2)

That's what NumPy does right now:

>>> import numpy as np
>>> x = np.array([1,2,3], dtype='>i2')
>>> np.from_dlpack(x)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: DLPack only supports native byte swapping.

And what others should also do IMO. I don't think it is unreasonable/unintuitive to ask the user to first convert the data to native byte ordering before exporting. So, +1 for (2)

leofang · 2022-03-18T15:03:26Z

Thanks for the discussion and the consensus during my absence. I agree with you all that

The endianness info can be piggybacked in a larger change
For now the exporter should raise if the tensor/array is not in little endianness to ask users to swap themselves

leofang · 2023-01-17T03:20:27Z

cc: @seberg (in case you have other thoughts since you're playing with endianness 🙂)

I think this issue can be closed by a simple doc update, since no change is required on the DLPack side. If no one else is interested in submitting a PR before the end of this month, I'll give it a shot.

seberg · 2023-01-17T08:02:01Z

No opinion, to me it seems like probably a valid but low-prio addition. And it is simple in that if you flag it on the dtype you get the "unsupported dtype" error for free.

leofang mentioned this issue Feb 28, 2022

Handle byte order markers at the beginning of format strings mpi4py/mpi4py#179

Merged

tirthasheshpatel mentioned this issue Feb 28, 2022

Future ABI compatibility #34

Closed

tirthasheshpatel mentioned this issue Mar 8, 2022

Add a note about non-native endianness #98

Merged

tirthasheshpatel mentioned this issue Mar 25, 2022

[ABI break] Add new structs with version info and readonly flag #101

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Endianness support is missing #97

Endianness support is missing #97

leofang commented Feb 28, 2022 •

edited

Loading

rgommers commented Feb 28, 2022

tirthasheshpatel commented Feb 28, 2022

tqchen commented Feb 28, 2022

rgommers commented Mar 3, 2022

tqchen commented Mar 3, 2022

tirthasheshpatel commented Mar 3, 2022

leofang commented Mar 18, 2022

leofang commented Jan 17, 2023

seberg commented Jan 17, 2023

Endianness support is missing #97

Endianness support is missing #97

Comments

leofang commented Feb 28, 2022 • edited Loading

rgommers commented Feb 28, 2022

tirthasheshpatel commented Feb 28, 2022

tqchen commented Feb 28, 2022

rgommers commented Mar 3, 2022

tqchen commented Mar 3, 2022

tirthasheshpatel commented Mar 3, 2022

leofang commented Mar 18, 2022

leofang commented Jan 17, 2023

seberg commented Jan 17, 2023

leofang commented Feb 28, 2022 •

edited

Loading