Fetch float16 and float128 with h5grove as binary using `dtype=safe` #1561

axelboc · 2024-02-01T14:02:01Z

No description provided.

axelboc · 2024-02-01T14:10:14Z

packages/app/src/providers/h5grove/__snapshots__/h5grove-api.test.ts.snap

      0,
      1,
      2,
      3,
      4,
-      null,
+      Infinity,


The maximum finite float128 value gets converted to float64 Infinity. I would have expected Numpy to clamp it to the maximum finite float64 value instead. 🤔

Now that I think of it, a similar thing happens with the smallest float128 value greater than 0 (the value of the float128_scalar dataset in the sample file) => h5grove sends 0 in float64 — not the smallest float64 value greater than 0, as I would have expected.

The maximum finite float128 value gets converted to float64 Infinity

Sounds like a safe behaviour. It would be weird to convert a finite value to another while you know it's not the same value.

It would be weird to convert a finite value to another while you know it's not the same value.

That's exactly what happens when you cast an int64 to int32, though. Obviously inf cannot be represented in int32, but still, in float, it's a special value... The rounding to 0 makes more sense in that regard.

I'm just asking in case the code in h5grove is wrong somehow — maybe the float128 are cast by Python before numpy can convert them properly, or something of the sort. 🤷

loichuder · 2024-02-07T13:43:30Z

Sorry, took a while to get back to this and I would need some context to refresh my mind.

Why using dtype=safe makes it possible to fetch float16 and float128 as binary ?

axelboc · 2024-02-08T08:03:04Z

Sorry, took a while to get back to this and I would need some context to refresh my mind.

Why using dtype=safe makes it possible to fetch float16 and float128 as binary ?

No worries!

There's no Float16Array and Float128Array in JS, so if we fetched float16/128 datasets with format=bin without dtype=safe, we'd get binary we don't understand (at least without using custom libraries like https://github.com/petamoriken/float16 or https://github.com/munrocket/double.js).

When fetching float16/128 datasets with format=bin and dtype=safe, h5grove converts them to float32/64 respectively. With this knowledge, we can adjust our provider logic to say that when fetching float16, the response buffer should be passed into a Float32Array and when fetching float128, into a Float64Array. (Before, we were saying that float16/128 datasets had no corresponding typed array, which led to fetching them as JSON).

packages/app/src/providers/h5grove/utils.ts

axelboc changed the title ~~Fetch float16 and float128 as binary using dtype=safe~~ Fetch float16 and float128 with h5grove as binary using dtype=safe Feb 1, 2024

axelboc commented Feb 1, 2024

View reviewed changes

axelboc requested a review from loichuder February 1, 2024 14:13

axelboc force-pushed the main branch from 00840fc to 5cc6049 Compare February 6, 2024 15:29

axelboc force-pushed the safe branch 2 times, most recently from 02f2b71 to 44dbe02 Compare February 7, 2024 12:18

axelboc force-pushed the safe branch from 44dbe02 to c343383 Compare February 8, 2024 07:52

loichuder approved these changes Feb 8, 2024

View reviewed changes

packages/app/src/providers/h5grove/utils.ts Outdated Show resolved Hide resolved

Fetch float16 and float128 as binary using dtype=safe

5e2c8b9

axelboc force-pushed the safe branch from c343383 to 5e2c8b9 Compare February 8, 2024 13:27

axelboc merged commit a09481f into main Feb 8, 2024
8 checks passed

axelboc deleted the safe branch February 8, 2024 13:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch float16 and float128 with h5grove as binary using `dtype=safe` #1561

Fetch float16 and float128 with h5grove as binary using `dtype=safe` #1561

Fetch float16 and float128 with h5grove as binary using dtype=safe #1561

Fetch float16 and float128 with h5grove as binary using dtype=safe #1561

Conversation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Fetch float16 and float128 with h5grove as binary using `dtype=safe` #1561

Fetch float16 and float128 with h5grove as binary using `dtype=safe` #1561