Replace JSON serialization with cloudpickle and make reference semantics explicit. #153

tbenthompson · 2023-02-06T16:34:47Z

See #138 for older discussion.

Copied from the changelog:

replaced JSON serialization with cloudpickle. This allows extracting a much
wider range of objects from the notebook subprocess.
Reference semantics have changed.
- Old behavior of tb.get(name) and tb[name]:
  - a reference would be returned for non-JSON-serializable objects.
  - a value would be returned for JSON-serializable objects.
- Old behavior of tb.ref(name) was identical to tb.get(name).
- However, now almost all objects are serializable and as a result, under
  the old semantics, a reference would almost never be returned. Therefore,
  when a reference is desired, we now require explicitly requesting a
  reference. The new behavior of tb.get(name) and tb[name] is to always
  return the deserialized object and to never return a reference. The new
  behavior of tb.ref(name) is to always return a reference.

I think this is a substantial improvement because:

this PR allows serializing a much wider range of objects
makes the API substantially clearer about when a reference will or will not be returned.

I understand that this is a mildly breaking change and adds a dependency and that must be weighed against the benefits. I would argue that the change is very worthwhile!

In my case, I wanted to be able to extract pandas DataFrames and various custom objects from a notebook under test.

codecov-commenter · 2023-02-17T07:17:38Z

Codecov Report

Merging #153 (0b65427) into main (62d7bd9) will increase coverage by 6.93%.
The diff coverage is 88.23%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #153      +/-   ##
==========================================
+ Coverage   91.13%   98.07%   +6.93%     
==========================================
  Files           7        8       +1     
  Lines         361      363       +2     
==========================================
+ Hits          329      356      +27     
+ Misses         32        7      -25

tbenthompson added 3 commits February 6, 2023 11:12

Use cloudpickle instead of JSON serialization.

d3b80be

Explicit reference semantics.

c9e2ce9

Describe the changes in the changelog.

0b65427

tbenthompson force-pushed the main branch from 796c373 to 0b65427 Compare February 6, 2023 16:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace JSON serialization with cloudpickle and make reference semantics explicit. #153

Replace JSON serialization with cloudpickle and make reference semantics explicit. #153

tbenthompson commented Feb 6, 2023 •

edited

Loading

codecov-commenter commented Feb 17, 2023

Replace JSON serialization with cloudpickle and make reference semantics explicit. #153

Are you sure you want to change the base?

Replace JSON serialization with cloudpickle and make reference semantics explicit. #153

Conversation

tbenthompson commented Feb 6, 2023 • edited Loading

codecov-commenter commented Feb 17, 2023

Codecov Report

tbenthompson commented Feb 6, 2023 •

edited

Loading