[Feature]: Add optional lazy look-up of extractor properties #247

pauladkisson · 2023-09-20T20:15:17Z

Current Behavior

Every ImagingExtractor (IE) looks up all its properties (num_channels, num_frames, etc.) during __init__ and stores them as private attributes (self._num_channels, self._num_frames, etc.).

Issue

This is efficient if you want to look up the properties many times for one file, but inefficient for MultiImagingExtractor that only needs to look up properties for 1 of its many IE's.

Proposed Change

Add an option to the base ImagingExtractor class for precomputed_properties=True and then if you pre-compute the properties, I/O happens during __init__ and they get stored as attributes. But if precomputed_properties=False then I/O happens lazily and you can skip the consistency checks for MultiImagingExtractors.

This class is just an IO function disguissed as a class (and IO wrapper). You probably eliminated the attribute access because it was inefficient. You could just had used the logic of get_video inside of BrukerTiffSinglePlaneImagingExtractor get_video instead of using the MultiImagingExtractor approach.

That would probably be harder to mantain, a cost. But if we are propagating a lazy look up feature to ALL extractors well that is also a cost.

I think you approach you guys used there is fine, keep this IO wrapper class as a private / utility object. That way you get the benefits but do not need to propagate the costs anywhere else. I am less keen on propagating this behavior to the whole library because some extractor might be used the way you used _BrukerTiffSinglePlaneImagingExtractor.

h-mayorquin · 2023-09-20T22:50:50Z

I think it would be better to implement a general object that behaves similar to _BrukerTiffSinglePlaneImagingExtractor for classes that are meant to be composed / concatenated. The only method that it should have is basically get_video and num_frames as a property and they will be meant to be used as convenience IO wrappers to be composed by MultiImaging. That way, you don't leak this complexity anywhere else than where it is needed.

pauladkisson · 2023-09-20T22:51:25Z

I don't really get the pushback here.

MultiImagingExtractor is a very general purpose object used to concatenate individual IE's by frames regardless of file format usu. bc the files are split into chunks in time. If lazy property look-up is better for MultiImagingExtractor then that is a relevant use case for all of the IE's since they all can (and many do) use MultiImagingExtractor.

If you think MultiImagingExtractor is a bad idea in general, then we should discuss that instead. And tbh it seemed pretty intuitive to me.

pauladkisson · 2023-09-20T23:01:49Z

I think it would be better to implement a general object that behaves similar to _BrukerTiffSinglePlaneImagingExtractor for classes that are meant to be composed / concatenated. The only method that it should have is basically get_video and num_frames as a property and they will be meant to be used as convenience IO wrappers to be composed by MultiImaging. That way, you don't leak this complexity anywhere else than where it is needed.

That's an interesting idea. But it doesn't work as well for formats like ScanImage that can have multi-file or single-file structure. If MultiImagingExtractor can't call its IEs' property methods then it won't be able to stand alone, and you would have to inherit it separately for every format that wants to support the multi-file case. Not the end of the world, but definitely a cost.

h-mayorquin · 2023-09-20T23:02:30Z

if lazy property look-up is better for MultiImagingExtractor then that is a relevant use case for all of the IE's since they all can (and many do) use MultiImagingExtractor.

I disagree with this. The use case of lazy look up is when you use MultiImagingExtractor to build an extractor from many files to avoid inefficiencies as it was done in an ad-hoc way for _BrukerTiffSinglePlaneImagingExtractor above. Not all extractors are meant to be composed in that way and therefore the feature does not need to be as general. If we can avoid complexity in our most general object, we should. The more general the object the more wary we should be of adding extra functionality.

h-mayorquin · 2023-09-20T23:43:15Z

Do we have data for volumetric ScanImage in google drive for some conversion project?

That's an interesting idea. But it doesn't work as well for formats like ScanImage that can have multi-file or single-file structure.

Seems simple to me. Use the simple IO wrapper class for the multi-file case, expand the simple IO wrapper class to a full one for the single file case.

If MultiImagingExtractor can't call its IEs' property methods then it won't be able to stand alone, and you would have to inherit it separately for every format that wants to support the multi-file case. Not the end of the world, but definitely a cost.

MultiImagingExtractor should be able to call IE properties. The simple classes IO wrapper classes that you compose with MultiImagingExtractor are the ones that don't need those methods.

simple_private_wrapper_io_class -> add_the_rest_of_the_methods -> fully fleshed `ImagingExtractor` for the single file case
simple_private_wrapper_io_classs -> compose many with `MultiImagingExtractor` -> fully fleshed `ImaginExtractor` for the mlutiple file case.

This is confined and ad-hoc for every format, you don't propagate complexity outsides of where it is needed. You can adjust what's best for performance depending on the layout of the files (if there are many the io_wrapper can be more or less light) and can chose in general what's best per format instead of thinking on a general interface for the abstract class in advance. Once you have some examples, then you can generalize and propagate to the general class.

pauladkisson · 2023-09-21T00:15:41Z

Do we have data for volumetric ScanImage in google drive for some conversion project?

Yes, it's under mouseV1-to-nwb

pauladkisson added the enhancement New feature or request label Sep 20, 2023

pauladkisson mentioned this issue Sep 20, 2023

Multi-Plane and Multi-Channel Support for ScanImage #241

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Add optional lazy look-up of extractor properties #247

[Feature]: Add optional lazy look-up of extractor properties #247

pauladkisson commented Sep 20, 2023 •

edited

Loading

h-mayorquin commented Sep 20, 2023 •

edited

Loading

CodyCBakerPhD commented Sep 20, 2023 •

edited

Loading

h-mayorquin commented Sep 20, 2023

h-mayorquin commented Sep 20, 2023 •

edited

Loading

h-mayorquin commented Sep 20, 2023

pauladkisson commented Sep 20, 2023

pauladkisson commented Sep 20, 2023

h-mayorquin commented Sep 20, 2023

h-mayorquin commented Sep 20, 2023

pauladkisson commented Sep 21, 2023

[Feature]: Add optional lazy look-up of extractor properties #247

[Feature]: Add optional lazy look-up of extractor properties #247

Comments

pauladkisson commented Sep 20, 2023 • edited Loading

Current Behavior

Issue

Proposed Change

See Also

Do you have any interest in helping implement the feature?

Code of Conduct

h-mayorquin commented Sep 20, 2023 • edited Loading

CodyCBakerPhD commented Sep 20, 2023 • edited Loading

h-mayorquin commented Sep 20, 2023

h-mayorquin commented Sep 20, 2023 • edited Loading

h-mayorquin commented Sep 20, 2023

pauladkisson commented Sep 20, 2023

pauladkisson commented Sep 20, 2023

h-mayorquin commented Sep 20, 2023

h-mayorquin commented Sep 20, 2023

pauladkisson commented Sep 21, 2023

pauladkisson commented Sep 20, 2023 •

edited

Loading

h-mayorquin commented Sep 20, 2023 •

edited

Loading

CodyCBakerPhD commented Sep 20, 2023 •

edited

Loading

h-mayorquin commented Sep 20, 2023 •

edited

Loading