You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, measurements are applied sequentially to a given dataframe. This might be not optimal - besides, this approach doesn't bring any benefit, it only slows things down. A better would be to somehow aggregate the measurements so that they get executed on a single pass through data. This might require implementing slightly different approach to Measure.scala using some kind of MeasurementBuilder class.
Except for Measure class, some other things might be changed here, see #79 (comment) for example.
The text was updated successfully, but these errors were encountered:
Background
Currently, measurements are applied sequentially to a given dataframe. This might be not optimal - besides, this approach doesn't bring any benefit, it only slows things down. A better would be to somehow aggregate the measurements so that they get executed on a single pass through data. This might require implementing slightly different approach to
Measure.scala
using some kind of MeasurementBuilder class.Except for Measure class, some other things might be changed here, see #79 (comment) for example.
The text was updated successfully, but these errors were encountered: