Improve stream rendering performances #641

satabin · 2024-10-24T09:30:22Z

This change addresses #634 in two ways:

A change to the generic pretty printer that allows to avoid:
- Boxing of integers when computing the layout
- Instantiating many tuples that are immediately discarded
- Creating intermediate chunks by fusing the annotation and rendering phases
Re-introduce a direct compact rendering for JSON, that is way simpler than leveraging the generic printer with no groups

By doing so, the compact rendering should be back to the performance in 1.10 and the pretty case is improved a bit.

Here are some benchmark results with the compact rendering in 1.11.1 as the baseline. 1.11.2 represents the results with this change.

xychart-beta
  title "Rendering of int array"
  x-axis ["Pretty 1.10", "Pretty 1.11.1", "Pretty 1.11.2", "Compact 1.11.1", "Compact 1.11.2"]
  y-axis "Factor" 0 --> 1.2
  bar [0.02, 1.12, 0.71, 1, 0.004]

xychart-beta
  title "Rendering of int object"
  x-axis ["Pretty 1.10", "Pretty 1.11.1", "Pretty 1.11.2", "Compact 1.11.1", "Compact 1.11.2"]
  y-axis "Factor" 0 --> 1.2
  bar [0.01, 1.11, 0.85, 1, 0.01]

This avoids many closure and tuple creations, that are costly.

satabin · 2024-10-24T11:55:05Z

@recons This PR should solve your problem with compact rendering once merged and released.

ybasket

Looks good, just few minor comments.

ybasket · 2024-11-08T08:50:43Z

benchmarks/src/main/scala/fs2/data/benchmarks/PrinterBenchmarks.scala

+        (List
+          .range(0, 1000000)
+          .map(i => Token.NumberValue(i.toString())) :+ Token.EndArray))


Suggested change

(List

.range(0, 1000000)

.map(i => Token.NumberValue(i.toString())) :+ Token.EndArray))

(List.tabulate(1000000)(i => Token.NumberValue(i.toString())) :+ Token.EndArray))

Minor, but IMHO, that makes the intent a bit clearer.

I also would love to avoid appending to that long list, but I assume you want a single chunk? Otherwise using Stream's ++, this could be simplified further.

Yeah I want a single chunk, to see the overhead in processing one chunk.

ybasket · 2024-11-08T08:55:49Z

text/shared/src/main/scala/fs2/data/text/render/internal/StreamPrinter.scala

+    annctx.groups.unsnoc match {
+      case Some((OpenGroup(ghpl, gindent, group), groups)) =>
+        annctx.groups = groups.snoc(OpenGroup(ghpl, gindent, group.append(evt)))
+      case None => // should never happen


Do we have a better option than silently ignoring this bug? fs2 itself uses assert() to catch some bugs – not cool because AFAICT here we would still produce a semi-valid result, but let's at least have the discussion (and maybe a decision for all of fs2-data).

Yeah, I don't like cases like this one, where we cannot statically ensure never happen. And I don't like crashing with an assert either. I will try to find something better here. But it was already like that before 😬

ybasket · 2024-11-08T08:59:33Z

text/shared/src/main/scala/fs2/data/text/render/internal/StreamPrinter.scala

+    }
+
+  private def renderIndentBegin(ctx: RenderingContext): Unit = {
+    ctx.lines = NonEmptyList(ctx.lines.head + (" " * indentSize), ctx.lines.tail)


Do you think a StringBuilder that is allocated to the right capacity could help speeding up here? Like first append the head, then indentSize spaces?

I was think it would actually be way better to cache the padding. In a structured data rendering the same indent size will be found over and over again. I will try to reuse the padding that was already encountered for a indent depth, rather than this.

satabin added 8 commits October 24, 2024 10:38

Add benchmarks for the json rendering

f9a66a8

Implement rendering as a Pull

b9db552

This avoids many closure and tuple creations, that are costly.

Drop unused annotation parameters

6320b1c

Use a dedicated mutable internal context for rendering

eb349a1

Prepare steps for fusing

fe79117

Fuse annotation and rendering phases

0faf876

Avoid int boxing in the contexts

5fa1b8d

Switch to direct compact rendering

5b52c89

satabin requested a review from a team as a code owner October 24, 2024 09:30

satabin added enhancement New feature or request json regression labels Oct 24, 2024

Make Scala 3 happy

4455b06

satabin force-pushed the json/render branch from 9d58c77 to 4455b06 Compare October 24, 2024 09:35

satabin force-pushed the json/render branch 3 times, most recently from 2829f8f to a0c50b0 Compare October 24, 2024 12:55

Add MiMa exceptions for private classes

6d74393

satabin force-pushed the json/render branch from a0c50b0 to 6d74393 Compare October 24, 2024 13:41

ybasket reviewed Nov 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve stream rendering performances #641

Improve stream rendering performances #641

satabin commented Oct 24, 2024

satabin commented Oct 24, 2024

ybasket left a comment

ybasket Nov 8, 2024

satabin Nov 8, 2024

ybasket Nov 8, 2024

satabin Nov 8, 2024

ybasket Nov 8, 2024

satabin Nov 8, 2024

Improve stream rendering performances #641

Are you sure you want to change the base?

Improve stream rendering performances #641

Conversation

satabin commented Oct 24, 2024

satabin commented Oct 24, 2024

ybasket left a comment

Choose a reason for hiding this comment

ybasket Nov 8, 2024

Choose a reason for hiding this comment

satabin Nov 8, 2024

Choose a reason for hiding this comment

ybasket Nov 8, 2024

Choose a reason for hiding this comment

satabin Nov 8, 2024

Choose a reason for hiding this comment

ybasket Nov 8, 2024

Choose a reason for hiding this comment

satabin Nov 8, 2024

Choose a reason for hiding this comment