Promote 0.9.28 #1889

amolnayak311 · 2024-11-13T20:03:42Z

Promote 0.9.28

…b#1831) * misc(core): Adding unit tests for histograms for StitchRvsExec

* fix bugs caused by stitching empty and non-empty data. The current Transient row can only handle a schema (Long, Double). However, there are different schemas for histogram, Avg and so on. Create a NaNRowReader to handle any schemas. --------- Co-authored-by: Yu Zhang <[email protected]>

…ilodb#1819)" (filodb#1838) * Revert "feat(core): Now metadata queries support _type_ filter (filodb#1819)" This reverts commit 8ce88de.

…rved field (filodb#1842) Addition of _type_ field to index is now configurable for each cluster, false by default for now. Also, if the part-key already has the _type_ field we don't index that since it is a reserved field that we populate.

…ilodb#1819)"

…ilodb#1819)" (filodb#1838) * Revert "feat(core): Now metadata queries support _type_ filter (filodb#1819)" This reverts commit 8ce88de.

This reverts commit 8108083.

…er (filodb#1819)" cherry-pick and revert a hotfix commit from main to prevent conflicts during downstream merges

…nants instead of workspace (filodb#1849)

) New behavior : This change adds support for the Tantivy indexing library as an alternative to Lucene for time series indexing. In several cases it has been found that this is superior to Lucene performance, especially when it comes to memory usage and predictability of memory spikes. This feature is opt-in via a configuration setting to avoid any unexpected changes during upgrade. For the moment only the raw time series index is supported. Downsample support may come in a future PR. BREAKING CHANGES This change requires a working Rust & C compiler to build given the Tantivy code is written in Rust. README docs have been updated to reflect this. There are no runtime breaking changes.

… aggregation metric if applicable based on the given tags (filodb#1844)

Add a qualifier to imports to support slightly older Rust versions. Tag cargo metadata with min tested version to give a better error.

Co-authored-by: Kier Petrov <[email protected]>

Rust names the x86-64 architecture as x86_64. Java names it as amd64. This mismatch causes errors during library load as they can't agree on the file path. The fix is to normalize the Rust name into the Java name, so it can locate the output binaries.

If you're targetting an older Linux distro the default glibc version being linked against may be too high to produce runnable images. cargo zigbuild supports specifying a specific glibc version to use for a link target by appending ".<version>" to the target triple. For example, "x86_64-unknown-linux-gnu.2.17" will target v2.17. For the most part this just works but we need to strip this suffix when looking for output binaries. This fix adds that logic.

Co-authored-by: Kier Petrov <[email protected]>

feat(query): cherry picking shard level failover metrics

… segments (filodb#1864) Columns in the column cache hold a reference to the mmaped file data that backs the segment. These segments can be deleted during segment merging, but if a column for that segment is in the cache it prevents the mmap from closing and releasing RAM. To fix this we subscribe for notifications on segment list changes and clear the column cache when these occur so stale segments can be reclaimed.

…1855) * Fixed mismatched schema regarding fixedVectorLen. * Do not compare against colIds on schema match. --------- Co-authored-by: Yu Zhang <[email protected]>

…1855) (filodb#1865) * Fixed mismatched schema regarding fixedVectorLen. * Do not compare against colIds on schema match. --------- Co-authored-by: Yu Zhang <[email protected]>

…l aggregated metric (filodb#1863) * misc(query): increment counter when query plan updated with next level aggregated metric * Adding unit test to test if metric is being incremented as expected

) indexValues was falling way behind Lucene due to a few reasons: 1. We were copying results directly into Java objects, which was incurring a lot of JNI back and forth overhead 2. When querying the entire index we were looking at docs instead of the reverse index, which increased the count of items to process This PR does a few things: 1. Add perf benchmarks for the missing functions 2. Add a new IndexCollector trait that can be used to walk the index vs docs 3. Remove the JNI object usage in indexValues vs byte serialized data 4. Glue all these optimizations togther. With this Tantivy is still a bit behind Lucene for this path, but it's almost 100x faster than before.

…Experience logicalPlan update (filodb#1869) * Supporting multiple agg rules for a single promql query. Example query for which the hierarchical logical plan updated will be now supported: sum(metric1:::suffix1{}) + sum(metric2:::suffix2{})

Current behavior : Metadata queries do not support filtering based on metric type using _type_ filter New behavior : Added support for that feature. Note that only new documents of persisted downsample index will have the type field. Index needs to be rebuilt if full support is needed. Can be skipped if metadata queries don't hit downsample index.

…rved field (filodb#1842) Addition of _type_ field to index is now configurable for each cluster, false by default for now. Also, if the part-key already has the _type_ field we don't index that since it is a reserved field that we populate.

When bootstrapping the raw index we skip over tracking items with invalid schemas, signified by partId = -1. However, today we still index them which can create query errors later on like the following: ``` java.lang.IllegalStateException: This shouldn't happen since every document should have a partIdDv at filodb.core.memstore.PartIdCollector.collect(PartKeyLuceneIndex.scala:963) at org.apache.lucene.search.Weight$DefaultBulkScorer.scoreAll(Weight.java:305) at org.apache.lucene.search.Weight$DefaultBulkScorer.score(Weight.java:247) at org.apache.lucene.search.BulkScorer.score(BulkScorer.java:38) at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:776) at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:551) at filodb.core.memstore.PartKeyLuceneIndex.$anonfun$searchFromFilters$1(PartKeyLuceneIndex.scala:635) at filodb.core.memstore.PartKeyLuceneIndex.$anonfun$searchFromFilters$1$adapted(PartKeyLuceneIndex.scala:635) at filodb.core.memstore.PartKeyLuceneIndex.withNewSearcher(PartKeyLuceneIndex.scala:279) at filodb.core.memstore.PartKeyLuceneIndex.searchFromFilters(PartKeyLuceneIndex.scala:635) at filodb.core.memstore.PartKeyLuceneIndex.partIdsFromFilters(PartKeyLuceneIndex.scala:591) at filodb.core.memstore.TimeSeriesShard.labelValuesWithFilters(TimeSeriesShard.scala:1782) ``` This fix ensures that we don't index part keys we skip during bootstrap so that the in memory shard and index are consistent with each other.

… data. (filodb#1868) Co-authored-by: Yu Zhang <[email protected]>

…erience (filodb#1873) * fix(query): removing max/min aggregations from hierarchical query experience

Merge develop to integration 0.9.28

… functions in queries (filodb#1875)

…educed memory allocation and faster planning (filodb#1874)

merge develop to integration

…ilodb#1881)

cherry-pick RepeatValueVactor optimization

sandeep6189 and others added 30 commits August 8, 2024 16:29

misc(core): Adding unit tests for histograms for StitchRvsExec (filod…

42c92c3

…b#1831) * misc(core): Adding unit tests for histograms for StitchRvsExec

hotfix(0.9.27.1): revert "Now metadata queries support _type_ filter (f…

aed6961

…ilodb#1819)" (filodb#1838) * Revert "feat(core): Now metadata queries support _type_ filter (filodb#1819)" This reverts commit 8ce88de.

fix(query): fix parse issue with escaped dot character (filodb#1841)

74107b0

hotfix(0.9.27.1): revert "Now metadata queries support _type_ filter (f…

46236ea

…ilodb#1819)"

hotfix(0.9.27.1): revert "Now metadata queries support _type_ filter (f…

8108083

…ilodb#1819)" (filodb#1838) * Revert "feat(core): Now metadata queries support _type_ filter (filodb#1819)" This reverts commit 8ce88de.

Unrevert "Now metadata queries support _type_ filter (filodb#1819)"

7cc9cca

This reverts commit 8108083.

hotfix(0.9.27.1): un/revert "Now metadata queries support _type_ filt…

a7312de

…er (filodb#1819)" cherry-pick and revert a hotfix commit from main to prevent conflicts during downstream merges

fix(query): negative rate/increase due to NaN chunk (filodb#1846)

4320860

fix(query): Generalizaing the column filter check to span multiple te…

c2946bd

…nants instead of workspace (filodb#1849)

feat(query): Add support for LogicalPlan updates, to use higher level…

101f566

… aggregation metric if applicable based on the given tags (filodb#1844)

fix(core): Support Rust 1.78 (filodb#1854)

29dcec6

Add a qualifier to imports to support slightly older Rust versions. Tag cargo metadata with min tested version to give a better error.

adding metrics for failover (filodb#1856)

f9cbbcf

Co-authored-by: Kier Petrov <[email protected]>

metrics for failover (filodb#1859)

aab8633

Co-authored-by: Kier Petrov <[email protected]>

adding metrics for failover (filodb#1856)

45f8d22

Co-authored-by: Kier Petrov <[email protected]>

metrics for failover (filodb#1859)

a685755

Co-authored-by: Kier Petrov <[email protected]>

Merge pull request filodb#1860 from kvpetrov/shard_failover_metric

95ee5d2

feat(query): cherry picking shard level failover metrics

misc(sparkjobs): force push metrics publish from index job (filodb#1862)

33e4656

fix(query) Fixed mismatched schema regarding fixedVectorLen. (filodb#…

22d0b08

…1855) * Fixed mismatched schema regarding fixedVectorLen. * Do not compare against colIds on schema match. --------- Co-authored-by: Yu Zhang <[email protected]>

fix(query) Fixed mismatched schema regarding fixedVectorLen. (filodb#…

ccc70dd

…1855) (filodb#1865) * Fixed mismatched schema regarding fixedVectorLen. * Do not compare against colIds on schema match. --------- Co-authored-by: Yu Zhang <[email protected]>

misc(query): increment counter when query plan updated with next leve…

6d0e997

…l aggregated metric (filodb#1863) * misc(query): increment counter when query plan updated with next level aggregated metric * Adding unit test to test if metric is being incremented as expected

sherali42 and others added 15 commits October 15, 2024 14:55

Cherry-pick: support for _type_ filter in metadata queries

4450bcd

fix(query) the schema provided by _type_ does not match colIDs in the…

fd59ebb

… data. (filodb#1868) Co-authored-by: Yu Zhang <[email protected]>

fix(query): removing max/min aggregations from hierarchical query exp…

6657340

…erience (filodb#1873) * fix(query): removing max/min aggregations from hierarchical query experience

Merge branch 'develop' into integ-merge

880d6e9

Merge pull request filodb#1876 from amolnayak311/integ-merge

7ed9466

Merge develop to integration 0.9.28

fix(coordinator): update LogicalPlanParser to correctly handle scalar…

376e7c6

… functions in queries (filodb#1875)

perf(query) Memoize the part of the logical plan tree traversal for r…

7f008e7

…educed memory allocation and faster planning (filodb#1874)

Merge branch 'develop' into integ-merge-take2

722caa4

Merge pull request filodb#1877 from amolnayak311/integ-merge-take2

b5b3c0a

merge develop to integration

Version bumnp to 0.9.28 (filodb#1878)

74c238f

perf(query) Eliminate the allocation of memory for RepeatValueVector (f…

a606dbc

…ilodb#1881)

Merge pull request filodb#1888 from amolnayak311/integration

9181cd9

cherry-pick RepeatValueVactor optimization

Merge branch 'integration' into promote-0.9.28

5003655

Version bump to 0.9.28.0

2c5f291

sandeep6189 approved these changes Nov 13, 2024

View reviewed changes

amolnayak311 merged commit 2f31ec1 into filodb:main Nov 13, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Promote 0.9.28 #1889

Promote 0.9.28 #1889

amolnayak311 commented Nov 13, 2024

Promote 0.9.28 #1889

Promote 0.9.28 #1889

Conversation

amolnayak311 commented Nov 13, 2024