Move regex opt to after lookup #2973

gouthamve · 2020-08-04T09:06:48Z

When we use the caching client (which is what is used in most cases), we
load the entire row (tableName+HashKey) irrespective of what the
rangeKey parameters are. Which means with the optimisation, we are
loading the same row multiple times and then operating on the same data.
This PR moves the optimisation to after the data is loaded which should
be faster.

Signed-off-by: Goutham Veeramachaneni [email protected]

What this PR does:

Which issue(s) this PR fixes:
Fixes #

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

gouthamve · 2020-08-04T09:22:45Z

➜  cortex git:(fix-2906) ✗ benchcmp old.txt new.txt 
benchmark                                     old ns/op     new ns/op     delta
BenchmarkParseIndexEntries500-8               342057        291739        -14.71%
BenchmarkParseIndexEntries2500-8              1718683       1516488       -11.76%
BenchmarkParseIndexEntries10000-8             6568014       6463280       -1.59%
BenchmarkParseIndexEntries50000-8             34212971      34284051      +0.21%
BenchmarkParseIndexEntriesRegexSet500-8       137045        89742         -34.52%
BenchmarkParseIndexEntriesRegexSet2500-8      647045        441737        -31.73%
BenchmarkParseIndexEntriesRegexSet10000-8     2772264       1881870       -32.12%
BenchmarkParseIndexEntriesRegexSet50000-8     14044719      10336336      -26.40%

benchmark                                     old allocs     new allocs     delta
BenchmarkParseIndexEntries500-8               1503           1504           +0.07%
BenchmarkParseIndexEntries2500-8              7503           7504           +0.01%
BenchmarkParseIndexEntries10000-8             30005          30007          +0.01%
BenchmarkParseIndexEntries50000-8             150007         150008         +0.00%
BenchmarkParseIndexEntriesRegexSet500-8       1503           1522           +1.26%
BenchmarkParseIndexEntriesRegexSet2500-8      7503           7522           +0.25%
BenchmarkParseIndexEntriesRegexSet10000-8     30005          30023          +0.06%
BenchmarkParseIndexEntriesRegexSet50000-8     150007         150024         +0.01%

benchmark                                     old bytes     new bytes     delta
BenchmarkParseIndexEntries500-8               96289         96320         +0.03%
BenchmarkParseIndexEntries2500-8              481975        482008        +0.01%
BenchmarkParseIndexEntries10000-8             1928507       1928559       +0.00%
BenchmarkParseIndexEntries50000-8             9606750       9606808       +0.00%
BenchmarkParseIndexEntriesRegexSet500-8       88222         88693         +0.53%
BenchmarkParseIndexEntriesRegexSet2500-8      441163        441627        +0.11%
BenchmarkParseIndexEntriesRegexSet10000-8     1764739       1765087       +0.02%
BenchmarkParseIndexEntriesRegexSet50000-8     8804033       8804335       +0.00%

Hrm, not such a big change, interesting but still worth having imo.

jtlisi

LGTM

pracucci

Is it worth a CHANGELOG entry? If so, could you rebase master and add the CHANGELOG entry, please?

pstibrany · 2020-08-10T12:04:55Z

pkg/chunk/chunk_store.go

Several comments here:

Nit: if condition can be merged into single if statement.

No unit test covers continue branch. I think we should cover it by tests.

If labelValue is one of matched values, we still run regex on it on the next line. We can remove matcher.Matches in this case, and save some cpu cycles.

pstibrany · 2020-08-20T06:25:01Z

This looks useful, and would be great to merge. @gouthamve would you find some time to look at the comments and get the PR into mergeable state?

fixes cortexproject#2906 When we use the caching client (which is what is used in most cases), we load the entire row (tableName+HashKey) irrespective of what the rangeKey parameters are. Which means with the optimisation, we are loading the same row multiple times and then operating on the same data. This PR moves the optimisation to after the data is loaded which should be faster. Signed-off-by: Goutham Veeramachaneni <[email protected]>

Signed-off-by: Goutham Veeramachaneni <[email protected]>

pstibrany

LGTM, thanks!

pull-request-size bot added the size/L label Aug 4, 2020

gouthamve marked this pull request as ready for review August 4, 2020 09:29

jtlisi approved these changes Aug 4, 2020

View reviewed changes

pracucci reviewed Aug 5, 2020

View reviewed changes

gouthamve force-pushed the fix-2906 branch from c430e12 to ae1ea17 Compare August 6, 2020 08:42

pstibrany reviewed Aug 10, 2020

View reviewed changes

gouthamve force-pushed the fix-2906 branch 2 times, most recently from 49a7cb6 to 8e64367 Compare August 25, 2020 08:44

gouthamve added 4 commits August 25, 2020 10:46

Add benchmark

98e2b15

Signed-off-by: Goutham Veeramachaneni <[email protected]>

Add changelog entry

2f0d684

Signed-off-by: Goutham Veeramachaneni <[email protected]>

Address feedback

50fc7f5

Signed-off-by: Goutham Veeramachaneni <[email protected]>

gouthamve force-pushed the fix-2906 branch from 8e64367 to 50fc7f5 Compare August 25, 2020 08:47

pstibrany approved these changes Aug 25, 2020

View reviewed changes

Merge branch 'master' into fix-2906

2a4cd1d

pstibrany merged commit 5ea4cc6 into cortexproject:master Aug 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move regex opt to after lookup #2973

Move regex opt to after lookup #2973

Uh oh!

gouthamve commented Aug 4, 2020

Uh oh!

gouthamve commented Aug 4, 2020

Uh oh!

jtlisi left a comment

Uh oh!

pracucci left a comment

Uh oh!

pstibrany Aug 10, 2020 •

edited

Loading

Uh oh!

pstibrany commented Aug 20, 2020

Uh oh!

pstibrany left a comment

Uh oh!

Uh oh!

Move regex opt to after lookup #2973

Move regex opt to after lookup #2973

Uh oh!

Conversation

gouthamve commented Aug 4, 2020

Uh oh!

gouthamve commented Aug 4, 2020

Uh oh!

jtlisi left a comment

Choose a reason for hiding this comment

Uh oh!

pracucci left a comment

Choose a reason for hiding this comment

Uh oh!

pstibrany Aug 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pstibrany commented Aug 20, 2020

Uh oh!

pstibrany left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pstibrany Aug 10, 2020 •

edited

Loading