Reduce memory by multiple GBs with many classic queues #6684

ansd · 2022-12-15T16:37:03Z

Store directory names as binary instead of string.

This commit saves >1GB of memory per 100,000 classic queues v2. With longish node names, the memory savings are even much higher.

This commit is especially a prerequisite for scalable MQTT where every subscribing MQTT connection creates its own classic queue.

So, with 3 million MQTT subscribers, this commit saves >30GB of memory.

This commits stores file names as binaries and converts back to file:filename() when passed to file API functions. This is to reduce risk of breaking behaviour for path names containing unicode chars on certain platforms.

Prior to this commit, a classic queue process state contained many lists:

              {passthrough,rabbit_variable_queue,
                           {vqstate,{0,[],[]},
                                    {0,[],[]},
                                    {delta,undefined,0,0,undefined},
                                    {0,[],[]},
                                    {0,[],[]},
                                    0,0,#{},#{},undefined,rabbit_classic_queue_index_v2,
                                    {qi,{resource,<<"/">>,queue,
                                                  <<"mqtt-subscription-nuc_bench_sub_23092_1622527410qos1">>},
                                        "/home/david/scratch/rabbit/test/rabbit@nuc/mnesia/rabbit@nuc/msg_stores/vhosts/628WB79CIFDYO9LJI6DKMI09L/queues/8Q5F1PPMH3BD1C7F6CQXGLGME",
                                        #{},0,#{},#{},#{},#{},
                                        #Fun<rabbit_variable_queue.2.85400881>,
                                        #Fun<rabbit_variable_queue.3.85400881>},
                                    {qs,"/home/david/scratch/rabbit/test/rabbit@nuc/mnesia/rabbit@nuc/msg_stores/vhosts/628WB79CIFDYO9LJI6DKMI09L/queues/8Q5F1PPMH3BD1C7F6CQXGLGME",
                                        undefined,64,#{},0,#{},undefined,undefined},
                                    {{client_msstate,<0.712.0>,
                                                     <<189,37,176,225,122,247,125,208,243,230,50,155,167,150,
                                                       206,58>>,
                                                     #{},
                                                     {state,#Ref<0.1127660195.458620929.200967>,
                                                            "/home/david/scratch/rabbit/test/rabbit@nuc/mnesia/rabbit@nuc/msg_stores/vhosts/628WB79CIFDYO9LJI6DKMI09L/msg_store_persistent"},
                                                     rabbit_msg_store_ets_index,
                                                     "/home/david/scratch/rabbit/test/rabbit@nuc/mnesia/rabbit@nuc/msg_stores/vhosts/628WB79CIFDYO9LJI6DKMI09L/msg_store_persistent",
                                                     <0.715.0>,#Ref<0.1127660195.458620929.200968>,
                                                     #Ref<0.1127660195.458620929.200965>,
                                                     #Ref<0.1127660195.458620929.200969>,
                                                     #Ref<0.1127660195.458620929.200970>,
                                                     {4000,800}},
                                     {client_msstate,<0.708.0>,
                                                     <<210,61,62,233,91,44,113,162,15,92,141,3,168,202,67,57>>,
                                                     #{},
                                                     {state,#Ref<0.1127660195.458620929.200936>,
                                                            "/home/david/scratch/rabbit/test/rabbit@nuc/mnesia/rabbit@nuc/msg_stores/vhosts/628WB79CIFDYO9LJI6DKMI09L/msg_store_transient"},
                                                     rabbit_msg_store_ets_index,
                                                     "/home/david/scratch/rabbit/test/rabbit@nuc/mnesia/rabbit@nuc/msg_stores/vhosts/628WB79CIFDYO9LJI6DKMI09L/msg_store_transient",
                                                     <0.709.0>,#Ref<0.1127660195.458620929.200937>,
                                                     #Ref<0.1127660195.458620929.200934>,
                                                     #Ref<0.1127660195.458620929.200938>,
                                                     #Ref<0.1127660195.458620929.200939>,
                                                     {4000,800}}},
                                    true,0,4096,0,0,0,0,0,0,infinity,0,0,0,0,0,0,
                                    {rates,0.0,0.0,0.0,0.0,-576460124199912220},
                                    #{},#{},#{},#{},0,0,0,0,4096,default,2,#{},<<"/">>,false}},

Alternatives to the implementation in this commit:

Store common directory list prefix only once (e.g. put it into persistent_term) and store per queue directory names in ETS.
Use file:filename_all() instead of file:filename() and pass binaries to the file module functions. However this might be brittle on some platforms since these binaries are interpreted as "raw filenames". Using raw filenames requires more changes to classic queues which we want to avoid to reduce risk.

The downside of the implemenation in this commit is that the binary gets converted to a list sometimes.
This happens whenever a file is flushed or a new file gets created for example.

Following perf tests did not show any regression in performance:

java -jar target/perf-test.jar -s 10 -x 1 -y 0 -u q -f persistent -z 30
java -jar target/perf-test.jar -s 10000 -x 1 -y 0 -u q -f persistent -z 30
java -jar target/perf-test.jar -s 10 -x 100 -qp q%d -qpf 1 -qpt 100 -y 0 -f persistent -z 60 -c 1000

Furthermore rabbit_file did not show up in the CPU flame graphs either.
It showed up only once with 0.1% in a page fault flame graph.

deps/rabbit/src/rabbit_msg_store.erl

lhoguin · 2022-12-16T09:45:08Z

The approach seems sensible, but I'll echo what JP said, careful about Windows and weird file names. Let's test this thoroughly.

Store directory names as binary instead of string. This commit saves >1GB of memory per 100,000 classic queues v2. With longish node names, the memory savings are even much higher. This commit is especially a prerequisite for scalalbe MQTT where every subscribing MQTT connection creates its own classic queue. So, with 3 million MQTT subscribers, this commit saves >30 GB of memory. This commits stores file names as binaries and converts back to file:filename() when passed to file API functions. This is to reduce risk of breaking behaviour for path names containing unicode chars on certain platforms. Alternatives to the implementation in this commit: 1. Store common directory list prefix only once (e.g. put it into persistent_term) and store per queue directory names in ETS. 2. Use file:filename_all() instead of file:filename() and pass binaries to the file module functions. However this might be brittle on some platforms since these binaries are interpreted as "raw filenames". Using raw filenames requires more changes to classic queues which we want to avoid to reduce risk. The downside of the implemenation in this commit is that the binary gets converted to a list sometimes. This happens whenever a file is flushed or a new file gets created for example. Following perf tests did not show any regression in performance: ``` java -jar target/perf-test.jar -s 10 -x 1 -y 0 -u q -f persistent -z 30 java -jar target/perf-test.jar -s 10000 -x 1 -y 0 -u q -f persistent -z 30 java -jar target/perf-test.jar -s 10 -x 100 -qp q%d -qpf 1 -qpt 100 -y 0 -f persistent -z 60 -c 1000 ``` Furthermore `rabbit_file` did not show up in the CPU flame graphs either.

lukebakken · 2023-01-04T16:52:38Z

Test process for Windows:

Build zip file, extract locally
Create the C:\ProgramData\RabbitMQ Sérvér directory
Start powershell, cd to extracted RabbitMQ build
Set the following $env:RABBITMQ_BASE='C:\ProgramData\RabbitMQ Sérvér'
Create C:\ProgramData\RabbitMQ Sérvér\rabbitmq-env-conf.bat with this content:
```
set LOG=debug
set RABBITMQ_ALLOW_INPUT=true
```
Start RabbitMQ, in management UI, create a v1 and v2 queue

Start two instances of PerfTest, like this:

mvn exec:java '-Dexec.mainClass=com.rabbitmq.perf.PerfTest' '-Dexec.args=--predeclared --queue v2'

Note that publishing works fine
Kill erlang Stop-Process -Name werl
Restart RabbitMQ
PerfTest reconnects, RabbitMQ starts normally

Reduce memory by multiple GBs with many classic queues (backport #6684)

mergify bot added the bazel label Dec 15, 2022

ansd force-pushed the cqv2-memory branch 2 times, most recently from 6ad153c to 838c9ef Compare December 15, 2022 21:55

ansd marked this pull request as ready for review December 15, 2022 22:10

ansd requested a review from lhoguin December 15, 2022 22:10

michaelklishin added the backport-v3.11.x label Dec 15, 2022

michaelklishin added this to the 3.11.6 milestone Dec 15, 2022

ansd mentioned this pull request Dec 16, 2022

Fix potential crash if using UTF8 directory names. rabbitmq/osiris#114

Merged

lhoguin reviewed Dec 16, 2022

View reviewed changes

deps/rabbit/src/rabbit_msg_store.erl Show resolved Hide resolved

ansd added 2 commits January 4, 2023 07:59

Test Unicode char in queue names

b0d0308

lukebakken force-pushed the cqv2-memory branch from acef343 to b0d0308 Compare January 4, 2023 15:59

lukebakken approved these changes Jan 4, 2023

View reviewed changes

michaelklishin merged commit 3089f33 into main Jan 4, 2023

michaelklishin deleted the cqv2-memory branch January 4, 2023 17:00

mergify bot mentioned this pull request Jan 4, 2023

Reduce memory by multiple GBs with many classic queues (backport #6684) #6795

Merged

michaelklishin added a commit that referenced this pull request Jan 4, 2023

Merge pull request #6795 from rabbitmq/mergify/bp/v3.11.x/pr-6684

b3a175e

Reduce memory by multiple GBs with many classic queues (backport #6684)

ansd mentioned this pull request Jan 17, 2023

Native MQTT #5895

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce memory by multiple GBs with many classic queues #6684

Reduce memory by multiple GBs with many classic queues #6684

Uh oh!

ansd commented Dec 15, 2022 •

edited

Loading

Uh oh!

Uh oh!

lhoguin commented Dec 16, 2022

Uh oh!

lukebakken commented Jan 4, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Reduce memory by multiple GBs with many classic queues #6684

Reduce memory by multiple GBs with many classic queues #6684

Uh oh!

Conversation

ansd commented Dec 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

lhoguin commented Dec 16, 2022

Uh oh!

lukebakken commented Jan 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ansd commented Dec 15, 2022 •

edited

Loading

lukebakken commented Jan 4, 2023 •

edited

Loading