Read only uses #9237

alphaprinz · 2025-10-07T16:16:42Z

Describe the Problem

Use new 'read only' pool (connected to cnpg replicated read-only cluster) in order to alleviate load from main, writable pg cluster.

Explain the Changes

Move some background queries to read-only pool:
-db cleancer
-object reclaimer's first find
-scrubber's first find

Issues: Fixed #xxx / Gap #xxx

Testing Instructions:

Doc added/updated
Tests added

Summary by CodeRabbit

New Features
- Query API accepts an optional preferred-pool parameter to route individual queries to a specific database pool.
Improvements
- Read operations now prefer read-only replicas to distribute load.
- Queries fall back and retry on the primary pool when a preferred pool is unavailable, improving resilience.
- New configuration toggle to enable/disable read-only replica usage.
Observability
- Added debug logging for query host selection and retry behavior.

coderabbitai · 2025-10-07T16:16:52Z

Walkthrough

Adds per-query pool selection and read-only replica usage: executeSQL accepts preferred_pool; MDStore read operations pass { preferred_pool: 'read_only' }; Postgres client routes queries to named pools, falls back to default, and may retry failed read queries on the default pool. A new config flag POSTGRES_USE_READ_ONLY controls read-only host usage.

Changes

Cohort / File(s)	Summary
TypeScript interface `src/sdk/nb.d.ts`	`DBCollection.executeSQL` options now include optional `preferred_pool?: string`.
MDStore read-query sites `src/server/object_services/md_store.js`	Multiple read/query operations updated to pass `{ preferred_pool: 'read_only' }` (`find_unreclaimed_objects`, `find_deleted_objects`, `iterate_all_chunks`, `find_deleted_chunks`, `has_any_blocks_for_chunk`, `has_any_parts_for_chunk`).
Postgres client & pool routing `src/util/postgres_client.js`	`executeSQL` and call sites accept `options.preferred_pool`; `get_pool` supports named pool with fallback to default; pools carry `retry_with_default_pool`; `_do_query` can retry on default pool when configured; read-only host resolution respects `POSTGRES_USE_READ_ONLY`; added debug logging and propagated retry flag into pool instances.
Config `config.js`	New exported flag `POSTGRES_USE_READ_ONLY` (default `true`) added to control read-only replica usage.

Sequence Diagram(s)

sequenceDiagram
    participant Caller
    participant MDStore
    participant PostgresTable
    participant PoolMgr as Pool Manager
    participant ReadPool as read_only
    participant DefaultPool as default

    Caller->>MDStore: read request
    MDStore->>PostgresTable: executeSQL(query, params, {preferred_pool:'read_only'})
    PostgresTable->>PoolMgr: get_pool('read_only')
    alt read_only exists
        PoolMgr-->>PostgresTable: ReadPool
        PostgresTable->>ReadPool: run query
        alt success
            ReadPool-->>PostgresTable: result
        else error and ReadPool.instance.retry_with_default_pool
            PostgresTable->>DefaultPool: retry query on default (logged)
            DefaultPool-->>PostgresTable: result/error
        end
    else read_only missing
        PoolMgr-->>PostgresTable: DefaultPool (fallback)
        PostgresTable->>DefaultPool: run query
        DefaultPool-->>PostgresTable: result
    end
    PostgresTable-->>MDStore: result/error
    MDStore-->>Caller: return result/error

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Potential attention areas:

src/util/postgres_client.js — verify get_pool fallback behavior, propagation of retry_with_default_pool, _do_query retry logic and added debug logging.
src/server/object_services/md_store.js — confirm only read paths were changed and the intended methods pass preferred_pool.
src/sdk/nb.d.ts and JSDoc — ensure declaration/signature matches runtime executeSQL options.

Possibly related PRs

performance - read only pool POC #9197 — Similar changes to src/util/postgres_client.js for read_only pool handling and pool selection.

Suggested labels

size/L

Suggested reviewers

dannyzaken
jackyalbo

Pre-merge checks and finishing touches

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.
Title check	❓ Inconclusive	The title 'Read only uses' is vague and generic, failing to clearly specify what is being modified or the primary change being implemented.	Use a more descriptive title such as 'Route background queries to read-only PostgreSQL pool' or 'Add read-only pool routing for db cleaner and scrubber operations' to clearly convey the main change.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

src/util/postgres_client.js (1)
632-642: Review the fallback behavior in get_pool.

The fallback logic on lines 636-637 silently falls back to the default pool when a requested pool key is invalid:
if (key && key !== this.pool_key) {
    return this.get_pool();
}
Concerns:

Silent fallback masks configuration issues: If code requests a non-existent pool (e.g., typo like 'read_onyl'), it silently uses the default pool instead of failing fast.

Misleading error message: After the fallback attempt, line 639 throws an error mentioning the requested key, but the fallback might have already succeeded with this.pool_key.

Recommendation:
Consider removing the fallback or adding a warning log when falling back, so configuration issues are visible:
 get_pool(key = this.pool_key) {
     const pool = this.client.get_pool(key);
     if (!pool) {
-        //if original get_pool was no for the default this.pool_key, try also this.pool_key
+        // If original get_pool was not for the default this.pool_key, try also this.pool_key
         if (key && key !== this.pool_key) {
+            dbg.warn(`Pool ${key} not found, falling back to default pool ${this.pool_key}`);
             return this.get_pool();
         }
         throw new Error(`The postgres clients pool ${key} disconnected`);
     }
     return pool;
 }
Alternatively, remove the fallback entirely to fail fast on invalid pool names.
src/server/object_services/md_store.js (1)

814-835: Route mapReduce operations to read-only pool for consistent replica distribution.

The list_objects and list_object_versions methods use mapReduce for queries with delimiters (lines 814-824 and 864-874), but these calls don't pass preferred_pool to route to the read-only pool. While the find() operations support routing via preferred_pool: 'read_only' (as seen elsewhere in md_store.js), the mapReduce calls should be consistent.

To fix:

Update mapReduceListObjects() and mapReduceAggregate() in postgres_client.js to extract options.preferred_pool and pass it to this.single_query() (currently they call this.single_query(mr_q) without the pool parameter)

Pass preferred_pool: 'read_only' in the options object when calling this._objects.mapReduce() at lines 814-824 and 864-874 in md_store.js

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2367740 and d699655.

📒 Files selected for processing (4)

src/sdk/nb.d.ts (1 hunks)
src/server/bg_services/db_cleaner.js (1 hunks)
src/server/object_services/md_store.js (5 hunks)
src/util/postgres_client.js (5 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-08-08T13:12:46.728Z

Learnt from: naveenpaul1
PR: noobaa/noobaa-core#9182
File: src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js:9-17
Timestamp: 2025-08-08T13:12:46.728Z
Learning: In upgrade script src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js for noobaa-core, rely on structural detection (e.g., pool.mongo_info, and resource_type === 'INTERNAL') with name-prefix fallback for removing legacy mongo/internal pools, instead of depending solely on config.INTERNAL_STORAGE_POOL_NAME or config.DEFAULT_POOL_NAME. Handle multi-system stores and remove all matching pools in one change.

Applied to files:

src/server/object_services/md_store.js

🧬 Code graph analysis (1)

src/server/bg_services/db_cleaner.js (1)

src/server/object_services/md_store.js (3)

dbg (14-14)

P (13-13)

config (28-28)

🔇 Additional comments (10)

src/server/object_services/md_store.js (6)

779-779: LGTM! Read-only pool routing for unreclaimed objects.

Correctly routes the find operation to the read-only pool, which aligns with the PR objective of offloading background read queries from the main cluster.

1127-1127: LGTM! Read-only pool routing for deleted objects query.

The SQL query correctly uses the read-only pool via the options parameter, consistent with the pattern established in this PR.

1603-1603: LGTM! Read-only pool routing for chunk iteration.

Appropriately routes the chunk iteration query to the read-only pool for the scrubber's background operations.

1778-1779: LGTM! Read-only pool routing for deleted chunks.

Correctly adds preferred_pool: 'read_only' to the find options for deleted chunks cleanup query.

1787-1788: LGTM! Read-only pool routing for chunk existence checks.

Both has_any_blocks_for_chunk and has_any_parts_for_chunk correctly route to the read-only pool. These existence checks are read-only operations suitable for read replicas.

Also applies to: 1796-1797

2039-2040: LGTM! Read-only pool routing for deleted blocks.

Correctly routes the deleted blocks query to the read-only pool for background cleanup operations.

src/util/postgres_client.js (3)

252-252: Helpful debug logging for pool routing.

The added debug log that includes pg_client.options?.host will be useful for verifying which pool (and host) is being used for each query, aiding in troubleshooting read-only pool routing.

726-733: LGTM! Clean implementation of preferred_pool in executeSQL.

The JSDoc and implementation correctly support routing queries to a preferred pool. The pattern of passing options.preferred_pool to get_pool() is clean and extensible.

937-937: LGTM! Consistent preferred_pool support in find and findOne.

Both methods correctly pass options.preferred_pool to get_pool(), enabling read-only pool routing for these common query methods. The implementation is consistent with the executeSQL pattern.

Also applies to: 954-954

src/sdk/nb.d.ts (1)

769-769: LGTM! TypeScript interface correctly updated.

The addition of preferred_pool?: string to the executeSQL options parameter aligns with the implementation in postgres_client.js and properly documents the new functionality for TypeScript consumers.

src/server/bg_services/db_cleaner.js

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

src/util/postgres_client.js (1)
635-638: Add warning log when falling back to default pool.

The silent fallback could mask misconfigurations. If the read_only pool is expected but unavailable, queries will silently route to the default pool, defeating the load distribution purpose.

Apply this diff to add visibility:
         //if original get_pool was not for the default this.pool_key, try also this.pool_key
         if (key && key !== this.pool_key) {
+            dbg.warn(`Pool ${key} not available for table ${this.name}, falling back to default pool ${this.pool_key}`);
             return this.get_pool();
         }

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d699655 and af555fb.

📒 Files selected for processing (1)

src/util/postgres_client.js (5 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Build Noobaa Image
GitHub Check: run-package-lock-validation
GitHub Check: run-jest-unit-tests

🔇 Additional comments (4)

src/util/postgres_client.js (4)

251-252: LGTM! Helpful debug logging for pool routing verification.

The host logging will be useful for confirming that queries are being routed to the correct pool (read-only vs. default).

954-954: LGTM! Consistent with find implementation.

The preferred_pool support in findOne is consistent with the find implementation. Same eventual consistency considerations apply.

937-937: LGTM. Routing find queries to read replicas is appropriate for background maintenance operations.

The callers using preferred_pool: 'read_only' are background operations (find_unreclaimed_objects, find_deleted_objects, chunk/block existence checks) where eventual consistency is acceptable. These align with the stated use cases (object reclaimer, scrubber) and do not impact critical read paths.

726-733: Review comment is incorrect—executeSQL is designed for standalone queries and not used within transactions.

The code shows that executeSQL and PgTransaction have separate execution paths:

PgTransaction (lines 335-378) uses its own query() method with a dedicated pg_client connection for transactional queries

executeSQL (line 731) gets its own pool connection via pool.connect() for standalone queries

All executeSQL call sites in md_store.js are standalone queries, never invoked within transaction contexts

The preferred_pool option is appropriate for directing standalone queries to specific pools (e.g., read-only)

There is no mixing of transaction and standalone query execution paths, and no risk of routing queries to incorrect pools during transactions.

Likely an incorrect or invalid review comment.

dannyzaken

In general, this looks good. A few general comments\questions

We should consider the case where the secondary is down. Maybe we should fallback to the primary?
We should make sure that the rest of the code behaves nicely if the secondary returns stale data (e.g., object_mds that were marked as reclaimed but were not reflected yet in the secondary). From a quick look at the code, it looks ok, but I didn't look too deeply into it.

dannyzaken · 2025-10-29T11:53:23Z

@alphaprinz I also think we should have a config to disable this feature and revert to the old behavior. It can be done in a different PR, so approving this in the meantime

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

src/util/postgres_client.js (2)
639-642: Fallback logic is correct but comment could be clearer.

The fallback mechanism works as intended, but the comment "try also this.pool_key" could be more explicit that it's falling back to the table's default pool key.

Consider this clearer comment:
-            //if original get_pool was not for the default this.pool_key, try also this.pool_key
+            // If requested pool doesn't exist and wasn't the table's default pool, fallback to table's default pool
             if (key && key !== this.pool_key) {
                 return this.get_pool();
             }
1510-1514: Consider implementing the requested configuration option.

Based on the PR comment from dannyzaken, there's a request for a configuration option to disable the new read-only behavior and revert to previous behavior. While approved for a separate PR, consider whether adding a simple config flag now (e.g., config.ENABLE_READ_ONLY_POOL) would prevent future breaking changes.

Would you like me to generate a script to check how the preferred_pool: 'read_only' option is being used across the codebase?

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between af555fb and 2f4a70a.

📒 Files selected for processing (3)

src/sdk/nb.d.ts (1 hunks)
src/server/object_services/md_store.js (5 hunks)
src/util/postgres_client.js (8 hunks)

🚧 Files skipped from review as they are similar to previous changes (2)

src/sdk/nb.d.ts
src/server/object_services/md_store.js

🧰 Additional context used

🧠 Learnings (4)

📚 Learning: 2025-08-08T13:12:46.728Z

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js:9-17
Timestamp: 2025-08-08T13:12:46.728Z
Learning: In upgrade script src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js for noobaa-core, rely on structural detection (e.g., pool.mongo_info, and resource_type === 'INTERNAL') with name-prefix fallback for removing legacy mongo/internal pools, instead of depending solely on config.INTERNAL_STORAGE_POOL_NAME or config.DEFAULT_POOL_NAME. Handle multi-system stores and remove all matching pools in one change.

Applied to files:

src/util/postgres_client.js

📚 Learning: 2025-08-11T06:12:12.318Z

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js:6-22
Timestamp: 2025-08-11T06:12:12.318Z
Learning: In the noobaa-core upgrade script src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js, bucket migration from the internal mongo pool to a new default pool is planned to be handled in separate future PRs with comprehensive testing, rather than being included directly in the pool removal script.

Applied to files:

src/util/postgres_client.js

📚 Learning: 2025-08-08T13:10:36.141Z

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/server/system_services/pool_server.js:1314-1317
Timestamp: 2025-08-08T13:10:36.141Z
Learning: In src/server/system_services/pool_server.js (and config usage), the constant config.INTERNAL_STORAGE_POOL_NAME has been removed from the system. Future logic should not depend on this constant and should instead use config.DEFAULT_POOL_NAME or structural markers (e.g., pool.resource_type === 'INTERNAL' or pool.mongo_info) to identify internal/mongo pools.

Applied to files:

src/util/postgres_client.js

📚 Learning: 2025-08-08T13:13:42.192Z

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/server/system_services/pool_server.js:1304-1307
Timestamp: 2025-08-08T13:13:42.192Z
Learning: In noobaa-core, do not rely on config.INTERNAL_STORAGE_POOL_NAME to detect backingstore pools. In src/server/system_services/pool_server.js, prefer checking pool.hosts_pool_info.backingstore or pool.cloud_pool_info.backingstore; fallback to prefix-matching the pool name against config.DEFAULT_POOL_NAME ('backingstores') to handle potential suffixes.

Applied to files:

src/util/postgres_client.js

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: run-package-lock-validation
GitHub Check: Build Noobaa Image
GitHub Check: run-jest-unit-tests

🔇 Additional comments (6)

src/util/postgres_client.js (6)

252-252: LGTM: Debug logging added for pool routing.

The debug log provides useful information for troubleshooting pool selection and retry behavior.

730-730: LGTM: Type definition updated correctly.

The preferred_pool option is properly documented as optional in the typedef.

737-737: LGTM: Pool selection correctly uses preferred_pool option.

The implementation properly routes queries to the preferred pool when specified.

941-941: LGTM: Pool selection propagated correctly in find operations.

Both find() and findOne() correctly pass the preferred_pool option to route queries appropriately.

Also applies to: 958-958

1753-1754: LGTM: Retry flag correctly propagated to pool instance.

The flag is properly propagated from pool configuration to the pool instance, making it available in _do_query().

1510-1514: Configuration is correct—no infinite recursion risk.

The retry_with_default_pool: true setting for the read_only pool is properly configured. The default pool (lines 1502-1505) does not have this flag, preventing infinite recursion. The read_only pool will safely retry failed queries using the default pool as intended.

coderabbitai · 2025-10-31T23:19:52Z

src/util/postgres_client.js

+        if (pg_client.retry_with_default_pool) {
+            dbg.warn("retrying with default pool. q = ", q);
+            return _do_query(PostgresClient.instance().get_pool('default'), q, transaction_counter);
+        }


⚠️ Potential issue | 🟠 Major

Risk of infinite recursion if default pool also has retry flag set.

If the default pool's retry_with_default_pool is set to true, this creates infinite recursion. The code should either:

Explicitly pass retry_with_default_pool: false when getting the default pool, or

Add a guard to prevent retrying if already using the default pool.

Additionally, the original error details are lost before the retry. Consider logging the original error with more context.

Apply this diff to prevent infinite recursion:

- if (pg_client.retry_with_default_pool) { + if (pg_client.retry_with_default_pool && pg_client !== PostgresClient.instance().get_pool('default')) { dbg.warn("retrying with default pool. q = ", q); return _do_query(PostgresClient.instance().get_pool('default'), q, transaction_counter); }

Or alternatively, ensure the default pool instance never has retry_with_default_pool set:

if (pg_client.retry_with_default_pool) { - dbg.warn("retrying with default pool. q = ", q); + dbg.warn("retrying with default pool. original error = ", err.message, ", q = ", q); - return _do_query(PostgresClient.instance().get_pool('default'), q, transaction_counter); + const default_pool = PostgresClient.instance().get_pool('default'); + // Ensure we don't retry infinitely if default pool also has retry flag + const safe_pool = { ...default_pool, retry_with_default_pool: false }; + return _do_query(safe_pool, q, transaction_counter); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if (pg_client.retry_with_default_pool) {

dbg.warn("retrying with default pool. q = ", q);

return _do_query(PostgresClient.instance().get_pool('default'), q, transaction_counter);

}

if (pg_client.retry_with_default_pool && pg_client !== PostgresClient.instance().get_pool('default')) {

dbg.warn("retrying with default pool. q = ", q);

return _do_query(PostgresClient.instance().get_pool('default'), q, transaction_counter);

}

🤖 Prompt for AI Agents

In src/util/postgres_client.js around lines 271 to 274, the retry branch unconditionally calls _do_query against the default pool which can cause infinite recursion if the default pool also has retry_with_default_pool=true and loses the original error context; change the retry to (a) call get_pool('default') with retry_with_default_pool explicitly set to false or (b) add a guard that detects when the current pool is already the default and skip retrying, and also log the original error before attempting the retry so the original error details are preserved for debugging.

Signed-off-by: Amit Prinz Setter <[email protected]>

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

src/util/postgres_client.js (1)

271-274: Infinite recursion risk remains unaddressed.

The retry logic still lacks a guard to prevent infinite recursion if the default pool is configured with retry_with_default_pool: true. While the current configuration doesn't set this flag on the default pool, the code remains fragile and could break if the pool configuration changes.

As previously suggested, consider either:

Explicitly ensure the default pool never retries: const default_pool = PostgresClient.instance().get_pool('default'); default_pool.retry_with_default_pool = false;

Add a guard: if (pg_client.retry_with_default_pool && pg_client !== PostgresClient.instance().get_pool('default'))

Additionally, the original error is not logged before retry, making debugging difficult.

🧹 Nitpick comments (1)

src/util/postgres_client.js (1)
1528-1532: Read-only host handling implements requested disable option.

The logic correctly honors POSTGRES_USE_READ_ONLY to disable read-only pool usage as requested in PR comments. However, when POSTGRES_USE_READ_ONLY is true but no read-only host is configured (no POSTGRES_HOST_RO or POSTGRES_HOST_RO_PATH), the read_only pool silently connects to the read-write database (line 1528 fallback). Consider logging a warning when this fallback occurs to avoid confusion.

Example:
 let host_ro = process.env.POSTGRES_HOST_RO || fs_utils.try_read_file_sync(process.env.POSTGRES_HOST_RO_PATH) || host;
+if (config.POSTGRES_USE_READ_ONLY && host_ro === host && (process.env.POSTGRES_HOST_RO || process.env.POSTGRES_HOST_RO_PATH)) {
+    dbg.warn('POSTGRES_USE_READ_ONLY is enabled but no read-only host configured, falling back to read-write host');
+}
 //if POSTGRES_USE_READ_ONLY is off, switch to regular host
 if (!config.POSTGRES_USE_READ_ONLY) {
     host_ro = host;
 }

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between dc83dbd and 5be01b0.

📒 Files selected for processing (4)

config.js (1 hunks)
src/sdk/nb.d.ts (1 hunks)
src/server/object_services/md_store.js (5 hunks)
src/util/postgres_client.js (9 hunks)

🚧 Files skipped from review as they are similar to previous changes (2)

src/server/object_services/md_store.js
src/sdk/nb.d.ts

🧰 Additional context used

🧠 Learnings (5)

📓 Common learnings

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/server/system_services/pool_server.js:1304-1307
Timestamp: 2025-08-08T13:13:42.192Z
Learning: In noobaa-core, do not rely on config.INTERNAL_STORAGE_POOL_NAME to detect backingstore pools. In src/server/system_services/pool_server.js, prefer checking pool.hosts_pool_info.backingstore or pool.cloud_pool_info.backingstore; fallback to prefix-matching the pool name against config.DEFAULT_POOL_NAME ('backingstores') to handle potential suffixes.

📚 Learning: 2025-08-08T13:12:46.728Z

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js:9-17
Timestamp: 2025-08-08T13:12:46.728Z
Learning: In upgrade script src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js for noobaa-core, rely on structural detection (e.g., pool.mongo_info, and resource_type === 'INTERNAL') with name-prefix fallback for removing legacy mongo/internal pools, instead of depending solely on config.INTERNAL_STORAGE_POOL_NAME or config.DEFAULT_POOL_NAME. Handle multi-system stores and remove all matching pools in one change.

Applied to files:

src/util/postgres_client.js

📚 Learning: 2025-08-11T06:12:12.318Z

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js:6-22
Timestamp: 2025-08-11T06:12:12.318Z
Learning: In the noobaa-core upgrade script src/upgrade/upgrade_scripts/5.20.0/remove_mongo_pool.js, bucket migration from the internal mongo pool to a new default pool is planned to be handled in separate future PRs with comprehensive testing, rather than being included directly in the pool removal script.

Applied to files:

src/util/postgres_client.js

📚 Learning: 2025-08-08T13:10:36.141Z

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/server/system_services/pool_server.js:1314-1317
Timestamp: 2025-08-08T13:10:36.141Z
Learning: In src/server/system_services/pool_server.js (and config usage), the constant config.INTERNAL_STORAGE_POOL_NAME has been removed from the system. Future logic should not depend on this constant and should instead use config.DEFAULT_POOL_NAME or structural markers (e.g., pool.resource_type === 'INTERNAL' or pool.mongo_info) to identify internal/mongo pools.

Applied to files:

src/util/postgres_client.js

📚 Learning: 2025-08-08T13:13:42.192Z

Learnt from: naveenpaul1
Repo: noobaa/noobaa-core PR: 9182
File: src/server/system_services/pool_server.js:1304-1307
Timestamp: 2025-08-08T13:13:42.192Z
Learning: In noobaa-core, do not rely on config.INTERNAL_STORAGE_POOL_NAME to detect backingstore pools. In src/server/system_services/pool_server.js, prefer checking pool.hosts_pool_info.backingstore or pool.cloud_pool_info.backingstore; fallback to prefix-matching the pool name against config.DEFAULT_POOL_NAME ('backingstores') to handle potential suffixes.

Applied to files:

src/util/postgres_client.js

🧬 Code graph analysis (2)

config.js (1)

src/util/postgres_client.js (1)

config (29-29)

src/util/postgres_client.js (1)

config.js (1)

config (7-7)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: run-package-lock-validation
GitHub Check: run-jest-unit-tests
GitHub Check: Build Noobaa Image

🔇 Additional comments (6)

src/util/postgres_client.js (6)

252-252: Helpful debugging addition.

The debug log provides useful context about which pool/host is being used and whether retry is enabled. The log3 level ensures this won't impact production performance.

636-646: Pool fallback logic looks good.

The fallback from a preferred pool to the table's default pool is correctly implemented with protection against infinite recursion. This ensures queries can proceed even when the preferred pool is temporarily unavailable (e.g., disconnected).

Note: The fallback handles disconnected pools (where pool.instance is null) but not invalid pool names, which will throw an error. This is likely intentional since only 'default', 'md', and 'read_only' are valid pool names.

730-737: Good API extension for per-query pool selection.

The preferred_pool parameter is properly documented and implemented, allowing callers to route specific queries to designated pools. The integration with the fallback logic in get_pool ensures robustness when the preferred pool is unavailable.

941-941: Consistent preferred_pool support in find/findOne.

The preferred_pool parameter is consistently propagated through find and findOne methods, matching the pattern established in executeSQL. This enables read-only pool usage for read queries.

Also applies to: 958-958

1510-1514: Appropriate read-only pool configuration.

The retry_with_default_pool: true flag on the read_only pool enables automatic fallback to the writable database when the read-only replica is unavailable, which is the correct behavior for resilience.

1757-1758: Necessary flag propagation for retry mechanism.

Propagating retry_with_default_pool from pool configuration to pool instance enables the retry logic in _do_query to access this flag. The comment clearly explains the purpose.

coderabbitai · 2025-11-18T15:39:20Z

config.js

+//whether to use read-only postgres replica cluster
+//ro host is set by operator in process.env.POSTGRES_HOST_RO
+config.POSTGRES_USE_READ_ONLY = true;


⚠️ Potential issue | 🟠 Major

Consider defaulting to false for backward compatibility.

The new POSTGRES_USE_READ_ONLY flag defaults to true, which enables read-only pool usage by default for all deployments. According to the PR objectives, the configuration option was requested to allow disabling the new behavior and reverting to previous behavior, which suggests false might be more appropriate as the default for backward compatibility.

If true is intentional, ensure this breaking change is clearly documented in upgrade notes.

🤖 Prompt for AI Agents

In config.js around lines 254 to 256, the new POSTGRES_USE_READ_ONLY flag is defaulting to true which changes runtime behavior; change the default to false for backward compatibility (i.e., set POSTGRES_USE_READ_ONLY = false) so deployments continue to use the previous non-read-only pool unless explicitly opted-in, and if true is intentional, add clear upgrade-note documentation instead of changing the default.

alphaprinz requested a review from dannyzaken October 7, 2025 16:16

pull-request-size bot added the size/M label Oct 7, 2025

alphaprinz force-pushed the read_only_uses branch from 0a008b6 to d699655 Compare October 14, 2025 03:50

alphaprinz marked this pull request as ready for review October 17, 2025 22:47

coderabbitai bot reviewed Oct 17, 2025

View reviewed changes

src/server/bg_services/db_cleaner.js Outdated Show resolved Hide resolved

coderabbitai bot reviewed Oct 20, 2025

View reviewed changes

dannyzaken reviewed Oct 29, 2025

View reviewed changes

dannyzaken approved these changes Oct 29, 2025

View reviewed changes

alphaprinz force-pushed the read_only_uses branch from af555fb to 2f4a70a Compare October 31, 2025 23:15

coderabbitai bot reviewed Oct 31, 2025

View reviewed changes

alphaprinz mentioned this pull request Nov 11, 2025

add POSTGRES_HOST_RO to core host if using cnpg cluster noobaa/noobaa-operator#1725

Open

2 tasks

alphaprinz force-pushed the read_only_uses branch 2 times, most recently from 425aea0 to dc83dbd Compare November 12, 2025 18:26

alphaprinz added 6 commits November 18, 2025 07:28

use read only pool - db cleaner

dc652f8

Signed-off-by: Amit Prinz Setter <[email protected]>

use read only pool - object reclaimer first find

127272a

Signed-off-by: Amit Prinz Setter <[email protected]>

use read only pool - scrubber first find

b16aed5

Signed-off-by: Amit Prinz Setter <[email protected]>

use read only pool - fix logs

6f1195c

Signed-off-by: Amit Prinz Setter <[email protected]>

use read only pool - add retry with default pool

51cb9ae

Signed-off-by: Amit Prinz Setter <[email protected]>

add POSTGRES_USE_READ_ONLY flag to disable using read only replica

5be01b0

Signed-off-by: Amit Prinz Setter <[email protected]>

alphaprinz force-pushed the read_only_uses branch from dc83dbd to 5be01b0 Compare November 18, 2025 15:29

coderabbitai bot reviewed Nov 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Read only uses #9237

Read only uses #9237

alphaprinz commented Oct 7, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 7, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

dannyzaken left a comment

Uh oh!

dannyzaken commented Oct 29, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Oct 31, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Read only uses #9237

Are you sure you want to change the base?

Read only uses #9237

Conversation

alphaprinz commented Oct 7, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe the Problem

Explain the Changes

Issues: Fixed #xxx / Gap #xxx

Testing Instructions:

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

dannyzaken left a comment

Choose a reason for hiding this comment

Uh oh!

dannyzaken commented Oct 29, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alphaprinz commented Oct 7, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 7, 2025 •

edited

Loading