Feature/refactor class v2 #83

monishkadas-ms · 2025-06-16T08:45:18Z

No description provided.

Added the feature for size and time based flushing of buffer. Added config options for max_interval and max_size. Once either one is reached the events stored in the buffer will be flushed.

Added the feature for size and time based flushing of buffer. Added config options for max_interval and max_size. Once either one is reached the events stored in the buffer will be flushed. Updated kusto.rb

Added the feature for size and time based flushing of buffer. Added config options for max_interval and max_size. Once either one is reached the events stored in the buffer will be flushed. Updated kusto.rb and ingestor.rb with the implementation of memory based buffer and flushing and removed file based buffer

Testing spec files

The max_size config now refers to the size of the buffer (defaults to 10MB) instead of number of events in the buffer

** Updated the kusto_spec.rb test

Removed the temp file buffer and added retry to prevent data loss

Added exponential backoff to retries. Removed max_retries.

Updated upload_async and upload to raise errors during network downtime to trigger rescue block in flush buffer so the data sent for flushing can be restored and the flush can be attempted later. Added file buffer to store data ONLY when flush fails due to network issues. Once the network is back online the each file in the buffer is flushed and deleted first and the regular in-memory buffer is used post that.

Updated buffer_flush and upload()

Updated failed_items_path to be a required config parameter. If set to "nil", the failed items will not be persisted to local storage, else the items are stored in the file path provided after max_retries.

tanmaya-panda1 · 2025-06-18T14:08:59Z

lib/logstash/outputs/kusto/ingestor.rb

+          #@logger.info("Ingestion result: #{ingest_result}")
+        }
+        .rescue{ |e|
+          @logger.error("Ingestion failed: #{e.message}")


Multiple threads can fail simultaneously and call persist_batch without synchronization, which can lead to directory creation race conditions, file write conflicts

Directory is created when the @file_persistence instance is initialized in kusto.rb so it should not be an issue. I did add write mutex to persist_batch to ensure thread-safe file saving.

tanmaya-panda1 · 2025-06-18T14:11:33Z

lib/logstash/outputs/kusto/filePersistence.rb

+
+module LogStash; module Outputs; class KustoOutputInternal
+  class FilePersistence
+    attr_reader :failed_dir


Module instance variable @failed_dir is shared across all threads without synchronization, perhaps we will need a mutex protection. one solution can be, directory creation can be moved to init to prevent race conditions from multiple threads during directory creation.

The directory creation is already in the initialize, and we only create one instance of the @file_persistance in our code so I don't think adding a mutex for directory creation would be useful, but it would help to add one for persist_batch (multiple threads can try to save failed files)

tanmaya-panda1 · 2025-06-18T14:20:37Z

lib/logstash/outputs/kusto/filePersistence.rb

+      ::File.write(filename, JSON.dump(batch))
+    end
+
+    def load_batches


load_batches-> for each file, it reads the entire file into memory, parses into json and then returns an array which is done inside a map, which means all contents are loaded into memory at once since map returns an array containing all the results.
This can potentially lead to OOM errors if the number of failes files are large in number.

Good point! We are now loading only one batch at a time.

tanmaya-panda1 · 2025-06-18T14:32:26Z

lib/logstash/outputs/kusto/filePersistence.rb

+
+    def persist_batch(batch)
+      filename = ::File.join(@failed_dir, "failed_batch_#{Time.now.to_i}_#{SecureRandom.uuid}.json")
+      ::File.write(filename, JSON.dump(batch))


correct me if I am wrong, File.write is not atomic which means if the process crashes amidst a write, you get a corrupted JSON file. When load_batche tries to read it later, JSON.load will fail.

Good point! I am writing to temp file first and using file rename to atomically move it to the correct location. Added error handling and retries (default 3 atm) to keep trying to persist the failed batch in case of transient errors.

tanmaya-panda1 · 2025-06-18T14:35:44Z

lib/logstash/outputs/kusto/filePersistence.rb

+
+    def load_batches
+      return [] unless Dir.exist?(@failed_dir)
+      Dir.glob(::File.join(@failed_dir, 'failed_batch_*.json')).map do |file|


In continuation to the previous comment, we need to add error handling for corrupted/partial JSON files. Also how are we handling the files which are deleted between glob and read.

Yeah. Added error handling, for deleted files, and also for the corrupted files (they are moved to a quarantine dir so it doesn’t block future processing instead of ignoring/deleting them immediately)

…test.

tanmaya-panda1 · 2025-07-01T05:18:19Z

lib/logstash/outputs/kusto/customSizeBasedBuffer.rb

+    def process_failed_batches
+      @file_persistence.load_batches.each do |file, batch|
+        begin
+          @buffer_state[:flush_mutex].lock


The flush_mutex only protects the flush operation, but race condition happens before the mutex is acquired.

Consider the following
T1 calls process_failed_batches
T1 executes load_batches and gets the list of files
T2 calls process_failed_batches
T2 executes load_batches and also gets the list of files
T1 acquires flush_mutex and processes file1
T2 waits for flush_mutex
T1 deletes the file1 and releases mutex
T2 acquires flush_mutex and tries to process the file1 - but it's already deleted.

The process_failed_batches fn is only called once on startup from buffer_initialize, which is also only called once at the beginning. So it's not like multiple threads would call the fn. Although, just in case we end up using multiple threads later, I did add a rename to 'filename.processing' step to make sure multiple threads don't try to load/delete the same file. There is also error handling in place for corrupted files and failed loading of files. I removed the load_batches fn in file_persistence.rb and added the logic in the process_failed_files fn directly because ruby was still trying fetch the entire list of files after enumerating instead of loading one at a time. That is fixed now.

tanmaya-panda1 · 2025-07-01T05:40:45Z

lib/logstash/outputs/kusto/customSizeBasedBuffer.rb

+      return items_flushed
+    end
+
+    def process_failed_batches


Also do check for conditions when process_failed_batches is called from multiple places like on startup and from a timer. When both the threads try to retrieve the same file and try to flush leading to file already deleted/duplicate flushing to kusto.

As mentioned above, that fn is only called once on startup atm. If we do add a feature to run background recovery by setting a timer, it would make sense to add another mutex, but ideally it should still be running on the same timer thread.

lib/logstash/outputs/kusto/customSizeBasedBuffer.rb

lib/logstash/outputs/kusto/filePersistence.rb

lib/logstash/outputs/kusto/customSizeBasedBuffer.rb

monishkadas-ms and others added 30 commits April 30, 2025 15:33

added custom_size_based_buffer class

675b9c6

Added the feature for size and time based flushing of buffer. Added config options for max_interval and max_size. Once either one is reached the events stored in the buffer will be flushed.

added custom_size_based_buffer class

c82a6a8

Added the feature for size and time based flushing of buffer. Added config options for max_interval and max_size. Once either one is reached the events stored in the buffer will be flushed. Updated kusto.rb

added custom_size_based_buffer class

4b79a5d

Added the feature for size and time based flushing of buffer. Added config options for max_interval and max_size. Once either one is reached the events stored in the buffer will be flushed. Updated kusto.rb

Added warning for deprecated path config var

654da30

Updated kusto_spec.rb and ingestor_spec.rb

9c352c3

Updated kusto_spec.rb and ingestor_spec.rb

1adc29a

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

86e10e4

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

34e682c

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

7cf7099

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

0200f21

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

8d49fc4

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

5cb6af3

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

5a859fa

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

5cab147

Testing spec files

Updated kusto_spec.rb and ingestor_spec.rb

ab495f5

Testing spec files

Updated max_size config

76904bd

The max_size config now refers to the size of the buffer (defaults to 10MB) instead of number of events in the buffer

Updated max_size config

86dc8ae

The max_size config now refers to the size of the buffer (defaults to 10MB) instead of number of events in the buffer

Added tests in kusto_spec.rb

47f8b1a

Added tests in kusto_spec.rb

bbf9ed4

Updated custom_size_based_buffer.rb

028eec9

Updated custom_size_based_buffer.rb

fed3fc4

** Updated the kusto_spec.rb test

Updated custom_size_based_buffer.rb

7ca57b9

Adds temp file buffer used during network downtime

1f9591f

Updated custom_size_based_buffer.rb

b033dee

Removed the temp file buffer and added retry to prevent data loss

Updated custom_size_based_buffer.rb

71998af

Added exponential backoff to retries. Removed max_retries.

* Refactor to classes

398a301

Added max_retries and failed_items_path() configs

f560b09

Updated buffer_flush and upload()

Updated README.md and configs

2adc63c

Updated failed_items_path to be a required config parameter. If set to "nil", the failed items will not be persisted to local storage, else the items are stored in the file path provided after max_retries.

Update e2e test

f2024ee

monishkadas-ms temporarily deployed to build June 16, 2025 09:07 — with GitHub Actions Inactive

monishkadas-ms temporarily deployed to build June 16, 2025 09:09 — with GitHub Actions Inactive

*Minor edits

f29b5d8

ag-ramachandran temporarily deployed to build June 17, 2025 13:47 — with GitHub Actions Inactive

ag-ramachandran temporarily deployed to build June 17, 2025 13:49 — with GitHub Actions Inactive

monishkadas-ms added 2 commits June 18, 2025 16:31

Updated file persistance logic

24f0f6f

Updated spec tests

19ed184

monishkadas-ms temporarily deployed to build June 18, 2025 11:08 — with GitHub Actions Inactive

monishkadas-ms temporarily deployed to build June 18, 2025 11:10 — with GitHub Actions Inactive

tanmaya-panda1 reviewed Jun 18, 2025

View reviewed changes

Updated filePersistence class and added error handling. Updated spec …

1fbeffd

…test.

monishkadas-ms temporarily deployed to build June 19, 2025 05:55 — with GitHub Actions Inactive

monishkadas-ms temporarily deployed to build June 19, 2025 05:56 — with GitHub Actions Inactive

tanmaya-panda1 reviewed Jul 1, 2025

View reviewed changes

lib/logstash/outputs/kusto/customSizeBasedBuffer.rb Outdated Show resolved Hide resolved

tanmaya-panda1 reviewed Jul 1, 2025

View reviewed changes

lib/logstash/outputs/kusto/filePersistence.rb Outdated Show resolved Hide resolved

monishkadas-ms added 3 commits July 1, 2025 14:23

Updated process_failed_files and shutdown fn

5f7098a

Removed loadbatches fn

168b5b7

Updated spec tests

f2a7599

monishkadas-ms temporarily deployed to build July 1, 2025 11:55 — with GitHub Actions Inactive

monishkadas-ms temporarily deployed to build July 1, 2025 11:57 — with GitHub Actions Inactive

Updated process_failed_files

6418f75

monishkadas-ms temporarily deployed to build July 1, 2025 13:27 — with GitHub Actions Inactive

monishkadas-ms temporarily deployed to build July 1, 2025 13:29 — with GitHub Actions Inactive

tanmaya-panda1 reviewed Jul 2, 2025

View reviewed changes

lib/logstash/outputs/kusto/customSizeBasedBuffer.rb Outdated Show resolved Hide resolved

Updated Error Handling in buffer flush

be288da

monishkadas-ms temporarily deployed to build July 2, 2025 15:42 — with GitHub Actions Inactive

monishkadas-ms temporarily deployed to build July 2, 2025 15:44 — with GitHub Actions Inactive

Feature/refactor class v2 #83

Are you sure you want to change the base?

Feature/refactor class v2 #83

Uh oh!

Conversation

monishkadas-ms commented Jun 16, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tanmaya-panda1 Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tanmaya-panda1 Jul 1, 2025 •

edited

Loading