feat: feedback tool #36

NicolasBelissent · 2025-10-16T13:32:39Z

Summary by cubic

Adds a feedback collection tool and automatic implicit feedback to LangSmith so teams can capture both explicit user feedback and behavior signals from tool calls with minimal setup.

New Features
- Added meta_collect_tool_feedback with strict validation and ToolSet auto-inclusion; supports custom base_url.
- Implicit feedback to LangSmith: session tracking, quick refinement/task switch detection, suitability scoring, and safe payload sanitization.
- Auto-init via LANGSMITH_API_KEY; configurable with STACKONE_IMPLICIT_FEEDBACK_ENABLED, STACKONE_IMPLICIT_FEEDBACK_PROJECT, STACKONE_IMPLICIT_FEEDBACK_TAGS; programmatic configure_implicit_feedback available.
- Tool.execute and Tool.call accept options to attach feedback_session_id, feedback_user_id, feedback_metadata without affecting API requests.
Migration
- If you subclass StackOneTool, update execute(...) to include the options parameter and pass it to super().execute(..., options=options).

Add meta_collect_tool_feedback tool for collecting user feedback about StackOne tool performance. The tool is automatically included in the toolset and can be invoked by AI agents after user consent. Features: - Feedback collection tool with Pydantic validation - Automatic whitespace trimming and input cleaning - Support for custom base URLs - Integration with StackOneToolSet - OpenAI and LangChain format support Also includes implicit feedback system for automatic behavioral feedback to LangSmith: - Behavior analyzer for detecting quick refinements - Session tracking for tool call history - LangSmith client integration - Configurable via environment variables Breaking changes: - Updated StackOneTool.execute() signature to include options parameter - Updated meta tools execute methods to support options parameter 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

Copilot

Pull Request Overview

Adds implicit behavioral feedback instrumentation (LangSmith integration) and a user feedback collection tool to the SDK.

Introduces implicit feedback manager, analyzer, session tracking, sanitization utilities, and automatic dispatch from StackOneTool.execute.
Adds dedicated feedback meta tool (meta_collect_tool_feedback) and wires it into toolset plus README documentation.
Extends tool execution API to accept options for feedback context propagation.

Reviewed Changes

Copilot reviewed 16 out of 17 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
tests/test_implicit_feedback.py	Adds tests for behavior analyzer and implicit feedback manager with stub client.
tests/test_feedback.py	Adds comprehensive validation, execution, integration, and live submission tests for feedback tool and implicit feedback options.
stackone_ai/toolset.py	Injects feedback tool into loaded tools.
stackone_ai/models.py	Adds execution options handling and implicit feedback dispatch to StackOneTool.execute/call.
stackone_ai/meta_tools.py	Updates meta tools' execute signature to include options.
stackone_ai/implicit_feedback/utils.py	Adds sanitization helpers for payloads.
stackone_ai/implicit_feedback/session.py	Adds session tracker and event derivation logic.
stackone_ai/implicit_feedback/manager.py	Implements singleton manager coordinating analysis and LangSmith publishing.
stackone_ai/implicit_feedback/langsmith_client.py	Optional LangSmith client wrapper.
stackone_ai/implicit_feedback/data.py	Defines dataclasses for tool call records, quality signals, and events.
stackone_ai/implicit_feedback/analyzer.py	Implements behavior analysis (quick refinement, task switch, suitability score).
stackone_ai/implicit_feedback/init.py	Exposes implicit feedback public API.
stackone_ai/feedback/tool.py	Implements feedback collection tool with validation and integration.
stackone_ai/feedback/init.py	Exposes feedback tool factory.
stackone_ai/init.py	Re-exports implicit feedback configuration functions.
README.md	Documents implicit feedback system, options usage, and feedback collection tool.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-16T13:34:54Z

stackone_ai/meta_tools.py

+        def execute(
+            self, arguments: str | JsonDict | None = None, *, options: JsonDict | None = None
+        ) -> JsonDict:
            return execute_filter(arguments)


These overrides bypass StackOneTool.execute, so implicit feedback instrumentation and option propagation (feedback_session_id, etc.) are lost for meta tool calls. Either remove the override and let base execute run (after adapting underlying functions to accept the parsed kwargs), or forward options and manually invoke the implicit feedback manager similar to StackOneTool.execute before returning.

Suggested change

return execute_filter(arguments)

# Forward options and manually invoke implicit feedback instrumentation if available

result = execute_filter(arguments)

# If feedback instrumentation is available, invoke it here

# For example:

# if hasattr(self, "_implicit_feedback_manager") and options is not None:

# self._implicit_feedback_manager.handle_feedback(self, arguments, result, options)

# Propagate options if needed (e.g., feedback_session_id)

# If options are needed in the result, add them here

return result

Copilot · 2025-10-16T13:34:54Z

stackone_ai/meta_tools.py

+        def execute(
+            self, arguments: str | JsonDict | None = None, *, options: JsonDict | None = None
+        ) -> JsonDict:
            return execute_tool(arguments)


These overrides bypass StackOneTool.execute, so implicit feedback instrumentation and option propagation (feedback_session_id, etc.) are lost for meta tool calls. Either remove the override and let base execute run (after adapting underlying functions to accept the parsed kwargs), or forward options and manually invoke the implicit feedback manager similar to StackOneTool.execute before returning.

Copilot · 2025-10-16T13:34:55Z

stackone_ai/feedback/tool.py

+    def execute(
+        self, arguments: str | JsonDict | None = None, *, options: JsonDict | None = None
+    ) -> JsonDict:
+        """
+        Execute the feedback tool with enhanced validation.
+
+        Args:
+            arguments: Tool arguments as string or dict
+            options: Execution options
+
+        Returns:
+            Response from the API
+
+        Raises:
+            StackOneError: If validation or API call fails
+        """
+        try:
+            # Parse input
+            if isinstance(arguments, str):
+                raw_params = json.loads(arguments)
+            else:
+                raw_params = arguments or {}
+
+            # Validate with Pydantic
+            parsed_params = FeedbackInput(**raw_params)
+
+            # Build validated request body
+            validated_arguments = {
+                "feedback": parsed_params.feedback,
+                "account_id": parsed_params.account_id,
+                "tool_names": parsed_params.tool_names,
+            }
+
+            # Use the parent execute method with validated arguments
+            return super().execute(validated_arguments, options=options)
+
+        except json.JSONDecodeError as exc:
+            raise StackOneError(f"Invalid JSON in arguments: {exc}") from exc
+        except ValueError as exc:
+            raise StackOneError(f"Validation error: {exc}") from exc
+        except Exception as error:
+            if isinstance(error, StackOneError):
+                raise
+            raise StackOneError(f"Error executing feedback tool: {error}") from error


When validation fails (exceptions before calling super().execute), implicit feedback instrumentation in StackOneTool.execute is skipped, so failed attempts are not recorded. To preserve consistent telemetry, move validation ahead of building arguments but always delegate to a wrapper that calls super().execute within a try/finally that dispatches feedback (or re-use StackOneTool.execute for JSON parsing and only apply additional pydantic validation).

Copilot · 2025-10-16T13:34:55Z

README.md

+configure_implicit_feedback(
+    api_key="/path/to/langsmith.key",


Example passes a filesystem path as api_key; configure_implicit_feedback expects the actual LangSmith API key string, not a file path. Replace '/path/to/langsmith.key' with an API key value or adjust the example to show reading the key file first.

Suggested change

configure_implicit_feedback(

api_key="/path/to/langsmith.key",

# If your API key is stored in a file, read it first:

with open("/path/to/langsmith.key", "r") as f:

langsmith_api_key = f.read().strip()

configure_implicit_feedback(

api_key=langsmith_api_key,

cubic-dev-ai

6 issues found across 17 files

Prompt for AI agents (all 6 issues)


Understand the root cause of the following 6 issues and fix them.


<file name="stackone_ai/implicit_feedback/utils.py">

<violation number="1" location="stackone_ai/implicit_feedback/utils.py:28">
Building list(value.items()) materializes the entire mapping before truncation, so large payloads are still fully copied; iterate lazily (e.g., via itertools.islice) to honor MAX_ITEMS.</violation>

<violation number="2" location="stackone_ai/implicit_feedback/utils.py:39">
Converting sequences with list(value) copies all elements before truncation, negating MAX_ITEMS and causing large-memory copies; iterate lazily instead.</violation>
</file>

<file name="stackone_ai/implicit_feedback/analyzer.py">

<violation number="1" location="stackone_ai/implicit_feedback/analyzer.py:43">
This guard prevents quick refinement detection whenever session_id is missing, even though the tracker keeps history for user-level sessions via the user_id fallback. Drop the session_id check so quick refinements are still surfaced for those records.</violation>
</file>

<file name="stackone_ai/meta_tools.py">

<violation number="1" location="stackone_ai/meta_tools.py:197">
Adding options to MetaExecuteTool.execute without forwarding it causes any supplied feedback metadata to be discarded instead of reaching the underlying tool or implicit feedback logging. Please propagate options through execute_tool and into tool.execute.</violation>
</file>

<file name="stackone_ai/models.py">

<violation number="1" location="stackone_ai/models.py:128">
Feedback keys must be removed from kwargs even when options already defines them; otherwise they still get forwarded to the API request and can break the call.</violation>
</file>

<file name="stackone_ai/implicit_feedback/manager.py">

<violation number="1" location="stackone_ai/implicit_feedback/manager.py:209">
create_run returns None, so run_id is always falsy and this early return prevents publishing any implicit feedback. Please assign a stable run_id yourself (e.g., include record.call_id in the payload and reuse it) before dispatching feedback.</violation>
</file>

_{React with 👍 or 👎 to teach cubic. Mention @cubic-dev-ai to give feedback, ask questions, or re-run the review.}

cubic-dev-ai · 2025-10-16T13:51:46Z

stackone_ai/implicit_feedback/utils.py

+        return sanitized
+
+    if isinstance(value, Sequence) and not isinstance(value, (bytes, bytearray, str)):
+        sanitized_sequence = [sanitize_value(item, depth=depth + 1) for item in list(value)[:MAX_ITEMS]]


Converting sequences with list(value) copies all elements before truncation, negating MAX_ITEMS and causing large-memory copies; iterate lazily instead.

Prompt for AI agents

Address the following comment on stackone_ai/implicit_feedback/utils.py at line 39: <comment>Converting sequences with list(value) copies all elements before truncation, negating MAX_ITEMS and causing large-memory copies; iterate lazily instead.</comment> <file context> @@ -0,0 +1,49 @@ + return sanitized + + if isinstance(value, Sequence) and not isinstance(value, (bytes, bytearray, str)): + sanitized_sequence = [sanitize_value(item, depth=depth + 1) for item in list(value)[:MAX_ITEMS]] + if len(value) > MAX_ITEMS: + sanitized_sequence.append("<truncated>") </file context>

✅ Addressed in 81c14bc

cubic-dev-ai · 2025-10-16T13:51:46Z

stackone_ai/implicit_feedback/utils.py

+
+    if isinstance(value, Mapping):
+        sanitized: dict[str, Any] = {}
+        for key, item in list(value.items())[:MAX_ITEMS]:


Building list(value.items()) materializes the entire mapping before truncation, so large payloads are still fully copied; iterate lazily (e.g., via itertools.islice) to honor MAX_ITEMS.

Prompt for AI agents

Address the following comment on stackone_ai/implicit_feedback/utils.py at line 28: <comment>Building list(value.items()) materializes the entire mapping before truncation, so large payloads are still fully copied; iterate lazily (e.g., via itertools.islice) to honor MAX_ITEMS.</comment> <file context> @@ -0,0 +1,49 @@ + + if isinstance(value, Mapping): + sanitized: dict[str, Any] = {} + for key, item in list(value.items())[:MAX_ITEMS]: + key_str = str(key) + if key_str.lower() in SENSITIVE_KEYS: </file context>

✅ Addressed in 81c14bc

cubic-dev-ai · 2025-10-16T13:51:46Z

stackone_ai/implicit_feedback/analyzer.py

+    def _detect_quick_refinement(
+        self, history: Sequence[ToolCallRecord], current: ToolCallRecord
+    ) -> tuple[bool, float | None]:
+        if not current.session_id or not history:


This guard prevents quick refinement detection whenever session_id is missing, even though the tracker keeps history for user-level sessions via the user_id fallback. Drop the session_id check so quick refinements are still surfaced for those records.

Prompt for AI agents

Address the following comment on stackone_ai/implicit_feedback/analyzer.py at line 43: <comment>This guard prevents quick refinement detection whenever session_id is missing, even though the tracker keeps history for user-level sessions via the user_id fallback. Drop the session_id check so quick refinements are still surfaced for those records.</comment> <file context> @@ -0,0 +1,83 @@ + def _detect_quick_refinement( + self, history: Sequence[ToolCallRecord], current: ToolCallRecord + ) -> tuple[bool, float | None]: + if not current.session_id or not history: + return False, None + </file context>

✅ Addressed in 81c14bc

cubic-dev-ai · 2025-10-16T13:51:46Z

stackone_ai/meta_tools.py


-        def execute(self, arguments: str | JsonDict | None = None) -> JsonDict:
+        def execute(
+            self, arguments: str | JsonDict | None = None, *, options: JsonDict | None = None


Adding options to MetaExecuteTool.execute without forwarding it causes any supplied feedback metadata to be discarded instead of reaching the underlying tool or implicit feedback logging. Please propagate options through execute_tool and into tool.execute.

Prompt for AI agents

Address the following comment on stackone_ai/meta_tools.py at line 197: <comment>Adding options to MetaExecuteTool.execute without forwarding it causes any supplied feedback metadata to be discarded instead of reaching the underlying tool or implicit feedback logging. Please propagate options through execute_tool and into tool.execute.</comment> <file context> @@ -193,7 +193,9 @@ def __init__(self) -> None: - def execute(self, arguments: str | JsonDict | None = None) -> JsonDict: + def execute( + self, arguments: str | JsonDict | None = None, *, options: JsonDict | None = None + ) -> JsonDict: return execute_filter(arguments) </file context>

cubic-dev-ai · 2025-10-16T13:51:46Z

stackone_ai/models.py

+        merged_params = dict(params)
+        feedback_options = dict(options or {})
+        for key in cls._FEEDBACK_OPTION_KEYS:
+            if key in merged_params and key not in feedback_options:


Feedback keys must be removed from kwargs even when options already defines them; otherwise they still get forwarded to the API request and can break the call.

Prompt for AI agents

Address the following comment on stackone_ai/models.py at line 128: <comment>Feedback keys must be removed from kwargs even when options already defines them; otherwise they still get forwarded to the API request and can break the call.</comment> <file context> @@ -110,6 +120,15 @@ def __init__( + merged_params = dict(params) + feedback_options = dict(options or {}) + for key in cls._FEEDBACK_OPTION_KEYS: + if key in merged_params and key not in feedback_options: + feedback_options[key] = merged_params.pop(key) + return merged_params, feedback_options </file context>

✅ Addressed in d7dbf5c

cubic-dev-ai · 2025-10-16T13:51:46Z

stackone_ai/implicit_feedback/manager.py

+        if isinstance(run, Mapping):
+            run_id = run.get("id") or run.get("run_id") or run_id
+
+        if not run_id:


create_run returns None, so run_id is always falsy and this early return prevents publishing any implicit feedback. Please assign a stable run_id yourself (e.g., include record.call_id in the payload and reuse it) before dispatching feedback.

Prompt for AI agents

Address the following comment on stackone_ai/implicit_feedback/manager.py at line 209: <comment>create_run returns None, so run_id is always falsy and this early return prevents publishing any implicit feedback. Please assign a stable run_id yourself (e.g., include record.call_id in the payload and reuse it) before dispatching feedback.</comment> <file context> @@ -0,0 +1,226 @@ + if isinstance(run, Mapping): + run_id = run.get("id") or run.get("run_id") or run_id + + if not run_id: + return + </file context>

✅ Addressed in 81c14bc

- Add feedback tool that sends data to API endpoint - No LangSmith integration - just API calls - Simple validation with Pydantic - Auto-included in StackOneToolSet - 6 focused tests covering validation, execution, and integration

- Remove implicit feedback imports from __init__.py - Fix base class call method to not pass options parameter - All 6 feedback tests now passing

- Remove implicit feedback imports and unreachable code - Fix execute method signature to match base class - All mypy checks now pass

- Remove unused start_time, call_params, end_time variables - All linting checks now pass (Ruff + Mypy)

NicolasBelissent · 2025-10-17T08:53:58Z

@claude review this pr

- Allow account_id to be string or list of strings - Send same feedback to each account individually - Return combined results with success/failure counts - Add comprehensive tests for validation and execution - All 9 tests passing, linting clean

- Adjust formatting of tool.execute calls in test_feedback.py for consistency - Enhance code clarity by aligning dictionary entries - All tests remain passing

- Split large validation test into focused, single-purpose tests - Consolidate execution tests to avoid duplication - Combine multiple account success/error scenarios into one test - Remove redundant integration test class - All 10 tests still passing, better organization

willleeney

LGTM

NicolasBelissent and others added 2 commits October 14, 2025 17:41

changing tests

077139d

Copilot AI review requested due to automatic review settings October 16, 2025 13:32

Copilot AI reviewed Oct 16, 2025

View reviewed changes

Resolve merge conflict in uv.lock by regenerating

a3037c7

cubic-dev-ai bot reviewed Oct 16, 2025

View reviewed changes

NicolasBelissent added 8 commits October 16, 2025 14:52

Fix linting errors: line length, imports, and trailing whitespace

7349349

feat: add minimal feedback collection tool

8a0eaad

- Add feedback tool that sends data to API endpoint - No LangSmith integration - just API calls - Simple validation with Pydantic - Auto-included in StackOneToolSet - 6 focused tests covering validation, execution, and integration

feat: add minimal feedback collection tool

fdaee7b

- Add feedback tool that sends data to API endpoint - No LangSmith integration - just API calls - Simple validation with Pydantic - Auto-included in StackOneToolSet - 6 focused tests covering validation, execution, and integration

cleanup: remove implicit feedback files - focus on feedback tool only

81c14bc

fix: remove implicit feedback imports and fix call method

d7dbf5c

- Remove implicit feedback imports from __init__.py - Fix base class call method to not pass options parameter - All 6 feedback tests now passing

fix: remove unused typing.Any import

c4bd88e

fix: resolve mypy errors

ab50631

- Remove implicit feedback imports and unreachable code - Fix execute method signature to match base class - All mypy checks now pass

fix: remove unused variables after implicit feedback cleanup

59e792e

- Remove unused start_time, call_params, end_time variables - All linting checks now pass (Ruff + Mypy)

NicolasBelissent added 3 commits October 17, 2025 10:51

refactor: format tool.execute calls for improved readability

4e4d527

- Adjust formatting of tool.execute calls in test_feedback.py for consistency - Enhance code clarity by aligning dictionary entries - All tests remain passing

willleeney approved these changes Oct 17, 2025

View reviewed changes

improving test suite

e583a41

NicolasBelissent merged commit 9179918 into main Oct 17, 2025
5 checks passed

NicolasBelissent deleted the feat/implicit-feedback-tool branch October 17, 2025 10:21

github-actions bot mentioned this pull request Sep 26, 2025

chore(main): release stackone-ai 0.3.3 #32

Merged

-            return execute_filter(arguments)
+            # Forward options and manually invoke implicit feedback instrumentation if available
+            result = execute_filter(arguments)
+            # If feedback instrumentation is available, invoke it here
+            # For example:
+            # if hasattr(self, "_implicit_feedback_manager") and options is not None:
+            #     self._implicit_feedback_manager.handle_feedback(self, arguments, result, options)
+            # Propagate options if needed (e.g., feedback_session_id)
+            # If options are needed in the result, add them here
+            return result

		configure_implicit_feedback(
		api_key="/path/to/langsmith.key",

-configure_implicit_feedback(
-    api_key="/path/to/langsmith.key",
+# If your API key is stored in a file, read it first:
+with open("/path/to/langsmith.key", "r") as f:
+    langsmith_api_key = f.read().strip()
+configure_implicit_feedback(
+    api_key=langsmith_api_key,

feat: feedback tool #36

feat: feedback tool #36

Uh oh!

Conversation

NicolasBelissent commented Oct 16, 2025 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasBelissent commented Oct 17, 2025

Uh oh!

willleeney left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NicolasBelissent commented Oct 16, 2025 •

edited by cubic-dev-ai bot

Loading

cubic-dev-ai bot Oct 16, 2025 •

edited

Loading

cubic-dev-ai bot Oct 16, 2025 •

edited

Loading

cubic-dev-ai bot Oct 16, 2025 •

edited

Loading

cubic-dev-ai bot Oct 16, 2025 •

edited

Loading

cubic-dev-ai bot Oct 16, 2025 •

edited

Loading

cubic-dev-ai bot Oct 16, 2025 •

edited

Loading