feat: Langfuse Observability Integration — CLI, LangfuseSink Adapter & Docker Hosting by praisonai-triage-agent[bot] · Pull Request #1271 · MervinPraison/PraisonAI

praisonai-triage-agent · 2026-04-02T22:19:03Z

Analyze existing code and identify issues
Fix bare except: → except Exception: in CLI and close() method (2 locations)
Move import requests and import json outside loops/repeated calls — requests_lib alias at top of function, json as module-level import
Rename _requests → requests_lib (avoids misleading private-variable convention)
Fix ImportError for missing credentials → ValueError (semantically correct)
Fix LangfuseSinkConfig host env-var bug: host default was "https://cloud.langfuse.com" causing or-based env-var reading to short-circuit; changed default to "" and always resolve env vars in __post_init__
Fix close() ordering bug: was setting _closed = True before calling flush(), so flush() always no-op'd inside close(); fixed to flush first then mark closed
Add 21 unit tests covering config, initialization, all event types (AGENT_START/END, TOOL_START/END, ERROR, OUTPUT), lifecycle (flush/close), and protocol compliance
All 21 tests pass

- Implement LangfuseSink adapter following TraceSinkProtocol - Add complete praisonai langfuse CLI command group - Support local (Docker) and remote Langfuse instances - Add langfuse>=3.0.0 as optional dependency - Maps ActionEvents to Langfuse trace/span/event API - Thread-safe with lazy imports following AGENTS.md guidelines - Zero changes to core SDK (praisonaiagents) Closes #1270 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-04-02T22:19:27Z

@coderabbitai review

MervinPraison · 2026-04-02T22:19:28Z

/review

MervinPraison · 2026-04-02T22:19:28Z

@gemini review this PR

coderabbitai · 2026-04-02T22:19:33Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

qodo-code-review · 2026-04-02T22:19:35Z

Code Review by Qodo

🐞 Bugs (7) 📘 Rule violations (0) 📎 Requirement gaps (0) 🎨 UX Issues (0)

1. Env host ignored 🐞 Bug ≡ Correctness

Description

LangfuseSinkConfig.__post_init__ cannot apply LANGFUSE_HOST/LANGFUSE_BASE_URL because self.host is
initialized to a non-empty default and the code short-circuits before reading env vars. This
silently routes traces to the wrong Langfuse instance for self-hosted deployments.

Code

src/praisonai/praisonai/observability/langfuse.py[R29-44]

+    public_key: str = ""
+    secret_key: str = ""
+    host: str = "https://cloud.langfuse.com"
+    flush_at: int = 20
+    flush_interval: float = 10.0
+    enabled: bool = True
+    
+    def __post_init__(self):
+        """Validate configuration."""
+        if self.enabled and not (self.public_key and self.secret_key):
+            # Try environment variables if not explicitly set
+            import os
+            self.public_key = self.public_key or os.getenv("LANGFUSE_PUBLIC_KEY", "")
+            self.secret_key = self.secret_key or os.getenv("LANGFUSE_SECRET_KEY", "")
+            self.host = self.host or os.getenv("LANGFUSE_HOST", os.getenv("LANGFUSE_BASE_URL", self.host))
+

Evidence
The config default sets host to a non-empty string, and the env override uses self.host or ..., so
the env value is never reached.
src/praisonai/praisonai/observability/langfuse.py[29-44]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`LangfuseSinkConfig` currently never honors `LANGFUSE_HOST` / `LANGFUSE_BASE_URL` because `host` has a truthy default and the code uses `self.host or ...`.

### Issue Context
This breaks self-hosted Langfuse setups that expect env vars to direct the SDK to a custom base URL.

### Fix Focus Areas
- src/praisonai/praisonai/observability/langfuse.py[29-44]

### Suggested fix
- Read env vars first and only fall back to the default if env is not set, e.g.:
 - `env_host = os.getenv("LANGFUSE_HOST") or os.getenv("LANGFUSE_BASE_URL")`
 - `if env_host: self.host = env_host`
- Alternatively, make `host` default to `""`/`None` and then set `self.host = env_host or "https://cloud.langfuse.com"` in `__post_init__`.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

2. Enabled ignores missing keys 🐞 Bug ≡ Correctness

Description

LangfuseSinkConfig leaves enabled=True even when no public/secret key is configured, so callers
(including praisonai langfuse test) cannot reliably detect that Langfuse tracing is effectively
unusable. This causes the test command to proceed and fail later during sink initialization instead
of cleanly prompting for credentials.

Code

src/praisonai/praisonai/observability/langfuse.py[R34-44]

+    enabled: bool = True
+    
+    def __post_init__(self):
+        """Validate configuration."""
+        if self.enabled and not (self.public_key and self.secret_key):
+            # Try environment variables if not explicitly set
+            import os
+            self.public_key = self.public_key or os.getenv("LANGFUSE_PUBLIC_KEY", "")
+            self.secret_key = self.secret_key or os.getenv("LANGFUSE_SECRET_KEY", "")
+            self.host = self.host or os.getenv("LANGFUSE_HOST", os.getenv("LANGFUSE_BASE_URL", self.host))
+

Evidence
The config only attempts to read env vars, but never toggles enabled off when keys are still
missing; the CLI test relies on if not config.enabled: as the guard.
src/praisonai/praisonai/observability/langfuse.py[34-44]
src/praisonai/praisonai/cli/commands/langfuse.py[389-396]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`LangfuseSinkConfig.enabled` stays `True` even when keys are missing. Callers that use `enabled` as a readiness check (e.g., CLI `langfuse test`) will proceed and then fail later.

### Issue Context
The CLI test expects `enabled` to reflect whether credentials are present.

### Fix Focus Areas
- src/praisonai/praisonai/observability/langfuse.py[34-44]
- src/praisonai/praisonai/cli/commands/langfuse.py[389-396]

### Suggested fix
- After env lookup in `__post_init__`, if either key is still empty:
 - set `self.enabled = False`, or
 - raise a clear config/validation error.
- In `langfuse_test`, also guard explicitly on `config.public_key` and `config.secret_key` (defensive), so the user sees an immediate actionable message.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

3. Credential error masked 🐞 Bug ☼ Reliability

Description

LangfuseSink._initialize_client raises ImportError for missing credentials, but then catches
ImportError broadly and re-raises as “Langfuse is not installed,” hiding the real configuration
error. This misleads users and makes troubleshooting much harder.

Code

src/praisonai/praisonai/observability/langfuse.py[R74-97]

+    def _initialize_client(self) -> None:
+        """Lazy import and initialize Langfuse client."""
+        try:
+            import langfuse
+            
+            # Validate config
+            if not (self._config.public_key and self._config.secret_key):
+                raise ImportError(
+                    "Langfuse credentials missing. Set LANGFUSE_PUBLIC_KEY and LANGFUSE_SECRET_KEY "
+                    "environment variables or pass them to LangfuseSinkConfig."
+                )
+            
+            self._client = langfuse.Langfuse(
+                public_key=self._config.public_key,
+                secret_key=self._config.secret_key,
+                host=self._config.host,
+                flush_at=self._config.flush_at,
+                flush_interval=self._config.flush_interval,
+            )
+            
+        except ImportError:
+            raise ImportError(
+                "Langfuse is not installed. Install with: pip install praisonai[langfuse]"
+            )

Evidence

The code explicitly raises ImportError when credentials are missing and then immediately catches all
ImportError (including the one it raised) and replaces the message.

src/praisonai/praisonai/observability/langfuse.py[74-97]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
Missing credentials currently get reported as a missing dependency because `_initialize_client` catches `ImportError` broadly.

### Issue Context
This affects both programmatic usage and `praisonai langfuse test` because initialization failures become misleading.

### Fix Focus Areas
- src/praisonai/praisonai/observability/langfuse.py[74-97]

### Suggested fix
- Use two stages:
 1) `try: import langfuse` / `except ImportError as e: raise ImportError("Langfuse is not installed...") from e`
 2) Validate keys and raise `ValueError` (or a custom exception) for missing credentials.
- Keep the credential error message specific (public/secret key missing + which env vars to set).

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

View more (1)

4. Status port extracted wrong 🐞 Bug ≡ Correctness

Description

langfuse_status computes the web UI port as ports.split(':')[0] when a mapping contains ->,
which returns the host segment rather than the published port. This produces invalid health-check
URLs and incorrect status output.

Code

src/praisonai/praisonai/cli/commands/langfuse.py[R249-258]

+            ports = web_container.get("Publishers", "")
+            if ":" in ports:
+                port = ports.split(":")[0] if "->" in ports else "3000"
+            else:
+                port = "3000"
+            
+            try:
+                import requests
+                response = requests.get(f"http://127.0.0.1:{port}", timeout=5)
+                if response.status_code < 500:

Evidence

The code explicitly branches on the presence of -> and then selects the substring before the first
colon, which is not the port portion of a typical host:port->containerPort mapping; the selected
value is then interpolated into http://127.0.0.1:{port}.

src/praisonai/praisonai/cli/commands/langfuse.py[243-263]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`langfuse_status` extracts the published port incorrectly from the `Publishers` string, causing broken/invalid URLs for the web health check.

### Issue Context
The code already assumes a `...:...->...` structure by checking for `"->"`, but then extracts the wrong segment.

### Fix Focus Areas
- src/praisonai/praisonai/cli/commands/langfuse.py[243-263]

### Suggested fix
- Parse the published port with a small, robust parser (regex recommended), e.g.:
 - Find the substring between the last `:` before `->` and the `->`.
 - Example approach: `m = re.search(r":(?P<port>\d+)->", ports)`.
- If parsing fails, fall back to `3000` and print a warning that port detection failed.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

5. Compose port change one-way 🐞 Bug ≡ Correctness

Description

langfuse_start rewrites docker-compose.yml by replacing only the exact substring 3000:3000, so
after the first non-default port run the mapping can’t be updated on later runs. This forces manual
edits to change ports after initial setup.

Code

src/praisonai/praisonai/cli/commands/langfuse.py[R93-99]

+    # Update docker-compose.yml port if different from default
+    if port != 3000:
+        compose_file = repo_path / "docker-compose.yml"
+        if compose_file.exists():
+            content = compose_file.read_text()
+            content = content.replace("3000:3000", f"{port}:3000")
+            compose_file.write_text(content)

Evidence

The replacement only targets 3000:3000; once it has been changed to e.g. 8080:3000, rerunning
with a new port won’t match and therefore won’t update.

src/praisonai/praisonai/cli/commands/langfuse.py[93-100]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
The compose port rewrite only works once because it only replaces `3000:3000`.

### Issue Context
Users commonly need to change ports after the first run (port conflicts, multiple instances).

### Fix Focus Areas
- src/praisonai/praisonai/cli/commands/langfuse.py[93-100]

### Suggested fix
- Replace any existing published-port mapping for container port 3000 (regex like `r"\d+:3000"`) with the requested `f"{port}:3000"`.
- Consider writing a `docker-compose.override.yml` in the data directory instead of editing upstream files in-place.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

6. Secrets file permissions unsafe 🐞 Bug ⛨ Security

Description

langfuse_config writes LANGFUSE_SECRET_KEY to ~/.praisonai/langfuse.env without setting
restrictive permissions, so on POSIX systems a permissive umask can leave it group/world readable.
This risks credential exposure on shared machines.

Code

src/praisonai/praisonai/cli/commands/langfuse.py[R345-357]

+    # Write config
+    config_dir.mkdir(exist_ok=True)
+    
+    content_lines = [
+        "# Langfuse Configuration",
+        "# Generated by PraisonAI CLI",
+        ""
+    ]
+    for key, value in config_data.items():
+        content_lines.append(f"{key}={value}")
+    
+    config_file.write_text('\n'.join(content_lines) + '\n')
+

Evidence
The config file is written in plaintext via write_text(...) and no chmod/permission hardening is
applied afterward.
src/praisonai/praisonai/cli/commands/langfuse.py[281-363]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`~/.praisonai/langfuse.env` contains a secret key but is created with default permissions.

### Issue Context
On POSIX platforms, the resulting permissions depend on the current process umask; in multi-user environments this can expose secrets.

### Fix Focus Areas
- src/praisonai/praisonai/cli/commands/langfuse.py[345-357]

### Suggested fix
- After writing the file, on non-Windows platforms set restrictive permissions:
 - `os.chmod(config_file, 0o600)`
- Optionally, warn if the file exists with overly-permissive mode bits.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

7. Git missing crashes start 🐞 Bug ☼ Reliability

Description

langfuse_start only catches CalledProcessError around git clone, so if git is not installed a
FileNotFoundError will crash the CLI. This breaks the intended “one command” setup path on minimal
environments.

Code

src/praisonai/praisonai/cli/commands/langfuse.py[R67-76]

+        try:
+            subprocess.run([
+                "git", "clone", 
+                "https://github.com/langfuse/langfuse.git",
+                str(repo_path)
+            ], check=True, cwd=data_path)
+            console.print("[green]✅ Repository cloned successfully[/green]")
+        except subprocess.CalledProcessError:
+            console.print("[red]❌ Failed to clone Langfuse repository[/red]")
+            raise typer.Abort()

Evidence
The clone step invokes git without a preceding availability check and the exception handler does
not include FileNotFoundError.
src/praisonai/praisonai/cli/commands/langfuse.py[62-76]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
If `git` is not installed, `subprocess.run(["git", ...])` raises `FileNotFoundError`, which is not caught.

### Issue Context
You already validate Docker/Docker Compose, but not Git.

### Fix Focus Areas
- src/praisonai/praisonai/cli/commands/langfuse.py[62-76]

### Suggested fix
- Either:
 - Add a preflight check `subprocess.run(["git", "--version"], ...)` similar to Docker, or
 - Catch `(subprocess.CalledProcessError, FileNotFoundError)` and print an actionable install message before aborting.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

ⓘ The new review experience is currently in Beta. Learn more

coderabbitai · 2026-04-02T22:19:44Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: b333191c-bf0d-43b0-a9a9-899f7e595004

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

Introduces Langfuse observability integration to PraisonAI, including a new CLI command group (praisonai langfuse) for managing local Docker instances and configuring remote connections, a LangfuseSink adapter implementing TraceSinkProtocol to emit agent/tool traces, and optional dependency configuration.

Changes

Cohort / File(s)	Summary
CLI Langfuse Command Group `src/praisonai/praisonai/cli/commands/langfuse.py`	New Typer-based CLI module with 7 subcommands: `start` (launches local Docker Compose stack with health polling), `stop` (shuts down containers), `status` (reports container/API health), `config` (persists credentials to `~/.praisonai/langfuse.env`), `connect` (thin wrapper for `config`), `test` (emits synthetic trace), and `version` (reports SDK and local repo versions). Includes Docker availability checks, Git repo cloning, port remapping, and container status polling.
CLI Registration `src/praisonai/praisonai/cli/app.py`	Registers the new Langfuse CLI subcommand group as `langfuse_app` on the main Typer application with help text referencing the observability platform.
Observability Module & LangfuseSink Adapter `src/praisonai/praisonai/observability/__init__.py`, `src/praisonai/praisonai/observability/langfuse.py`	Introduces `LangfuseSinkConfig` dataclass (credentials, host, batching options, enabled flag) and `LangfuseSink` class implementing `TraceSinkProtocol`. Maps `ActionEvent` types to Langfuse trace/span/event operations: `AGENT_START/AGENT_END` manage root spans, `TOOL_START/TOOL_END` create nested spans with duration tracking, `ERROR/OUTPUT` emit trace events. Lazy loads the Langfuse client, handles thread-safe event emission, auto-populates credentials from environment variables, and suppresses exceptions with logging.
Dependency Configuration `src/praisonai/pyproject.toml`	Adds optional `langfuse` extra with `langfuse>=3.0.0` dependency and includes it in `all` and `claw` extras.

Sequence Diagram

sequenceDiagram
    participant Agent
    participant TraceEmitter
    participant LangfuseSink
    participant LangfuseClient as Langfuse Client
    participant LangfuseServer as Langfuse Server

    Agent->>TraceEmitter: emit(AGENT_START)
    TraceEmitter->>LangfuseSink: emit(ActionEvent)
    LangfuseSink->>LangfuseSink: _handle_event()
    LangfuseSink->>LangfuseClient: trace.create(agent-run)
    LangfuseClient->>LangfuseServer: POST /traces
    LangfuseClient->>LangfuseSink: return trace_id
    LangfuseSink->>LangfuseClient: span.create(root_span)
    LangfuseClient->>LangfuseServer: POST /spans

    Agent->>TraceEmitter: emit(TOOL_START)
    TraceEmitter->>LangfuseSink: emit(ActionEvent)
    LangfuseSink->>LangfuseClient: span.create(tool_span)
    LangfuseClient->>LangfuseServer: POST /spans

    Agent->>TraceEmitter: emit(TOOL_END)
    TraceEmitter->>LangfuseSink: emit(ActionEvent)
    LangfuseSink->>LangfuseClient: span.update(output, status)
    LangfuseClient->>LangfuseServer: PUT /spans

    Agent->>TraceEmitter: emit(AGENT_END)
    TraceEmitter->>LangfuseSink: emit(ActionEvent)
    LangfuseSink->>LangfuseClient: span.update(output, status)
    LangfuseClient->>LangfuseServer: PUT /spans
    LangfuseSink->>LangfuseClient: flush()
    LangfuseClient->>LangfuseServer: batch commit

Estimated Code Review Effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Suggested Labels

Review effort 3/5

Poem

🐰 A new trace was born in the Langfuse way,
Events and spans dancing the whole day,
From agents to tools, each step marked clear,
Docker containers running without fear,
Observability magic, now crystal and bright! ✨

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title accurately and comprehensively summarizes the main changes: Langfuse observability integration with CLI command group, LangfuseSink adapter implementation, and Docker hosting support.
Linked Issues check	✅ Passed	The pull request fulfills all primary objectives from issue `#1270`: implements LangfuseSink adapter (TraceSinkProtocol), adds praisonai langfuse CLI group with required commands (start/stop/status/config/connect/test/version), makes langfuse an optional dependency with lazy imports, and supports Docker self-hosting without changes to core SDK.
Out of Scope Changes check	✅ Passed	All changes are directly aligned with issue `#1270` objectives. File modifications include only: new observability modules (langfuse.py, init.py), new CLI command group (langfuse.py), CLI registration (app.py), and optional dependency configuration (pyproject.toml). No unrelated changes detected.
Docstring Coverage	✅ Passed	Docstring coverage is 95.45% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch claude/issue-1270-20260402-2212

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist

Code Review

This pull request introduces Langfuse observability integration, adding a new langfuse CLI command for managing server instances and a LangfuseSink for tracing agent workflows. The feedback identifies several technical improvements: the port configuration logic in the CLI is fragile and should use regular expressions, environment variable parsing needs whitespace stripping for robustness, and print statements in the observability adapter should be replaced with standard logging. Additionally, the efficiency of the span lookup mechanism in the LangfuseSink was flagged for potential performance issues during concurrent tool execution.

gemini-code-assist · 2026-04-02T22:20:53Z

src/praisonai/praisonai/cli/commands/langfuse.py

+        compose_file = repo_path / "docker-compose.yml"
+        if compose_file.exists():
+            content = compose_file.read_text()
+            content = content.replace("3000:3000", f"{port}:3000")


The port replacement logic using content.replace("3000:3000", ...) is fragile. It will fail on subsequent runs if the port has already been changed from the default 3000, as the string "3000:3000" will no longer exist in the file. Consider using a regular expression to match any port mapped to 3000.

Suggested change

content = content.replace("3000:3000", f"{port}:3000")

import re

content = re.sub(r"\d+:3000", f"{port}:3000", content)

gemini-code-assist · 2026-04-02T22:20:53Z

src/praisonai/praisonai/cli/commands/langfuse.py

+                key, value = line.split('=', 1)
+                config_data[key] = value


When parsing the configuration file, keys and values are not stripped of whitespace. This can lead to duplicate entries or incorrect values if the .env file contains spaces around the = sign (e.g., KEY = VALUE).

Suggested change

key, value = line.split('=', 1)

config_data[key] = value

key, value = line.split('=', 1)

config_data[key.strip()] = value.strip()

gemini-code-assist · 2026-04-02T22:20:53Z

src/praisonai/praisonai/observability/langfuse.py

+                self._handle_event(event)
+        except Exception as e:
+            # Don't let observability errors break agent execution
+            print(f"LangfuseSink error: {e}")


Using print for error reporting in a library component or adapter is generally discouraged as it pollutes the standard output of the consuming application. It is better to use the standard logging module, which allows users to configure how and where logs are handled.

Suggested change

print(f"LangfuseSink error: {e}")

import logging

logging.getLogger(__name__).error(f"LangfuseSink error: {e}")

gemini-code-assist · 2026-04-02T22:20:53Z

src/praisonai/praisonai/observability/langfuse.py

+        for key in self._spans:
+            if key.startswith(f"{agent_name}:{tool_name}:"):
+                tool_key = key


Iterating over all active spans to find a matching tool name is inefficient (O(N)) and potentially ambiguous if multiple concurrent tools with the same name are running. Consider using a more structured storage for spans, such as a dictionary of lists or stacks keyed by agent_name and tool_name to improve lookup performance and correctness.

gemini-code-assist · 2026-04-02T22:20:53Z

src/praisonai/praisonai/observability/langfuse.py

+            try:
+                self._client.flush()
+            except Exception as e:
+                print(f"LangfuseSink flush error: {e}")


Using print for error reporting in a library component is discouraged. Use the logging module instead to provide better control over error output.

Suggested change

print(f"LangfuseSink flush error: {e}")

import logging

logging.getLogger(__name__).error(f"LangfuseSink flush error: {e}")

qodo-code-review · 2026-04-02T22:22:51Z

src/praisonai/praisonai/observability/langfuse.py

+    public_key: str = ""
+    secret_key: str = ""
+    host: str = "https://cloud.langfuse.com"
+    flush_at: int = 20
+    flush_interval: float = 10.0
+    enabled: bool = True
+
+    def __post_init__(self):
+        """Validate configuration."""
+        if self.enabled and not (self.public_key and self.secret_key):
+            # Try environment variables if not explicitly set
+            import os
+            self.public_key = self.public_key or os.getenv("LANGFUSE_PUBLIC_KEY", "")
+            self.secret_key = self.secret_key or os.getenv("LANGFUSE_SECRET_KEY", "")
+            self.host = self.host or os.getenv("LANGFUSE_HOST", os.getenv("LANGFUSE_BASE_URL", self.host))
+


1. Env host ignored 🐞 Bug ≡ Correctness

LangfuseSinkConfig.__post_init__ cannot apply LANGFUSE_HOST/LANGFUSE_BASE_URL because self.host is initialized to a non-empty default and the code short-circuits before reading env vars. This silently routes traces to the wrong Langfuse instance for self-hosted deployments.

Agent Prompt

### Issue description `LangfuseSinkConfig` currently never honors `LANGFUSE_HOST` / `LANGFUSE_BASE_URL` because `host` has a truthy default and the code uses `self.host or ...`. ### Issue Context This breaks self-hosted Langfuse setups that expect env vars to direct the SDK to a custom base URL. ### Fix Focus Areas - src/praisonai/praisonai/observability/langfuse.py[29-44] ### Suggested fix - Read env vars first and only fall back to the default if env is not set, e.g.: - `env_host = os.getenv("LANGFUSE_HOST") or os.getenv("LANGFUSE_BASE_URL")` - `if env_host: self.host = env_host` - Alternatively, make `host` default to `""`/`None` and then set `self.host = env_host or "https://cloud.langfuse.com"` in `__post_init__`.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

qodo-code-review · 2026-04-02T22:22:51Z

src/praisonai/praisonai/observability/langfuse.py

+    enabled: bool = True
+
+    def __post_init__(self):
+        """Validate configuration."""
+        if self.enabled and not (self.public_key and self.secret_key):
+            # Try environment variables if not explicitly set
+            import os
+            self.public_key = self.public_key or os.getenv("LANGFUSE_PUBLIC_KEY", "")
+            self.secret_key = self.secret_key or os.getenv("LANGFUSE_SECRET_KEY", "")
+            self.host = self.host or os.getenv("LANGFUSE_HOST", os.getenv("LANGFUSE_BASE_URL", self.host))
+


2. Enabled ignores missing keys 🐞 Bug ≡ Correctness

LangfuseSinkConfig leaves enabled=True even when no public/secret key is configured, so callers (including praisonai langfuse test) cannot reliably detect that Langfuse tracing is effectively unusable. This causes the test command to proceed and fail later during sink initialization instead of cleanly prompting for credentials.

Agent Prompt

### Issue description `LangfuseSinkConfig.enabled` stays `True` even when keys are missing. Callers that use `enabled` as a readiness check (e.g., CLI `langfuse test`) will proceed and then fail later. ### Issue Context The CLI test expects `enabled` to reflect whether credentials are present. ### Fix Focus Areas - src/praisonai/praisonai/observability/langfuse.py[34-44] - src/praisonai/praisonai/cli/commands/langfuse.py[389-396] ### Suggested fix - After env lookup in `__post_init__`, if either key is still empty: - set `self.enabled = False`, or - raise a clear config/validation error. - In `langfuse_test`, also guard explicitly on `config.public_key` and `config.secret_key` (defensive), so the user sees an immediate actionable message.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

qodo-code-review · 2026-04-02T22:22:51Z

src/praisonai/praisonai/observability/langfuse.py

+    def _initialize_client(self) -> None:
+        """Lazy import and initialize Langfuse client."""
+        try:
+            import langfuse
+
+            # Validate config
+            if not (self._config.public_key and self._config.secret_key):
+                raise ImportError(
+                    "Langfuse credentials missing. Set LANGFUSE_PUBLIC_KEY and LANGFUSE_SECRET_KEY "
+                    "environment variables or pass them to LangfuseSinkConfig."
+                )
+
+            self._client = langfuse.Langfuse(
+                public_key=self._config.public_key,
+                secret_key=self._config.secret_key,
+                host=self._config.host,
+                flush_at=self._config.flush_at,
+                flush_interval=self._config.flush_interval,
+            )
+
+        except ImportError:
+            raise ImportError(
+                "Langfuse is not installed. Install with: pip install praisonai[langfuse]"
+            )


3. Credential error masked 🐞 Bug ☼ Reliability

LangfuseSink._initialize_client raises ImportError for missing credentials, but then catches ImportError broadly and re-raises as “Langfuse is not installed,” hiding the real configuration error. This misleads users and makes troubleshooting much harder.

Agent Prompt

### Issue description Missing credentials currently get reported as a missing dependency because `_initialize_client` catches `ImportError` broadly. ### Issue Context This affects both programmatic usage and `praisonai langfuse test` because initialization failures become misleading. ### Fix Focus Areas - src/praisonai/praisonai/observability/langfuse.py[74-97] ### Suggested fix - Use two stages: 1) `try: import langfuse` / `except ImportError as e: raise ImportError("Langfuse is not installed...") from e` 2) Validate keys and raise `ValueError` (or a custom exception) for missing credentials. - Keep the credential error message specific (public/secret key missing + which env vars to set).

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

qodo-code-review · 2026-04-02T22:22:51Z

src/praisonai/praisonai/cli/commands/langfuse.py

+            ports = web_container.get("Publishers", "")
+            if ":" in ports:
+                port = ports.split(":")[0] if "->" in ports else "3000"
+            else:
+                port = "3000"
+
+            try:
+                import requests
+                response = requests.get(f"http://127.0.0.1:{port}", timeout=5)
+                if response.status_code < 500:


4. Status port extracted wrong 🐞 Bug ≡ Correctness

langfuse_status computes the web UI port as ports.split(':')[0] when a mapping contains ->, which returns the host segment rather than the published port. This produces invalid health-check URLs and incorrect status output.

Agent Prompt

### Issue description `langfuse_status` extracts the published port incorrectly from the `Publishers` string, causing broken/invalid URLs for the web health check. ### Issue Context The code already assumes a `...:...->...` structure by checking for `"->"`, but then extracts the wrong segment. ### Fix Focus Areas - src/praisonai/praisonai/cli/commands/langfuse.py[243-263] ### Suggested fix - Parse the published port with a small, robust parser (regex recommended), e.g.: - Find the substring between the last `:` before `->` and the `->`. - Example approach: `m = re.search(r":(?P<port>\d+)->", ports)`. - If parsing fails, fall back to `3000` and print a warning that port detection failed.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

MervinPraison · 2026-04-02T22:23:01Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above first.

Review areas:

Bloat check: Are changes minimal and focused?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Suggest specific improvements with code examples where possible

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/178e99c1-305d-436d-8385-6e9f0642fec2 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

Copilot · 2026-04-02T22:30:01Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above first.

Review areas:...

Thorough review done — here's a summary of findings and fixes applied in e4afa3b:

🐛 Bugs fixed

close() never flushed — _closed = True was set before calling flush(), so the guard if not self._closed in flush() short-circuited every time. Fixed to flush first, then mark closed.
LANGFUSE_HOST env var was silently ignored — host defaulted to "https://cloud.langfuse.com" (truthy), so self.host or os.getenv(...) always short-circuited. Fixed: default is now "", and __post_init__ unconditionally reads all three env vars (LANGFUSE_PUBLIC_KEY, LANGFUSE_SECRET_KEY, LANGFUSE_HOST/LANGFUSE_BASE_URL).
ImportError raised for missing credentials — semantically wrong (langfuse is installed, credentials are just missing). Changed to ValueError.

⚠️ Code quality
4. Two bare except: — one in close() (bare except: pass around span.end()), one in the version command's subprocess.run block. Both changed to except Exception:.
5. import requests inside a loop — in the health-check loop during langfuse start, import requests was called on every iteration. Extracted to requests_lib at the top of the block (with ImportError fallback).
6. Inline import json inside a function — moved to module-level alongside other stdlib imports.
7. _requests alias — renamed to requests_lib (leading underscore implies private attribute, not an alias for an optional import).

✅ Tests added
21 unit tests covering: config env-var resolution, explicit-override precedence, disabled-sink short-circuits, missing-credentials ValueError, all six event types (AGENT_START/END, TOOL_START/END, ERROR, OUTPUT), flush/close lifecycle, double-close safety, and TraceSinkProtocol runtime compliance — all passing with no real Langfuse dependency (mocked).

ℹ️ No backward-compat concerns — all changes are to newly added code. No existing public APIs modified.

coderabbitai

Actionable comments posted: 6

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/praisonai/praisonai/cli/commands/langfuse.py`:
- Around line 281-362: The config command writes ~/.praisonai/langfuse.env but
the test path (langfuse_test() / LangfuseSinkConfig()) reads os.environ only, so
running "praisonai langfuse config ..." then "praisonai langfuse test" fails;
after writing config_file in the config flow, load the saved keys into the
current process environment (e.g., set os.environ["LANGFUSE_PUBLIC_KEY"],
os.environ["LANGFUSE_SECRET_KEY"], os.environ["LANGFUSE_HOST"] from config_data)
so LangfuseSinkConfig() sees them immediately, and keep the existing file
write/sourcing guidance for new shells.

In `@src/praisonai/praisonai/observability/langfuse.py`:
- Around line 66-67: The current trace/span maps (_traces and _spans) use
agent_name and tool display names which can collide across overlapping runs;
change the keying to a stable per-run + per-invocation key (e.g., combine
agent_id (fallback to agent_name if missing) with a generated invocation_id/uuid
for each tool call) and store that composite key when creating traces/spans;
update all places that create/lookup/close traces/spans (references to _traces,
_spans and the code that currently searches for keys like
"{agent_name}:{tool_name}:*") to use the composite key (pass/return
invocation_id from the tool-start function so tool-completion can use it) and
remove any heuristics that pick the "last" matching key. Ensure tool completion
and agent-run lifecycle handlers accept or propagate the invocation_id so spans
are closed against the exact trace/span entry.
- Around line 255-270: The shutdown in close() sets self._closed before calling
flush(), which prevents flush() from sending the final batch and then ends spans
in _spans afterwards; fix by changing the order so spans are ended and flushed
while the sink is still open (e.g., call self.flush() and end/clear
spans/_traces first, then set self._closed = True), or alternatively ensure
flush() ignores the _closed flag during this shutdown path; update the close()
method references (close(), _closed, flush(), _spans, span.end(), _traces)
accordingly.
- Around line 29-43: The host env vars are only read inside the "missing
credentials" branch so custom LANGFUSE_HOST/LANGFUSE_BASE_URL never override the
default; move or add the host environment lookup out of the credentials check in
__post_init__ (or always apply it before the credential check) so self.host =
os.getenv("LANGFUSE_HOST", os.getenv("LANGFUSE_BASE_URL", self.host)) runs
regardless of whether public_key/secret_key are set, while preserving any
explicitly provided self.host if desired.
- Around line 135-156: The code currently calls removed Langfuse v2 APIs
(self._client.trace(), trace.span(), trace.event(), parent_span.span()) so
update the adapter to use the v3+ observation API: replace
self._client.trace(...) with creating an observation via
self._client.start_observation(...) or using
self._client.start_as_current_observation(as_type="generation") as a context
manager for the trace-level object; replace trace.span(...) and
parent_span.span(...) with start_as_current_observation(as_type="span") (or
start_observation()) to create span observations, call observation.update(...)
to change metadata, and call observation.end() (or let the context manager exit)
to finish them; similarly replace trace.event(...) usages with
start_observation/start_as_current_observation for events and use
.update()/.end() for lifecycle and metadata updates. Ensure you preserve
metadata keys (agent_id, agent_name, schema_version, event.metadata) when
building the observation, and update references to self._traces[agent_name] to
store the observation object rather than a v2 trace.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 9ec545e4-05b3-47bb-bf9e-4def65c82c37

📥 Commits

Reviewing files that changed from the base of the PR and between 3622b1a and 17b87aa.

📒 Files selected for processing (5)

src/praisonai/praisonai/cli/app.py
src/praisonai/praisonai/cli/commands/langfuse.py
src/praisonai/praisonai/observability/__init__.py
src/praisonai/praisonai/observability/langfuse.py
src/praisonai/pyproject.toml

coderabbitai · 2026-04-02T22:34:18Z

src/praisonai/praisonai/cli/commands/langfuse.py

+        result = subprocess.run([
+            "docker", "compose", "ps", "--format", "json"
+        ], cwd=repo_path, capture_output=True, text=True, check=True)
+
+        if not result.stdout.strip():
+            console.print("[yellow]📊 No Langfuse containers running[/yellow]")
+            return
+
+        # Parse JSON output
+        import json
+        containers = []
+        for line in result.stdout.strip().split('\n'):
+            if line:
+                containers.append(json.loads(line))


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

cat -n src/praisonai/praisonai/cli/commands/langfuse.py | sed -n '200,260p'

Repository: MervinPraison/PraisonAI

Length of output: 2604

🌐 Web query:

docker compose ps --format json output structure documentation

💡 Result:

The docker compose ps --format json command outputs a newline-delimited JSON (NDJSON) stream, where each line is a complete JSON object representing one container. It is not a single JSON array. This behavior aligns with docker ps --format json and has been intentional since Docker Compose v2.21+ (via alignment with docker/cli formatter). Example output (two containers): {"ID":"abc123...","Name":"project-service-1","Command":"...","CreatedAt":"2024-10-01T12:00:00Z","ExitCode":0,"Health":"","Image":"nginx","Labels":"...","LocalVolumes":"0","Mounts":"","Name":"project-service-1","Names":["project-service-1"],"Networks":"project_default","Ports":"80/tcp","Project":"project","Publishers":[{"URL":"0.0.0.0","TargetPort":80,"PublishedPort":8080,"Protocol":"tcp"}],"RunningFor":"5 minutes ago","Service":"service","Size":"0B","State":"running","Status":"Up 5 minutes"} {"ID":"def456...","Name":"project-db-1",...} Key fields in each JSON object (synthesized from official examples in docs and real outputs; exact set may vary by version/container state): - ID: Container ID (string) - Name: Container name (string) - Names: Array of names (usually single element) - Command: Quoted command (string) - CreatedAt: Creation timestamp (string, e.g., "2024-10-01 12:00:00 +0000 UTC") - Project: Compose project name (string, restored post-v2.21) - Service: Compose service name (string) - State: State like "running", "exited" (string) - Status: Detailed status like "Up 5 minutes (healthy)" (string) - Health: Health status (string, empty if none) - ExitCode: Exit code if exited (int) - Image: Image name (string) - Ports: Exposed ports summary (string) - Publishers: Array of port bindings [{URL: string (e.g. "0.0.0.0"), TargetPort: int, PublishedPort: int, Protocol: string}] - Size: Size info (string, e.g. "0B") - RunningFor: Uptime (string) - Networks: Networks summary (string) - Mounts: Mounts summary (string) - LocalVolumes: Local volume count (string) - Labels: All labels as string (string) Additional fields from docker ps alignment: Labels (map serialized), LocalVolumes, Mounts, Networks, SizeRw/SizeRootFs (sometimes). To parse as array: docker compose ps --format json | jq -s . Official documentation shows outdated array examples but confirms JSON format usage. Source code uses docker/cli's ContainerWrite with NewContainerFormat("json"), producing NDJSON of container formatter fields. No formal JSON Schema exists in docs; structure derives from Go template fields for container ps (see docker ps --format docs for base fields + Compose extras like Project/Service).

Citations:

1: https://docs.docker.com/reference/cli/docker/compose/ps/

2: https://docs.docker.com/reference/cli/docker/compose/ps

3: https://docs.docker.com/reference/cli/docker/container/ps/

4: https://manpages.debian.org/testing/docker-compose/docker-compose-ps.1.en.html

5: https://manpages.debian.org/unstable/docker-compose/docker-compose-ps.1.en.html

6: [BUG] docker-compose ps --format json docker/compose#11784

7: https://stackoverflow.com/questions/77142618/incorrect-json-format-from-docker-compose-ps

8: https://github.com/docker/compose/blob/v2/cmd/compose/ps.go

9: https://pkg.go.dev/github.com/docker/DoCkEr/api/types/container

🏁 Script executed:

cat -n src/praisonai/praisonai/cli/commands/langfuse.py | sed -n '240,265p'

Repository: MervinPraison/PraisonAI

Length of output: 1103

The Publishers field is an array of objects, not a string.

Docker Compose ps --format json outputs newline-delimited JSON where each Publishers field is an array of objects (e.g., [{"URL":"0.0.0.0","TargetPort":80,"PublishedPort":8080,"Protocol":"tcp"}]), not a colon-delimited string. The code at line 260 will fail with a TypeError when attempting if ":" in ports: on a list, and line 261 will fail attempting .split() on a list. Additionally, line 241 displays the raw Publishers array in the table without formatting.

Also applies to: 249-257

🧰 Tools

🪛 Ruff (0.15.7)

[error] 202-204: Starting a process with a partial executable path

(S607)

coderabbitai · 2026-04-02T22:34:18Z

src/praisonai/praisonai/cli/commands/langfuse.py

+    config_dir = Path.home() / ".praisonai"
+    config_file = config_dir / "langfuse.env"
+
+    if show:
+        # Show current configuration
+        if config_file.exists():
+            console.print(f"[blue]📄 Configuration file: {config_file}[/blue]")
+            content = config_file.read_text()
+
+            table = Table(title="Langfuse Configuration")
+            table.add_column("Setting", style="cyan")
+            table.add_column("Value", style="green")
+
+            for line in content.strip().split('\n'):
+                if '=' in line and not line.startswith('#'):
+                    key, value = line.split('=', 1)
+                    # Mask secret key for security
+                    if "SECRET" in key:
+                        value = value[:8] + "..." if len(value) > 8 else "***"
+                    table.add_row(key, value)
+
+            console.print(table)
+
+            # Also show env vars
+            env_vars = []
+            for var in ["LANGFUSE_PUBLIC_KEY", "LANGFUSE_SECRET_KEY", "LANGFUSE_HOST", "LANGFUSE_BASE_URL"]:
+                value = os.environ.get(var)
+                if value:
+                    if "SECRET" in var:
+                        value = value[:8] + "..." if len(value) > 8 else "***"
+                    env_vars.append((var, value))
+
+            if env_vars:
+                console.print("\n[blue]🌍 Environment Variables:[/blue]")
+                for var, value in env_vars:
+                    console.print(f"  {var}={value}")
+        else:
+            console.print("[yellow]⚠️  No configuration file found[/yellow]")
+
+        return
+
+    # Set configuration
+    if not any([public_key, secret_key, host]):
+        console.print("[red]❌ At least one option must be provided[/red]")
+        console.print("Use --show to view current configuration")
+        raise typer.Abort()
+
+    # Read existing config
+    config_data = {}
+    if config_file.exists():
+        content = config_file.read_text()
+        for line in content.strip().split('\n'):
+            if '=' in line and not line.startswith('#'):
+                key, value = line.split('=', 1)
+                config_data[key] = value
+
+    # Update with new values
+    if public_key:
+        config_data["LANGFUSE_PUBLIC_KEY"] = public_key
+    if secret_key:
+        config_data["LANGFUSE_SECRET_KEY"] = secret_key
+    if host:
+        config_data["LANGFUSE_HOST"] = host
+
+    # Write config
+    config_dir.mkdir(exist_ok=True)
+
+    content_lines = [
+        "# Langfuse Configuration",
+        "# Generated by PraisonAI CLI",
+        ""
+    ]
+    for key, value in config_data.items():
+        content_lines.append(f"{key}={value}")
+
+    config_file.write_text('\n'.join(content_lines) + '\n')
+
+    console.print(f"[green]✅ Configuration saved to {config_file}[/green]")
+    console.print("\n[blue]💡 To use this configuration, either:[/blue]")
+    console.print(f"  1. Source it: source {config_file}")
+    console.print("  2. Export variables manually")
+    console.print("  3. Set them in your shell profile")


⚠️ Potential issue | 🟠 Major

config does not actually feed test in a fresh shell.

This writes ~/.praisonai/langfuse.env, but langfuse_test() just instantiates LangfuseSinkConfig(), and that config only reads os.environ. So the documented flow of praisonai langfuse config ... followed by praisonai langfuse test still fails unless the user manually sources the file first.

Also applies to: 389-390

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/praisonai/praisonai/cli/commands/langfuse.py` around lines 281 - 362, The config command writes ~/.praisonai/langfuse.env but the test path (langfuse_test() / LangfuseSinkConfig()) reads os.environ only, so running "praisonai langfuse config ..." then "praisonai langfuse test" fails; after writing config_file in the config flow, load the saved keys into the current process environment (e.g., set os.environ["LANGFUSE_PUBLIC_KEY"], os.environ["LANGFUSE_SECRET_KEY"], os.environ["LANGFUSE_HOST"] from config_data) so LangfuseSinkConfig() sees them immediately, and keep the existing file write/sourcing guidance for new shells.

coderabbitai · 2026-04-02T22:34:18Z

src/praisonai/praisonai/observability/langfuse.py

+    public_key: str = ""
+    secret_key: str = ""
+    host: str = "https://cloud.langfuse.com"
+    flush_at: int = 20
+    flush_interval: float = 10.0
+    enabled: bool = True
+
+    def __post_init__(self):
+        """Validate configuration."""
+        if self.enabled and not (self.public_key and self.secret_key):
+            # Try environment variables if not explicitly set
+            import os
+            self.public_key = self.public_key or os.getenv("LANGFUSE_PUBLIC_KEY", "")
+            self.secret_key = self.secret_key or os.getenv("LANGFUSE_SECRET_KEY", "")
+            self.host = self.host or os.getenv("LANGFUSE_HOST", os.getenv("LANGFUSE_BASE_URL", self.host))


⚠️ Potential issue | 🟠 Major

Environment-based host overrides never take effect here.

host starts with a non-empty default, and the env lookup only runs inside the "missing credentials" branch, so non-default hosts are ignored in normal configurations. This silently forces the default cloud host even when LANGFUSE_HOST / LANGFUSE_BASE_URL are set.

🐛 One way to fix it

`@dataclass` class LangfuseSinkConfig: @@ - host: str = "https://cloud.langfuse.com" + host: Optional[str] = None @@ def __post_init__(self): """Validate configuration.""" - if self.enabled and not (self.public_key and self.secret_key): - # Try environment variables if not explicitly set - import os - self.public_key = self.public_key or os.getenv("LANGFUSE_PUBLIC_KEY", "") - self.secret_key = self.secret_key or os.getenv("LANGFUSE_SECRET_KEY", "") - self.host = self.host or os.getenv("LANGFUSE_HOST", os.getenv("LANGFUSE_BASE_URL", self.host)) + if not self.enabled: + return + import os + self.public_key = self.public_key or os.getenv("LANGFUSE_PUBLIC_KEY", "") + self.secret_key = self.secret_key or os.getenv("LANGFUSE_SECRET_KEY", "") + self.host = ( + self.host + or os.getenv("LANGFUSE_HOST") + or os.getenv("LANGFUSE_BASE_URL") + or "https://cloud.langfuse.com" + )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/praisonai/praisonai/observability/langfuse.py` around lines 29 - 43, The host env vars are only read inside the "missing credentials" branch so custom LANGFUSE_HOST/LANGFUSE_BASE_URL never override the default; move or add the host environment lookup out of the credentials check in __post_init__ (or always apply it before the credential check) so self.host = os.getenv("LANGFUSE_HOST", os.getenv("LANGFUSE_BASE_URL", self.host)) runs regardless of whether public_key/secret_key are set, while preserving any explicitly provided self.host if desired.

coderabbitai · 2026-04-02T22:34:18Z

src/praisonai/praisonai/observability/langfuse.py

+        self._traces: Dict[str, Any] = {}  # agent_name -> langfuse trace
+        self._spans: Dict[str, Any] = {}   # agent_name -> langfuse span


⚠️ Potential issue | 🟠 Major

The correlation keys are not unique enough for overlapping runs.

_traces / _spans are keyed by agent_name, and tool completion picks the last "{agent_name}:{tool_name}:*" key it finds. Two same-name agent runs, or two same-name tool invocations finishing out of order, will overwrite or mis-close each other. This needs a stable per-run key (agent_id if present) plus a per-tool invocation id instead of display names.

Also applies to: 114-127, 190-203

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/praisonai/praisonai/observability/langfuse.py` around lines 66 - 67, The current trace/span maps (_traces and _spans) use agent_name and tool display names which can collide across overlapping runs; change the keying to a stable per-run + per-invocation key (e.g., combine agent_id (fallback to agent_name if missing) with a generated invocation_id/uuid for each tool call) and store that composite key when creating traces/spans; update all places that create/lookup/close traces/spans (references to _traces, _spans and the code that currently searches for keys like "{agent_name}:{tool_name}:*") to use the composite key (pass/return invocation_id from the tool-start function so tool-completion can use it) and remove any heuristics that pick the "last" matching key. Ensure tool completion and agent-run lifecycle handlers accept or propagate the invocation_id so spans are closed against the exact trace/span entry.

coderabbitai · 2026-04-02T22:34:18Z

src/praisonai/praisonai/observability/langfuse.py

+        trace = self._client.trace(
+            name=trace_name,
+            input=trace_input,
+            timestamp=event.timestamp,
+            metadata={
+                "agent_id": event.agent_id,
+                "agent_name": agent_name,
+                "schema_version": event.schema_version,
+                **(event.metadata if event.metadata else {}),
+            }
+        )
+        self._traces[agent_name] = trace
+
+        # Create root span for the agent
+        span = trace.span(
+            name=agent_name,
+            start_time=event.timestamp,
+            metadata={
+                "agent_id": event.agent_id,
+                **(event.metadata if event.metadata else {}),
+            }
+        )


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

fd -t f "langfuse.py" src/praisonai/

Repository: MervinPraison/PraisonAI

Length of output: 165

🏁 Script executed:

wc -l src/praisonai/praisonai/observability/langfuse.py

Repository: MervinPraison/PraisonAI

Length of output: 120

🏁 Script executed:

sed -n '135,156p;180,188p;224,244p' src/praisonai/praisonai/observability/langfuse.py

Repository: MervinPraison/PraisonAI

Length of output: 1987

🏁 Script executed:

cat -n src/praisonai/praisonai/observability/langfuse.py

Repository: MervinPraison/PraisonAI

Length of output: 12505

🌐 Web query:

Langfuse 3.0.0 Python SDK API trace span observation documentation

💡 Result:

The Langfuse Python SDK v3 (generally available since June 2025, latest releases v3.14.x as of 2026) is built on OpenTelemetry for tracing LLM applications. Traces are implicitly created by the first root span or generation. Observations include spans (generic operations) and generations (LLM calls), plus specialized types like agent, tool, chain, etc. ## Setup pip install langfuse Set env vars: LANGFUSE_PUBLIC_KEY, LANGFUSE_SECRET_KEY, optionally LANGFUSE_BASE_URL. from langfuse import get_client langfuse = get_client # singleton client langfuse.flush # for short-lived apps ## Core API for Traces, Spans, Observations ### 1. Context Manager (recommended, auto end/flush) with langfuse.start_as_current_observation(name="my-span", as_type="span") as span: span.update(input={"query": "hi"}, output="result") # optional updates with langfuse.start_as_current_observation( name="llm-call", as_type="generation", model="gpt-4o", input={"messages": [...]}, model_parameters={"temperature": 0.7} ) as generation: generation.update(output="response", usage={"input": 10, "output": 50}) Nested spans auto-link via OTEL context propagation. name is required. ### 2. Decorator from langfuse import observe @observe(name="my-function", as_type="span") def my_func: ... ### 3. Manual (requires explicit .end) span = langfuse.start_span(name="manual-span") # or start_generation span.update(input=..., output=...) span.end ### Trace-level - Use propagate_attributes(user_id="123", session_id="abc", trace_name="my-trace") context manager for trace attributes. - langfuse.get_current_trace_id, langfuse.get_current_observation_id - langfuse.create_trace_id(seed="deterministic") for W3C-compatible IDs. ### Updates on active obs - langfuse.update_current_span(input=...) - langfuse.update_current_generation(output=...) ### Scores span.score(name="quality", value=0.95) # or generation.score ## Official Sources - Primary docs: https://langfuse.com/docs/observability/sdk (overview, instrumentation) - Full API reference: https://python.reference.langfuse.com (Langfuse class, LangfuseSpan, etc.) - v3 Announcement/Key Changes: https://langfuse.com/changelog/2025-06-05-python-sdk-v3-generally-available - GitHub Releases: https://github.com/langfuse/langfuse-python/releases (v3.14.5 latest) - Observation Types: https://langfuse.com/docs/observability/features/observation-types

Citations:

1: https://langfuse.com/changelog/2025-06-05-python-sdk-v3-generally-available

2: https://langfuse.com/docs/observability/sdk/overview

3: https://langfuse.com/docs/sdk/python/sdk-v3

4: https://python.reference.langfuse.com/langfuse

5: https://python.reference.langfuse.com/langfuse.html

6: https://langfuse.com/docs/observability/sdk/python/example

7: https://langfuse.com/docs/observability/features/observation-types

8: http://langfuse.com/docs/observability/sdk/advanced-features

9: https://langfuse.com/docs/observability/sdk/python/setup

10: https://langfuse.com/docs/sdk/python/example

🌐 Web query:

Langfuse Python SDK v2 API trace span event methods

💡 Result:

In Langfuse Python SDK v2 (released December 17, 2023), the low-level API for creating traces, spans, and events uses the Langfuse client class. Initialize with from langfuse import Langfuse; langfuse = Langfuse(public_key="...", secret_key="..."). Key methods (from v2 snapshot docs and PyPI readme): - Traces: trace = langfuse.trace(name="my-trace", user_id="user_123", input={"query": "Hello"}); then trace.span(...), trace.generation(...), trace.event(...); finally trace.end(output="result"). - Spans: span = langfuse.span(name="my-span", input="data"); span.end(output="result"). - Events: event = langfuse.event(name="my-event"); events are point-in-time and auto-end. Full low-level SDK docs: https://langfuse.com/docs/sdk/python/low-level-sdk (archived for v2 at python-sdk-v2.docs-snapshot.langfuse.com). v2 uses direct methods like trace, span, generation, event on the client, unlike later OpenTelemetry-based versions (v3+) which use start_as_current_observation(as_type="span"), etc. See migration guide: https://langfuse.com/docs/observability/sdk/upgrade-path/python-v2-to-v3. GitHub: https://github.com/langfuse/langfuse-python (check v2 tags/releases).

Citations:

1: https://python-sdk-v2.docs-snapshot.langfuse.com/

2: https://python-sdk-v2.docs-snapshot.langfuse.com/docs/

3: https://github.com/langfuse/langfuse-python

4: https://langfuse.com/docs/observability/sdk/upgrade-path/python-v2-to-v3

5: https://pypi.org/project/langfuse/2.26.2/

6: https://pypi.org/project/langfuse/2.22.0/

7: https://pypi.org/project/langfuse/2.15.0/

8: https://pypi.org/project/langfuse/2.2.2a0/

9: https://langfuse.com/docs/sdk/python/low-level-sdk

🏁 Script executed:

find . -type f \( -name "requirements.txt" -o -name "setup.py" -o -name "setup.cfg" -o -name "pyproject.toml" \) | head -20

Repository: MervinPraison/PraisonAI

Length of output: 335

🏁 Script executed:

grep -r "langfuse" ./src/praisonai/pyproject.toml ./src/praisonai/requirements.txt

Repository: MervinPraison/PraisonAI

Length of output: 236

🌐 Web query:

Langfuse Python SDK v3 start_span start_generation API reference

💡 Result:

In Langfuse Python SDK v3 (legacy version, docs at https://langfuse.com/docs/sdk/python/sdk-v3), the primary methods for creating spans and generations use context managers for automatic lifecycle management and OpenTelemetry context propagation: - langfuse.start_as_current_observation(name="span-name", as_type="span") or langfuse.start_as_current_span(name="span-name") (legacy) - langfuse.start_as_current_observation(name="gen-name", as_type="generation", model="gpt-4") or langfuse.start_as_current_generation(name="gen-name", model="gpt-4") (legacy) For manual control (no automatic context setting or end): - langfuse.start_observation(name="span-name") - span.start_observation(name="child-span-name") on a LangfuseSpan object Example: from langfuse import get_client langfuse = get_client with langfuse.start_as_current_observation(as_type="span", name="process-request") as span: span.update(output="Processing complete") with langfuse.start_as_current_observation(as_type="generation", name="llm-response", model="gpt-3.5-turbo") as generation: generation.update(output="LLM response") langfuse.flush Note: SDK v3 is legacy; current version is v4+ with unified start_observation API. Official API reference: https://python.reference.langfuse.com (covers latest, shows LangfuseSpan and start_as_current_observation patterns). v3 upgrade guide notes migration from start_span/start_generation to start_observation(as_type=...). No direct non-context "start_span" on client found in v3; use start_observation equivalents. For self-hosted, requires Langfuse >=3.125.0.

Citations:

1: https://python.reference.langfuse.com/langfuse

2: https://python.reference.langfuse.com/langfuse.html

3: https://langfuse.com/docs/sdk/python/sdk-v3

4: https://langfuse.com/docs/observability/sdk/instrumentation

5: https://langfuse.com/changelog/2025-06-05-python-sdk-v3-generally-available

6: https://github.com/langfuse/langfuse-python/blob/72aa1565/langfuse/_client/span.py

7: https://www.apiautomations.com

8: https://get.langfuse.com/docs/observability/sdk/upgrade-path/python-v3-to-v4

🌐 Web query:

Langfuse Python SDK v3 client trace span event methods backward compatibility v2

💡 Result:

The Langfuse Python SDK v3 is not backward compatible with v2 client methods for traces, spans, and events. Official migration guides explicitly state this. In v2, the low-level SDK used methods like langfuse.trace, langfuse.span, langfuse.generation, and langfuse.event to create observations, with updates via methods on the returned objects. In v3 (and later v4+), these are replaced by OpenTelemetry-based methods: - langfuse.start_as_current_observation(as_type="span"| "generation"| "event", name=...) with context manager for auto-end, or langfuse.start_observation for manual .end - Updates use .update on LangfuseSpan, LangfuseGeneration, or LangfuseEvent objects (e.g., span.update(input=..., output=...)) - Events via langfuse.create_event(name=...) returning LangfuseEvent - Trace management via propagate_attributes or observation-specific trace updates (some deprecated in v4) Key breaking changes: - Required 'name' parameter for observations - Explicit .end or context managers (no auto-end like v2) - No direct langfuse.trace/span; traces implicit from root span - Context propagation automatic via OTEL - IDs in W3C format v2 receives only critical fixes; upgrade requires code changes per integration type (@observe, low-level, integrations). See official v2→v3 guide: https://langfuse.com/docs/observability/sdk/upgrade-path/python-v2-to-v3

Citations:

1: https://langfuse.com/changelog/2025-06-05-python-sdk-v3-generally-available

2: https://langfuse.com/docs/observability/sdk/upgrade-path/python-v2-to-v3

3: https://python.reference.langfuse.com/langfuse

4: https://python.reference.langfuse.com/langfuse.html

5: https://github.com/langfuse/langfuse-python/blob/72aa1565/langfuse/_client/client.py

6: https://github.com/langfuse/langfuse-python/blob/72aa1565/langfuse/_client/span.py

Langfuse SDK v3 API incompatibility: code uses removed v2-style methods.

The code uses trace(), trace.span(), and trace.event() methods which were removed in Langfuse v3. While langfuse>=3.0.0 is declared as a dependency, v3 replaced these with start_as_current_observation(as_type="span"|"generation") (context manager) or start_observation() (manual), with automatic OTEL context propagation. This adapter will fail at runtime with AttributeError when instantiated with any Langfuse 3.x or 4.x client.

Affected locations: lines 142 (self._client.trace()), 156 (trace.span()), 187 (parent_span.span()), 231 and 247 (trace.event()).

Migrate to v3+ API using start_as_current_observation() context managers or start_observation() for manual lifecycle control, ensuring updates use .update() method on observation objects and explicit .end() calls or context manager exit.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/praisonai/praisonai/observability/langfuse.py` around lines 135 - 156, The code currently calls removed Langfuse v2 APIs (self._client.trace(), trace.span(), trace.event(), parent_span.span()) so update the adapter to use the v3+ observation API: replace self._client.trace(...) with creating an observation via self._client.start_observation(...) or using self._client.start_as_current_observation(as_type="generation") as a context manager for the trace-level object; replace trace.span(...) and parent_span.span(...) with start_as_current_observation(as_type="span") (or start_observation()) to create span observations, call observation.update(...) to change metadata, and call observation.end() (or let the context manager exit) to finish them; similarly replace trace.event(...) usages with start_observation/start_as_current_observation for events and use .update()/.end() for lifecycle and metadata updates. Ensure you preserve metadata keys (agent_id, agent_name, schema_version, event.metadata) when building the observation, and update references to self._traces[agent_name] to store the observation object rather than a v2 trace.

coderabbitai · 2026-04-02T22:34:18Z

src/praisonai/praisonai/observability/langfuse.py

+    def close(self) -> None:
+        """Close the sink and release resources."""
+        if not self._closed:
+            self._closed = True
+            self.flush()
+            if self._client:
+                try:
+                    # Close any remaining spans
+                    with self._lock:
+                        for span in self._spans.values():
+                            try:
+                                span.end()
+                            except:
+                                pass
+                        self._spans.clear()
+                        self._traces.clear()


⚠️ Potential issue | 🟠 Major

close() never flushes the spans it ends.

_closed is set before flush(), so the first flush is skipped, and the remaining spans are ended only after that attempted flush. Short-lived processes that rely on close() can therefore drop the final batch.

♻️ Safer shutdown order

def close(self) -> None: """Close the sink and release resources.""" - if not self._closed: - self._closed = True - self.flush() - if self._client: - try: - # Close any remaining spans - with self._lock: - for span in self._spans.values(): - try: - span.end() - except: - pass - self._spans.clear() - self._traces.clear() - except Exception: - pass + if self._closed: + return + if self._client: + with self._lock: + for span in list(self._spans.values()): + try: + span.end() + except Exception: + pass + self._spans.clear() + self._traces.clear() + self.flush() + self._closed = True

🧰 Tools

🪛 Ruff (0.15.7)

[error] 267-267: Do not use bare except

(E722)

[error] 267-268: try-except-pass detected, consider logging the exception

(S110)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/praisonai/praisonai/observability/langfuse.py` around lines 255 - 270, The shutdown in close() sets self._closed before calling flush(), which prevents flush() from sending the final batch and then ends spans in _spans afterwards; fix by changing the order so spans are ended and flushed while the sink is still open (e.g., call self.flush() and end/clear spans/_traces first, then set self._closed = True), or alternatively ensure flush() ignores the _closed flag during this shutdown path; update the close() method references (close(), _closed, flush(), _spans, span.end(), _traces) accordingly.

MervinPraison · 2026-04-03T07:13:27Z

@claude You are the FINAL architecture reviewer. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.\n\nPhase 1: Review per AGENTS.md\n1. Protocol-driven: check heavy implementations vs core SDK\n2. Backward compatible: ensure zero feature regressions\n3. Performance: no hot-path regressions\n\nPhase 2: FIX Valid Issues\n4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix\n5. Push all code fixes directly to THIS branch (do NOT create a new PR)\n6. Comment a summary of exact files modified and what you skipped\n\nPhase 3: Final Verdict\n7. If all issues are resolved, approve the PR / close the Issue\n8. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-04-03T07:13:41Z

- Update LangfuseSink to use Langfuse v3+ API (start_observation vs trace/span/event) - Fix CLI port parsing for Docker Compose JSON format (Publishers is array not string) - Fix config->test workflow: load ~/.praisonai/langfuse.env in test command - Improve agent/tool span collision safety with unique keys - Add proper UUID-based tool invocation tracking Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

praisonai-triage-agent bot mentioned this pull request Apr 2, 2026

feat: Langfuse Observability Integration — CLI, LangfuseSink Adapter & Docker Hosting #1270

Open

gemini-code-assist bot reviewed Apr 2, 2026

View reviewed changes

qodo-code-review bot reviewed Apr 2, 2026

View reviewed changes

Copilot started work on behalf of MervinPraison April 2, 2026 22:23 View session

fix: Address review feedback — fix bugs, add tests, improve code quality

e4afa3b

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/178e99c1-305d-436d-8385-6e9f0642fec2 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

Copilot finished work on behalf of MervinPraison April 2, 2026 22:30

Copilot AI requested a review from MervinPraison April 2, 2026 22:30

coderabbitai bot reviewed Apr 2, 2026

View reviewed changes

	content = content.replace("3000:3000", f"{port}:3000")
	import re
	content = re.sub(r"\d+:3000", f"{port}:3000", content)

	print(f"LangfuseSink error: {e}")
	import logging
	logging.getLogger(__name__).error(f"LangfuseSink error: {e}")

	print(f"LangfuseSink flush error: {e}")
	import logging
	logging.getLogger(__name__).error(f"LangfuseSink flush error: {e}")

		self._traces: Dict[str, Any] = {} # agent_name -> langfuse trace
		self._spans: Dict[str, Any] = {} # agent_name -> langfuse span

Uh oh!

Conversation

praisonai-triage-agent bot commented Apr 2, 2026 • edited by Copilot AI Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MervinPraison commented Apr 2, 2026

Uh oh!

MervinPraison commented Apr 2, 2026

Uh oh!

MervinPraison commented Apr 2, 2026

Uh oh!

coderabbitai bot commented Apr 2, 2026

Uh oh!

qodo-code-review bot commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review by Qodo

Uh oh!

coderabbitai bot commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram

Estimated Code Review Effort

Suggested Labels

Poem

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

qodo-code-review bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

qodo-code-review bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

qodo-code-review bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

qodo-code-review bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

MervinPraison commented Apr 2, 2026

Uh oh!

Copilot AI commented Apr 2, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

praisonai-triage-agent bot commented Apr 2, 2026 •

edited by Copilot AI

Loading

qodo-code-review bot commented Apr 2, 2026 •

edited

Loading

coderabbitai bot commented Apr 2, 2026 •

edited

Loading

praisonai-triage-agent bot commented Apr 3, 2026 •

edited

Loading