agent-debugging

Here are 32 public repositories matching this topic...

liaohch3 / claude-tap

Intercept and inspect Coding Agent API traffic from Claude Code, Codex CLI, Gemini CLI, Cursor CLI, OpenCode, Kimi, Pi, and Hermes in a local trace viewer.

Updated May 30, 2026
Python

najeed / ai-agent-eval-harness

Star

The open-source MultiAgentOps evaluation and verification harness for any industry business workflow.

Updated May 28, 2026
Python

OthmanAdi / langsmith-fetch-skill

Sponsor

Star

🔍 AI observability skill for Claude Code. Debug LangChain/LangGraph agents by fetching execution traces from LangSmith Studio directly in your terminal.

developer-tools observability ai-agents langchain langsmith llm-ops langsmith-tracing developer-tools-ai-agent claude-skills claude-skills-creator claude-skills-hub claude-skills-libary agent-debugging

Updated Apr 6, 2026

cylestio / agent-inspector

Star

Local open-source dev tool to debug, secure, and evaluate LLM agents. Provides static analysis, dynamic security checks, and runtime monitoring - integrates with Cursor and Claude Code.

behavior-analysis agent-trace ai-security-tool agent-security cursor-integration claude-code-plugin agent-debugging

Updated Jan 15, 2026
Python

converra / agent-triage

Star

Diagnose your AI agents in production. Extract policies from prompts, evaluate traces, generate diagnostic reports.

Updated Mar 10, 2026
TypeScript

Ylsssq926 / clawclip

Star

Cut your OpenClaw / ZeroClaw token bill. Find which model earns its cost. Prove whether optimizations actually work. Local, no upload.

hermes ai-agent ai-observability cost-reduction local-ai agent-tools llm-cost token-optimization agent-debugging openclaw zeroclaw hermes-agent agent-analytics prompt-efficiency

Updated May 28, 2026
TypeScript

aaronlab / browsertrace

Star

Local replay debugger for Browser Use failures with screenshots, model I/O, failed-step timelines, and public-safe HTML exports.

Updated May 14, 2026
Python

amitmishrg / agenticlens

Star

Visual debugging, tracing, and replay for agent workflows.

nodejs ai reactjs devtools tracing developer-tools visualizations observability debugging-tools ai-agents log-visualization jsonl ai-observability llm agentic-ai agent-workflows workflow-visualization agent-debugging execution-tracing

Updated Mar 27, 2026
JavaScript

kangjinghang / agent-chatlens

Star

🔍 A beautiful web viewer for AI agent session files. Browse Claude Code & OpenClaw conversations with chat-style UI, timeline visualization, and zero setup.

react visualization typescript developer-tools dark-mode chat-ui claude conversation-analysis jsonl vite ai-agent session-viewer claude-code agent-debugging openclaw jsonl-viewer tool-call-visualization

Updated May 19, 2026
TypeScript

xiaoshuo1988130 / deepseek-compat-kit

Star

Compatibility and diagnostics for DeepSeek V4 tool-calling agents

json-schema llm-proxy deepseek tool-calling openai-compatible agent-debugging deepseek-v4 reasoning-content

Updated May 27, 2026
JavaScript

ChainWatch is a flight data recorder for multi-step AI systems. It's a CLI-based tool that records every step in an AI decision chain, links them together in order, prevents tampering, and allows you to verify the chain's integrity and replay the full decision flow.

ai artificial-intelligence audit-log autonomous-agents ai-agents ai-engineering ai-observability llm llmops ai-tracing agent-observability ai-audit agent-debugging tool-using-agents decision-tracing

Updated Jan 22, 2026
Python

rty90 / Android-Agent-Reliability-Runtime

Star

Android Agent Reliability Runtime A debugging and safety runtime for mobile GUI agents: detect readiness, block unsafe actions, verify progress, diagnose failures, and save reproducible traces.

adb android-automation llm-agent mobile-agent gui-agent agent-observability android-agent agent-runtime agent-debugging ui-automation-testing

Updated May 28, 2026
Python

jigjoy-ai / kaleidoskop

Star

Kaleidoskop — replay your baro/Mozaik agent runs visually. Audit log → hexagonal neural firing in your browser.

visualization typescript multi-agent replay mozaik observability ai-agents baro llm agent-orchestration agent-debugging jigjoy

Updated May 23, 2026
TypeScript

Exploreunive / agentlens

Star

Explain why your agent failed — root-cause debugging, memory attribution, and run divergence for LLM agents.

python memory tracing developer-tools observability ai-agents llm agent-debugging

Updated Mar 31, 2026
Python

valani9 / vstack

Star

AI agents fail like junior teammates—looping on bad ideas, ignoring feedback, escalating commitment. vstack ports 34 of the most-cited organizational-behavior frameworks so you can diagnose your agents the same way you'd diagnose your team.

python docker multi-agent-systems ai-agents fastapi psychological-safety organizational-behavior llmops llm-evaluation model-context-protocol mcp-server agent-evaluation agent-observability agent-debugging after-action-review

Updated May 25, 2026
Python

joshualamerton / AgentLens

Star

A real-time observability and debugging layer for AI agents.

python machine-learning ai machine-learning-algorithms devtools agents ai-agents machine-learning-projects llms ai-devtools agent-debugging

Updated Mar 11, 2026
Python

aryanVijaywargia / Continua

Star

Self-hosted debugging for AI agent runs

react go debugging postgres typescript openapi self-hosted tracing observability ai-agents agent-debugging

Updated May 30, 2026
Go

zengin0201 / AI_Debugger

Star

ai-agents react-flow langchain visual-debugger agent-debugging langchain-debugger

Updated May 27, 2026
Python

daslabhq / scenegrad

Star

TDD for AI agents — watch world state morph step-by-step. Drop-in for Vercel AI SDK / Anthropic SDK / LangChain. Scrubbable trajectories + bulk grid view.

typescript ai tdd evaluation observability trajectory ai-agents llm anthropic vercel-ai-sdk llm-agents agent-evaluation agent-observability agent-debugging

Updated May 12, 2026
TypeScript

mda-diaz / runlens

Star

RunLens helps teams compare and debug AI agent runs with step timelines, run diffs, and cost analysis.

python ai-agents fastapi observability-analyze llmops agent-debugging

Updated Apr 1, 2026
HTML

Improve this page

Add a description, image, and links to the agent-debugging topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the agent-debugging topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agent-debugging

Here are 32 public repositories matching this topic...

liaohch3 / claude-tap

najeed / ai-agent-eval-harness

OthmanAdi / langsmith-fetch-skill

cylestio / agent-inspector

converra / agent-triage

Ylsssq926 / clawclip

aaronlab / browsertrace

amitmishrg / agenticlens

kangjinghang / agent-chatlens

xiaoshuo1988130 / deepseek-compat-kit

Tarunjit45 / ChainWatch

rty90 / Android-Agent-Reliability-Runtime

jigjoy-ai / kaleidoskop

Exploreunive / agentlens

valani9 / vstack

joshualamerton / AgentLens

aryanVijaywargia / Continua

zengin0201 / AI_Debugger

daslabhq / scenegrad

mda-diaz / runlens

Improve this page

Add this topic to your repo