Pinned Loading
-
atlas-agentic-fraud-lab
atlas-agentic-fraud-lab PublicAdversarial Testing Lab for Agentic Safeguards (ATLAS). A synthetic multi-agent eval environment for adversarial fraud decisioning inspired by Anthropic's Project Deal. Measures how model quality, …
Python 1
-
agent-harness-environment
agent-harness-environment PublicAn eval and observability cockpit for coding agents. It runs policy-controlled coding agents in sandboxed toy repos, tool-use traces, MCP tools, compares harness policies, scores recovery and safet…
Python
-
regulated-agent-launch-kit
regulated-agent-launch-kit PublicA regulated-agent deployment kit for turning traces, evals, regressions, and approval gates into launch/no-launch decisions
Python
-
voice-agent-prompt-lab
voice-agent-prompt-lab PublicA voice agent demo and prompt evaluation harness for insurance first notice of loss claims
TypeScript
-
coachbench
coachbench PublicAn Anthropic Project Deal inspired football adversarial-agent arena for short red-zone strategy contests. Offensive and defensive coordinator agents compete through simultaneous legal play calls, s…
Python 1
-
google-advanced-data-analytics
google-advanced-data-analytics PublicJupyter notebooks of statistics, EDA, Python, regression, and machine learning project work done on sample TikTok and Waze data
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.



