WillLewis

Will Lewis WillLewis

AI/ML Product Manager | agentic systems, RAG, evals, decision intelligence | Wharton MBA | UPenn

Achievements

atlas-agentic-fraud-lab atlas-agentic-fraud-lab Public

Adversarial Testing Lab for Agentic Safeguards (ATLAS). A synthetic multi-agent eval environment for adversarial fraud decisioning inspired by Anthropic's Project Deal. Measures how model quality, …

Python 1
agent-harness-environment agent-harness-environment Public

An eval and observability cockpit for coding agents. It runs policy-controlled coding agents in sandboxed toy repos, tool-use traces, MCP tools, compares harness policies, scores recovery and safet…

Python
regulated-agent-launch-kit regulated-agent-launch-kit Public

A regulated-agent deployment kit for turning traces, evals, regressions, and approval gates into launch/no-launch decisions

Python
voice-agent-prompt-lab voice-agent-prompt-lab Public

A voice agent demo and prompt evaluation harness for insurance first notice of loss claims

TypeScript
coachbench coachbench Public

An Anthropic Project Deal inspired football adversarial-agent arena for short red-zone strategy contests. Offensive and defensive coordinator agents compete through simultaneous legal play calls, s…

Python 1
google-advanced-data-analytics google-advanced-data-analytics Public

Jupyter notebooks of statistics, EDA, Python, regression, and machine learning project work done on sample TikTok and Waze data

Jupyter Notebook