Steven Matthiesen slmatthiesen

AI Engineer building agentic systems

👋 /me

const steven = {
  role:    "AI Engineer · CTO @ INTU",
  focus:   ["LLM agents", "agentic issue-fix pipelines", "RAG",
            "multimodal doc + image ingestion", "evals", "observability", "fully agentic systems"],
  stack:   ["React", "Node", "Python", "Rust", "Postgres", "GraphQL"],
  web3:    ["MPC", "DKG", "EVM", "Solana", "Solidity"],
  shipping: "production agent systems",
};

I build agent systems that survive contact with production — tool-using LLMs wired through MCP, grounded by RAG, gated by eval harnesses, and instrumented end-to-end so failures are observable instead of mysterious. Before AI I spent four years deep in Web3, leading an MPC wallet-infrastructure team across cryptography, smart contracts, and Rust.

🔁 /method

   ideas ──▶ evals ──▶ guardrails ──▶ build ──▶ review ──▶ production
     ▲                                                           │
     └─────────── observe · measure · iterate · harden ◀────────┘

Design intent before code: I write the evals and guardrails before a line ships, then let observability close the loop — every production failure feeds the next iteration instead of disappearing.

🧠 /stack

AI / ML

Languages & Core

Platforms

Web3

🚀 /review

🔐 INTU — Web3 onboarding via MPC

CTO · Lead Engineer

Open-source NPM package orchestrating distributed key generation (DKG) and multi-party computation, removing seed phrases from the onboarding flow. Cross-chain transaction flows across EVM networks, bridged to Solana — sending a Solana tx authorized by an EVM signature. Self-hosted The Graph indexers for chains without hosted support.

Rust · Solidity · MPC · EVM · TypeScript

🤖 Agentic Github Issues Fixer

Autonomous coding agent

An agent that triages open GitHub issues, reproduces the bug, drafts a fix, and opens a PR — closing the loop from issue to reviewable change. Proof: medplum/medplum#9293 — an upstream OSS fix landed fully agentically (working branch).

Agents · Tool Use · GitHub API · OSS

▶️ Watch the demo

📈 Algorithmic Futures Trading

Quant Research · WIP

Backtest harness and execution research for systematic futures strategies — applying the same eval + observability discipline I use on AI agents to strategy selection, slippage modeling, and live risk.

Python · Quant · Backtesting · WIP

▶️ Watch the demo

🩺 OpenEMR Clinical Agent

Selected Project · 2026

LLM agent layered onto an open-source EHR that reads patient charts and relays clinical context on demand. Lab-report ingestion pipeline produces summaries with source-page citations, so clinicians can verify any agent-surfaced claim — a RAG pattern tuned for high-stakes clinical use.

RAG · LLM Agents · Citations · Healthcare

▶️ Watch the demo

🍻 Happy Hour Friends — Crowdsourced happy hour finder

Live · 2026

Fully agent-operated site: every update — parsed automatically from the web or submitted by users — passes strict agentic moderation gates (classify → verify, versioned prompts, audited apply path) before going live. The test: can my agent safeguards run the site without my intervention? The product itself is dead-simple — venues and deals in one sortable, filterable view, kept current by crowdsourcing.

Agents · Crowdsourcing · Moderation Gates · Next.js

🍽️ GURUPass / Pass Rewards — Restaurant AI Agents

Lead AI & Blockchain Engineer

Tool-using LLM agents handling order intake and menu Q&A, wired through MCP with structured-output validation. Curated eval set + offline regression harness catches failures before deploy; production traces drive failure-mode analysis. Personalization layer surfaces targeted coupons from purchase history.

MCP · Agents · Evals · Personalization

⚡ Efficiency > token-maxing

I burn a lot of tokens — on purpose. But spending them to look busy is waste.
The craft is signal per token: tight context, sharp evals, and failure modes that are observable instead of mysterious.
Every system above was designed, built, and shipped on a ~$100/month plan.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Steven Matthiesen slmatthiesen

Achievements

Achievements

Block or report slmatthiesen

AI Engineer building agentic systems

👋 /me

🔁 /method

🧠 /stack

🚀 /review

🔐 INTU — Web3 onboarding via MPC

🤖 Agentic Github Issues Fixer

📈 Algorithmic Futures Trading

🩺 OpenEMR Clinical Agent

🍻 Happy Hour Friends — Crowdsourced happy hour finder

🍽️ GURUPass / Pass Rewards — Restaurant AI Agents

⚡ Efficiency > token-maxing

Popular repositories Loading

Uh oh!