AI Portfolio & Achievements

Suraj Varma

AI Systems Integrator

I build and run reliable AI automation. The work is taking existing components, wiring them into pipelines that actually hold up, and keeping them running with the monitoring, error tracking, and infrastructure work that most automation skips.

Selected work, March to June 2026

Tool Integration

Taking APIs like Amadeus, Google Calendar, and Reddit and turning them into pipelines that do real work.

Operational Discipline

Cron, email, monitoring, heartbeat checks. The unglamorous part that keeps automation reliable.

Troubleshooting Depth

Docker networking, auth chains, nginx, systemd. I fix the problems that stop everything else from working.

Reusable Procedures

Skills that capture how I work so I can reuse them: Devil's Advocate, Grill-Me, Diagnose, and others.

Built Systems

Code I wrote, shipped, and still run.

Aegis Gateway: Defensive LLM Request Interception

Built

Python · FastAPI · API-key auth · JWT · rate limiting · policy DSL

A hardened gateway that sits in front of an LLM and checks every request before it goes through: authentication, rate limits, request shape, and logging with sensitive fields redacted. Policy comes from a JSON config that gets validated at startup.

API key auth, plus optional JWT with configurable issuer and audience
Per-user rate limiting, with guards on the request body for Content-Length, schema bounds, and metadata depth
Audit logging that redacts by default, hashing prompts and user IDs
Refuses to start under insecure production settings. Ships with a full runbook and validation reports.

Multi-Agent Factory

Phase 1 Live

Multi-agent orchestration · Gemini · Sonnet · Codex · Opus · state files

A five-agent pipeline that works overnight, so there is a researched, designed, built, and reviewed product waiting in the morning.

Scout finds pain points on Reddit, HN, and GitHub. Architect turns one into full specs. Builder ships an MVP, Critic reviews it, and Marketer writes the launch copy.
Agents hand off through state files, with each task routed to the right model for how hard it is.
Everything lands in a morning email. Scout and Architect are running now.

Job Search Automation

Active

Node.js · email parsing · scoring algorithm · HTML dashboard · CV tailoring

Reads job alert emails, pulls out the details, and scores each role from 0 to 100% with hard disqualifiers for fields I will not work in. Matches show up in a searchable dashboard, and for the best ones it drafts a tailored CV and cover letter, then tracks the application from first sighting to outcome.

AI Constraint Library: MCP Server

Built

Node.js · MCP protocol · PostgreSQL · pgvector

An MCP server that records when something gets rejected and turns those events into structured constraints you can search by meaning. You retrieve them by project or task context, and each one moves through a draft, active, and deprecated lifecycle.

OpenClaw Memory Dashboard

Functional MVP

Node.js · Express · Vanilla JS · Tailwind CSS (no build step)

A web dashboard for a markdown-based memory system. It has a category file browser, full-text search across 120-plus files, and a markdown viewer with syntax highlighting. It works on a phone, and listings come back in about 50 to 100ms.

Daily AI News Brief

Live

Cron · Reddit API · web search · Telegram · Opus

A scheduled job that pulls AI news from r/LocalLLaMA, r/MachineLearning, and r/artificial, adds a web search pass, and sends me the brief on Telegram every morning.

Travel Deal Intelligence

Live

Amadeus API · OzBargain · web search · Cron · scoring algorithm

Scores travel deals every day against my budget, destinations, and airline rules, then surfaces the hotels, booking links, and flights that actually fit.

Trip Logistics Autopilot

Live

Google Calendar API · Cron · automated email

Automatic pre-trip reminders for packing, currency, and documents, sent 7, 5, 3, and 1 days before departure based on a tracked travel calendar.

Skills I Built

Procedures and orchestration I wrote myself, not installed off the shelf.

Devil's Advocate

Spins up an Advocate and a Skeptic sub-agent to argue both sides of a plan, then reports back on overclaims, backfire risks, and specific objections. I ran it on this portfolio.

Grill-Me

Interrogates a plan with hard questions until it actually holds together, updating the docs as decisions get made.

Write-a-Skill

A repeatable process for building new skills: gather requirements, draft it, review with me, then ship it.

Diagnose

A fixed loop for hard bugs: reproduce, minimise, hypothesise, instrument, fix, then add a regression test. The rule is no fixes without a root cause.

TDD

Test-driven development, red, green, refactor, for both new features and bug fixes.

Triage + To-Issues

Runs issues through a triage state machine, and breaks plans into tickets someone can pick up and finish on their own.

Operational Infrastructure

The pipelines and fixes that keep everything else running.

Email Infrastructure

A nodemailer and Gmail SMTP path I built after the local mail setup turned out to be accepting mail and quietly dropping it. It is now the email route for every automation here.

Cron & Heartbeat Monitoring

A health-check system that tracks timestamps and errors across embedding failover, signal watches, backups, and model health checks.

Deterministic Model Routing

A fallback chain that sends each task to the right model by cost and capability, with API keys managed per provider.

Local Heartbeat Triage

A llama.cpp and Qwen2.5 script that does first-pass urgency filtering locally, so simple checks never hit a paid model. Timeout-bounded, with all six tests passing.

Inbound Media Scan

A scanner that checks both the inbound folder and the home directory, which fixed a recurring problem where shared images never reached the image tool.

Wikilink Auto-Linking

Automatic [[wikilink]] resolution across the workspace, so I stopped linking everything by hand.

Tooling Evaluated & Configured

Twenty-plus skills I evaluated, picked, and wired into a working setup. The credit here is in the curation, not the original code.

office-hours · product brainstorm ceo-review · plan review investigate · root-cause debugging retro · engineering retrospective summarize tavily · web research browser-automation grill-me diagnose tdd triage to-issues to-prd improve-codebase-architecture zoom-out + more

Research & Explorations

Concepts and specs I worked through to figure out what was worth building.

AI Sentinel

An enterprise AI-governance concept: discover, classify, monitor, and report against the EU AI Act, NIST AI RMF, and GDPR.

AI Research Assistant

A design for a personalised research system that pulls from several sources, with an interest profile, trend tracking, and a "why this matters for you" angle.

Second Brain

A full implementation spec on PostgreSQL, pgvector, and FastAPI, organised with the PARA method.

xAI Cookbook Study

Worked through the official Grok cookbook to learn its Realtime API patterns, WebSocket streaming, and function calling.

Knowledge Base

LEARNINGS.md

A 10,000-plus word operational knowledge base of corrections, workflow discoveries, and preferences.

ERRORS.md

A root-cause error log covering Docker networking, cron regressions, and delivery failures, each with prevention steps.

REFLECTIONS.md

Consistent post-task review: what worked, what did not, what to do differently.

Learning Sprint

Daily learning discipline across nine rotating focus areas, with weekly progress reviews.