SANJAY CHAUHAN · AI ENGINEERING LEADERAVAILABLE FOR CONTRACT & CONSULTING

I build AI that shows its receipts.

I build production AI products from scratch by directing AI coding agents, the same way I lead engineers, then prove they work. Eight years shipping healthcare agents and eval-first systems that cite a verified source or abstain, never a confident guess. Available to US teams as an advisor, a fractional lead, or a hands-on builder.

Book a 30-min call See it verify

FILE / SC-001CONFIDENCE: HIGH

SUBJECT

ROLE: AI Engineering Lead
FOCUS: Ships products via agentic dev
PROVES: 3 OSS, Apache-2.0
AVAIL: Contract / consulting

EIGHT YEARS ACROSSAssured Health·Forage AI·Xebia·Headfield

8yrs

shipping production AI and data systems

1M+

records extracted per day, at peak

US states covered by live verification agents

open-source tools shipped, all Apache-2.0

THE PROOFEXHIBIT 01

Most AI asks you to trust it.
Mine hands you the receipt.

A citeproof receipt. Pick a claim and watch it find the exact supporting line, or abstain when no line supports it.

HOW IT WORKS

// SELECT A CLAIM TO VERIFY

ENTAILMENT

VERDICTSUPPORTED

GATEsupporting span found

SCORE0.96

Interactive explainer. The real run is shown below.

en.wikipedia.org/wiki/Coral_reefsnapshot, illustrative

Coral reef, Threats

Destructive fishing practices, including blast fishing and cyanide fishing, damage reef structures and reduce fish populations. Rising ocean temperatures cause coral bleaching, in which corals expel the symbiotic algae living in their tissues. Reefs also buffer coastlines from storm surge and erosion, protecting communities near the shore.

When a claim has no support in the snapshot it is dropped, not guessed. That refusal is the whole product: a local entailment gate against a verified source, anchored to a verbatim span. Abstain over guess.

citeproof running: a question is researched, each cited claim links to a snapshot of the source page with the supporting line highlighted, and unsupported claims are excluded — Not a mockupciteproof, running locally. Same idea, real output.

ENGAGEMENTSEXHIBIT 02

Ways to work together

Three ways in, one conversation. Tell me the problem on a call and we will pick the lightest engagement that solves it.

MOST ASKED

Advisory & eval design

Retainer or sprint

Is your agentic system working, or just demoing well? I build the eval harness that measures it and review the architecture around it.

Eval framework + golden sets
RAG / agent architecture review
Faithfulness + hallucination audits

Fractional AI lead

Part-time, monthly

Embed as your AI engineering lead. I have run teams and shipped production agentic AI, so yours skips the traps and ships sooner.

Senior lead on your team, 10 to 20 hrs/week
Roadmap, hiring bar, code review
EST overlap from India

Build engagements

Fixed scope

You have a scoped problem now. I build it from scratch with agentic development, the RAG pipeline, the agent, or the full product, ship it, and hand it over with the evals that keep it honest.

RAG / retrieval pipelines
Agentic + browser automation
Production deploy on AWS

For the right team, also open to a senior full-time or fractional leadership seat. US-remote, EST overlap, no visa required.

Book a 30-min call

FIELD WORKEXHIBIT 03

Shipped, in production

Production systems I built and led, some end to end with agentic development. The code is proprietary, so the numbers are the proof.

Assured Copilot

Agentic Chrome extension + conversational RAG agent for healthcare credentialing

Built end to end with agentic development, one engineer directing AI agents
Streaming Claude RAG chat that auto-fills payer enrollment and answers payer-policy questions
Semantic field-mapping pipeline with a learning loop that improves from user corrections

Manifest V3Claude APIDjangoDynamoDBLambda

2,006+

health-plan configs

Credentialing Agents Platform

Serverless AI verification across all 50 US states

20+ verification agents (ABMS, CAQH, DEA, OIG, OFAC, NPI, NPDB, state boards)
Pluggable state-specific agent architecture
Async jobs, DynamoDB tracking, EventBridge refresh

Python monorepoLambda + SQSDynamoDBS3

US states covered

Intelligent Browser Unblocker

ML fingerprint synthesis + human-behavior simulation, Forage AI

Resilient extraction across 1M+ profiles per day
Distributed crawling with AWS pipeline orchestration
Bypassed Cloudflare and bot-detection systems at scale

PythonPlaywrightML modelsAWS ECS

1M+

profiles per day

THE EVIDENCE LOCKEREXHIBIT 04

Open source you can run today

Production-grade primitives, all Apache-2.0. Click any repo. Every link resolves.

veriscrape

Fetch, but it tells you the truth.

HTTP returns bytes. veriscrape returns a portable trust verdict alongside them, so downstream systems can decide whether to trust the page.

pypiv0.2.0stars6licenseApache-2.0

$ pip install veriscrapegithub

citeproof

Research that won't cite what it can't verify.

A local deep-research agent. Each cited claim links to a receipt: the page snapshot, highlighted to the supporting line. Unsupported claims are dropped, not guessed.

statusactive devlicenseApache-2.0

$ git clone github.com/san64777/citeproofgithub

acroforge

Turn flat PDFs into real, fillable AcroForms.

Most PDF tools render forms or extract data, not both deterministically. acroforge builds true AcroForms with named fields and calculation logic, zero copyleft surface.

pypiv0.4.0stars8licenseApache-2.0

$ pip install acroforgegithub

eudoxic

Sound judgment from every document.

An AI document workspace for US professionals. Upload anything, it pulls out the parties, dates, amounts, and obligations, then answers questions with citations to the exact document and page. Built solo, from scratch, with agentic development.

statusprivate betastackNext.js, Claudedemoeudoxic.ai

# closed source, demo on requestvisit

METHODEXHIBIT 05

How I work

You are not just buying what I shipped. You are buying how I operate when the model is confidently wrong in production.

05.1

Eval-first

If there is no metric for the change, it does not ship. Golden sets land before code.

05.2

Demos weekly

Working software in front of stakeholders every Friday. Surprises die in the demo, not in production.

05.3

Boring infra

Postgres, Docker, FastAPI, Lambda. The clever part stays in the model layer, not the plumbing.

05.4

Async-first

Real EST overlap, plus progress while your team sleeps. Written decisions over meetings, so the record stays the source of truth.

RECORDEXHIBIT 06

Eight years building production data and AI systems

May 2024 - now

Assured HealthAssociate Engineering Manager

Lead engineering for a production healthcare credentialing platform: Django backend, serverless verification across 50 states, and Assured Copilot, the agentic RAG product I built end to end.

2024

Xebia IT ArchitectsSenior Consultant

Automation architecture and data engineering for enterprise clients.

2019 - 2024

Forage AISenior Data Automation Engineer

Built anti-detection extraction at scale, 1M+ profiles/day, defeating Cloudflare and bot-detection.

2019

Headfield SolutionsTechnical Lead

Led 8 engineers building an NLP-driven real-time social listening platform.

2017 - 2019

Quickcompany, TrizTeck, CollegeduniaDeveloper / Analyst

Legal-court, USPTO patent, and automotive data pipelines. Where the scraping craft started.

B.Tech, Electronics & Communication, Bharati Vidyapeeth College of Engineering, New Delhi, 2013-2017

GET IN TOUCH

Let's talk

If you are building agentic AI and the cost of being confidently wrong is high, that is exactly my work. I reply within 24 hours.

EMAILsan64777@gmail.com BOOK A CALL30-min slot LINKEDIN/in/sanjay-chauhan GITHUB@san64777

Based in New Delhi, US-remote, EST overlap, available for contract and consulting, no visa required.