Skip to content
SANJAY.CHAUHANBook a call
SANJAY CHAUHAN · AI ENGINEERING LEADERAVAILABLE FOR CONTRACT & CONSULTING

I build AI that shows its receipts.

I build production AI products from scratch by directing AI coding agents, the same way I lead engineers, then prove they work. Eight years shipping healthcare agents and eval-first systems that cite a verified source or abstain, never a confident guess. Available to US teams as an advisor, a fractional lead, or a hands-on builder.

FILE / SC-001CONFIDENCE: HIGH
Sanjay ChauhanSUBJECT
ROLE
AI Engineering Lead
FOCUS
Ships products via agentic dev
PROVES
3 OSS, Apache-2.0
AVAIL
Contract / consulting
EIGHT YEARS ACROSSAssured Health·Forage AI·Xebia·Headfield
8yrs
shipping production AI and data systems
1M+
records extracted per day, at peak
50
US states covered by live verification agents
3
open-source tools shipped, all Apache-2.0
THE PROOFEXHIBIT 01

Most AI asks you to trust it.
Mine hands you the receipt.

A citeproof receipt. Pick a claim and watch it find the exact supporting line, or abstain when no line supports it.

HOW IT WORKS
// SELECT A CLAIM TO VERIFY
Supporting line highlighted in the snapshot.
ENTAILMENT
VERDICTSUPPORTED
GATEsupporting span found
SCORE0.96

Interactive explainer. The real run is shown below.

en.wikipedia.org/wiki/Coral_reefsnapshot, illustrative

Coral reef, Threats

Destructive fishing practices, including blast fishing and cyanide fishing, damage reef structures and reduce fish populations. Rising ocean temperatures cause coral bleaching, in which corals expel the symbiotic algae living in their tissues. Reefs also buffer coastlines from storm surge and erosion, protecting communities near the shore.

When a claim has no support in the snapshot it is dropped, not guessed. That refusal is the whole product: a local entailment gate against a verified source, anchored to a verbatim span. Abstain over guess.

Not a mockupciteproof, running locally. Same idea, real output.
citeproof running: a question is researched, each cited claim links to a snapshot of the source page with the supporting line highlighted, and unsupported claims are excluded
Ask a question, citeproof fetches sources, verifies each claim against the page, and opens a receipt with the supporting line highlighted. Claims it cannot verify are excluded, not guessed. See the repo.
ENGAGEMENTSEXHIBIT 02

Ways to work together

Three ways in, one conversation. Tell me the problem on a call and we will pick the lightest engagement that solves it.

MOST ASKED

Advisory & eval design

Retainer or sprint

Is your agentic system working, or just demoing well? I build the eval harness that measures it and review the architecture around it.

  • Eval framework + golden sets
  • RAG / agent architecture review
  • Faithfulness + hallucination audits

Fractional AI lead

Part-time, monthly

Embed as your AI engineering lead. I have run teams and shipped production agentic AI, so yours skips the traps and ships sooner.

  • Senior lead on your team, 10 to 20 hrs/week
  • Roadmap, hiring bar, code review
  • EST overlap from India

Build engagements

Fixed scope

You have a scoped problem now. I build it from scratch with agentic development, the RAG pipeline, the agent, or the full product, ship it, and hand it over with the evals that keep it honest.

  • RAG / retrieval pipelines
  • Agentic + browser automation
  • Production deploy on AWS

For the right team, also open to a senior full-time or fractional leadership seat. US-remote, EST overlap, no visa required.

Book a 30-min call
FIELD WORKEXHIBIT 03

Shipped, in production

Production systems I built and led, some end to end with agentic development. The code is proprietary, so the numbers are the proof.

Assured Copilot

Agentic Chrome extension + conversational RAG agent for healthcare credentialing

  • Built end to end with agentic development, one engineer directing AI agents
  • Streaming Claude RAG chat that auto-fills payer enrollment and answers payer-policy questions
  • Semantic field-mapping pipeline with a learning loop that improves from user corrections
Manifest V3Claude APIDjangoDynamoDBLambda
2,006+
health-plan configs

Credentialing Agents Platform

Serverless AI verification across all 50 US states

  • 20+ verification agents (ABMS, CAQH, DEA, OIG, OFAC, NPI, NPDB, state boards)
  • Pluggable state-specific agent architecture
  • Async jobs, DynamoDB tracking, EventBridge refresh
Python monorepoLambda + SQSDynamoDBS3
50
US states covered

Intelligent Browser Unblocker

ML fingerprint synthesis + human-behavior simulation, Forage AI

  • Resilient extraction across 1M+ profiles per day
  • Distributed crawling with AWS pipeline orchestration
  • Bypassed Cloudflare and bot-detection systems at scale
PythonPlaywrightML modelsAWS ECS
1M+
profiles per day
THE EVIDENCE LOCKEREXHIBIT 04

Open source you can run today

Production-grade primitives, all Apache-2.0. Click any repo. Every link resolves.

Fetch, but it tells you the truth.

HTTP returns bytes. veriscrape returns a portable trust verdict alongside them, so downstream systems can decide whether to trust the page.

pypiv0.2.0stars6licenseApache-2.0
$ pip install veriscrapegithub

Research that won't cite what it can't verify.

A local deep-research agent. Each cited claim links to a receipt: the page snapshot, highlighted to the supporting line. Unsupported claims are dropped, not guessed.

statusactive devlicenseApache-2.0
$ git clone github.com/san64777/citeproofgithub

Turn flat PDFs into real, fillable AcroForms.

Most PDF tools render forms or extract data, not both deterministically. acroforge builds true AcroForms with named fields and calculation logic, zero copyleft surface.

pypiv0.4.0stars8licenseApache-2.0
$ pip install acroforgegithub

Sound judgment from every document.

An AI document workspace for US professionals. Upload anything, it pulls out the parties, dates, amounts, and obligations, then answers questions with citations to the exact document and page. Built solo, from scratch, with agentic development.

statusprivate betastackNext.js, Claudedemoeudoxic.ai
# closed source, demo on requestvisit
METHODEXHIBIT 05

How I work

You are not just buying what I shipped. You are buying how I operate when the model is confidently wrong in production.

05.1

Eval-first

If there is no metric for the change, it does not ship. Golden sets land before code.

05.2

Demos weekly

Working software in front of stakeholders every Friday. Surprises die in the demo, not in production.

05.3

Boring infra

Postgres, Docker, FastAPI, Lambda. The clever part stays in the model layer, not the plumbing.

05.4

Async-first

Real EST overlap, plus progress while your team sleeps. Written decisions over meetings, so the record stays the source of truth.

RECORDEXHIBIT 06

Eight years building production data and AI systems

May 2024 - now
Assured HealthAssociate Engineering Manager

Lead engineering for a production healthcare credentialing platform: Django backend, serverless verification across 50 states, and Assured Copilot, the agentic RAG product I built end to end.

2024
Xebia IT ArchitectsSenior Consultant

Automation architecture and data engineering for enterprise clients.

2019 - 2024
Forage AISenior Data Automation Engineer

Built anti-detection extraction at scale, 1M+ profiles/day, defeating Cloudflare and bot-detection.

2019
Headfield SolutionsTechnical Lead

Led 8 engineers building an NLP-driven real-time social listening platform.

2017 - 2019
Quickcompany, TrizTeck, CollegeduniaDeveloper / Analyst

Legal-court, USPTO patent, and automotive data pipelines. Where the scraping craft started.

B.Tech, Electronics & Communication, Bharati Vidyapeeth College of Engineering, New Delhi, 2013-2017

GET IN TOUCH

Let's talk

If you are building agentic AI and the cost of being confidently wrong is high, that is exactly my work. I reply within 24 hours.

Based in New Delhi, US-remote, EST overlap, available for contract and consulting, no visa required.