Currently building — open to internship opportunities

I build backend systems
for AI infrastructure.

I'm Aditya, a Purdue CS student focused on retrieval pipelines, agent infrastructure, verification systems, and production-grade backend tooling. I build the parts that make AI systems reliable after the demo: APIs, storage layers, evaluation loops, orchestration, and guardrails.

Purdue CS · Machine Intelligence + Security · Seeking Summer 2026 Backend SWE / AI Infrastructure internshipsPurdue CS · Seeking Backend SWE / AI Infra internships

About

Purdue CS. Machine Intelligence
+ Security.

I'm Aditya — a Purdue CS student focused on backend systems and AI infrastructure. I work on retrieval pipelines, agent runtimes, verification systems, and developer tooling: the parts that make AI reliable after the demo.

Outside projects, I lead education at BoilerBlockchain, where I design technical workshops and mentor students through smart contracts, backend infrastructure, and full-stack systems. We've certified 100+ students through a 10-week blockchain engineering curriculum.

100+students certified
4production systems
3research positions
Currently
  • Building backend/AI systems
  • Leading education at BoilerBlockchain
  • Seeking Backend SWE / AI Infra roles
Selected systems

Infrastructure for intelligent and distributed systems.

Memory systems, evaluation pipelines, verification infrastructure, and agent runtimes — across production deployments, research, and active builds.

flagship

Memora

Encrypted memory infrastructure for autonomous AI agents.

Each write is encrypted with AES-256-GCM, content-addressed to IPFS, and sequenced via a tamper-evident consensus log — producing a verifiable record of what an agent stored and when. Permissioned reads are gated by a challenge-response key broker that verifies on-chain owner/delegate state before releasing a decryption key. The Supabase index is a rebuildable query cache; the authoritative record is on-chain.

StatusActive · deployed
RuntimeRailway · Supabase · Hedera HCS · IPFS
StackTypeScript · Node.js · Hedera HCS · Solidity · IPFS · Supabase · Express · pnpm workspaces

TrustGraph

Deterministic trust-boundary analyser for Solidity contracts.

A four-predicate model (E ∧ P ∧ V ∧ ¬G) drives an 8-node LangGraph pipeline — generates executable Foundry PoC exploit tests, emits Markdown/JSON reports, and surfaces diagnostics in a VS Code extension. Gemini optionally enriches trust-assumption explanations but cannot create or suppress a finding.

StatusShipped
StackPython · LangGraph · Foundry · Solidity · Gemini · VS Code extension

Retrieval Adaptation Pipeline

Config-driven ML evaluation pipeline for procedural adaptation under disruption.

Built a 3,292-example procedural disruption benchmark from YouCook2 and WikiHow — ingredient-removal and unavailability scenarios with ground-truth adaptation annotations. Ran a config-driven evaluation pipeline covering BM25, dense retrieval, reranking, and VLM-grounded strategies, with persisted retrieval traces for reproducible multi-method comparison.

StatusResearch · Purdue CS490
StackPython · PyTorch · FAISS · Transformers · Hugging Face · vLLM

Mano

AI deal coordinator for commercial real estate acquisitions.

End-to-end diligence automation for institutional CRE buyers — document requests, estoppel collection, title review, lender coordination, and red-flag detection. Pre-empts every diligence request in parallel the moment an LOI is signed; surfaces lease-vs-rent-roll discrepancies and title exceptions on day three.

StatusActive build
StackNext.js · TypeScript · Supabase · OpenAI · Resend · Tailwind

AgentOS

Local-first runtime infrastructure for supervising coding-agent workflows.

React/Tauri frontend, Rust desktop bridge, isolated git worktrees, command allowlists with risk classification, diff inspection, provider diagnostics, and Ollama integration. Browser demo mode available; desktop runtime handles real worktree operations.

StatusActive build
StackReact · Tauri · Rust · TypeScript · Ollama
Applied systems
Shipped · Solsten 2025

SmartInvoiceExtractor

Regex-first PDF extraction pipeline for semi-structured medical invoices.

98.2% field accuracy · 65% lower inference cost · 80% manual correction reduction
Python · pdfplumber · Gemini 2.5 Flash · Vertex AI · DSPy · pytest
Archived · Kong Labs 2025

Clinical Speech ML Pipeline

End-to-end speech analysis and federated training pipeline for clinical audio.

200+ samples · 1.5× throughput · 15% accuracy gain under distribution shift
Python · PyTorch · Whisper · Librosa · Docker · Federated Learning
Collaboration · MOI Labs

Sageo

Trust and discovery layer for AI agents on MOI Chain.

A2A interaction proofs · on-chain agent identity registry
TypeScript · Node.js · MOI Chain · Google A2A Protocol · Coco contracts
Systems sprints
Agent Safe (ETH Denver 26)
Sandboxed execution environment for coding agents — scoped filesystem access, syscall filtering, and blast-radius containment for untrusted tool use.
hackathon
GitHub
CosmosPool (ETH Denver 25)
Cosmos SDK staking pool with delegator analytics and automated reward compounding via a governance-gated rebalancer.
hackathon
GitHub
EntreLink
Founder-to-founder matching platform — semantic search over founder profiles to surface co-founders, investors, and early hires.
EstateWise
Modular AI employee system for real estate transaction automation — MCP servers handling document requests, title review, and deal coordination.
FormulaOS
AI-native spreadsheet desktop app — formula engine, GPT-4 copilot, command palette, and Tauri desktop packaging.
Clio (MBC 25)
Base L2 protocol for artist bonding-curve tokens — Solidity contracts with a social discovery layer for early fan-to-artist engagement.
hackathon
GitHub
Experience

Where this thinking came from.

Jul 2025 — Aug 2025Solsten
Software Engineering Intern

Medical invoice parsing pipeline with hybrid extraction and LLM fallback.

Built an automated parsing pipeline for handwritten and printed medical receipts and invoices. Combined regex-based extraction with Gemini 2.5 Flash for missing or low-confidence fields, reducing inference costs by ≈65%. Refined extraction logic with DSPy evaluation and pytest validation, reaching ≈98.2% field-level accuracy and reducing manual correction workload by ≈80%.

Python · Vertex AI · Gemini 2.5 Flash · DSPy · pytest
Sep 2025 — Dec 2025MOI Labs × BoilerBlockchain
Software Engineer / Project Lead

Agent identity and interaction verification layer on MOI Chain.

Led SDK development for Sageo — built TypeScript wrappers for outgoing A2A clients and server-side request handlers that inject trace metadata and log hashed request/response proofs on MOI Coco contracts. Designed ephemeral-state fallbacks for devnet edge cases, wrote E2E workflow scripts to verify on-chain proof integrity, and co-authored the technical specification.

TypeScript · Node.js · MOI Chain · Google A2A Protocol · Coco
Feb 2025 — Aug 2025Kong Labs
Undergraduate Research Assistant

Clinical speech preprocessing and federated training pipeline.

Built an end-to-end clinical speech ML pipeline for scalable preprocessing and federated training across distributed hospital datasets. Implemented low-latency feature extraction with Whisper and Librosa, processed 200+ clinical audio samples, Dockerized ingestion and preprocessing, improved dataset throughput by 1.5×, and designed federated workflows that improved classification accuracy by 15% under distribution shift.

Python · PyTorch · Whisper · Librosa · Docker · Federated Learning
Jan 2025 — presentBoilerBlockchain, Purdue
Head of Education

10-week blockchain engineering curriculum — 100+ students certified.

Designed and delivered a 10-week blockchain engineering curriculum covering EVM, Solidity, ERC standards, DAOs, DEX mechanics, and L2 scaling. Taught and certified 100+ students, improved course completion rate by 15%, and expanded the active developer team by 3×.

Solidity · Foundry · EVM · TypeScript
May 2025 - PresentBoilerBite · Purdue SPARK
Co-founder / Software Engineer

Built backend infrastructure for a student startup through Purdue's SPARK accelerator.

Co-founded BoilerBite through Purdue's SPARK entrepreneurship program. Designed and shipped the backend from scratch — Node.js/Express REST API, PostgreSQL with role-based access control, Clerk authentication, input validation with Zod, and production deployment on Purdue's Linux infrastructure via NGINX and PM2. Replaced an initial Firebase prototype with a robust, auditable API layer.

Node.js · Express · PostgreSQL · Clerk · Zod · NGINX · PM2
Stack

What I actually reach for.

Grouped by function, not by category. Not a checklist — the things below are the ones I've shipped with, broken, and would pick again.

Backend & infra
Python·TypeScript·Node.js·FastAPI·Postgres·Redis·Docker·Supabase·Railway
AI systems
PyTorch·vLLM·DSPy·Hugging Face·BGE·sentence-transformers·Vertex AI·Gemini
Runtime & tooling
OpenTelemetry·pytest·Git worktrees·command allowlists·CI/CD·Foundry·Linux
Verification & chain
Solidity·Foundry·EVM·Hedera HCS·IPFS·MOI Chain·Hardhat
Frontend & product
React·Next.js·Tailwind·TypeScript·Figma·WordPress
Habits
writing design docs·eval-first·small commits·read the source·observability before features
System Notes

Architecture notes and build logs.

From systems I've shipped, tested, or broken open — write paths, tradeoffs, and what I actually learned.

productionpython
5 min

SmartInvoiceExtractor

Regex extracts high-confidence fields from pdfplumber-parsed PDFs; Gemini fills gaps when critical fields are absent. A quality gate, JSON normalisation layer, and regex suggestion loop keep the LLM call rate low and the patterns improving. Production metrics are from the proprietary Solsten deployment — the public repo is an earlier local prototype.

Read note
distributedtypescript
6 min

Memora

Encrypted payload storage, consensus-ordered event log, and challenge-response permissioned key release — a tamper-evident memory architecture for agents that need to prove what they remembered and when.

Read note
researchml
5 min

Retrieval Adaptation Pipeline

A 3,292-example procedural disruption benchmark from YouCook2 and WikiHow, and a config-driven evaluation pipeline covering retrieval augmentation, reranking, and VLM strategies — with persisted retrieval and generation traces for reproducible multi-method comparison.

Read note
securitysolidity
4 min

TrustGraph

Four-predicate deterministic scanning for trust-boundary vulnerabilities — executable Foundry PoC exploit tests, Markdown/JSON reports, and VS Code diagnostics. The LLM enriches explanations; it never creates or suppresses a finding.

Read note
agentstypescript
4 min

AgentOS

Isolated git worktrees, command allowlists, diff inspection, and a Rust-backed desktop runtime — bounded coding-agent execution with human review controls. A browser demo shows the UI and event flow; the desktop Tauri runtime handles real worktree operations.

Read note
agentsidentity
3 min

Sageo

A verifiable identity and interaction layer for A2A-protocol agents — Coco smart contracts on MOI register agent identities and store hashed interaction proofs; a TypeScript SDK wraps A2A clients and handlers to inject trace metadata without storing raw payloads.

Read note
mlpython
3 min

Clinical Speech ML Pipeline

End-to-end speech analysis pipeline: Expo mobile capture, FastAPI ingestion, FFmpeg normalisation, Whisper transcription, Librosa acoustic metrics, and Postgres persistence with an analytics endpoint. Prototype-stage — no clinical validation.

Read note
Field notes

Working theories, half-formed opinions, things I keep relearning. Mostly about infrastructure for systems that try to think. Read at your own risk.

2026 · 05 · 18infra

Most AI systems don't fail because the model is weak. They fail because nobody can see what happened between retrieval, prompting, tool execution, and state updates.

2026 · 05 · 02agents

Memory is becoming more important than raw model size. A smaller model with stable retrieval and consistent state management is usually more useful than a larger one that forgets why it made a decision three steps ago.

2026 · 04 · 19coordination

Most multi-agent systems eventually rediscover distributed systems problems: retries, consensus, partial failure, stale state, and conflicting writes. The hard part is coordination, not generation.

2026 · 03 · 29verification

A surprising amount of 'trust' in AI systems is just good UI. If verification depends on the interface instead of independently reproducible traces, you're still relying on vibes.

2026 · 03 · 08retrieval

Retrieval quality changes with model scale. In our smaller VLM experiments, retrieval did most of the work. As models improved, visual grounding started mattering more than the retrieved examples themselves.

2026 · 02 · 21chain

Most on-chain AI projects don't actually need a blockchain. The interesting use cases start when multiple parties need shared state, coordination, or verification without trusting a single executor.

2026 · 02 · 11runtime

Most coding-agent demos assume the environment behaves perfectly. Real systems need retries, bounded execution, isolation, logging, and ways to recover when the agent inevitably does something stupid.

Active investigations

Open questions I keep returning to. Some turn into systems. Some stay open.

What should an agent remember?

Experimenting with TTL, episodic memory, and retrieval policies. Long-term memory becomes noisy surprisingly fast once workflows branch and tools start mutating shared state.

active
How much isolation do coding agents actually need?

Comparing worktrees, command allowlists, containers, and microVM approaches. Trying to understand where the security/ergonomics tradeoff stops being worth it.

active
Retrieval evaluation beyond MRR

Most retrieval benchmarks reward overlap, not usefulness. Interested in evaluation methods that survive paraphrase, ambiguity, and adversarial rewrite.

writing
Compact execution traces

Exploring how much workflow history a verifier actually needs. Full traces become expensive quickly, but oversimplified summaries destroy reproducibility.

active
Shared state for autonomous systems

Looking at where distributed coordination actually helps agent systems: permissions, replayability, attribution, and conflict resolution across multiple executors.

active
Local-first agent workflows

Trying to understand which parts of autonomous workflows benefit from the cloud and which parts only ended up there because GPUs used to be inaccessible.

parked
Trust boundaries in agent systems

Who is trusted to do what? Most agent frameworks blur the boundary between orchestration, execution, memory, and verification far more than they should.

writing
Get in touch
Open toBackend SWE · AI Infrastructure · Platform Engineering

Building something load-bearing?

I'm currently looking for Backend SWE and AI infrastructure internships and full-time opportunities — particularly teams working on retrieval, developer infrastructure, agent systems, distributed systems, or applied AI platforms. If you're working on something that needs to survive past the demo stage, get in touch.

Fastest response via email or LinkedIn DM. Usually under 24h.