Nikunj Khitha
Full-Stack GenAI Engineer
I build full-stack GenAI platforms where Spring Boot APIs, retrieval systems, MCP servers, LLM gateways, security, observability, and product UX work together in production.
About
Nikunj
Full-Stack GenAI Engineer focused on backend architecture, full-stack product delivery, AI security, and reliable agent systems.
I build backend-heavy AI products and platform systems where APIs, reliability, security, observability, retrieval quality, and user experience all matter at the same time.
I work best on products and platforms where backend systems have to be reliable, AI has to be useful, and the user experience has to earn trust in production. My sweet spot is owning the system end to end: designing product surfaces, building Java/Spring Boot and Node.js backends, shaping MCP access layers and LLM gateways, wiring observability and security controls, and turning retrieval-heavy model behavior into software teams can trust.
Recognition
AI Ninja Award at ArmorCode
First person to receive ArmorCode's AI Ninja Award, and the youngest person to receive an award at the company.
Built ArmorCode's central Knowledge Graph RAG brain over 1,000,000+ entities from Zendesk, QMetry, Jira, Chorus, and codebase sources, improving retrieval accuracy by 40%.
Designed tenant-scoped MCP access so platform and internal agents can retrieve context and execute authorized tools without cross-tenant leakage.
Shipped 6 production MCP servers and an internal AI runtime stack across Knowledge Graph RAG, LiteLLM, OpenCode Server, n8n, and agent workflows.
Built AI Exposure Management product surfaces for AI visibility, ownership, governance, shadow AI discovery, and auditable risk tracking.
Delivered 30+ automations and 25+ AI workflows across 20+ systems, eliminating 200+ manual hours monthly and earning ArmorCode's AI Ninja Award.
Built CodeNex, a distributed AI code generation SaaS with Spring Boot, Spring AI, SSE streaming, Kubernetes previews, MinIO/NFS persistence, RBAC, and Stripe subscriptions.
Professional
Experience
I’ve grown from building public-sector software into shipping backend systems, full-stack platform features, gateways, observability, and AI products where reliability, delivery speed, retrieval quality, and usability all have to work together.
And a new adventure ahead
Software Development Intern
Central Electricity Authority, Government of IndiaExperienceMay 2023 - July 2023- Built public-sector software that improved data reliability, internal operations, and workflow speed across government systems.
- Integrated National Power Portal data into a national renewable energy dashboard serving 150+ power stations and improved reporting accuracy by 30%.
- Built a secure Java/PostgreSQL file management system with role-based access control that improved retrieval efficiency by 25% across 5,000+ files.
- Developed a MERN conference room booking system that cut booking time by 60% and reduced scheduling errors by 40%.
SDE Intern (Backend/AI)
Xansr Media (Aiko)ExperienceJun 2024 – Dec 2024- Shipped full-stack, backend, GenAI, and data systems for AIKO and Fantasy GPT, powering personalized sports experiences, voice AI, and retrieval-backed cricket intelligence.
- Built Node.js and FastAPI microservices, improving API performance by 40% and reducing deployment time by 42% with Docker and GitHub Actions.
- Engineered Fantasy GPT with RAG, LangGraph, backend APIs, agents, and DeepEval quality checks to resolve 98% of complex sports queries.
- Built Python-based ETL pipelines to collect sports data from multiple sources and ingest it into MS SQL for Fantasy GPT SQL RAG workflows.
- Worked across AIKO, a voice-based sports companion using Azure Speech SDK for text-to-speech and speech-to-text, user-level personalization, and live AI-generated commentary in 20+ languages.
- Built AIKO personalization features for on-the-fly highlight reels, where AI agents stitched sports moments based on each user's profile and interests for a product presented at IBC 2024 in Amsterdam.
Software Development Engineer Intern
ArmorCodeExperienceJan 2025 - Nov 2025- Built backend integrations, reusable backend infrastructure, AI-assisted scaffolding, and internal LLM platform components across ArmorCode's AppSec platform.
- Owned backend integrations for 5+ security tools, including Black Duck, Snyk, and Checkmarx, on an AppSec platform with 130+ connectors for consolidated security finding management.
- Created AI-assisted code generation utilities with template engines and AST parsing to automate new integration scaffolding, reducing per-integration boilerplate setup time by 30%.
- Built a shared backend service-orchestration library with parallel container startup for MySQL, Elasticsearch, Redis, Kafka, MongoDB, and LocalStack across backend services.
- Built an OpenAI-compatible proxy in Go for Gemini CLI, Codex, and Claude Code auth with multi-account load balancing, provider fallbacks, and Redis-backed LLM caching, cutting LLM indexing costs by 70% and saving $15,000+ annually.
- Shipped a reusable HTTP client using Resilience4j for failure isolation, configurable backoff, timeout handling, and SSRF protection adopted across multiple services.
Associate Engineer (Full-Stack GenAI)
ArmorCodeExperienceDec 2025 – Present- Own end-to-end backend, full-stack, and GenAI platform work across ArmorCode's platform agent memory, AI Exposure Management, MCP servers, gateways, automation, observability, and production AI infrastructure.
- Architected ArmorCode's central Knowledge Graph RAG brain, ingesting 1,000,000+ entities from Zendesk, QMetry, Jira, Chorus, and codebase sources into Neo4j and pgvector, improving retrieval accuracy by 40%.
- Built the tenant-scoped MCP access layer around the graph brain, enabling platform and internal agents to retrieve tenant-specific context and execute authorized tools without cross-tenant leakage.
- Deployed and maintained ArmorCode's internal AI runtime stack with Knowledge Graph RAG, LiteLLM, OpenCode Server, n8n, and 6 production MCP servers for retrieval, model access, agent execution, and workflow orchestration.
- Shipped the ArmorCode platform agent memory layer with Graphiti temporal knowledge graphs, combining session-scoped context with tenant and person-level long-term Knowledge Graph recall for multi-step reasoning workflows.
- Contributed to ArmorCode AI Exposure Management (AIEM), building frontend and backend workflows for AI visibility, ownership, governance, shadow AI discovery, and auditable risk tracking.
- Built the OpenCode gateway in front of OpenCode Server using Go and Gin with observability, load balancing, task queueing, main-server routing, and scheduled and PR-triggered workflow orchestration.
- Delivered 30+ automations and 25+ AI workflows across 20+ systems with fallback paths, approval gates, and Slack/Jira handoffs, eliminating 200+ manual hours monthly and earning ArmorCode's AI Ninja Award in the second month after FTE conversion.
And a new adventure ahead
Featured Work
Backend systems, full-stack products, AI applications, and platform tooling spanning APIs, gateways, UX, automation, retrieval, observability, and production infrastructure.
CodeNex: AI Builder
Built an AI codegen SaaS that turns natural-language prompts into full React applications using Spring Boot and Spring AI, with SSE streaming, MinIO/NFS persistence, and Kubernetes preview pods.
CodeNex AI API Proxy
Built an OpenAI-compatible AI gateway in Go and Gin with provider abstraction, multi-account load balancing, health-aware fallbacks, Redis-backed response caching, streaming support, and operational controls behind a single API surface.
Serenify
Built an open-source AI wellness product that combines empathetic Gemini-powered chat, mood tracking, journaling, guided sessions, crisis-help flows, privacy-aware analytics, and pgvector-backed personalization.
Resume Fit — CodeNex
Built an AI resume analysis and optimization tool with ATS-style scoring, keyword extraction, guided refinements, and visual feedback for iterative resume improvement.
CodeNex Images
Built an AI image generation and editing product around Gemini models, Auth0, and a polished workspace that emphasizes creation flow instead of exposing raw model controls.
LLaMa MCP Streamlit
Built an interactive assistant that combines NVIDIA NIM-hosted LLaMA 3.3 70B with MCP to show how LLM interfaces can move beyond chat into real-time tool execution.
Technical
Stack
The technologies I reach for most often when building backend systems, full-stack products, gateways, production infrastructure, and AI-powered workflows.
Learning fast.
Backend & Product Engineering
19 technologies
The languages, frameworks, and application-layer tools I use to build production products, APIs, and internal platforms end to end.
GenAI, Agents & Retrieval
22 technologies
The AI, orchestration, and retrieval stack I use to build production-grade GenAI systems and agent workflows.
Data & Search Infrastructure
10 technologies
The storage, indexing, vector, and search technologies I use to make AI systems accurate, scalable, and cost-aware.
Platform, DevOps & Delivery
22 technologies
The infrastructure and operational tooling I use to deploy, route, observe, secure, and scale products reliably.
Product Tooling & UX
13 technologies
Supporting tools I use to ship full-stack product surfaces, admin workflows, charts, authentication, and polished UX.
Talk to my
AI Twin
Checking whether the live assistant is available right now. You can still open the chat panel while the status loads.
Best for recruiter summaries, architecture deep-dives, backend and gateway discussions, AIEM context, platform agent memory work, and guided project walkthroughs.
Get in Touch
Open to backend, full-stack, platform, AI security, observability, and GenAI product engineering opportunities, as well as thoughtful collaborations.
Let's Build Something
If you're hiring for backend, full-stack, platform, AI security, observability, or GenAI product engineering work, I'd love to talk. I'm also open to thoughtful collaborations and open-source conversations.

