Skip to content
Available for New Opportunities

Nikunj Khitha

Full-Stack GenAI Engineer

I build full-stack GenAI platforms where Spring Boot APIs, retrieval systems, MCP servers, LLM gateways, security, observability, and product UX work together in production.

0M+
KG Entities
0
Production MCP Servers
0+
Automations
0%
Retrieval Lift
Profile

About

Nikunj

Full-Stack GenAI Engineer focused on backend architecture, full-stack product delivery, AI security, and reliable agent systems.

Backend systemsFull-stack platformsProduction GenAI

I build backend-heavy AI products and platform systems where APIs, reliability, security, observability, retrieval quality, and user experience all matter at the same time.

I work best on products and platforms where backend systems have to be reliable, AI has to be useful, and the user experience has to earn trust in production. My sweet spot is owning the system end to end: designing product surfaces, building Java/Spring Boot and Node.js backends, shaping MCP access layers and LLM gateways, wiring observability and security controls, and turning retrieval-heavy model behavior into software teams can trust.

Recognition

AI Ninja Award at ArmorCode

First person to receive ArmorCode's AI Ninja Award, and the youngest person to receive an award at the company.

Built ArmorCode's central Knowledge Graph RAG brain over 1,000,000+ entities from Zendesk, QMetry, Jira, Chorus, and codebase sources, improving retrieval accuracy by 40%.

Designed tenant-scoped MCP access so platform and internal agents can retrieve context and execute authorized tools without cross-tenant leakage.

Shipped 6 production MCP servers and an internal AI runtime stack across Knowledge Graph RAG, LiteLLM, OpenCode Server, n8n, and agent workflows.

Built AI Exposure Management product surfaces for AI visibility, ownership, governance, shadow AI discovery, and auditable risk tracking.

Delivered 30+ automations and 25+ AI workflows across 20+ systems, eliminating 200+ manual hours monthly and earning ArmorCode's AI Ninja Award.

Built CodeNex, a distributed AI code generation SaaS with Spring Boot, Spring AI, SSE streaming, Kubernetes previews, MinIO/NFS persistence, RBAC, and Stripe subscriptions.

Journey

Professional Experience

I’ve grown from building public-sector software into shipping backend systems, full-stack platform features, gateways, observability, and AI products where reliability, delivery speed, retrieval quality, and usability all have to work together.

  • Software Development Intern

    Central Electricity Authority, Government of India
    ExperienceMay 2023 - July 2023
    • Built public-sector software that improved data reliability, internal operations, and workflow speed across government systems.
    • Integrated National Power Portal data into a national renewable energy dashboard serving 150+ power stations and improved reporting accuracy by 30%.
    • Built a secure Java/PostgreSQL file management system with role-based access control that improved retrieval efficiency by 25% across 5,000+ files.
    • Developed a MERN conference room booking system that cut booking time by 60% and reduced scheduling errors by 40%.
  • SDE Intern (Backend/AI)

    Xansr Media (Aiko)
    ExperienceJun 2024 – Dec 2024
    • Shipped full-stack, backend, GenAI, and data systems for AIKO and Fantasy GPT, powering personalized sports experiences, voice AI, and retrieval-backed cricket intelligence.
    • Built Node.js and FastAPI microservices, improving API performance by 40% and reducing deployment time by 42% with Docker and GitHub Actions.
    • Engineered Fantasy GPT with RAG, LangGraph, backend APIs, agents, and DeepEval quality checks to resolve 98% of complex sports queries.
    • Built Python-based ETL pipelines to collect sports data from multiple sources and ingest it into MS SQL for Fantasy GPT SQL RAG workflows.
    • Worked across AIKO, a voice-based sports companion using Azure Speech SDK for text-to-speech and speech-to-text, user-level personalization, and live AI-generated commentary in 20+ languages.
    • Built AIKO personalization features for on-the-fly highlight reels, where AI agents stitched sports moments based on each user's profile and interests for a product presented at IBC 2024 in Amsterdam.
  • Software Development Engineer Intern

    ArmorCode
    ExperienceJan 2025 - Nov 2025
    • Built backend integrations, reusable backend infrastructure, AI-assisted scaffolding, and internal LLM platform components across ArmorCode's AppSec platform.
    • Owned backend integrations for 5+ security tools, including Black Duck, Snyk, and Checkmarx, on an AppSec platform with 130+ connectors for consolidated security finding management.
    • Created AI-assisted code generation utilities with template engines and AST parsing to automate new integration scaffolding, reducing per-integration boilerplate setup time by 30%.
    • Built a shared backend service-orchestration library with parallel container startup for MySQL, Elasticsearch, Redis, Kafka, MongoDB, and LocalStack across backend services.
    • Built an OpenAI-compatible proxy in Go for Gemini CLI, Codex, and Claude Code auth with multi-account load balancing, provider fallbacks, and Redis-backed LLM caching, cutting LLM indexing costs by 70% and saving $15,000+ annually.
    • Shipped a reusable HTTP client using Resilience4j for failure isolation, configurable backoff, timeout handling, and SSRF protection adopted across multiple services.
  • Associate Engineer (Full-Stack GenAI)

    ArmorCode
    ExperienceDec 2025 – Present
    • Own end-to-end backend, full-stack, and GenAI platform work across ArmorCode's platform agent memory, AI Exposure Management, MCP servers, gateways, automation, observability, and production AI infrastructure.
    • Architected ArmorCode's central Knowledge Graph RAG brain, ingesting 1,000,000+ entities from Zendesk, QMetry, Jira, Chorus, and codebase sources into Neo4j and pgvector, improving retrieval accuracy by 40%.
    • Built the tenant-scoped MCP access layer around the graph brain, enabling platform and internal agents to retrieve tenant-specific context and execute authorized tools without cross-tenant leakage.
    • Deployed and maintained ArmorCode's internal AI runtime stack with Knowledge Graph RAG, LiteLLM, OpenCode Server, n8n, and 6 production MCP servers for retrieval, model access, agent execution, and workflow orchestration.
    • Shipped the ArmorCode platform agent memory layer with Graphiti temporal knowledge graphs, combining session-scoped context with tenant and person-level long-term Knowledge Graph recall for multi-step reasoning workflows.
    • Contributed to ArmorCode AI Exposure Management (AIEM), building frontend and backend workflows for AI visibility, ownership, governance, shadow AI discovery, and auditable risk tracking.
    • Built the OpenCode gateway in front of OpenCode Server using Go and Gin with observability, load balancing, task queueing, main-server routing, and scheduled and PR-triggered workflow orchestration.
    • Delivered 30+ automations and 25+ AI workflows across 20+ systems with fallback paths, approval gates, and Slack/Jira handoffs, eliminating 200+ manual hours monthly and earning ArmorCode's AI Ninja Award in the second month after FTE conversion.

And a new adventure ahead

Portfolio

Featured Work

Backend systems, full-stack products, AI applications, and platform tooling spanning APIs, gateways, UX, automation, retrieval, observability, and production infrastructure.

Full-Stack AI SaaS

CodeNex: AI Builder

Built an AI codegen SaaS that turns natural-language prompts into full React applications using Spring Boot and Spring AI, with SSE streaming, MinIO/NFS persistence, and Kubernetes preview pods.

JavaSpring BootSpring AIReactTypeScriptSSEKubernetesMinIOStripe
AI Gateway & Infra

CodeNex AI API Proxy

Built an OpenAI-compatible AI gateway in Go and Gin with provider abstraction, multi-account load balancing, health-aware fallbacks, Redis-backed response caching, streaming support, and operational controls behind a single API surface.

GoGinRedisPostgreSQLReactOpenAI-compatible APIs
Full-Stack AI Product

Serenify

Built an open-source AI wellness product that combines empathetic Gemini-powered chat, mood tracking, journaling, guided sessions, crisis-help flows, privacy-aware analytics, and pgvector-backed personalization.

ReactTypeScriptSupabasepgvectorGemini AIVercel
AI Product

Resume Fit — CodeNex

Built an AI resume analysis and optimization tool with ATS-style scoring, keyword extraction, guided refinements, and visual feedback for iterative resume improvement.

ReactTypeScriptGemini AIVercel AI SDKRecharts
Full-Stack AI Product

CodeNex Images

Built an AI image generation and editing product around Gemini models, Auth0, and a polished workspace that emphasizes creation flow instead of exposing raw model controls.

ReactTypeScriptViteAuth0Gemini AINode.jsMongoDB
LLM Tooling / MCP

LLaMa MCP Streamlit

Built an interactive assistant that combines NVIDIA NIM-hosted LLaMA 3.3 70B with MCP to show how LLM interfaces can move beyond chat into real-time tool execution.

PythonStreamlitMCPLLaMANVIDIA NIM
Expertise

Technical Stack

The technologies I reach for most often when building backend systems, full-stack products, gateways, production infrastructure, and AI-powered workflows.

Learning fast.

Backend & Product Engineering

19 technologies

The languages, frameworks, and application-layer tools I use to build production products, APIs, and internal platforms end to end.

Java 21Spring Boot 3Spring AITypeScriptNode.jsGoGinPythonFastAPISQLAlchemyNext.jsReactREST APIsMicroservicesServer-Sent EventsRBACJWTOAuth2Stripe

GenAI, Agents & Retrieval

22 technologies

The AI, orchestration, and retrieval stack I use to build production-grade GenAI systems and agent workflows.

RAGGraphRAGKnowledge Graph RAGLightRAGGraphitiTemporal Graph MemoryAgentic AIMulti-Step Reasoning AgentsMulti-Agent SystemsLangGraphLangChain4jSpring AICrewAIDeepEvalMCPTenant-Scoped ToolsPrompt EngineeringClaudeGemini AIOpenAILLaMALLM Proxying

Data & Search Infrastructure

10 technologies

The storage, indexing, vector, and search technologies I use to make AI systems accurate, scalable, and cost-aware.

PostgreSQLpgvectorNeo4jMongoDBElasticsearchRedisMSSQLSupabaseMinIONFS

Platform, DevOps & Delivery

22 technologies

The infrastructure and operational tooling I use to deploy, route, observe, secure, and scale products reliably.

Resilience4jDockerKubernetesKubernetes AutoscalingFabric8Kubernetes IngressKafkaLocalStackGitHub ActionsJenkinsAWSAzureVercelLiteLLMGrafanaNginxLoad BalancingAPI GatewaysAI ObservabilityAI Securityn8nOpenCode Server

Product Tooling & UX

13 technologies

Supporting tools I use to ship full-stack product surfaces, admin workflows, charts, authentication, and polished UX.

Auth0Tailwind CSSFramer Motionshadcn/uinext-themesRechartsStreamlitSwaggerPostmanViteKiroClaude CodeCodex
Checking availability

Talk to my AI Twin

Checking whether the live assistant is available right now. You can still open the chat panel while the status loads.

Best for recruiter summaries, architecture deep-dives, backend and gateway discussions, AIEM context, platform agent memory work, and guided project walkthroughs.

Connect

Get in Touch

Open to backend, full-stack, platform, AI security, observability, and GenAI product engineering opportunities, as well as thoughtful collaborations.

Let's Build Something

If you're hiring for backend, full-stack, platform, AI security, observability, or GenAI product engineering work, I'd love to talk. I'm also open to thoughtful collaborations and open-source conversations.

Download Resume
Available for new opportunities

Send a Message

Share the role, team, problem space, or product idea and I'll reply with context that's actually useful.

Messages are sent securely to njkhitha2003@gmail.com with your email set as the reply-to.