Generative Discovery · Agentic Retrieval · Search at Scale
Architecting retrieval systems driven by intent, context, and cognition, bringing fifteen years of search, ranking, and ML infrastructure to the next generation of generative AI.
Staff AI engineer focused on Generative Discovery: the discipline of building retrieval that understands what users mean, not just what they type.
Fifteen-plus years across information retrieval, search, ranking, recommendations, deep learning, and (most recently) agentic and multimodal systems. The same problems keep surfacing in new forms: relevance, context, evaluation, latency, scale. The tools change; the discipline does not.
Currently shipping multimodal generative discovery at marketplace scale: 500M assets, two million contributors, twenty-five languages. Speaker, mentor, and occasional contributor to the search projects underneath it all: Lucene, Solr, and OpenSearch.
Architecting the retrieval layer behind RAG-style GenAI applications across one of the largest creative marketplaces in the world: 500M+ stock assets, two million contributors, twenty-five language surfaces, sustained production SLOs. Dense vector retrieval, hybrid search, and metadata grounding running across hundreds of nodes. Multimodal embedding pipelines for image + text semantic search, with offline and online evaluation frameworks driving relevance quality. Led the migration of all 500M assets from Solr to OpenSearch, enabling cloud-native scale and ML-powered ranking, including Learning-to-Rank. Mentor and tech-lead to L2 / L3 engineers on the team for three years running.
Search Consultant / Lucidworks India · Senior Software Engineer / Target.com India · Sr. Java Programmer / Happiest Minds · Search Consultant / Sony India · Software Engineer / PointCross Life Sciences · Search Intern / Glassdoor USA
Research: Efficient and effective search for large textual collections using machine learning techniques.
Instrumentation & Technology.
Production-grade generative AI applications on AWS.
RAG, agents, LLMOps, transformers, evaluation, fine-tuning.
Occasional contributor to the foundational projects of the search and retrieval ecosystem.
Built and presented prototypes for agentic search and generative discovery patterns.
Open to staff and principal AI engineering roles in New York. Let's talk about retrieval, agents, or whatever you're building.