Rajani Maski
Staff Software Engineer · AI · New York City

Rajani
Maski

Generative Discovery · Agentic Retrieval · Search at Scale

Architecting retrieval systems driven by intent, context, and cognition, bringing fifteen years of search, ranking, and ML infrastructure to the next generation of generative AI.

Open to staff & principal AI roles
NYC · Hybrid or on-site
01 · About

A throughline from search to agents.

Profile

Staff AI engineer focused on Generative Discovery: the discipline of building retrieval that understands what users mean, not just what they type.

Fifteen-plus years across information retrieval, search, ranking, recommendations, deep learning, and (most recently) agentic and multimodal systems. The same problems keep surfacing in new forms: relevance, context, evaluation, latency, scale. The tools change; the discipline does not.

Currently shipping multimodal generative discovery at marketplace scale: 500M assets, two million contributors, twenty-five languages. Speaker, mentor, and occasional contributor to the search projects underneath it all: Lucene, Solr, and OpenSearch.

02 · Now

What I'm building.

Currently
In production

Multimodal Generative Discovery

Shutterstock · May 2022 to present

Architecting the retrieval layer behind RAG-style GenAI applications across one of the largest creative marketplaces in the world: 500M+ stock assets, two million contributors, twenty-five language surfaces, sustained production SLOs. Dense vector retrieval, hybrid search, and metadata grounding running across hundreds of nodes. Multimodal embedding pipelines for image + text semantic search, with offline and online evaluation frameworks driving relevance quality. Led the migration of all 500M assets from Solr to OpenSearch, enabling cloud-native scale and ML-powered ranking, including Learning-to-Rank. Mentor and tech-lead to L2 / L3 engineers on the team for three years running.

500M+
Stock assets indexed & served
$1M/yr
Infrastructure cost reduction
2,000RPS
Sustained production throughput
25×
Languages supported in retrieval
03 · Work

Selected experience.

Timeline
2022 · Present
Shutterstock
Staff Software Engineer, AI · Multimodal Generative Discovery
  • Architected Generative Discovery infrastructure powering RAG-style GenAI applications across 500M+ stock assets from 2M+ contributors, combining dense vector retrieval, hybrid search, and metadata grounding at 2,000+ RPS across hundreds of nodes, with retrieval surfaces in 25 languages.
  • Drove infrastructure cost reduction of ~$1M annual alongside 25% latency improvement, while simultaneously improving relevance metrics, decoupling scale from quality tradeoffs.
  • Designed and shipped a production agentic query understanding system: LLM-driven intent extraction, query rewriting and expansion, and multi-step retrieval planning integrated into the discovery stack at marketplace scale.
  • Built multimodal embedding pipelines supporting image + text semantic search, with MLflow and SageMaker for feature generation and model lifecycle management.
  • Designed offline and online evaluation frameworks for retrieval quality using precision/recall, relevance judgments, and human-in-the-loop feedback loops, supporting strict production SLOs.
  • Led the Solr → OpenSearch migration of all 500M assets, redesigning the retrieval layer for cloud-native scale and unlocking ML-powered ranking pipelines including Learning-to-Rank.
  • Performance work spanning FAISS, vLLM, vector index tuning, and JVM / Lucene profiling, closing the loop between retrieval architecture and the underlying systems.
  • Mentor and tech-lead to L2 / L3 engineers on the team for three consecutive years, reviewing designs, driving technical direction on retrieval and ranking workstreams, and growing the bench.
2020 to 2022
Redis
Professional Services Consulting Engineer
  • Designed and deployed vector similarity search systems using Redis; advised enterprise customers on semantic search and retrieval architectures for ML workloads.
  • Tuned latency-sensitive distributed systems for production environments with strict SLA requirements at scale.
  • Led developer and admin workshops on Redis data structures, RediSearch, and graph workloads.
2018 to 2020
Lucidworks
Senior Search Consultant
  • Architected enterprise search platforms using Lucidworks Fusion, Solr, and Spark; implemented Learning-to-Rank and ML-based relevance pipelines.
  • Designed full-stack search applications for digital workplace and commerce verticals.
  • Implemented large-scale search analytics using Apache Spark and Solr.
2009 to 2017
Earlier roles
Foundations in search, retrieval, & large-scale Java systems

Search Consultant / Lucidworks India · Senior Software Engineer / Target.com India · Sr. Java Programmer / Happiest Minds · Search Consultant / Sony India · Software Engineer / PointCross Life Sciences · Search Intern / Glassdoor USA

04 · Speaking

On stage.

Talks
2026 Speaker

Agentic Search Patterns

Optimized AI Conference
2026 Upcoming

Generative Discovery: Intent, Context & Cognition

OpenSearchCon India
05 · Stack

Tools of the trade.

Capabilities
AI & Retrieval
RAG · Vector Search · Hybrid Retrieval · Multimodal AI · Agentic Systems · Embeddings · LLMOps · Prompt Engineering
Search Platforms
OpenSearch · Elasticsearch · Apache Solr · Apache Lucene · Learning-to-Rank · RediSearch
Performance & Systems
FAISS · vLLM · Vector Index Tuning · JVM & Lucene Profiling · Distributed Systems Perf · Latency-Sensitive Production SLOs
Languages
Java · Python · Scala · C++ · Bash
Cloud & Infra
AWS Bedrock · AWS SageMaker · Kubernetes · Terraform · MLflow · Apache Spark · CI/CD
06 · Credentials

Education & certifications.

Background
Education

M.S. Computer Science

San Francisco State University · 2016 to 2018

Research: Efficient and effective search for large textual collections using machine learning techniques.

Education

B.E. Engineering

Visvesvaraya Technological University · 2005 to 2009

Instrumentation & Technology.

Certification

AWS Certified GenAI Developer Professional

Amazon Web Services

Production-grade generative AI applications on AWS.

Continuous Learning

DeepLearning.AI · 40+ courses

Andrew Ng & collaborators

RAG, agents, LLMOps, transformers, evaluation, fine-tuning.

Open Source

Apache Lucene · Solr · OpenSearch

Issue reports & pull requests

Occasional contributor to the foundational projects of the search and retrieval ecosystem.

Recognition

Hackathons & internal demos

Multiple events

Built and presented prototypes for agentic search and generative discovery patterns.

Open to staff and principal AI engineering roles in New York. Let's talk about retrieval, agents, or whatever you're building.

Based
New York, NY