Staff Software Engineer · AI · New York City

Rajani
Maski

Generative Discovery · Agentic Retrieval · Search at Scale

Architecting retrieval systems driven by intent, context, and cognition, bringing fifteen years of search, ranking, and ML infrastructure to the next generation of generative AI.

Open to staff & principal AI roles
NYC · Hybrid or on-site

01 · About

A throughline from search to agents.

Profile

Staff AI engineer focused on Generative Discovery: the discipline of building retrieval that understands what users mean, not just what they type.

Fifteen-plus years across information retrieval, search, ranking, recommendations, deep learning, and (most recently) agentic and multimodal systems. The same problems keep surfacing in new forms: relevance, context, evaluation, latency, scale. The tools change; the discipline does not.

Currently shipping multimodal generative discovery at marketplace scale: 500M assets, two million contributors, twenty-five languages. Speaker, mentor, and occasional contributor to the search projects underneath it all: Lucene, Solr, and OpenSearch.

02 · Now

What I'm building.

Currently

Multimodal Generative Discovery

Shutterstock · May 2022 to present

Architecting the retrieval layer behind RAG-style GenAI applications across one of the largest creative marketplaces in the world: 500M+ stock assets, two million contributors, twenty-five language surfaces, sustained production SLOs. Dense vector retrieval, hybrid search, and metadata grounding running across hundreds of nodes. Multimodal embedding pipelines for image + text semantic search, with offline and online evaluation frameworks driving relevance quality. Led the migration of all 500M assets from Solr to OpenSearch, enabling cloud-native scale and ML-powered ranking, including Learning-to-Rank. Mentor and tech-lead to L2 / L3 engineers on the team for three years running.

500M+

Stock assets indexed & served

$1M/yr

Infrastructure cost reduction

2,000RPS

Sustained production throughput

25×

Languages supported in retrieval

03 · Work

Selected experience.

2022 to Present

Shutterstock

Staff Software Engineer, AI · Multimodal Generative Discovery

Architected Generative Discovery infrastructure powering RAG-style GenAI applications across 500M+ stock assets from 2M+ contributors, combining dense vector retrieval, hybrid search, and metadata grounding at 2,000+ RPS across hundreds of nodes, with retrieval surfaces in 25 languages.
Drove infrastructure cost reduction of ~$1M annual alongside 25% latency improvement, while simultaneously improving relevance metrics, decoupling scale from quality tradeoffs.
Designed and shipped a production agentic query understanding system: LLM-driven intent extraction, query rewriting and expansion, and multi-step retrieval planning integrated into the discovery stack at marketplace scale.
Built multimodal embedding pipelines supporting image + text semantic search, with MLflow and SageMaker for feature generation and model lifecycle management.
Designed offline and online evaluation frameworks for retrieval quality using precision/recall, relevance judgments, and human-in-the-loop feedback loops, supporting strict production SLOs.
Led the Solr → OpenSearch migration of all 500M assets, redesigning the retrieval layer for cloud-native scale and unlocking ML-powered ranking pipelines including Learning-to-Rank.
Performance work spanning FAISS, vLLM, vector index tuning, and JVM / Lucene profiling, closing the loop between retrieval architecture and the underlying systems.
Mentor and tech-lead to L2 / L3 engineers on the team for three consecutive years, reviewing designs, driving technical direction on retrieval and ranking workstreams, and growing the bench.

2020 to 2022

Redis

Professional Services Consulting Engineer

Designed and deployed vector similarity search systems using Redis; advised enterprise customers on semantic search and retrieval architectures for ML workloads.
Tuned latency-sensitive distributed systems for production environments with strict SLA requirements at scale.
Led developer and admin workshops on Redis data structures, RediSearch, and graph workloads.

2018 to 2020

Lucidworks

Senior Search Consultant

Architected enterprise search platforms using Lucidworks Fusion, Solr, and Spark; implemented Learning-to-Rank and ML-based relevance pipelines.
Designed full-stack search applications for digital workplace and commerce verticals.
Implemented large-scale search analytics using Apache Spark and Solr.

2009 to 2017

Earlier roles

Foundations in search, retrieval, & large-scale Java systems

Search Consultant / Lucidworks India · Senior Software Engineer / Target.com India · Sr. Java Programmer / Happiest Minds · Search Consultant / Sony India · Software Engineer / PointCross Life Sciences · Search Intern / Glassdoor USA

04 · Speaking

On stage.

Talks

2026 Speaker

Agentic Search Patterns

Optimized AI Conference

2026 Upcoming

Generative Discovery: Intent, Context & Cognition

OpenSearchCon India

05 · Stack

Tools of the trade.

Capabilities

AI & Retrieval

RAG · Vector Search · Hybrid Retrieval · Multimodal AI · Agentic Systems · Embeddings · LLMOps · Prompt Engineering

Search Platforms

OpenSearch · Elasticsearch · Apache Solr · Apache Lucene · Learning-to-Rank · RediSearch

Performance & Systems

FAISS · vLLM · Vector Index Tuning · JVM & Lucene Profiling · Distributed Systems Perf · Latency-Sensitive Production SLOs

Languages

Java · Python · Scala · C++ · Bash

Cloud & Infra

AWS Bedrock · AWS SageMaker · Kubernetes · Terraform · MLflow · Apache Spark · CI/CD

06 · Credentials

Education & certifications.

Background

Education

M.S. Computer Science

San Francisco State University · 2016 to 2018

Research: Efficient and effective search for large textual collections using machine learning techniques.

Education

B.E. Engineering

Visvesvaraya Technological University · 2005 to 2009

Instrumentation & Technology.

Certification

AWS Certified GenAI Developer Professional

Amazon Web Services · verified on Credly

Production-grade generative AI applications on AWS.

Continuous Learning

DeepLearning.AI · 40+ courses

Andrew Ng & collaborators

RAG, agents, LLMOps, transformers, evaluation, fine-tuning.

Open Source

Apache Lucene · Solr · OpenSearch

Issue reports & pull requests

Occasional contributor to the foundational projects of the search and retrieval ecosystem.

Recognition

Hackathons & internal demos

Spark Hack NYC · YouTube channel

Recent participant in Spark Hack NYC; ongoing video series on agentic search and generative discovery prototypes.

Open to staff and principal AI engineering roles in New York. Let's talk about retrieval, agents, or whatever you're building.

rajani.maski@gmail.com

Phone

+1 (516) 309-0398

linkedin.com/in/rajanimaski

Based

New York, NY

Rajani Maski