“I worked with Jairaj for close to 3 years in the data science team at Swiggy. I was reporting directly to him and got first hand experience of working with him. One thing that immediately comes out while working with him is that he understands the business space and the company's vision clearly and trickles it down to his team which gives the team a clear idea. As a manager - he is helpful, kind, flexible, open to discussions and understanding of one's situation. He nudges one to grow while giving them flexibility to identify ways to do that. He gained my trust as a manager/ leader and always had my back in non-trivial situations while mentoring me if my stance towards a situation could be improved. He won't push decisions on you but nudges you to pick the right ones. As a data scientist, he guided me in all aspects of data science - data wrangling, model building, presenting results and helping the model reach production. His technical depth and business understanding allowed the team to work on interesting and impactful problems. He understands end-to-end of one's project and will help in resolving blockers. The expectations set are reasonable and mutually discussed. He makes sure that one is growing both professionally and personally on the job. He gives due consideration to any out-of-the-box ideas and gives the freedom to explore those. He encourages the team to publish papers/ blogs, attend conferences, present outside and will be open to any other ideas one may have. He creates a collaborative, friendly environment in the team. There were regular check-in, both 1:1 and with the team, more so during remote work. I enjoyed working with Jairaj, learning from his experiences and growing under his leadership. Would highly recommend anyone to work with Jairaj. I wish him the very best.”
Jairaj Sathyanarayana
Bengaluru, Karnataka, India
25K followers
500+ connections
About
All things data at Tata Digital.
I am an accomplished data leader with 23+ years…
Activity
-
🚀 Today marks an extraordinary milestone in Netskope’s journey: we have officially become a public company. 💡 When I started Netskope, it was…
🚀 Today marks an extraordinary milestone in Netskope’s journey: we have officially become a public company. 💡 When I started Netskope, it was…
Liked by Jairaj Sathyanarayana
Experience
Education
Publications
-
Improving search relevance in hyperlocal food delivery using language models
CODS-COMAD '24: Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)
The ability to accurately understand and serve customer search queries is critical to the success of e-commerce platforms. This need is amplified in food delivery platforms operating in India due to the wide variety of languages, cuisines and tastes. Our platform alone offers millions of items from hundreds of thousands of restaurants across India. Not only do Indian dish names have a tremendous amount of regional variety, international cuisine served in India is typically customized to Indian…
The ability to accurately understand and serve customer search queries is critical to the success of e-commerce platforms. This need is amplified in food delivery platforms operating in India due to the wide variety of languages, cuisines and tastes. Our platform alone offers millions of items from hundreds of thousands of restaurants across India. Not only do Indian dish names have a tremendous amount of regional variety, international cuisine served in India is typically customized to Indian palate. This variety is reflected in search queries which can be combinations of dish names, dish categories, cuisine, preparation style, occasions, dietary preferences, just to name a few. Additionally, queries can also be generic and conversational in nature when the customer is trying to explore. Given these complexities, language models can play a significant role in intelligently embedding the query context to retrieve relevant items. In this paper, we start with an open source language model almpnet-base-v2 seeking a balance of quality and latency requirements for our use-case. We propose a two-stage fine-tuning approach. In the first stage we use unsupervised training on historical search and ordering data to adapt the model to the food domain. The second stage uses supervised training on a strategically curated dataset containing a diverse set of search queries and items relevant to those queries. We conducted offline evaluation on a curated test dataset of complex, conversation-style natural language queries in two different geographical areas, one with a sparse supply of restaurants and another with dense supply. Our model showed an improvement of 14.94% in MAP and 19.22% in Precision@1 in the dense area and an improvement of 7.04% in mean average precision and 10.27% in Precision@1 in the sparse area, versus our baseline Search.
Other authorsSee publication -
DyCOD- Determining cash-on-delivery limits for real-time e-commerce transactions via constrained optimisation modelling
Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. ECML PKDD 2023
Paying for deliveries using cash after the delivery is made is a popular mode of payment for customers transacting online for the first time or those that prefer more control, especially in emerging economies like India. While the cash (or pay)-on-delivery (COD or POD) option is a win-win for customers and e-commerce platforms, it also opens up surface area for fraud and abuse. A common risk mitigation strategy is to impose a limit on the order value that can be paid using COD. In our…
Paying for deliveries using cash after the delivery is made is a popular mode of payment for customers transacting online for the first time or those that prefer more control, especially in emerging economies like India. While the cash (or pay)-on-delivery (COD or POD) option is a win-win for customers and e-commerce platforms, it also opens up surface area for fraud and abuse. A common risk mitigation strategy is to impose a limit on the order value that can be paid using COD. In our experience and literature survey, these limits are typically blunt (a single limit for a city or zip code) and set by business teams using heuristics and primarily from a risk-management-backwards view. This one-size-fits-all approach means we leave money on the table on customer groups where the limits are too strict and lose money on groups where they are lax. We need to concurrently balance the risk-management and the customer-preference angles. Note that this is different from a typical credit-scoring approach due to at least two major reasons - 1) the information available in e-commerce, especially online food delivery, is much sparser, 2) the limit needs to be calculated dynamically in real-time depending on the transaction value, restaurant and marketplace constraints and network effects. To this end, we present a framework called DyCOD that maps this to a non-linear constrained optimisation problem. To the best of our knowledge there are no published results in this area and our work is the first. We solve this using both heuristic and model driven approaches and run large-scale A/B experiments. Our approaches delivered a 2.1% lift in margin per order vs. the baseline while not breaching any risk check-metrics, which is highly significant at our scale.
Other authors -
A multi-modal approach to mining intent from code-mixed Hindi-English calls in the hyperlocal-delivery domain
24th International Conference on Speech and Computer (SPECOM'22)
Abstract: In this work we outline an approach to mine insights from calls between delivery partners (DP) and customers involved in hyperlocal food delivery in India. Incorrect addresses/ locations or other impediments prompt the DPs to call customers leading to suboptimal experiences like breaches in the promised arrival-time, cancellation, fraud, etc. We demonstrate an end-to-end system that utilizes a multi- modal approach where we combine data across speech, text and geospatial domains to…
Abstract: In this work we outline an approach to mine insights from calls between delivery partners (DP) and customers involved in hyperlocal food delivery in India. Incorrect addresses/ locations or other impediments prompt the DPs to call customers leading to suboptimal experiences like breaches in the promised arrival-time, cancellation, fraud, etc. We demonstrate an end-to-end system that utilizes a multi- modal approach where we combine data across speech, text and geospatial domains to extract the intent behind these calls. To transcribe calls to text, we develop an Automatic Speech Recognition (ASR) engine that works in the Indian context where the calls are typically highly code-mixed (in our case Hindi and English) along with variations in dialects and pronunciations. Additionally in the hyperlocal delivery space, the calls are also corrupted by high levels of background noise due to the nature of the business. Starting with Wav2Vec2.0 as the base we carried out a series of data and model based experiments to progressively reduce the WER from 85.30% to 31.17%. The transcripts from the ASR engine are encoded into embeddings by adapting an IndicBERT based model. Features extracted from the geospatial markers of calls are concatenated with the embeddings and passed through an XGBoost classification head to classify calls into one of three intents. Through ablation studies we show incremental improvements attributable to signals from different modalities. The winning multi-modal model has a macro average precision of 68.33% which is a 29.3pp lift over the baseline not utilizing all the modalities.
Other authorsSee publication -
Identifying fraud rings using domain-aware weighted community detection
International IFIP Cross Domain (CD) Conference for Machine Learning & Knowledge Extraction (MAKE) CD-MAKE 2022
With the increase in online platforms, the surface area of malicious activities has increased manifold. Bad actors abuse policies and services like claims, coupons, payouts, etc. to gain material benefits. These fraudsters often work collusively (rings) and it is difficult to identify underlying relationships between them when analyzing individual actors. Fraud rings’ identification can be modeled as a community detection problem on graphs where nodes are the actors and the edges represent…
With the increase in online platforms, the surface area of malicious activities has increased manifold. Bad actors abuse policies and services like claims, coupons, payouts, etc. to gain material benefits. These fraudsters often work collusively (rings) and it is difficult to identify underlying relationships between them when analyzing individual actors. Fraud rings’ identification can be modeled as a community detection problem on graphs where nodes are the actors and the edges represent common attributes between them. However the challenge lies in incorporating the attributes’ domain-informed importance and hierarchy in coming up with edge weights. Treating all edge types as equal (and binary) can be fairly naive; we show that using domain knowledge considerably outperforms other methods. For community detection itself, while the weight information is expected to be learned automatically in deep learning based methods like GNNs, it is explicitly provided in traditional methods. In this paper, we propose a scalable and extensible end-to-end framework based on domain-aware weighted community detection, to detect fraud rings. We first convert a multi- edge weighted graph into a homogenous weighted graph and perform domain aware edge-weight optimization to maximize modularity, using the Leiden community detection algorithm. We then use features of communities and nodes to classify both community and a node as fraud or not.
We show that our methods achieve up to 9.92% lift in F1-score on internal data, which is significant at our scale, and up to 4.81% F1-score lift on two open datasets (Amazon, Yelp) vs. an XGBoost based baseline.Other authorsSee publication -
FoodNet: Simplifying Online Food Ordering with Contextual Food Combos
CODS-COMAD 2022: 5th Joint International Conference on Data Science & Management of Data (9th ACM IKDD CODS and 27th COMAD), Bangalore, India, January 2022
Bundling complementary dishes into easy-to-order food combos is vital to providing a seamless food ordering experience. Manually curating combos across several thousands of restaurants and millions of dishes is neither scalable nor can be personalized. We propose FoodNet, an attention-based deep learning architecture with a monotonically decreasing constraint of diversity, to recommend personalized two-item combos from across different restaurants. In a large-scale evaluation involving 200…
Bundling complementary dishes into easy-to-order food combos is vital to providing a seamless food ordering experience. Manually curating combos across several thousands of restaurants and millions of dishes is neither scalable nor can be personalized. We propose FoodNet, an attention-based deep learning architecture with a monotonically decreasing constraint of diversity, to recommend personalized two-item combos from across different restaurants. In a large-scale evaluation involving 200 million candidate combos, we show that FoodNet outperforms the Transformer based model by 1.3%, the Siamese network based model by 13.6%, and the traditional Apriori baseline by 18.8% in terms of NDCG, which are significant improvements at our scale. We also present qualitative results to show the importance of attention and lattice layers in the proposed architecture.
We won the 'best paper award' in the Applied DS track for this paper.Other authorsSee publication -
DeFraudNet: An end-to-end weak supervision framework to detect fraud in online food delivery
Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. ECML PKDD 2021
Detecting abusive and fraudulent claims is one of the key challenges in online food delivery. This is further aggravated by the fact that it is not practical to do reverse-logistics on food unlike in e-commerce. This makes the already-hard problem of harvesting labels for fraud even harder because we cannot confirm if the claim was legitimate by inspecting the item(s). Using manual effort to analyze transactions to generate labels is often expensive and time-consuming. On the other hand…
Detecting abusive and fraudulent claims is one of the key challenges in online food delivery. This is further aggravated by the fact that it is not practical to do reverse-logistics on food unlike in e-commerce. This makes the already-hard problem of harvesting labels for fraud even harder because we cannot confirm if the claim was legitimate by inspecting the item(s). Using manual effort to analyze transactions to generate labels is often expensive and time-consuming. On the other hand, typically, there is a wealth of ‘noisy’ information about what constitutes fraud, in the form of customer service interactions, weak and hard rules derived from data analytics, business intuition and domain understanding.
In this paper, we present a novel end-to-end framework for detecting fraudulent transactions based on large-scale label generation using weak supervision. We directly use Stanford AI Lab’s (SAIL) Snorkel and tree based methods to do manual and automated discovery of labeling functions, to generate weak labels. We follow this up with an auto-encoder reconstruction-error based method to reduce label noise. The final step is a discriminator model which is an ensemble of an MLP and an LSTM. In addition to cross-sectional and longitudinal features around customer history, transactions, we also harvest customer embeddings from a Graph Convolution Network (GCN) on a customer-customer relationship graph, to capture collusive behavior. The final score is thresholded and used in decision making.
This solution is currently deployed for real-time serving and has yielded a 16% points’ improvement in recall at a given precision level. These results are against a baseline MLP model based on manually labeled data and are highly significant at our scale. Our approach can easily scale to additional fraud scenarios or to use-cases where ‘strong’ labels are hard to get but weak labels are prevalent.Other authorsSee publication -
Semantic Embeddings for Food Search Using Siamese Networks
Proceedings of 4th International Conference on Natural Language Processing and Information Retrieval (NLPIR), Seoul, S. Korea
Efficient and effective search is a key driver of business in e-commerce. Functionally, most search systems consist of retrieval and ranking phases. While the use of methods like Learning to Rank (LTR) for (re)ranking has been studied widely, most retrieval systems in the industry are still predominantly based on variants of text matching. Because text matching cannot capture the semantic intent of the query, most out-of-vocabulary (OOV) queries are either not handled at all or poorly handled…
Efficient and effective search is a key driver of business in e-commerce. Functionally, most search systems consist of retrieval and ranking phases. While the use of methods like Learning to Rank (LTR) for (re)ranking has been studied widely, most retrieval systems in the industry are still predominantly based on variants of text matching. Because text matching cannot capture the semantic intent of the query, most out-of-vocabulary (OOV) queries are either not handled at all or poorly handled by matching to similarly-spelled entities. For niche e-commerce like food delivery apps operating on phonetically spelled, non-Western dish names, this problem is even more acute. Pre-trained word embedding models are of limited help because the majority of dish names are words that occur rarely or not at all in most openly available vocabularies. In this work, we present experiments and efficient Siamese network based models to learn dish embeddings from scratch. Compared to current baselines, we demonstrate that these models lead to a 3-5% improvement in Mean Reciprocal Rank (MRR) and Recall@k. We also quantify, using a combination of in-house Food Taxonomy and the Davies-Bouldin (DB) index, that the new embeddings capture semantic information with an improvement of up to 20% over baseline.
Other authorsSee publication -
Sample-Rank: Weak Multi-Objective Recommendations Using Rejection Sampling
Workshop on Utility-Driven Mining and Learning at the 20th IEEE International Conference on Data Mining (ICDM 2020)
Online food ordering marketplaces are multi-stakeholder systems where recommendations impact the experience and growth of each participant in the system. A recommender system in this setting has to encapsulate the objectives and constraints of different stakeholders in order to find utility of an item for recommendation. Constrained-optimization based approaches to this problem typically involve complex formulations and have high computational complexity in production settings involving…
Online food ordering marketplaces are multi-stakeholder systems where recommendations impact the experience and growth of each participant in the system. A recommender system in this setting has to encapsulate the objectives and constraints of different stakeholders in order to find utility of an item for recommendation. Constrained-optimization based approaches to this problem typically involve complex formulations and have high computational complexity in production settings involving millions of entities. Simplifications and relaxation techniques (for example, scalarization) help but introduce sub-optimality and can be time-consuming due to the amount of tuning needed. In this paper, we introduce a method involving multi-goal sampling followed by ranking for user-relevance (Sample-Rank), to nudge recommendations towards multi-objective (MO) goals of the marketplace. The proposed method's novelty is that it reduces the MO recommendation problem to sampling from a desired multi-goal distribution then using it to build a production-friendly learning-to-rank (LTR) model. In offline experiments we show that we are able to bias recommendations towards MO criteria with acceptable trade-offs in metrics like AUC and NDCG. We also show results from a large-scale online A/B experiment where this approach gave a statistically significant lift of 2.64% in average revenue per order (RPO) (objective #1) with no drop in conversion rate (CR) (objective #2) while holding the average last-mile traversed flat (objective #3), vs. the baseline ranking method. This method also significantly reduces time to model development and deployment in MO settings and allows for trivial extensions to more objectives and other types of LTR models.
Other authorsSee publication
Honors & Awards
-
Microsoft High Potential Leader Bench Program FY2017
Microsoft
Top 1% employees
-
Microsoft High Potential Leader Bench Program FY2014
Microsoft
Top 1% employees
-
Microsoft High Potential Leader Bench Program FY2013
Microsoft
Microsoft's 'High-Potential' program that selects & mentors high performers for accelerated career growth and impact.
Languages
-
English
Native or bilingual proficiency
-
Kannada
Native or bilingual proficiency
-
Tamil
Professional working proficiency
-
Hindi
Professional working proficiency
Recommendations received
8 people have recommended Jairaj
Join now to viewMore activity by Jairaj
-
After an incredible 6.5-year journey, I've just closed my chapter at DoorDash to prioritize my health. Please check out my latest Medium article…
After an incredible 6.5-year journey, I've just closed my chapter at DoorDash to prioritize my health. Please check out my latest Medium article…
Liked by Jairaj Sathyanarayana
-
Excited to have Jairaj Sathyanarayana, SVP and Chief Data & AI Officer at Tata Digital, join Encore 2025. He will speak on how data science and…
Excited to have Jairaj Sathyanarayana, SVP and Chief Data & AI Officer at Tata Digital, join Encore 2025. He will speak on how data science and…
Liked by Jairaj Sathyanarayana
-
The Rainmatter-Zerodha culture. Work hard at office and harder outside it. This is our team on a Sunday morning after a 2hr workout along with our…
The Rainmatter-Zerodha culture. Work hard at office and harder outside it. This is our team on a Sunday morning after a 2hr workout along with our…
Liked by Jairaj Sathyanarayana
-
After an enriching 12-year tenure at Amazon Advertising, I am embarking on a new chapter in my career journey. My time at Amazon presented me with…
After an enriching 12-year tenure at Amazon Advertising, I am embarking on a new chapter in my career journey. My time at Amazon presented me with…
Liked by Jairaj Sathyanarayana
-
At Amazon, if the payments system went down, everyone knew right away. Graphs showed revenue loss by the minute, executives got alerts, and entire…
At Amazon, if the payments system went down, everyone knew right away. Graphs showed revenue loss by the minute, executives got alerts, and entire…
Liked by Jairaj Sathyanarayana
-
I think I need a career break. 9 months back, on a train to Florence, I suddenly blurted this out to my wife. She heard me, she knew this idea was…
I think I need a career break. 9 months back, on a train to Florence, I suddenly blurted this out to my wife. She heard me, she knew this idea was…
Liked by Jairaj Sathyanarayana
-
Nothing beats in-person/ classroom-setting training to understand gen AI fundamentals. Glad to have partnered with Abhijith Neerkaje and Ajay Shenoy…
Nothing beats in-person/ classroom-setting training to understand gen AI fundamentals. Glad to have partnered with Abhijith Neerkaje and Ajay Shenoy…
Posted by Jairaj Sathyanarayana
-
Happy to be upgraded to Professor at Indian Institute of Management Bangalore!
Happy to be upgraded to Professor at Indian Institute of Management Bangalore!
Liked by Jairaj Sathyanarayana
-
One of the strangest lessons I learned at Amazon was that you can’t always trust your metrics—because your customers might have already adapted to…
One of the strangest lessons I learned at Amazon was that you can’t always trust your metrics—because your customers might have already adapted to…
Liked by Jairaj Sathyanarayana
-
It feels like just yesterday I walked into the Swiggy office, and now, 7 years later, I've just logged off for the last time. What an absolutely…
It feels like just yesterday I walked into the Swiggy office, and now, 7 years later, I've just logged off for the last time. What an absolutely…
Liked by Jairaj Sathyanarayana
-
2025 has been a whirlwind for me personally. January officially marked more than half a decade (!!) since I started working on the idea that would…
2025 has been a whirlwind for me personally. January officially marked more than half a decade (!!) since I started working on the idea that would…
Liked by Jairaj Sathyanarayana
-
Thrilled to share some exciting news! 🎉 After an amazing journey working part-time with Danucore, I’m officially joining the team full-time as an…
Thrilled to share some exciting news! 🎉 After an amazing journey working part-time with Danucore, I’m officially joining the team full-time as an…
Liked by Jairaj Sathyanarayana
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More