Using AI to Enhance
the Quality of Retail
Product Metadata
By increasing the transparency of product information
metadata, retailers can help consumers make more informed
purchase decisions – and compete more effectively with digital
pure-plays. Here’s how retailers can accomplish this goal, using
machine learning and deep learning techniques.
March 2018
DIGITAL SYSTEMS & TECHNOLOGY
22
Digital Systems & Technology
| Using AI to Enhance Retail Product Metadata Quality
EXECUTIVE SUMMARY
With online sales growing faster than ever,1
traditional retailers are increasing their
investments in omnichannel strategies and redoubling their efforts to meet online consumer
demands. One of the most effective ways to keep pace with the giants of e-commerce is to
offer superior product discovery and selection capabilities, which requires detailed product
information and critical product-specific attributes, coupled with semantic search.
To enhance online product discovery, retailers must maintain and provide digital images
and videos, catalog descriptions, category-specific metadata (e.g., nutrition information
for food products), stock availability, product matrices (e.g., size ranges), company/brand
logos, product ratings and reviews, pricing, and promotions information for all physical
stock keeping units (SKU). Acquiring this information from suppliers is a time-consuming
task, requiring various methods and a significant amount of manual activity.
Concurrently, many retailers face tremendous product data management challenges as
product data is stored in different locations and formats. Another challenge is duplicate
data. As a result, many retailers have incomplete and inaccurate product information on
their websites and in their systems, with little adherence to data standards and controls,
which undermines their competitiveness.
To alleviate this problem, we have built a system that extracts product attributes from
food product label images, using computer vision, natural language processing (NLP),
optical code recognition (OCR) and machine learning/deep learning techniques. Using
these technologies, the system can extract product metadata such as product title, product
description, volume/weight, nutrition facts, company/product logos and barcode.
Test results in our labs show 95% accuracy for attribute extraction from high-quality
product images featuring machine-printed characters with contrasting backgrounds. This
white paper offers perspective on how retailers can take advantage of this solution to get
their product metadata house in order.
Product Information Quality and Completeness Impacts Product Sales
3Using AI to Enhance Retail Product Metadata Quality |
Digital Systems & Technology
THE PRODUCT METADATA DILEMMA
In an increasingly e-commerce-driven retail environment, retailers need quality metadata and powerful
search platforms to entice customers and help them make effective purchase decisions. However, their
inability to easily deliver complete and accurate product information puts them at a severe disadvantage.
Retailers typically rely on suppliers to provide product images and metadata through various methods
(electronic data interchange, printed or digital catalogs) and various formats (text, Excel, PDF, XML).
Different suppliers often supply inconsistent content for the same products, and few share usable
images. Retailers also don’t have a simple way of validating the metadata and images before storing
the content on their respective systems.
Further, retailers often purchase product information from third-party providers and online Universal
Product Code (UPC) databases. UPC product information is used as an input to databases that val-
idate available product metadata. However, online metadata databases are not always accurate; in
fact, the data sometimes differs from one UPC database to another.
And yet, next to price, high-quality product images and product metadata (nutrition/ingredients, any
special warning messages about the product) are a primary driver for consumer purchases. According
to Retailer Brand Services, 97.7% of shoppers expect retailers to show comprehensive product data.2
In fact, there is an indisputable link between the quality and completeness of online product content
and sales (see Figure 1).
Item number Low-res image Product space Product description High-res image 360 image Video
-80%
Base
+35%
ConversionRate
Content Completeness and Quality
Source: Shotfarm, 2016
Figure 1
Product Information Influences E-Commerce Returns
Source: IHL Group/Order Dynamics
Figure 2
4
Digital Systems & Technology
| Using AI to Enhance Retail Product Metadata Quality
Retailers also face several technology challenges when trying to extract content from retail product
label images, including region segmentation, diverse product backgrounds, natural settings, typo-
graphic and font usage, cursive/handwritten text, lighting conditions, camera artifacts and low-quality
images. Other obstacles include:
•	 Product size: Variations in product size limit the product details that the camera can capture and
determine the camera that should be used.
•	 Dispersed information: Product metadata may either exist on the external packaging (top, bottom
or side) or on the product itself, which can only be seen when the product is unpacked.
•	 Information alignment: Text may be aligned at different angles, posing a challenge for easy extraction.
THE IMPACT OF POOR QUALITY DATA
In the ever-evolving online marketplace, there is still much to be studied and discovered about which
types of content best influence shopper purchasing decisions. There are no definitive rules to be
followed regarding the optimal number of product images or videos to be displayed by category or
sector, or about the preferred character length of product descriptions. However, potential complexi-
ties notwithstanding, a solid formula for online success is:
Incremental improvement in the accuracy and completeness of online product content = incremental
increase in sales performance
Inaccurate or incomplete product content is damaging to both shoppers’ perception of and trust in
brands and retailers. In fact, a 2015 study calculated that consumers return $642.6 billion in goods
each year, or an estimated 4.4% of $14.5 trillion in global retail sales.3
While the biggest reason for
returns is poor-quality products or the purchase of the wrong item, a big reason for e-commerce
returns specifically is that the item didn’t match the description (see Figure 2).
0%
Store E-commerce
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
W
rongsize
Late
delivery
DefectiveDidn’tm
atch
the
description
W
rongitem
PoorqualityBuyer’srem
orse
Gift
Fraud
Price
Other
Input Product Labels Images
Output Display Attributes using UI
Database
Image Preprocessing
Region of Interest Detection
Attribute Extraction
OCR NLP ML
A Conceptual Architecture for Extracting Metadata from Retail Product Images
Figure 3
5Using AI to Enhance Retail Product Metadata Quality |
EFFECTIVE EXTRACTION OF PRODUCT METADATA
We have built a solution to ease the extraction of retail product metadata from product label images.
As depicted in Figure 3, our solution photographs a retail product carton from all sides. The captured
images are fed into an algorithm that performs data extraction. Image pre-processing techniques are
then applied to identify various regions of interest.
For each of these regions of interest, the text attributes are extracted using OCR, and are then
improved using machine learning and natural language processing (NLP) techniques before being
saved to a database. Similarly, brand and food certification logo detection is conducted using com-
puter vision and machine learning techniques. The attributes extracted include brand name, product
name, logo, food certification logo, net weight, nutrition facts and bar code. (See Figure 4 , next page,
for a visual summary of these extraction steps.)
While the biggest reason for returns is poor-
quality products or the purchase of the
wrong item, a big reason for e-commerce
returns specifically is that the item
didn’t match the description
Input Image
• Image
• Video
Output
• Display attributes
using UI
Background Removal
• Illumination correction
• Global threshold
• Projection
Image Quality
Check
• OCR
• Metrics
• Classification
Attribute Extraction
• Brand/logo name
• Product name
• Food certificate
• Net weight
• Nutrition facts
• Barcode
1 2
3
4
5
Metadata Extraction: A Five-Step Process
6
Digital Systems & Technology
| Using AI to Enhance Retail Product Metadata Quality
Background Removal
A background removal subsystem removes background color information from different product label
images. This is done to improve character recognition accuracy and product label image acceptance.
Product label images, with different gradient, solid colors and complex natural scenes, undergo image
preprocessing techniques such as background removal to identify and extract the regions of interest.
The illumination correction is calculated using morphological operation on gray-scale images.
Edge information is obtained using global threshold techniques. To get the region of interest image,
horizontal and vertical projection is applied on the extracted images.
Image Quality Check
The product label images are then put through a document acceptance check subsystem to filter and
classify images, which ensures the product attributes are extracted reliably. The subsystem puts the
documents into three buckets: “accept,” “needs manual intervention” and “reject.” Accepted docu-
ments can be automatically processed, with no manual intervention.
Because traditional OCR systems don’t support cursive/typographic font text, a detection module is
used to identify such text regions.
Text region detection is performed by combining maximally stable extremal region (MSER)4
and
Niblack algorithms,5
which results in low rates of false text detection. Stroke width transform,6
Euler’s number7
and neighborhood connected-component methods8
are used to validate character/
word regions, and the area of text detection is calculated. Text is extracted using the Nuance OCR
engine.9
The area of OCR character, the mean character height in pixels, and the mean character con-
fidence score are calculated based on text portions occupied by OCR text.
Figure 4
AI-Enabled Attribute Extraction
7Using AI to Enhance Retail Product Metadata Quality |
Digital Systems & Technology
Our proposed system uses the following document acceptance check features as metrics:
•	
•	 Mean character height.
•	 Mean character confidence score.
Classification is carried out by applying a threshold on metrics that result in an automation quality
test. Once completed, the process can proceed.
Regions of interest of barcodes, logos/brand names and certification logos are passed to another logo
detection and recognition module to detect the brand name and food certifications. Template sup-
port is provided to extract different formats of structured text information. Tabular text, volumetric
information and other information each pose their own challenges, for which the extraction process
is detailed below.
Attribute Extraction
Attributes such as brand name, product title, net weight, food certification, net weight/volume and
barcode are extracted from input images with the help of various image processing techniques and
the application of AI and NLP.
Detection Percentage = Area of OCR character
Area of text detection
Registered Trademark
Product Name:
Pumpkin Spice Flax
Food Certification:
Non GMO
Net Weight:
8.4 oz (240g)
Figure 5
Note: Kashi is a registered trademark of Kashi Co.
Extracting Nutrition Facts
8
Digital Systems & Technology
| Using AI to Enhance Retail Product Metadata Quality
Brand Name Detection (Logo Detection)
The OCR text output is sent to an NLP engine to receive entity identification. If the logo is not
detected, then the process moves to image processing, based on brand name detection. Local and
global feature extraction processes are applied to the input image. Features are then transformed to
a bag-of-words model10
using the K-Nearest Neighbors (K-NN) algorithm.11
The Euclidean distance is
calculated from the input image and bag-of-words model to recognize the correct brand name.
Product Title Detection
The product title is found by using NLP-based dictionary management and text similarity.
Standard Certification Detection
Food certification labels such as “USDA” or “gluten-free” are then identified. From the region of inter-
est images, local features are detected. A feature is a part of an image with some special properties
that can be used to perform certain calculations, such as tracking and matching. Here, standard food
certification matching is applied.
In this example, we rely on local features that describe a part of an image, rather than global features, which
describe the image as a whole. Once trained data set features and query image features are found, the pro-
cess moves to the matching phase. This is performed similarly to how brand name detection is conducted.
Nutrition Facts:
Serving Size: 2 bars (40g)
Servings Per Container: 6
Calories: 170
Ingredients: Whole grain oats, dried
cane syrup, rolled whole grain
blend, expeller pressed canola oil,
soy protein
Distributed by
Nutrition Grams % Daily Values
Total Fat
Saturated Fat
...
6
0.5
...
9
3
...
Figure 6
Note: Kashi is a registered trademark of Kashi Co.
Barcode: UPC -018627030126 100% Recycled Food Type: Vegetarian
Package: Multi-layer Wrapper
Package: Paper Box
Reading Barcodes
9Using AI to Enhance Retail Product Metadata Quality |
Digital Systems & Technology
Net Weight Detection/Extraction
To extract net weight/volume/quantity, regular expression techniques have been used on text extracted
using OCR. Regular expressions have been built using key words related to net weight, quantity and volume.
Nutrition Facts Detection and Extraction
As nutrition facts follow a tabular format, morphological operations are used to detect the horizontal and
vertical lines. With horizontal lines reference, each text subregion is cropped, and text is extracted using the
Nuance OCR. Extracted text is corrected with a predefined vocabulary. A rule-based approach is used on
corrected text to extract nutrition data. Figure 6 (previous page) reveals the rule used to extract the nutrition
facts from the text.
Barcode Detection
A third-party tool from a standalone library is applied to detect and recognize barcodes. These library
functions handle UPC-A format (as revealed in Figure 7).
Figure 7
10
Digital Systems & Technology
| Using AI to Enhance Retail Product Metadata Quality
KEY ADVANTAGES OF THIS APPROACH
Product label images are a trusted source of metadata information. Naturally, the process should
improve the quality of product metadata and data consistency. Our solution reduces the burden of
validating product data provided by various vendors, and provides additional information critical for
consumer product discovery, such as brand and certification logos information.
The Results
To assess the performance of our proposed solution, we evaluated it using a real dataset with 352 food
products, encompassing 53 brands containing 955 images (including front-, back-, side-view product
images). Background removal was used to improve OCR accuracy. We tested with product images with
and without background removal, and evaluated character confidence scores and conducted an auto-
mation quality check. (See Figure 8 for the results as applied to one particular product.)
The Accuracy of Image-Based Product Metadata Extraction
Figure 8
Attributes	 No of	 Accuracy
	product
Product name	 83	95.18%
Net weight/	
83	 98.79%volume
Barcode	 83	100.00%
Logo extraction	 38	 98.79%
Standard 	
83	100.00%certification
Nutrition facts	
83	98.10%
extraction
Note: Arrowhead Mills is a registered trademark of Hain Celestial Group.
11Using AI to Enhance Retail Product Metadata Quality |
Digital Systems & Technology
THE ROAD AHEAD
Computer vision and AI-based methods show clear potential for applying process automation to
reduce data inconsistencies and improve metadata data quality, thereby improving the retail indus-
try’s product data capture and metadata extraction processes.
Our proposed computer vision-based approach can be extended to other product categories, such as
health, beauty, books, toys and video games. It can also be extended to improve the extraction pro-
cess for product images with diverse backgrounds, cylindrical and can image labels.
Deep learning techniques can and will be explored to enhance image and text region segmentation
accuracy, as well as support cursive and typographic font-based text extraction. Machine learning and
NLP techniques can be explored to improve text attribute extraction accuracy.
All company names, trade names, trademarks, trade dress, designs/logos, copyrights, images and
products referenced in this white paper are the property of their respective owners. No company ref-
erenced in this white paper sponsored this white paper or the contents thereof.
All materials published herein are protected by copyright laws and international copyright treaty pro-
visions. © 2018, Cognizant Technology Solutions, All Rights Reserved.
Cognizant Technology Solutions and Cognizant Digital Systems and Technology are trademarks owned
by Cognizant Technology Solutions.
12
Digital Systems & Technology
| Using AI to Enhance Retail Product Metadata Quality
FOOTNOTES
1	 Amy Gesenhues, “Report: E-commerce Accounted for 11.7% of Total Retail Sales in 2016, Up 15.6% Over 2015,” Marketing
Land, Feb. 20, 2017, https://marketingland.com/report-e-commerce-accounted-11-7-total-retail-sales-2016-15-6-2015-207088.
2	 “High-Quality, Complete E-commerce Product Content Increases Sales,” BrandView, Sept. 7, 2016, http://www.brandview.
com/2016/09/high-quality-complete-ecommerce-product-content-increases-sales/.
3	 Andria Cheng, “Consumers Return $642.6 Billion in Goods Each Year,” Marketwatch, June 18, 2015, https://www.market-
watch.com/story/consumers-return-6426-billion-in-goods-each-year-2015-06-18.
4	 Tutorial on MSER, VLFeat.org, http://www.vlfeat.org/overview/mser.html.
5	 “Niblack and Sauvola Thresholding,” SciKit-Image, http://scikit-image.org/docs/dev/auto_examples/segmentation/plot_
niblack_sauvola.html.
6	 “Detecting Text in Natural Scenes with Stroke Width Transform,” IEEE, http://ieeexplore.ieee.org/document/5540041/.
7	 Wikipedia entry on Euler’s number: https://en.wikipedia.org/wiki/E_(mathematical_constant).
8	 Wikipedia entry on connected-component labeling: https://en.wikipedia.org/wiki/Connected-component_labeling.
9	 Nuance website: https://www.nuance.com/print-capture-and-pdf-solutions/pdf-and-document-conversion.html.
10	 Jason Brownlee, “A Gentle Introduction to the Bag-of-Words Model,” Machine Learning Mastery, Oct. 9, 2017, https://
machinelearningmastery.com/gentle-introduction-bag-words-model/.
11	 Adi Brohnshtein, “A Quick Introduction to K-Nearest Neighbors Algorithm,” Medium, April 11, 2017, https://medium.com/@adi.
bronshtein/a-quick-introduction-to-k-nearest-neighbors-algorithm-62214cea29c7
ACKNOWLEDGMENTS
The authors would like to thank Mahesh Balaji, Senior Director within Cognizant’s Global Technology Office, for his support and
guidance on this report.
13Using AI to Enhance Retail Product Metadata Quality |
Digital Systems & Technology
14
Digital Systems & Technology
| Using AI to Enhance Retail Product Metadata Quality
Gundimeda
Venugopal
Lead, Cognitive
Computing & Data
Sciences Lab
Gundimeda Venugopal leads the Cognitive Computing & Data Sciences
Lab’s research team within Cognizant’s Global Technology Office. He has
more than 23 years of IT industry experience in the areas of enterprise
architecture, large-scale application and framework development,
artificial intelligence, machine learning, NLP, web spidering, information
extraction, computer vision, speech processing, biometrics, enterprise
search, object-oriented design, web development, middleware,
databases/LDAP, performance tuning, embedded systems, networking,
protocol design and in-support systems. Venu was the solution architect
for multiple large-scale application development projects. He received
a B.Tech. in electrical and electronics engineering from J.N.T.U College
of Engineering, Kakinada, and an M.Tech. in computer science from
Jawaharlal Nehru University, New Delhi. He has filed two patents, written
articles for three research publications and won two innovation awards.
Venu delivered guest lectures in the areas of digital communications
(IIIT, Gwalior) and e-governance (IIM, Bangalore). He can be reached at
Venugopal.Gundimeda@cognizant.com | LinkedIn: https://www.linkedin.
com/in/venugopalgundimeda/.
ABOUT THE AUTHORS
Ramakrishnan
Viswanathan
Manager, Business
Development, Cognitive
Computing & Data
Sciences Lab
Ramakrishnan Viswanathan is a Manager of Business Development
in Cognizant’s Cognitive Computing & Data Sciences Lab within the
company’s Global Technology Office, focusing on AI ML services,
cognitive technology and emerging technologies. In addition to his
time spent in GTO and Cognizant’s Application Value Management
Practice, he has over 13 years of experience in pre-sales, strategic
partnerships, business development, client relations and business
analysis. Ram has an executive program in sales and marketing
(EPSM) degree from Indian Institute of Management, Calcutta (IIMC).
He can be reached at Ramakrishnan.Viswanathan3@cognizant.com |
LinkedIn: www. linkedin.com/in/ramakrishnanviswanathan.
15Using AI to Enhance Retail Product Metadata Quality |
Digital Systems & Technology
Rajkumar Joseph
Architect, Cognitive
Computing & Data
Sciences Lab
Naresh Babu N T
Technology Specialist,
Cognitive Computing &
Data Sciences Lab
Rajkumar Joseph is an Architect within Cognizant’s Cognitive Com-
puting & Data Sciences Lab within the company’s Global Technology
Office. He has over nine years of experience in innovation, research and
product development in the field of computer vision, artificial intelli-
gence, data science, mobile computing and IoT. He received an M.Tech
in industrial mathematics and scientific computing from Indian Insti-
tute of Technology Madras (IITM) and completed an executive program
in business analytics (EPBA) from Indian Institute of Management, Cal-
cutta (IIMC). He can be reached at Rajkumar.Joseph@cognizant.com |
LinkedIn: https://www.linkedin.com/in/rajkumar-j/.
Naresh Babu N T is a Technology Specialist within Cognizant’s Cog-
nitive Computing & Data Sciences Lab within Cognizant’s Global
Technology Office. He has eight years of experience in research
and software development, embedded platform and data analy-
sis. In addition to image and signal processing, his areas of interest
include computer vision, machine learning, pattern recognition and
soft computing. Naresh received an M. S (By Research) from Anna
University (MIT Campus), where he specialized in signal and image
processing. He can be reached at Nareshbabu.nt@cognizant.com |
LinkedIn: https://www.linkedin.com/in/naresh-babu-n-t-79086419/.
COGNIZANT’S COGNITIVE COMPUTING & DATA SCIENCES LAB
The Global Technology Office (GTO) is the core technology organization of Cognizant, with a mission to power and accelerate our capa-
bilities to harness transformative technologies that enable our people, customers and processes to navigate the shift in the work ahead.
As part of GTO, the Cognitive Computing & Data Sciences (CDS) Lab’s vision is to explore emerging and cognitive technology areas in
artificial intelligence, machine learning, natural language processing, voice/speech recognition and computer vision. The CDS lab builds
innovative industry-specific cognitive platforms and solutions for digital business transformation.
ABOUT COGNIZANT
Cognizant (NASDAQ-100: CTSH) is one of the world’s leading professional services companies, transforming clients’ business, operating and
technology models for the digital era. Our unique industry-based, consultative approach helps clients envision, build and run more innova-
tive and efficient businesses. Headquartered in the U.S., Cognizant is ranked 205 on the Fortune 500 and is consistently listed among the
most admired companies in the world. Learn how Cognizant helps clients lead with digital at www.cognizant.com or follow us @Cognizant.
© Copyright 2018, Cognizant. All rights reserved. No part of this document may be reproduced, stored in a retrieval system, transmitted in any form or by any means,electronic,
mechanical, photocopying, recording, or otherwise, without the express written permission from Cognizant. The information contained herein is subject to change without notice. All
other trademarks mentioned herein are the property of their respective owners.
TL Codex 3263
World Headquarters
500 Frank W. Burr Blvd.
Teaneck, NJ 07666 USA
Phone: +1 201 801 0233
Fax: +1 201 801 0243
Toll Free: +1 888 937 3277
European Headquarters
1 Kingdom Street
Paddington Central
London W2 6BD England
Phone: +44 (0) 20 7297 7600
Fax: +44 (0) 20 7121 0102
India Operations Headquarters
#5/535 Old Mahabalipuram Road
Okkiyam Pettai, Thoraipakkam
Chennai, 600 096 India
Phone: +91 (0) 44 4209 6000
Fax: +91 (0) 44 4209 6060

More Related Content

PDF
Business Intelligence for FMCG Business
PPTX
Equitec Case - Consumer Behavior
PPTX
Equitec consumer dynamics mba case study
PPTX
Instant Experts: DATUM, Powerful Product Information to Empower Your Sales En...
PPTX
Power Up Competitive Price Intelligence with Web Data
PDF
STEP (Stibo Enterprise Platform) Trailblazer
PPTX
E marketing research
PPTX
IBM Commerce for Retail
Business Intelligence for FMCG Business
Equitec Case - Consumer Behavior
Equitec consumer dynamics mba case study
Instant Experts: DATUM, Powerful Product Information to Empower Your Sales En...
Power Up Competitive Price Intelligence with Web Data
STEP (Stibo Enterprise Platform) Trailblazer
E marketing research
IBM Commerce for Retail

What's hot (20)

PDF
MR3 READINESS CHEAT SHEET
PDF
Product Data Must Haves
PDF
Power Up Your Competitive Price Intelligence With Web Data
PDF
How Product Information Management Solves Common Problems with Your Clients' ...
PDF
Product Information Management (PIM) system for all types of product - Right ...
PPTX
Product Information Management: Everything you wanted to know but were afraid...
PPTX
Retailing and CRM
PDF
SellPoints Webinar: The Convergence of Ecommerce and Media in Retail
PPTX
E commerce and strategic implication
PDF
What is Product Information Management?
PDF
Creating Business Value - Use Cases in CPG/Retail
PDF
Supply Chain in eCommerce (2011-12)
PPTX
Market analysis
PPTX
Supply chain management IN E COMMERCE
PDF
Ventana Research Product Information Management e-books
PDF
Best Practices for Implementing a Product Information Management System
ODP
Topic 7 Emarketing management
PPTX
What are the benefits of a Product Information Management (PIM) system?
DOCX
Boosting conversion rates on ecommerce using deep learning algorithms
PPT
Role of computer in marketing
MR3 READINESS CHEAT SHEET
Product Data Must Haves
Power Up Your Competitive Price Intelligence With Web Data
How Product Information Management Solves Common Problems with Your Clients' ...
Product Information Management (PIM) system for all types of product - Right ...
Product Information Management: Everything you wanted to know but were afraid...
Retailing and CRM
SellPoints Webinar: The Convergence of Ecommerce and Media in Retail
E commerce and strategic implication
What is Product Information Management?
Creating Business Value - Use Cases in CPG/Retail
Supply Chain in eCommerce (2011-12)
Market analysis
Supply chain management IN E COMMERCE
Ventana Research Product Information Management e-books
Best Practices for Implementing a Product Information Management System
Topic 7 Emarketing management
What are the benefits of a Product Information Management (PIM) system?
Boosting conversion rates on ecommerce using deep learning algorithms
Role of computer in marketing
Ad

Similar to Using AI to Enhance the Quality of Retail Product Metadata (20)

PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
How AI Solutions for E-commerce are upgrading the retail Industry 1.pdf
PDF
"AI use cases in retail and e‑commerce "
PPTX
Product Visibility- What Is Seen First, Will ppt.pptx
PDF
Product Information is Key to Winning the Customer Experience Race
PDF
Revolutionizing the Retail Landscape with Cutting-Edge Artificial Intelligenc...
PDF
Data Con LA 2019 - E-Commerce Product Categorization at Scale by Jesse Steinw...
PPTX
Driving-Innovation-and-Technology-Advancements.pptx
PDF
15 ways artificial intelligence is helping e commerce marketers
PPTX
Revolutionizing Retail: The Impact of Artificial Intelligence
PDF
How is AI Technology Changing the E-commerce Game?
PPTX
Franchise India presentation
PPTX
KSB_Global_case_study_competition_final_round_Team_Credible_Creditors.pptx
PPTX
Computer Vision: Coming to a Store Near You
PDF
Reducing Returns to Increase Margin Through Better Product Data
DOCX
Become an AI Product Manager Executive and Lead the Future Act Now
PDF
“Enabling Smart Retail with Visual AI,” a Presentation from 365 Retail Markets
PPTX
Fresh Tech: What grocers need to win in fresh
 
PDF
Computer Vision: Coming to a Store Near You - Brent Biddulph
DOCX
The role of ai in indian banking and retail
Optimise Shopper Experiences with a Strong Data Estate.pdf
How AI Solutions for E-commerce are upgrading the retail Industry 1.pdf
"AI use cases in retail and e‑commerce "
Product Visibility- What Is Seen First, Will ppt.pptx
Product Information is Key to Winning the Customer Experience Race
Revolutionizing the Retail Landscape with Cutting-Edge Artificial Intelligenc...
Data Con LA 2019 - E-Commerce Product Categorization at Scale by Jesse Steinw...
Driving-Innovation-and-Technology-Advancements.pptx
15 ways artificial intelligence is helping e commerce marketers
Revolutionizing Retail: The Impact of Artificial Intelligence
How is AI Technology Changing the E-commerce Game?
Franchise India presentation
KSB_Global_case_study_competition_final_round_Team_Credible_Creditors.pptx
Computer Vision: Coming to a Store Near You
Reducing Returns to Increase Margin Through Better Product Data
Become an AI Product Manager Executive and Lead the Future Act Now
“Enabling Smart Retail with Visual AI,” a Presentation from 365 Retail Markets
Fresh Tech: What grocers need to win in fresh
 
Computer Vision: Coming to a Store Near You - Brent Biddulph
The role of ai in indian banking and retail
Ad

More from Cognizant (20)

PDF
Using Adaptive Scrum to Tame Process Reverse Engineering in Data Analytics Pr...
PDF
Data Modernization: Breaking the AI Vicious Cycle for Superior Decision-making
PDF
It Takes an Ecosystem: How Technology Companies Deliver Exceptional Experiences
PDF
Intuition Engineered
PDF
The Work Ahead: Transportation and Logistics Delivering on the Digital-Physic...
PDF
Enhancing Desirability: Five Considerations for Winning Digital Initiatives
PDF
The Work Ahead in Manufacturing: Fulfilling the Agility Mandate
PDF
The Work Ahead in Higher Education: Repaving the Road for the Employees of To...
PDF
Engineering the Next-Gen Digital Claims Organisation for Australian General I...
PDF
Profitability in the Direct-to-Consumer Marketplace: A Playbook for Media and...
PDF
Green Rush: The Economic Imperative for Sustainability
PDF
Policy Administration Modernization: Four Paths for Insurers
PDF
The Work Ahead in Utilities: Powering a Sustainable Future with Digital
PDF
AI in Media & Entertainment: Starting the Journey to Value
PDF
Operations Workforce Management: A Data-Informed, Digital-First Approach
PDF
Five Priorities for Quality Engineering When Taking Banking to the Cloud
PDF
Getting Ahead With AI: How APAC Companies Replicate Success by Remaining Focused
PDF
Crafting the Utility of the Future
PDF
Utilities Can Ramp Up CX with a Customer Data Platform
PDF
The Work Ahead in Intelligent Automation: Coping with Complexity in a Post-Pa...
Using Adaptive Scrum to Tame Process Reverse Engineering in Data Analytics Pr...
Data Modernization: Breaking the AI Vicious Cycle for Superior Decision-making
It Takes an Ecosystem: How Technology Companies Deliver Exceptional Experiences
Intuition Engineered
The Work Ahead: Transportation and Logistics Delivering on the Digital-Physic...
Enhancing Desirability: Five Considerations for Winning Digital Initiatives
The Work Ahead in Manufacturing: Fulfilling the Agility Mandate
The Work Ahead in Higher Education: Repaving the Road for the Employees of To...
Engineering the Next-Gen Digital Claims Organisation for Australian General I...
Profitability in the Direct-to-Consumer Marketplace: A Playbook for Media and...
Green Rush: The Economic Imperative for Sustainability
Policy Administration Modernization: Four Paths for Insurers
The Work Ahead in Utilities: Powering a Sustainable Future with Digital
AI in Media & Entertainment: Starting the Journey to Value
Operations Workforce Management: A Data-Informed, Digital-First Approach
Five Priorities for Quality Engineering When Taking Banking to the Cloud
Getting Ahead With AI: How APAC Companies Replicate Success by Remaining Focused
Crafting the Utility of the Future
Utilities Can Ramp Up CX with a Customer Data Platform
The Work Ahead in Intelligent Automation: Coping with Complexity in a Post-Pa...

Using AI to Enhance the Quality of Retail Product Metadata

  • 1. Using AI to Enhance the Quality of Retail Product Metadata By increasing the transparency of product information metadata, retailers can help consumers make more informed purchase decisions – and compete more effectively with digital pure-plays. Here’s how retailers can accomplish this goal, using machine learning and deep learning techniques. March 2018 DIGITAL SYSTEMS & TECHNOLOGY
  • 2. 22 Digital Systems & Technology | Using AI to Enhance Retail Product Metadata Quality EXECUTIVE SUMMARY With online sales growing faster than ever,1 traditional retailers are increasing their investments in omnichannel strategies and redoubling their efforts to meet online consumer demands. One of the most effective ways to keep pace with the giants of e-commerce is to offer superior product discovery and selection capabilities, which requires detailed product information and critical product-specific attributes, coupled with semantic search. To enhance online product discovery, retailers must maintain and provide digital images and videos, catalog descriptions, category-specific metadata (e.g., nutrition information for food products), stock availability, product matrices (e.g., size ranges), company/brand logos, product ratings and reviews, pricing, and promotions information for all physical stock keeping units (SKU). Acquiring this information from suppliers is a time-consuming task, requiring various methods and a significant amount of manual activity. Concurrently, many retailers face tremendous product data management challenges as product data is stored in different locations and formats. Another challenge is duplicate data. As a result, many retailers have incomplete and inaccurate product information on their websites and in their systems, with little adherence to data standards and controls, which undermines their competitiveness. To alleviate this problem, we have built a system that extracts product attributes from food product label images, using computer vision, natural language processing (NLP), optical code recognition (OCR) and machine learning/deep learning techniques. Using these technologies, the system can extract product metadata such as product title, product description, volume/weight, nutrition facts, company/product logos and barcode. Test results in our labs show 95% accuracy for attribute extraction from high-quality product images featuring machine-printed characters with contrasting backgrounds. This white paper offers perspective on how retailers can take advantage of this solution to get their product metadata house in order.
  • 3. Product Information Quality and Completeness Impacts Product Sales 3Using AI to Enhance Retail Product Metadata Quality | Digital Systems & Technology THE PRODUCT METADATA DILEMMA In an increasingly e-commerce-driven retail environment, retailers need quality metadata and powerful search platforms to entice customers and help them make effective purchase decisions. However, their inability to easily deliver complete and accurate product information puts them at a severe disadvantage. Retailers typically rely on suppliers to provide product images and metadata through various methods (electronic data interchange, printed or digital catalogs) and various formats (text, Excel, PDF, XML). Different suppliers often supply inconsistent content for the same products, and few share usable images. Retailers also don’t have a simple way of validating the metadata and images before storing the content on their respective systems. Further, retailers often purchase product information from third-party providers and online Universal Product Code (UPC) databases. UPC product information is used as an input to databases that val- idate available product metadata. However, online metadata databases are not always accurate; in fact, the data sometimes differs from one UPC database to another. And yet, next to price, high-quality product images and product metadata (nutrition/ingredients, any special warning messages about the product) are a primary driver for consumer purchases. According to Retailer Brand Services, 97.7% of shoppers expect retailers to show comprehensive product data.2 In fact, there is an indisputable link between the quality and completeness of online product content and sales (see Figure 1). Item number Low-res image Product space Product description High-res image 360 image Video -80% Base +35% ConversionRate Content Completeness and Quality Source: Shotfarm, 2016 Figure 1
  • 4. Product Information Influences E-Commerce Returns Source: IHL Group/Order Dynamics Figure 2 4 Digital Systems & Technology | Using AI to Enhance Retail Product Metadata Quality Retailers also face several technology challenges when trying to extract content from retail product label images, including region segmentation, diverse product backgrounds, natural settings, typo- graphic and font usage, cursive/handwritten text, lighting conditions, camera artifacts and low-quality images. Other obstacles include: • Product size: Variations in product size limit the product details that the camera can capture and determine the camera that should be used. • Dispersed information: Product metadata may either exist on the external packaging (top, bottom or side) or on the product itself, which can only be seen when the product is unpacked. • Information alignment: Text may be aligned at different angles, posing a challenge for easy extraction. THE IMPACT OF POOR QUALITY DATA In the ever-evolving online marketplace, there is still much to be studied and discovered about which types of content best influence shopper purchasing decisions. There are no definitive rules to be followed regarding the optimal number of product images or videos to be displayed by category or sector, or about the preferred character length of product descriptions. However, potential complexi- ties notwithstanding, a solid formula for online success is: Incremental improvement in the accuracy and completeness of online product content = incremental increase in sales performance Inaccurate or incomplete product content is damaging to both shoppers’ perception of and trust in brands and retailers. In fact, a 2015 study calculated that consumers return $642.6 billion in goods each year, or an estimated 4.4% of $14.5 trillion in global retail sales.3 While the biggest reason for returns is poor-quality products or the purchase of the wrong item, a big reason for e-commerce returns specifically is that the item didn’t match the description (see Figure 2). 0% Store E-commerce 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% W rongsize Late delivery DefectiveDidn’tm atch the description W rongitem PoorqualityBuyer’srem orse Gift Fraud Price Other
  • 5. Input Product Labels Images Output Display Attributes using UI Database Image Preprocessing Region of Interest Detection Attribute Extraction OCR NLP ML A Conceptual Architecture for Extracting Metadata from Retail Product Images Figure 3 5Using AI to Enhance Retail Product Metadata Quality | EFFECTIVE EXTRACTION OF PRODUCT METADATA We have built a solution to ease the extraction of retail product metadata from product label images. As depicted in Figure 3, our solution photographs a retail product carton from all sides. The captured images are fed into an algorithm that performs data extraction. Image pre-processing techniques are then applied to identify various regions of interest. For each of these regions of interest, the text attributes are extracted using OCR, and are then improved using machine learning and natural language processing (NLP) techniques before being saved to a database. Similarly, brand and food certification logo detection is conducted using com- puter vision and machine learning techniques. The attributes extracted include brand name, product name, logo, food certification logo, net weight, nutrition facts and bar code. (See Figure 4 , next page, for a visual summary of these extraction steps.) While the biggest reason for returns is poor- quality products or the purchase of the wrong item, a big reason for e-commerce returns specifically is that the item didn’t match the description
  • 6. Input Image • Image • Video Output • Display attributes using UI Background Removal • Illumination correction • Global threshold • Projection Image Quality Check • OCR • Metrics • Classification Attribute Extraction • Brand/logo name • Product name • Food certificate • Net weight • Nutrition facts • Barcode 1 2 3 4 5 Metadata Extraction: A Five-Step Process 6 Digital Systems & Technology | Using AI to Enhance Retail Product Metadata Quality Background Removal A background removal subsystem removes background color information from different product label images. This is done to improve character recognition accuracy and product label image acceptance. Product label images, with different gradient, solid colors and complex natural scenes, undergo image preprocessing techniques such as background removal to identify and extract the regions of interest. The illumination correction is calculated using morphological operation on gray-scale images. Edge information is obtained using global threshold techniques. To get the region of interest image, horizontal and vertical projection is applied on the extracted images. Image Quality Check The product label images are then put through a document acceptance check subsystem to filter and classify images, which ensures the product attributes are extracted reliably. The subsystem puts the documents into three buckets: “accept,” “needs manual intervention” and “reject.” Accepted docu- ments can be automatically processed, with no manual intervention. Because traditional OCR systems don’t support cursive/typographic font text, a detection module is used to identify such text regions. Text region detection is performed by combining maximally stable extremal region (MSER)4 and Niblack algorithms,5 which results in low rates of false text detection. Stroke width transform,6 Euler’s number7 and neighborhood connected-component methods8 are used to validate character/ word regions, and the area of text detection is calculated. Text is extracted using the Nuance OCR engine.9 The area of OCR character, the mean character height in pixels, and the mean character con- fidence score are calculated based on text portions occupied by OCR text. Figure 4
  • 7. AI-Enabled Attribute Extraction 7Using AI to Enhance Retail Product Metadata Quality | Digital Systems & Technology Our proposed system uses the following document acceptance check features as metrics: • • Mean character height. • Mean character confidence score. Classification is carried out by applying a threshold on metrics that result in an automation quality test. Once completed, the process can proceed. Regions of interest of barcodes, logos/brand names and certification logos are passed to another logo detection and recognition module to detect the brand name and food certifications. Template sup- port is provided to extract different formats of structured text information. Tabular text, volumetric information and other information each pose their own challenges, for which the extraction process is detailed below. Attribute Extraction Attributes such as brand name, product title, net weight, food certification, net weight/volume and barcode are extracted from input images with the help of various image processing techniques and the application of AI and NLP. Detection Percentage = Area of OCR character Area of text detection Registered Trademark Product Name: Pumpkin Spice Flax Food Certification: Non GMO Net Weight: 8.4 oz (240g) Figure 5 Note: Kashi is a registered trademark of Kashi Co.
  • 8. Extracting Nutrition Facts 8 Digital Systems & Technology | Using AI to Enhance Retail Product Metadata Quality Brand Name Detection (Logo Detection) The OCR text output is sent to an NLP engine to receive entity identification. If the logo is not detected, then the process moves to image processing, based on brand name detection. Local and global feature extraction processes are applied to the input image. Features are then transformed to a bag-of-words model10 using the K-Nearest Neighbors (K-NN) algorithm.11 The Euclidean distance is calculated from the input image and bag-of-words model to recognize the correct brand name. Product Title Detection The product title is found by using NLP-based dictionary management and text similarity. Standard Certification Detection Food certification labels such as “USDA” or “gluten-free” are then identified. From the region of inter- est images, local features are detected. A feature is a part of an image with some special properties that can be used to perform certain calculations, such as tracking and matching. Here, standard food certification matching is applied. In this example, we rely on local features that describe a part of an image, rather than global features, which describe the image as a whole. Once trained data set features and query image features are found, the pro- cess moves to the matching phase. This is performed similarly to how brand name detection is conducted. Nutrition Facts: Serving Size: 2 bars (40g) Servings Per Container: 6 Calories: 170 Ingredients: Whole grain oats, dried cane syrup, rolled whole grain blend, expeller pressed canola oil, soy protein Distributed by Nutrition Grams % Daily Values Total Fat Saturated Fat ... 6 0.5 ... 9 3 ... Figure 6 Note: Kashi is a registered trademark of Kashi Co.
  • 9. Barcode: UPC -018627030126 100% Recycled Food Type: Vegetarian Package: Multi-layer Wrapper Package: Paper Box Reading Barcodes 9Using AI to Enhance Retail Product Metadata Quality | Digital Systems & Technology Net Weight Detection/Extraction To extract net weight/volume/quantity, regular expression techniques have been used on text extracted using OCR. Regular expressions have been built using key words related to net weight, quantity and volume. Nutrition Facts Detection and Extraction As nutrition facts follow a tabular format, morphological operations are used to detect the horizontal and vertical lines. With horizontal lines reference, each text subregion is cropped, and text is extracted using the Nuance OCR. Extracted text is corrected with a predefined vocabulary. A rule-based approach is used on corrected text to extract nutrition data. Figure 6 (previous page) reveals the rule used to extract the nutrition facts from the text. Barcode Detection A third-party tool from a standalone library is applied to detect and recognize barcodes. These library functions handle UPC-A format (as revealed in Figure 7). Figure 7
  • 10. 10 Digital Systems & Technology | Using AI to Enhance Retail Product Metadata Quality KEY ADVANTAGES OF THIS APPROACH Product label images are a trusted source of metadata information. Naturally, the process should improve the quality of product metadata and data consistency. Our solution reduces the burden of validating product data provided by various vendors, and provides additional information critical for consumer product discovery, such as brand and certification logos information. The Results To assess the performance of our proposed solution, we evaluated it using a real dataset with 352 food products, encompassing 53 brands containing 955 images (including front-, back-, side-view product images). Background removal was used to improve OCR accuracy. We tested with product images with and without background removal, and evaluated character confidence scores and conducted an auto- mation quality check. (See Figure 8 for the results as applied to one particular product.) The Accuracy of Image-Based Product Metadata Extraction Figure 8 Attributes No of Accuracy product Product name 83 95.18% Net weight/ 83 98.79%volume Barcode 83 100.00% Logo extraction 38 98.79% Standard 83 100.00%certification Nutrition facts 83 98.10% extraction Note: Arrowhead Mills is a registered trademark of Hain Celestial Group.
  • 11. 11Using AI to Enhance Retail Product Metadata Quality | Digital Systems & Technology THE ROAD AHEAD Computer vision and AI-based methods show clear potential for applying process automation to reduce data inconsistencies and improve metadata data quality, thereby improving the retail indus- try’s product data capture and metadata extraction processes. Our proposed computer vision-based approach can be extended to other product categories, such as health, beauty, books, toys and video games. It can also be extended to improve the extraction pro- cess for product images with diverse backgrounds, cylindrical and can image labels. Deep learning techniques can and will be explored to enhance image and text region segmentation accuracy, as well as support cursive and typographic font-based text extraction. Machine learning and NLP techniques can be explored to improve text attribute extraction accuracy. All company names, trade names, trademarks, trade dress, designs/logos, copyrights, images and products referenced in this white paper are the property of their respective owners. No company ref- erenced in this white paper sponsored this white paper or the contents thereof. All materials published herein are protected by copyright laws and international copyright treaty pro- visions. © 2018, Cognizant Technology Solutions, All Rights Reserved. Cognizant Technology Solutions and Cognizant Digital Systems and Technology are trademarks owned by Cognizant Technology Solutions.
  • 12. 12 Digital Systems & Technology | Using AI to Enhance Retail Product Metadata Quality FOOTNOTES 1 Amy Gesenhues, “Report: E-commerce Accounted for 11.7% of Total Retail Sales in 2016, Up 15.6% Over 2015,” Marketing Land, Feb. 20, 2017, https://marketingland.com/report-e-commerce-accounted-11-7-total-retail-sales-2016-15-6-2015-207088. 2 “High-Quality, Complete E-commerce Product Content Increases Sales,” BrandView, Sept. 7, 2016, http://www.brandview. com/2016/09/high-quality-complete-ecommerce-product-content-increases-sales/. 3 Andria Cheng, “Consumers Return $642.6 Billion in Goods Each Year,” Marketwatch, June 18, 2015, https://www.market- watch.com/story/consumers-return-6426-billion-in-goods-each-year-2015-06-18. 4 Tutorial on MSER, VLFeat.org, http://www.vlfeat.org/overview/mser.html. 5 “Niblack and Sauvola Thresholding,” SciKit-Image, http://scikit-image.org/docs/dev/auto_examples/segmentation/plot_ niblack_sauvola.html. 6 “Detecting Text in Natural Scenes with Stroke Width Transform,” IEEE, http://ieeexplore.ieee.org/document/5540041/. 7 Wikipedia entry on Euler’s number: https://en.wikipedia.org/wiki/E_(mathematical_constant). 8 Wikipedia entry on connected-component labeling: https://en.wikipedia.org/wiki/Connected-component_labeling. 9 Nuance website: https://www.nuance.com/print-capture-and-pdf-solutions/pdf-and-document-conversion.html. 10 Jason Brownlee, “A Gentle Introduction to the Bag-of-Words Model,” Machine Learning Mastery, Oct. 9, 2017, https:// machinelearningmastery.com/gentle-introduction-bag-words-model/. 11 Adi Brohnshtein, “A Quick Introduction to K-Nearest Neighbors Algorithm,” Medium, April 11, 2017, https://medium.com/@adi. bronshtein/a-quick-introduction-to-k-nearest-neighbors-algorithm-62214cea29c7 ACKNOWLEDGMENTS The authors would like to thank Mahesh Balaji, Senior Director within Cognizant’s Global Technology Office, for his support and guidance on this report.
  • 13. 13Using AI to Enhance Retail Product Metadata Quality | Digital Systems & Technology
  • 14. 14 Digital Systems & Technology | Using AI to Enhance Retail Product Metadata Quality Gundimeda Venugopal Lead, Cognitive Computing & Data Sciences Lab Gundimeda Venugopal leads the Cognitive Computing & Data Sciences Lab’s research team within Cognizant’s Global Technology Office. He has more than 23 years of IT industry experience in the areas of enterprise architecture, large-scale application and framework development, artificial intelligence, machine learning, NLP, web spidering, information extraction, computer vision, speech processing, biometrics, enterprise search, object-oriented design, web development, middleware, databases/LDAP, performance tuning, embedded systems, networking, protocol design and in-support systems. Venu was the solution architect for multiple large-scale application development projects. He received a B.Tech. in electrical and electronics engineering from J.N.T.U College of Engineering, Kakinada, and an M.Tech. in computer science from Jawaharlal Nehru University, New Delhi. He has filed two patents, written articles for three research publications and won two innovation awards. Venu delivered guest lectures in the areas of digital communications (IIIT, Gwalior) and e-governance (IIM, Bangalore). He can be reached at Venugopal.Gundimeda@cognizant.com | LinkedIn: https://www.linkedin. com/in/venugopalgundimeda/. ABOUT THE AUTHORS Ramakrishnan Viswanathan Manager, Business Development, Cognitive Computing & Data Sciences Lab Ramakrishnan Viswanathan is a Manager of Business Development in Cognizant’s Cognitive Computing & Data Sciences Lab within the company’s Global Technology Office, focusing on AI ML services, cognitive technology and emerging technologies. In addition to his time spent in GTO and Cognizant’s Application Value Management Practice, he has over 13 years of experience in pre-sales, strategic partnerships, business development, client relations and business analysis. Ram has an executive program in sales and marketing (EPSM) degree from Indian Institute of Management, Calcutta (IIMC). He can be reached at Ramakrishnan.Viswanathan3@cognizant.com | LinkedIn: www. linkedin.com/in/ramakrishnanviswanathan.
  • 15. 15Using AI to Enhance Retail Product Metadata Quality | Digital Systems & Technology Rajkumar Joseph Architect, Cognitive Computing & Data Sciences Lab Naresh Babu N T Technology Specialist, Cognitive Computing & Data Sciences Lab Rajkumar Joseph is an Architect within Cognizant’s Cognitive Com- puting & Data Sciences Lab within the company’s Global Technology Office. He has over nine years of experience in innovation, research and product development in the field of computer vision, artificial intelli- gence, data science, mobile computing and IoT. He received an M.Tech in industrial mathematics and scientific computing from Indian Insti- tute of Technology Madras (IITM) and completed an executive program in business analytics (EPBA) from Indian Institute of Management, Cal- cutta (IIMC). He can be reached at Rajkumar.Joseph@cognizant.com | LinkedIn: https://www.linkedin.com/in/rajkumar-j/. Naresh Babu N T is a Technology Specialist within Cognizant’s Cog- nitive Computing & Data Sciences Lab within Cognizant’s Global Technology Office. He has eight years of experience in research and software development, embedded platform and data analy- sis. In addition to image and signal processing, his areas of interest include computer vision, machine learning, pattern recognition and soft computing. Naresh received an M. S (By Research) from Anna University (MIT Campus), where he specialized in signal and image processing. He can be reached at Nareshbabu.nt@cognizant.com | LinkedIn: https://www.linkedin.com/in/naresh-babu-n-t-79086419/.
  • 16. COGNIZANT’S COGNITIVE COMPUTING & DATA SCIENCES LAB The Global Technology Office (GTO) is the core technology organization of Cognizant, with a mission to power and accelerate our capa- bilities to harness transformative technologies that enable our people, customers and processes to navigate the shift in the work ahead. As part of GTO, the Cognitive Computing & Data Sciences (CDS) Lab’s vision is to explore emerging and cognitive technology areas in artificial intelligence, machine learning, natural language processing, voice/speech recognition and computer vision. The CDS lab builds innovative industry-specific cognitive platforms and solutions for digital business transformation. ABOUT COGNIZANT Cognizant (NASDAQ-100: CTSH) is one of the world’s leading professional services companies, transforming clients’ business, operating and technology models for the digital era. Our unique industry-based, consultative approach helps clients envision, build and run more innova- tive and efficient businesses. Headquartered in the U.S., Cognizant is ranked 205 on the Fortune 500 and is consistently listed among the most admired companies in the world. Learn how Cognizant helps clients lead with digital at www.cognizant.com or follow us @Cognizant. © Copyright 2018, Cognizant. All rights reserved. No part of this document may be reproduced, stored in a retrieval system, transmitted in any form or by any means,electronic, mechanical, photocopying, recording, or otherwise, without the express written permission from Cognizant. The information contained herein is subject to change without notice. All other trademarks mentioned herein are the property of their respective owners. TL Codex 3263 World Headquarters 500 Frank W. Burr Blvd. Teaneck, NJ 07666 USA Phone: +1 201 801 0233 Fax: +1 201 801 0243 Toll Free: +1 888 937 3277 European Headquarters 1 Kingdom Street Paddington Central London W2 6BD England Phone: +44 (0) 20 7297 7600 Fax: +44 (0) 20 7121 0102 India Operations Headquarters #5/535 Old Mahabalipuram Road Okkiyam Pettai, Thoraipakkam Chennai, 600 096 India Phone: +91 (0) 44 4209 6000 Fax: +91 (0) 44 4209 6060