SlideShare a Scribd company logo
Machine Teaching for workflow
automation
Ariadna Kramkovska
Muntis Rudzitis
RIGA COMM 2020
Agenda
• Introductions
• What is machine teaching?
• Benefits, our unique tool, use cases
• Change of Paradigm
• Demo 1: trained model
• Machine teaching quick wins
• Automation, document extraction
• Demo 2: document classification
Ariadna Kramkovska
Machine Learning Developer
at Emergn
Muntis Rudzitis
Lead Data Scientist
at Emergn
Presenters
Emergn’s Machine
Learning Lab
Our team's core competencies:
• Business understanding of how to profit
from ML
• Machine learning development
• Deep learning and reinforcement learning
• Data visualization
• Algorithms and ML techniques
• Data processing, cleaning and preparation
OUR TECHNOLOGY STACK
We have established and run the largest ML community
in the Baltics with 1260 field experts as members.
All these activities currently makes our company the
No. 1 Choice for young and experience ML talents.
Machine learning models
Tools and programming languages:
• Python
• TensorFlow, TensorBoard
• R ,OpenCV, Caffe2
• KNIME
• Azure Machine Learning
Studio
• C, C++
Deployment techniques
Platforms and environments:
• Stand alone models
• SAP Hana2
• Microsoft Azure (Cortana
intelligences suite)
• SQL Server
We are partnering with GDEXA to help enable the young
generation with highly demanded skills like applied
AI/ML, Big Data Analytics and Cloud Applications.
What is machine teaching?
Why use it?
Business automation
challenge
From automation with replacement of
humans to augmentation and
empowerment of subject matter experts.
We predict that companies who use
augmented automation technologies to
empower their environments and educate
their people on how to use them for better,
more predictable outcomes, will win by
providing the best service and building
better products.
• Wheels for the mind
• Find a comfortable level of automation
Why are we looking for
machine learning
(ML) alternatives?
ML for automation and
workflows should:
• Be transparent and interactive for
business users
• Include natural language-based
solutions where humans have better
comprehension
• Understand context and learning
from smaller data sets
• ML models should be verified and
monitored
AI director at
Facebook
YANN LECUN
ATARI GAME
Self-driving cars need
millions of hours of
training to reach
human level trained in
about 20 hours.
In 80 hours machine
will reach human level
aquired by 15 minutes.
Change of paradigm
Key differences
• Role of subject matter expert (SME) changes – using our tool, SME trains/provides the logic to improve the model.
• SME is integral to the success, needs to be empowered and have the tools to do their work better.
• The "one and done" approach is not flexible/doesn't allow for market changes nor incremental knowledge.
MACHINE TEACHING
MACHINE LEARNING
Iterative machine teaching process
For use cases such as:
• Email/text/document
classification
• Email/text/document
anonymization
• Entity extraction
SME
DATA SCIENTIST
Initial
model
training
Models could be
regularly monitored
by SME
AUTOMATION
WORKFLOW
CLASSIFICATION
MODEL
Tools for model quality inspection
Machine Teaching
administration tools
help business users
and ML Power Users
control classification
model quality.
Demo 1
Monitoring of the trained model and quality check
Finding quick wins
ROUTINE OPERATIONS
• RPA Robots
• Machine Learning
• OCR + Data Extraction
REPETITIVE
COGNITIVE TASKS
COLLABORATION
WORKFLOWS
• Machine Learning
• Rule Engines • Interactive applications
• Workflows
• Data Enrichment
DECISION MAKING
AND SUPPORT
• AI
• Process Mining
• Document classification
• Automation translation
• Collaboration apps
• Approvals
• Case management
• Prioritization of work
• Analysis
• Data Extraction
• Copy data
• Enter data
• Sort documents
FREE UP PEOPLE TIME SUPPORT SHIFT TO DIGITAL OPERATING MODEL
TASK AUTOMATION WORKFLOW AUTOMATION DECISION AUTOMATION
• Machine Learning
Natural
Language
based
use-cases
Automation of the document flow
DOCUMENT LIFE-CYCLE
RECEIVE
DOCUMENT
PROCESS
DOCUMENT
ARCHIVING
• Highly manual
• Need decision making, involving knowledge worker to do manual tasks
• Text comprehension (SME)
• Fraught with errors
• Difficult to research/go back, to find things
• Time consuming
• Not possible when scale is large
Automation of the document flow
DOCUMENT LIFE-CYCLE
RECEIVE
DOCUMENT
SAVE
DOCUMENT
EXTRA
META DATA
Document and
form recognition
using OCR
• Be physical
document or
email or video
any format can
be input
DOCUMENT
ROUTING
DOCUMENT
PUBLISHING
ARCHIVING
Meta-data
extraction
• Key fields
• Key words
• In text
• e.g. PO #
• Subject
• Amount
Document
classification
and routing e.g.
• Classification
• Invoice routing
• Approval,
level, direct
forward to
finance
Document
anonymization
and masking e.g.
• Bank and
personal info
Document
archiving by
classification e.g.
• Archiving based
on classification: in
cloud or on-site
HOT WARM COLD
Information extraction from documents
CLIENT RECORD
CLIENT RELATED
DOCUMENTS AND KEYWORDS DATA
EXTRACTION
MODEL
*Client information is
automatically enriched.
Company: BNP Paribas*
Title:
Name: Tom Barnes*
Other information:
Meta data
Retrieving information from text and documents helps you to obtain data
that can and must supplement the information of certain accounting cards.
DATA
EXTRACTION
CONFIG
Document classification
Email
• By title
• By object
• By action
Mail
• A
• B
• C
Contract
Application
Invoice
CLASS CATALOG
The document class is
determined by the
organizational
documents/text
classification catalog
Training of document class catalogues and document
classification algorithm by business user – specialist
The classification of documents results in the identification of the document class and can use it to
further process the document, usually routing and storing it in an appropriate document process.
DOCUMENT
CLASSIFICATION
MODEL
DOCUMENT
CLASS
Document anonymization
The purpose of anonymizing documents/text is to cover all or part
of sensitive information prior to publication of the document/text.
Person 1, social security number,
living, Address 1, closed contract
Contact Number, with Company
ABC for Contract Title
DATA
MASKING
George Bennett,
Social security number
989384843*****, address,
***** street *
DOCUMENT
ANONYMIZATION
ANONYMIZATED
DOCUMENT
ANONYMIZATED
TEXT
ANONYMIZATION
CONFIGURATION
ANONYMIZATED
DOCUMENT
Machine
Teaching (MT)
Tool
Machine
Learning (ML)
Solution
Technical architecture
SET UP AS AN INTERNAL TOOL
ON-PREMISE SERVICE
MODULAR STRUCTURE
SME/IT:
• MT Tool with UI for training & analytics
• Custom integrations possible
• Endpoints for production scenario provided
Data Scientist:
• ML model exchangeable - has standardized endpoints
• Data and integration – as needed
PYTHON
CAN BE INTEGRATED INTO EXISTING WORKFLOW MODEL
MT DATABASE
MODEL DATA
ML SOLUTION
MT SERVICE
CUSTOM
INTEGRATION &
PRODUCTION
MT TOOL
Machine teaching process
PERS LOC ORG DATE PROD
LABELING
CHANGES
VERIFICATION
TRAINING/
RETRAINING
Demo 2
Text classification example
How to get most value from machine
teaching for Natural Language
Processing (NLP) and Non-NLP models
IN NLP MODELS
• Objective – Create
Optimal data set for best
known optimal models
• Use for data labeling
and data preparation
IN NON-NLP MODELS
• Objective – Find
optimal/best model with
existing data
• Use for model monitoring
and verification
Machine Learning is the subfield of
computer science that, according to Arthur
Samuel in 1959, gives “computers the ability
to learn without being explicitly programmed.”
NLP
Artificial
Intelligence
Machine
Learning
Statistics
Let's stay connected
Contact us at info@emergn.com
for an individual demo.
Follow us: @emergn
MUNTIS RUDZITIS
Lead Data Scientist
at Emergn
https://www.linkedin.com/in/muntis-rudzitis/
ARIADNA KRAMKOVSKA
Machine Learning Developer
at Emergn
https://www.linkedin.com/in/ariadnakramkovska/

More Related Content

PDF
Machine learning specialist ver#4
PPTX
Step by step AI Day 3: AI Technologies
PPTX
DU Series - Day 4.pptx
PDF
Productionising Machine Learning Models
PPTX
Machine learning at scale - Webinar By zekeLabs
PDF
Building a Scalable and reliable open source ML Platform with MLFlow
DOC
Padmini parmar
DOC
Padmini Parmar
Machine learning specialist ver#4
Step by step AI Day 3: AI Technologies
DU Series - Day 4.pptx
Productionising Machine Learning Models
Machine learning at scale - Webinar By zekeLabs
Building a Scalable and reliable open source ML Platform with MLFlow
Padmini parmar
Padmini Parmar

Similar to Machine Teaching for workflow automation RIGA COMM 2020 (20)

PPTX
Design Like a Pro: Machine Learning Basics
PPTX
AI and Automation Implementation Roadmap.pptx
PPTX
Python for Machine Learning_ A Comprehensive Overview.pptx
PPTX
Global M365 Developer Bootcamp 2020 Hyderabad: KEYNOTE
PPTX
Choosing the right IDP Solution
PDF
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
PDF
Challenges of Operationalising Data Science in Production
PPTX
(Recent) technology trends and bridges to gap in the localization industry
PDF
EIS-Webinar-Agent-Approaches-2024-08-21.pdf
PPTX
Open, Secure & Transparent AI Pipelines
DOC
PPTX
Airtel-BML
PPTX
Design Like a Pro: Machine Learning Basics
PPTX
Machine Learning - Intro from Microsoft Partner University
DOC
Ipsita_Informatica_9Year
PDF
How I became ML Engineer
PPTX
GenerativeAI and Automation - IEEE ACSOS 2023.pptx
PPT
Digitization solutions
PPTX
An AI Maturity Roadmap for Becoming a Data-Driven Organization
PPTX
Practical Applications of AI: Real World Examples
Design Like a Pro: Machine Learning Basics
AI and Automation Implementation Roadmap.pptx
Python for Machine Learning_ A Comprehensive Overview.pptx
Global M365 Developer Bootcamp 2020 Hyderabad: KEYNOTE
Choosing the right IDP Solution
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
Challenges of Operationalising Data Science in Production
(Recent) technology trends and bridges to gap in the localization industry
EIS-Webinar-Agent-Approaches-2024-08-21.pdf
Open, Secure & Transparent AI Pipelines
Airtel-BML
Design Like a Pro: Machine Learning Basics
Machine Learning - Intro from Microsoft Partner University
Ipsita_Informatica_9Year
How I became ML Engineer
GenerativeAI and Automation - IEEE ACSOS 2023.pptx
Digitization solutions
An AI Maturity Roadmap for Becoming a Data-Driven Organization
Practical Applications of AI: Real World Examples

Recently uploaded (20)

PDF
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
PPTX
AMADEUS TRAVEL AGENT SOFTWARE | AMADEUS TICKETING SYSTEM
PDF
iTop VPN Crack Latest Version Full Key 2025
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PDF
Tally Prime Crack Download New Version 5.1 [2025] (License Key Free
PPTX
Advanced SystemCare Ultimate Crack + Portable (2025)
PPTX
Monitoring Stack: Grafana, Loki & Promtail
PDF
Download FL Studio Crack Latest version 2025 ?
PDF
AutoCAD Professional Crack 2025 With License Key
PPTX
Reimagine Home Health with the Power of Agentic AI​
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Designing Intelligence for the Shop Floor.pdf
PDF
Complete Guide to Website Development in Malaysia for SMEs
PPTX
history of c programming in notes for students .pptx
PDF
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
DOCX
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
AMADEUS TRAVEL AGENT SOFTWARE | AMADEUS TICKETING SYSTEM
iTop VPN Crack Latest Version Full Key 2025
Navsoft: AI-Powered Business Solutions & Custom Software Development
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
Tally Prime Crack Download New Version 5.1 [2025] (License Key Free
Advanced SystemCare Ultimate Crack + Portable (2025)
Monitoring Stack: Grafana, Loki & Promtail
Download FL Studio Crack Latest version 2025 ?
AutoCAD Professional Crack 2025 With License Key
Reimagine Home Health with the Power of Agentic AI​
CHAPTER 2 - PM Management and IT Context
Designing Intelligence for the Shop Floor.pdf
Complete Guide to Website Development in Malaysia for SMEs
history of c programming in notes for students .pptx
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx

Machine Teaching for workflow automation RIGA COMM 2020

  • 1. Machine Teaching for workflow automation Ariadna Kramkovska Muntis Rudzitis RIGA COMM 2020
  • 2. Agenda • Introductions • What is machine teaching? • Benefits, our unique tool, use cases • Change of Paradigm • Demo 1: trained model • Machine teaching quick wins • Automation, document extraction • Demo 2: document classification
  • 3. Ariadna Kramkovska Machine Learning Developer at Emergn Muntis Rudzitis Lead Data Scientist at Emergn Presenters
  • 4. Emergn’s Machine Learning Lab Our team's core competencies: • Business understanding of how to profit from ML • Machine learning development • Deep learning and reinforcement learning • Data visualization • Algorithms and ML techniques • Data processing, cleaning and preparation OUR TECHNOLOGY STACK We have established and run the largest ML community in the Baltics with 1260 field experts as members. All these activities currently makes our company the No. 1 Choice for young and experience ML talents. Machine learning models Tools and programming languages: • Python • TensorFlow, TensorBoard • R ,OpenCV, Caffe2 • KNIME • Azure Machine Learning Studio • C, C++ Deployment techniques Platforms and environments: • Stand alone models • SAP Hana2 • Microsoft Azure (Cortana intelligences suite) • SQL Server We are partnering with GDEXA to help enable the young generation with highly demanded skills like applied AI/ML, Big Data Analytics and Cloud Applications.
  • 5. What is machine teaching? Why use it?
  • 6. Business automation challenge From automation with replacement of humans to augmentation and empowerment of subject matter experts. We predict that companies who use augmented automation technologies to empower their environments and educate their people on how to use them for better, more predictable outcomes, will win by providing the best service and building better products. • Wheels for the mind • Find a comfortable level of automation
  • 7. Why are we looking for machine learning (ML) alternatives? ML for automation and workflows should: • Be transparent and interactive for business users • Include natural language-based solutions where humans have better comprehension • Understand context and learning from smaller data sets • ML models should be verified and monitored AI director at Facebook YANN LECUN ATARI GAME Self-driving cars need millions of hours of training to reach human level trained in about 20 hours. In 80 hours machine will reach human level aquired by 15 minutes.
  • 8. Change of paradigm Key differences • Role of subject matter expert (SME) changes – using our tool, SME trains/provides the logic to improve the model. • SME is integral to the success, needs to be empowered and have the tools to do their work better. • The "one and done" approach is not flexible/doesn't allow for market changes nor incremental knowledge. MACHINE TEACHING MACHINE LEARNING
  • 9. Iterative machine teaching process For use cases such as: • Email/text/document classification • Email/text/document anonymization • Entity extraction SME DATA SCIENTIST Initial model training Models could be regularly monitored by SME AUTOMATION WORKFLOW CLASSIFICATION MODEL
  • 10. Tools for model quality inspection Machine Teaching administration tools help business users and ML Power Users control classification model quality.
  • 11. Demo 1 Monitoring of the trained model and quality check
  • 12. Finding quick wins ROUTINE OPERATIONS • RPA Robots • Machine Learning • OCR + Data Extraction REPETITIVE COGNITIVE TASKS COLLABORATION WORKFLOWS • Machine Learning • Rule Engines • Interactive applications • Workflows • Data Enrichment DECISION MAKING AND SUPPORT • AI • Process Mining • Document classification • Automation translation • Collaboration apps • Approvals • Case management • Prioritization of work • Analysis • Data Extraction • Copy data • Enter data • Sort documents FREE UP PEOPLE TIME SUPPORT SHIFT TO DIGITAL OPERATING MODEL TASK AUTOMATION WORKFLOW AUTOMATION DECISION AUTOMATION • Machine Learning Natural Language based use-cases
  • 13. Automation of the document flow DOCUMENT LIFE-CYCLE RECEIVE DOCUMENT PROCESS DOCUMENT ARCHIVING • Highly manual • Need decision making, involving knowledge worker to do manual tasks • Text comprehension (SME) • Fraught with errors • Difficult to research/go back, to find things • Time consuming • Not possible when scale is large
  • 14. Automation of the document flow DOCUMENT LIFE-CYCLE RECEIVE DOCUMENT SAVE DOCUMENT EXTRA META DATA Document and form recognition using OCR • Be physical document or email or video any format can be input DOCUMENT ROUTING DOCUMENT PUBLISHING ARCHIVING Meta-data extraction • Key fields • Key words • In text • e.g. PO # • Subject • Amount Document classification and routing e.g. • Classification • Invoice routing • Approval, level, direct forward to finance Document anonymization and masking e.g. • Bank and personal info Document archiving by classification e.g. • Archiving based on classification: in cloud or on-site HOT WARM COLD
  • 15. Information extraction from documents CLIENT RECORD CLIENT RELATED DOCUMENTS AND KEYWORDS DATA EXTRACTION MODEL *Client information is automatically enriched. Company: BNP Paribas* Title: Name: Tom Barnes* Other information: Meta data Retrieving information from text and documents helps you to obtain data that can and must supplement the information of certain accounting cards. DATA EXTRACTION CONFIG
  • 16. Document classification Email • By title • By object • By action Mail • A • B • C Contract Application Invoice CLASS CATALOG The document class is determined by the organizational documents/text classification catalog Training of document class catalogues and document classification algorithm by business user – specialist The classification of documents results in the identification of the document class and can use it to further process the document, usually routing and storing it in an appropriate document process. DOCUMENT CLASSIFICATION MODEL DOCUMENT CLASS
  • 17. Document anonymization The purpose of anonymizing documents/text is to cover all or part of sensitive information prior to publication of the document/text. Person 1, social security number, living, Address 1, closed contract Contact Number, with Company ABC for Contract Title DATA MASKING George Bennett, Social security number 989384843*****, address, ***** street * DOCUMENT ANONYMIZATION ANONYMIZATED DOCUMENT ANONYMIZATED TEXT ANONYMIZATION CONFIGURATION ANONYMIZATED DOCUMENT
  • 18. Machine Teaching (MT) Tool Machine Learning (ML) Solution Technical architecture SET UP AS AN INTERNAL TOOL ON-PREMISE SERVICE MODULAR STRUCTURE SME/IT: • MT Tool with UI for training & analytics • Custom integrations possible • Endpoints for production scenario provided Data Scientist: • ML model exchangeable - has standardized endpoints • Data and integration – as needed PYTHON CAN BE INTEGRATED INTO EXISTING WORKFLOW MODEL MT DATABASE MODEL DATA ML SOLUTION MT SERVICE CUSTOM INTEGRATION & PRODUCTION MT TOOL
  • 19. Machine teaching process PERS LOC ORG DATE PROD LABELING CHANGES VERIFICATION TRAINING/ RETRAINING
  • 21. How to get most value from machine teaching for Natural Language Processing (NLP) and Non-NLP models IN NLP MODELS • Objective – Create Optimal data set for best known optimal models • Use for data labeling and data preparation IN NON-NLP MODELS • Objective – Find optimal/best model with existing data • Use for model monitoring and verification Machine Learning is the subfield of computer science that, according to Arthur Samuel in 1959, gives “computers the ability to learn without being explicitly programmed.” NLP Artificial Intelligence Machine Learning Statistics
  • 22. Let's stay connected Contact us at info@emergn.com for an individual demo. Follow us: @emergn MUNTIS RUDZITIS Lead Data Scientist at Emergn https://www.linkedin.com/in/muntis-rudzitis/ ARIADNA KRAMKOVSKA Machine Learning Developer at Emergn https://www.linkedin.com/in/ariadnakramkovska/