Computer Vision
Using Computer Vision Activities in UiPath
Computer Vision
• The AI Computer Vision pack contains refactored
fundamental UIAutomation activities such as Click, Type Into, or Get
Text.
• The main difference between the CV activities and their classic
counterparts is their usage of the Computer Vision neural network
developed in-house by UiPath Machine Learning department.
Computer Vision- Neural Networks
• A Neural network is a series of algorithms that endeavors
to recognize underlying relationships in a set of data through
a process that mimics the way the human brain operates.
• Vision activities help in sending images of the window you are
automating to the neural network, where it is analyzed and all UI
elements are identified and labeled according to what they are.
• The neural network is able to identify UI elements such as buttons,
text input fields, or check boxes without the use of selectors.
CV Activity Used in?
• Created mainly for automation in virtual desktop environments, such
as Citrix machines, VDI’s.
• Bypass the issue of inexistent or unreliable selectors.
Pre-requisites
• Package UiPath.UIAutomation.Activities must be installed.
• UiPath version for using the activity should be v18.3 or higher.
Computer Vision Activities
• CV Screen Scope
• CV Click
• CV Type into
• CV Get text
• CV Element exists
• CV Highlight
• CV Hover
• CV Refresh
CV Screen Scope
• Initializes the UiPath Computer Vision neural network, performing an
analysis of the indicated window and provides a scope for all
subsequent Computer Vision activities.
• Enables us to select which OCR engine you want to use for scraping
the text in the target application.
• The default OCR engine used for this activity is Microsoft OCR.
CV Screen Scope- Key properties
• API Key : The API key used for authenticating to the Computer Vision
server.
• URL : The URL of the server that runs the Computer Vision service.
Example
CV Click
• Clicks a specified UI element which is targeted by using
the UiPath Computer Vision neural network.
• Similar to the basic UiPath click activity.
Example
References
• https://docs.uipath.com/activities/docs/computer-vision-cv
• https://www.youtube.com/watch?v=DRjvbtsdbdM
THANK YOU

More Related Content

PDF
UiPath Citrix Automation | Image and Text Automation in UiPath | UiPath Train...
PDF
UiPath Task Capture training.pdf
PPTX
REST API testing with SpecFlow
PDF
Effective API Governance: Lessons Learnt
PPTX
Api Testing
PDF
Spring MVC Framework
PDF
Serving ML easily with FastAPI
ODP
Xke spring boot
UiPath Citrix Automation | Image and Text Automation in UiPath | UiPath Train...
UiPath Task Capture training.pdf
REST API testing with SpecFlow
Effective API Governance: Lessons Learnt
Api Testing
Spring MVC Framework
Serving ML easily with FastAPI
Xke spring boot

What's hot (20)

PPTX
PDF
UiPath 23.4 Product Release Updates
PPTX
Acceptance testing
PDF
Introduction to UiPath licensing model
PDF
Spring Boot
PDF
Api fundamentals
PDF
RPA Architecture
PDF
The Rise and Benefits of Robotic Process Automation
PDF
How to integrate UiPath into your Microsoft environment
PDF
Automate SAP S/4 HANA business processes across all user interfaces
PPTX
Types of Workflow.pptx
PPTX
Introduction to spring boot
PPTX
API testing - Japura.pptx
PDF
An Introduction To Automated API Testing
PPTX
Testing microservices with rest assured
PPT
Module 2: Managing Work Items in Rational Team Concert
PDF
API Testing
PDF
Unit Testing in Kotlin
PDF
Automotive QMS IATF16949
PDF
Location-Based Services on Android
UiPath 23.4 Product Release Updates
Acceptance testing
Introduction to UiPath licensing model
Spring Boot
Api fundamentals
RPA Architecture
The Rise and Benefits of Robotic Process Automation
How to integrate UiPath into your Microsoft environment
Automate SAP S/4 HANA business processes across all user interfaces
Types of Workflow.pptx
Introduction to spring boot
API testing - Japura.pptx
An Introduction To Automated API Testing
Testing microservices with rest assured
Module 2: Managing Work Items in Rational Team Concert
API Testing
Unit Testing in Kotlin
Automotive QMS IATF16949
Location-Based Services on Android
Ad

Similar to Computer vision activities in ui path (20)

PPTX
Azure integration services from the IT Professional perspective
PPTX
Presentation on Attendance with hAI.pptx
PPTX
Traffic Violation Detector using Object Detection
PDF
Get More Out of Your PeopleSoft Applications Using Tools that You May Not Eve...
PPTX
Industrial automation ignition by Anil.pptx
PPTX
Nikky:RPA
PPT
IBM SmartCloud Orchestration
PPTX
BCAVIMultimediaunhgghghghghghghgit2.pptx
PPTX
Dr.A.Jeyalakshmi,Assoiate Professor-IT-Sri Ramakrishna COllege of Arts and Sc...
PDF
UiPath NY AI Series: Session 4: UiPath AutoPilot for Developers using Studio Web
PDF
Robot Framework with actual robot
PPTX
Introduction to Azure Functions
PDF
Dev Dives: System-to-system integration with UiPath API Workflows
PPTX
Melbourne UG Presentation - UI Flow for Power Automate
PPTX
Azure functions - Build apps faster with serverless architecture
PPTX
Azure functions: Build apps faster with serverless architecture (March 2018)
PPTX
Building API in the cloud using Azure Functions
PPTX
Azure Functions in Action #CodePaLOUsa
PPTX
Presentation Azure Chat Bot Project.pptx
PPTX
Azure Cognitive Services딜 ė˜‘ė˜‘ķ•œ ģ„œė¹„ģŠ¤ ė§Œė“¤
Azure integration services from the IT Professional perspective
Presentation on Attendance with hAI.pptx
Traffic Violation Detector using Object Detection
Get More Out of Your PeopleSoft Applications Using Tools that You May Not Eve...
Industrial automation ignition by Anil.pptx
Nikky:RPA
IBM SmartCloud Orchestration
BCAVIMultimediaunhgghghghghghghgit2.pptx
Dr.A.Jeyalakshmi,Assoiate Professor-IT-Sri Ramakrishna COllege of Arts and Sc...
UiPath NY AI Series: Session 4: UiPath AutoPilot for Developers using Studio Web
Robot Framework with actual robot
Introduction to Azure Functions
Dev Dives: System-to-system integration with UiPath API Workflows
Melbourne UG Presentation - UI Flow for Power Automate
Azure functions - Build apps faster with serverless architecture
Azure functions: Build apps faster with serverless architecture (March 2018)
Building API in the cloud using Azure Functions
Azure Functions in Action #CodePaLOUsa
Presentation Azure Chat Bot Project.pptx
Azure Cognitive Services딜 ė˜‘ė˜‘ķ•œ ģ„œė¹„ģŠ¤ ė§Œė“¤
Ad

Recently uploaded (20)

PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
Modernising the Digital Integration Hub
PPTX
Benefits of Physical activity for teenagers.pptx
DOCX
search engine optimization ppt fir known well about this
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PPTX
The various Industrial Revolutions .pptx
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
UiPath Agentic Automation session 1: RPA to Agents
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PDF
A review of recent deep learning applications in wood surface defect identifi...
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Modernising the Digital Integration Hub
Benefits of Physical activity for teenagers.pptx
search engine optimization ppt fir known well about this
Consumable AI The What, Why & How for Small Teams.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
A proposed approach for plagiarism detection in Myanmar Unicode text
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
The various Industrial Revolutions .pptx
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
The influence of sentiment analysis in enhancing early warning system model f...
NewMind AI Weekly Chronicles – August ’25 Week III
Improvisation in detection of pomegranate leaf disease using transfer learni...
Developing a website for English-speaking practice to English as a foreign la...
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
Zenith AI: Advanced Artificial Intelligence
UiPath Agentic Automation session 1: RPA to Agents
Final SEM Unit 1 for mit wpu at pune .pptx
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
A review of recent deep learning applications in wood surface defect identifi...

Computer vision activities in ui path

  • 1. Computer Vision Using Computer Vision Activities in UiPath
  • 2. Computer Vision • The AI Computer Vision pack contains refactored fundamental UIAutomation activities such as Click, Type Into, or Get Text. • The main difference between the CV activities and their classic counterparts is their usage of the Computer Vision neural network developed in-house by UiPath Machine Learning department.
  • 3. Computer Vision- Neural Networks • A Neural network is a series of algorithms that endeavors to recognize underlying relationships in a set of data through a process that mimics the way the human brain operates. • Vision activities help in sending images of the window you are automating to the neural network, where it is analyzed and all UI elements are identified and labeled according to what they are. • The neural network is able to identify UI elements such as buttons, text input fields, or check boxes without the use of selectors.
  • 4. CV Activity Used in? • Created mainly for automation in virtual desktop environments, such as Citrix machines, VDI’s. • Bypass the issue of inexistent or unreliable selectors.
  • 5. Pre-requisites • Package UiPath.UIAutomation.Activities must be installed. • UiPath version for using the activity should be v18.3 or higher.
  • 6. Computer Vision Activities • CV Screen Scope • CV Click • CV Type into • CV Get text • CV Element exists • CV Highlight • CV Hover • CV Refresh
  • 7. CV Screen Scope • Initializes the UiPath Computer Vision neural network, performing an analysis of the indicated window and provides a scope for all subsequent Computer Vision activities. • Enables us to select which OCR engine you want to use for scraping the text in the target application. • The default OCR engine used for this activity is Microsoft OCR.
  • 8. CV Screen Scope- Key properties • API Key : The API key used for authenticating to the Computer Vision server. • URL : The URL of the server that runs the Computer Vision service.
  • 10. CV Click • Clicks a specified UI element which is targeted by using the UiPath Computer Vision neural network. • Similar to the basic UiPath click activity.