SlideShare a Scribd company logo
Working with Data
Eng Teong Cheah
Microsoft MVP
Working with Datastores
In Azure Machine Learning, datastores are abstractions for cloud data sources. They
encapsulate the information required to connect to data sources. You can access
datastores directly in code by using the the Azure Machine Learning.
Types of Datastore
Azure Machine Learning supports the creation of datastores for multiple kinds of Azure
data source, including:
- Azure Storage (blob and file containers)
- Azure Data Lake stores
- Azure SQL Database
- Azure Databricks file system (DBFS)
Working with Datasets
Datasets are versioned packaged data objects that can be easily consumed in
experiments and pipelines. Datasets are the recommended what to work with data, and re
the primary mechanism for advanced Azure Machine Learning capabilities like data
labeling and data drift monitoring.
Types of Datasets
Datasets are typically on files in a datastores, though they can also be based on URLs and
other sources. You can create the following types of datasets:
- Tabular
The data is read from the dataset as a table. You should use this type of dataset when
your data is consistently structured and you want to work with it in common tabular
data structures, such as Pandas dataframes.
- File
The dataset presents a list of file paths that can be read as though from the file system.
Use this type of dataset when your data is unstructured, or when you need to process
the data at the file level(for example, to train a convolutional neural network from a set
of image files).
Demo
Work with Data
References
Microsoft Docs

More Related Content

PDF
Walk through of azure machine learning studio new features
PPTX
Azure Certification | Azure Fundamentals to DevOps
PDF
Azure Certification | Azure Fundamentals to DevOps
PPTX
Arquitectura de Datos en Azure
PPTX
Azure Data Engineer Online Training Course - Azure Data Engineer Training Ame...
PPTX
warner-DP-203-slides.pptx
PDF
Azure Data Engineer Course | Azure Data Engineer Trainin
PPTX
Azure machine learning tech mela
Walk through of azure machine learning studio new features
Azure Certification | Azure Fundamentals to DevOps
Azure Certification | Azure Fundamentals to DevOps
Arquitectura de Datos en Azure
Azure Data Engineer Online Training Course - Azure Data Engineer Training Ame...
warner-DP-203-slides.pptx
Azure Data Engineer Course | Azure Data Engineer Trainin
Azure machine learning tech mela

Similar to Working with Data (20)

PPTX
Building Powerful and Intelligent Applications with Azure Machine Learning
PPTX
Building Powerful and Intelligent Applications with Azure Machine Learning
PPTX
Azure Machine Learning
PDF
Adventures in Azure Machine Learning from NE Bytes
PPTX
Microsoft Azure Big Data Analytics
PDF
Azure Data Engineer Interview Questions By ScholarHat
PPTX
Azure Data.pptx
PDF
Lake Database Database Template Map Data in Azure Synapse Analytics
PDF
Introduction to Machine Learning and Data Science using the Autonomous databa...
PPTX
Designing big data analytics solutions on azure
PPTX
Microsoft Azure Data Engineer Training | Azure Data Engineer Course in Hyderabad
PDF
Machine Learning in Autonomous Data Warehouse
PPTX
Collab365 Empower-Your-Applications-With-Azure-Machine-Learning
PPTX
Move your on prem data to a lake in a Lake in Cloud
PDF
Azure Fundamentals.pdf
PPTX
Data Engineer Course in Hyderabad - Azure Data Engineer Course Hyderabad.pptx
PDF
Big Data Adavnced Analytics on Microsoft Azure
PPTX
Machine Learning - Intro from Microsoft Partner University
PPTX
Getting Started with Azure AutoML
PPT
DMML1_overview.ppt
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
Azure Machine Learning
Adventures in Azure Machine Learning from NE Bytes
Microsoft Azure Big Data Analytics
Azure Data Engineer Interview Questions By ScholarHat
Azure Data.pptx
Lake Database Database Template Map Data in Azure Synapse Analytics
Introduction to Machine Learning and Data Science using the Autonomous databa...
Designing big data analytics solutions on azure
Microsoft Azure Data Engineer Training | Azure Data Engineer Course in Hyderabad
Machine Learning in Autonomous Data Warehouse
Collab365 Empower-Your-Applications-With-Azure-Machine-Learning
Move your on prem data to a lake in a Lake in Cloud
Azure Fundamentals.pdf
Data Engineer Course in Hyderabad - Azure Data Engineer Course Hyderabad.pptx
Big Data Adavnced Analytics on Microsoft Azure
Machine Learning - Intro from Microsoft Partner University
Getting Started with Azure AutoML
DMML1_overview.ppt
Ad

More from Eng Teong Cheah (20)

PDF
Modern Cross-Platform Apps with .NET MAUI
PDF
Efficiently Removing Duplicates from a Sorted Array
PDF
Monitoring Models
PDF
Responsible Machine Learning
PDF
Training Optimal Models
PDF
Deploying Models
PDF
Machine Learning Workflows
PDF
Working with Compute
PDF
Experiments & TrainingModels
PDF
Automated Machine Learning
PDF
Getting Started with Azure Machine Learning
PDF
Hacking Containers - Container Storage
PDF
Hacking Containers - Looking at Cgroups
PDF
Hacking Containers - Linux Containers
PDF
Data Security - Storage Security
PDF
Application Security- App security
PDF
Application Security - Key Vault
PDF
Compute Security - Container Security
PDF
Compute Security - Host Security
PDF
Virtual Networking Security - Network Security
Modern Cross-Platform Apps with .NET MAUI
Efficiently Removing Duplicates from a Sorted Array
Monitoring Models
Responsible Machine Learning
Training Optimal Models
Deploying Models
Machine Learning Workflows
Working with Compute
Experiments & TrainingModels
Automated Machine Learning
Getting Started with Azure Machine Learning
Hacking Containers - Container Storage
Hacking Containers - Looking at Cgroups
Hacking Containers - Linux Containers
Data Security - Storage Security
Application Security- App security
Application Security - Key Vault
Compute Security - Container Security
Compute Security - Host Security
Virtual Networking Security - Network Security
Ad

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Cloud computing and distributed systems.
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Encapsulation theory and applications.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Electronic commerce courselecture one. Pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
NewMind AI Monthly Chronicles - July 2025
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
cuic standard and advanced reporting.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Cloud computing and distributed systems.
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Encapsulation theory and applications.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
The AUB Centre for AI in Media Proposal.docx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Dropbox Q2 2025 Financial Results & Investor Presentation
20250228 LYD VKU AI Blended-Learning.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A Presentation on Artificial Intelligence
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
NewMind AI Weekly Chronicles - August'25 Week I
Electronic commerce courselecture one. Pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
NewMind AI Monthly Chronicles - July 2025

Working with Data

  • 3. Working with Datastores In Azure Machine Learning, datastores are abstractions for cloud data sources. They encapsulate the information required to connect to data sources. You can access datastores directly in code by using the the Azure Machine Learning.
  • 4. Types of Datastore Azure Machine Learning supports the creation of datastores for multiple kinds of Azure data source, including: - Azure Storage (blob and file containers) - Azure Data Lake stores - Azure SQL Database - Azure Databricks file system (DBFS)
  • 5. Working with Datasets Datasets are versioned packaged data objects that can be easily consumed in experiments and pipelines. Datasets are the recommended what to work with data, and re the primary mechanism for advanced Azure Machine Learning capabilities like data labeling and data drift monitoring.
  • 6. Types of Datasets Datasets are typically on files in a datastores, though they can also be based on URLs and other sources. You can create the following types of datasets: - Tabular The data is read from the dataset as a table. You should use this type of dataset when your data is consistently structured and you want to work with it in common tabular data structures, such as Pandas dataframes. - File The dataset presents a list of file paths that can be read as though from the file system. Use this type of dataset when your data is unstructured, or when you need to process the data at the file level(for example, to train a convolutional neural network from a set of image files).