SlideShare a Scribd company logo
UsingAzureCognitiveSearch
toDiveintotheCIAArchives
Azure AI + Data
• Azure Bot Service
• Azure Cognitive Services
Intelligent
Apps + Agents
• Azure Databricks
• Azure Machine Learning
Machine Learning
• Azure Cognitive SearchKnowledge Mining
Knowledge mining + Azure Cognitive Search
• The intelligent act of creating
actionable, structured information
from unstructured, large data such
as text, media and images
• Use of AI to help with automating
processes, ensuring safety or
improving search
Knowledge Mining
• The AI features for knowledge
mining in Azure Search
• Enrichment of content in the
indexing pipeline through skills
such as image analysis, OCR and
text analysis
Azure Cognitive Search
How Azure Cognitive Search works
2 Enrich1 Ingest 3 Explore
Cracked documents Enriched documentsSkillsData source
Indexer Query
Index Query
Blob Storage
TableStorage
Cosmos DB
SQL DB
Built-in
Custom
Searchable
Orderable
Filterable
Facetable
Azure Search SDK
Azure SearchAPI
CSV, Office, EML,
HTML, JSON, PDF,
RTF,TXT,XML,
ZIP
Built-in skills
For preset models
• Image analysis
• Printed OCR
• Handwritten OCR
• Language detection
• Entity linking
• Named entity recognition
• Key phrase analysis
• Sentiment analysis
• Merger
• Splitter
• Shaper
Cognitive skills Utility skills
Custom skills
For bring-your-own APIs
POST /api/count HTTP/1.1
Host: https://<my-api-app>.azurewebsites.net
Content-Type: application/json
x-functions-key: <my-api-key>
{
"values": [
{
"recordId": "0",
"data": {
"text": "Este es un contrato en Inglés",
"language": "es",
"countOf": "words"
}
}
]
}
HTTP/1.1 200 OK
Content-Length: 153
Content-Type: application/json
{
"values": [
{
"recordId": "0",
"data": {
"count": 6
},
"errors": [],
"warnings": []
}
]
}
POST or
PUT https
only
The batch
size
Any HTTP
headers
Data can
be shaped
Example: President’s Daily Reports
Cracked
documents
OCR
skill
Entity Recognition
skill
Enriched
documents
Image
(.TIFF)
per
page
{
"categories": [
"locations",
"organizations",
"persons"
]
}
{
“algorithm":"printed“
}
Custom
skill
metadata_storage_path
content
merged_text
hocr_data
locations
organizations
persons
PDF documents PDF pages
Index
Query
Custom
skill
PDF pages
hOCRJPEGPDF
Pricing
• A preview feature of the Azure Search service in 15 regions including
Australia East
• Free for a small workload
• Can attach to an Azure Cognitive Services resource for a larger workload
• Built-in skills are charged at the Azure Cognitive Search pay-as-you-go price
Resources
• JFK Files
https://www.ailab.microsoft.com/experiments/jfk-files
Demos
• Knowledge Mining Bootcamp
https://azure.github.io/LearnAI-KnowledgeMiningBootcamp/
Labs
• AI and Azure Search for knowledge mining
https://azure.microsoft.com/en-
gb/resources/videos/azuresearchcognitivesearchmicrosoftignite2018/
Videos
Using Azure Cognitive Search to Dive into the CIA Archives

More Related Content

PPTX
Cognitive Intelligence using azure search
PPTX
Azure cognitive search
PPTX
Supercharging your Data with Azure AI Search and Azure OpenAI
PPTX
Vector Search using OpenAI in Azure Cognitive Search.pptx
PPTX
Computer Vision and Text Analytics Using Azure Cognitive Services
PPTX
Cognitive Search: Announcing the smartest enterprise search engine, now with ...
PDF
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
PPTX
Secure your M365 resources using Azure AD Identity Governance
Cognitive Intelligence using azure search
Azure cognitive search
Supercharging your Data with Azure AI Search and Azure OpenAI
Vector Search using OpenAI in Azure Cognitive Search.pptx
Computer Vision and Text Analytics Using Azure Cognitive Services
Cognitive Search: Announcing the smartest enterprise search engine, now with ...
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
Secure your M365 resources using Azure AD Identity Governance

What's hot (20)

PDF
Best Practice on using Azure OpenAI Service
PDF
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
PPTX
AzureOpenAI.pptx
PDF
Azure Security Overview
PDF
Microsoft Azure Security Overview
PDF
Azure Active Directory | Microsoft Azure Tutorial for Beginners | Azure 70-53...
PDF
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
PPTX
introduction Azure OpenAI by Usama wahab khan
PDF
Build and Modernize Intelligent Apps​
PPTX
Azure App Service Deep Dive
PDF
Amazon DocumentDB - Architecture 및 Best Practice (Level 200) - 발표자: 장동훈, Sr. ...
PDF
Azure Purview Data Toboggan Erwin de Kreuk
PDF
How will development change with LLMs
PPTX
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
PPTX
Let's Talk About: Azure Networking
PDF
[Machine Learning 15minutes! #61] Azure OpenAI Service
PPTX
Google Vertex AI
PPTX
Introduction to Azure Databricks
PPTX
Overview on Azure Machine Learning
PDF
Suresh Poopandi_Generative AI On AWS-MidWestCommunityDay-Final.pdf
Best Practice on using Azure OpenAI Service
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
AzureOpenAI.pptx
Azure Security Overview
Microsoft Azure Security Overview
Azure Active Directory | Microsoft Azure Tutorial for Beginners | Azure 70-53...
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
introduction Azure OpenAI by Usama wahab khan
Build and Modernize Intelligent Apps​
Azure App Service Deep Dive
Amazon DocumentDB - Architecture 및 Best Practice (Level 200) - 발표자: 장동훈, Sr. ...
Azure Purview Data Toboggan Erwin de Kreuk
How will development change with LLMs
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Let's Talk About: Azure Networking
[Machine Learning 15minutes! #61] Azure OpenAI Service
Google Vertex AI
Introduction to Azure Databricks
Overview on Azure Machine Learning
Suresh Poopandi_Generative AI On AWS-MidWestCommunityDay-Final.pdf
Ad

Similar to Using Azure Cognitive Search to Dive into the CIA Archives (20)

PPTX
Artificial Intelligence Day 5 Slides for your Reference Happy Learning
PDF
O365Con19 - Sharepoint with (Artificial) Intelligence - Adis Jugo
PPTX
El camino a las Cloud Native Apps - Azure AI
PPTX
SQLDay 2021 PL AI Enrichment Azure Search.pptx
PPTX
Data saturday Oslo Azure Purview Erwin de Kreuk
PPTX
Datasaturday Pordenone Azure Purview Erwin de Kreuk
PDF
DataMinds 2022 Azure Purview Erwin de Kreuk
PPTX
Artificial Intelligence Day 2 Slides for your Reference Happy Learning
PPTX
Data weekender4.2 azure purview erwin de kreuk
PPTX
Microsoft AI Overview: Cognitive Services
PPTX
SG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptx
PPTX
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
PDF
Introduction Machine Learning - Microsoft
PDF
Decode2018 report
PDF
Leveraging Instant Extracts with Azure Fabric
PDF
Use O365 and Azure Cognitive Services for intelligent search
PPTX
03 Create a knowledge store with Azure AI Search.pptx
PDF
GlobalAIBootcamp - Knowledge Mining using Azure Cognitive Search
PDF
韓国オンラインゲームから学ぶアドホックなビックデータ分析
PDF
Using AI to classify your SharePoint Data
Artificial Intelligence Day 5 Slides for your Reference Happy Learning
O365Con19 - Sharepoint with (Artificial) Intelligence - Adis Jugo
El camino a las Cloud Native Apps - Azure AI
SQLDay 2021 PL AI Enrichment Azure Search.pptx
Data saturday Oslo Azure Purview Erwin de Kreuk
Datasaturday Pordenone Azure Purview Erwin de Kreuk
DataMinds 2022 Azure Purview Erwin de Kreuk
Artificial Intelligence Day 2 Slides for your Reference Happy Learning
Data weekender4.2 azure purview erwin de kreuk
Microsoft AI Overview: Cognitive Services
SG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptx
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
Introduction Machine Learning - Microsoft
Decode2018 report
Leveraging Instant Extracts with Azure Fabric
Use O365 and Azure Cognitive Services for intelligent search
03 Create a knowledge store with Azure AI Search.pptx
GlobalAIBootcamp - Knowledge Mining using Azure Cognitive Search
韓国オンラインゲームから学ぶアドホックなビックデータ分析
Using AI to classify your SharePoint Data
Ad

Recently uploaded (20)

PDF
Machine learning based COVID-19 study performance prediction
PDF
Encapsulation theory and applications.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
A Presentation on Artificial Intelligence
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Empathic Computing: Creating Shared Understanding
PPT
Teaching material agriculture food technology
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Modernizing your data center with Dell and AMD
Machine learning based COVID-19 study performance prediction
Encapsulation theory and applications.pdf
MYSQL Presentation for SQL database connectivity
A Presentation on Artificial Intelligence
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Empathic Computing: Creating Shared Understanding
Teaching material agriculture food technology
Dropbox Q2 2025 Financial Results & Investor Presentation
Encapsulation_ Review paper, used for researhc scholars
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Building Integrated photovoltaic BIPV_UPV.pdf
Unlocking AI with Model Context Protocol (MCP)
Network Security Unit 5.pdf for BCA BBA.
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Advanced methodologies resolving dimensionality complications for autism neur...
Modernizing your data center with Dell and AMD

Using Azure Cognitive Search to Dive into the CIA Archives

  • 2. Azure AI + Data • Azure Bot Service • Azure Cognitive Services Intelligent Apps + Agents • Azure Databricks • Azure Machine Learning Machine Learning • Azure Cognitive SearchKnowledge Mining
  • 3. Knowledge mining + Azure Cognitive Search • The intelligent act of creating actionable, structured information from unstructured, large data such as text, media and images • Use of AI to help with automating processes, ensuring safety or improving search Knowledge Mining • The AI features for knowledge mining in Azure Search • Enrichment of content in the indexing pipeline through skills such as image analysis, OCR and text analysis Azure Cognitive Search
  • 4. How Azure Cognitive Search works 2 Enrich1 Ingest 3 Explore Cracked documents Enriched documentsSkillsData source Indexer Query Index Query Blob Storage TableStorage Cosmos DB SQL DB Built-in Custom Searchable Orderable Filterable Facetable Azure Search SDK Azure SearchAPI CSV, Office, EML, HTML, JSON, PDF, RTF,TXT,XML, ZIP
  • 5. Built-in skills For preset models • Image analysis • Printed OCR • Handwritten OCR • Language detection • Entity linking • Named entity recognition • Key phrase analysis • Sentiment analysis • Merger • Splitter • Shaper Cognitive skills Utility skills
  • 6. Custom skills For bring-your-own APIs POST /api/count HTTP/1.1 Host: https://<my-api-app>.azurewebsites.net Content-Type: application/json x-functions-key: <my-api-key> { "values": [ { "recordId": "0", "data": { "text": "Este es un contrato en Inglés", "language": "es", "countOf": "words" } } ] } HTTP/1.1 200 OK Content-Length: 153 Content-Type: application/json { "values": [ { "recordId": "0", "data": { "count": 6 }, "errors": [], "warnings": [] } ] } POST or PUT https only The batch size Any HTTP headers Data can be shaped
  • 7. Example: President’s Daily Reports Cracked documents OCR skill Entity Recognition skill Enriched documents Image (.TIFF) per page { "categories": [ "locations", "organizations", "persons" ] } { “algorithm":"printed“ } Custom skill metadata_storage_path content merged_text hocr_data locations organizations persons PDF documents PDF pages Index Query Custom skill PDF pages hOCRJPEGPDF
  • 8. Pricing • A preview feature of the Azure Search service in 15 regions including Australia East • Free for a small workload • Can attach to an Azure Cognitive Services resource for a larger workload • Built-in skills are charged at the Azure Cognitive Search pay-as-you-go price
  • 9. Resources • JFK Files https://www.ailab.microsoft.com/experiments/jfk-files Demos • Knowledge Mining Bootcamp https://azure.github.io/LearnAI-KnowledgeMiningBootcamp/ Labs • AI and Azure Search for knowledge mining https://azure.microsoft.com/en- gb/resources/videos/azuresearchcognitivesearchmicrosoftignite2018/ Videos