SlideShare a Scribd company logo
©All rights reserved. Not for reproduction, distribution or sale.
Improving Patent Analytics Using
Semantic and Machine Learning
Technologies
Anthony Trippe
Managing Director
Patinformatics, LLC
ii-SDV
October 5th, 2021
©All rights reserved. Not for reproduction, distribution or sale.
PATENT ANALYTICS USING
SEMANTIC AND MACHINE
LEARNING TECHNOLOGIES
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
3
• Taken from: Guidelines for Preparing Patent Landscape Reports
• http://www.wipo.int/edocs/pubdocs/en/wipo_pub_946.pdf
• Building a collection
• Conducting a patent search
• Determining relevance
• TIDYing the collection
• Patent family reduction
• Deciding on type of year to use
• Standardizing key fields
• Creating categories
• Steps in preparing a patent landscape report
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
4
• Analyzing the collection
• Building the models
• Looking for trends
• Visualizing the collection
• Charts & Graphs
• Network diagrams
• Spatial concept maps
• Sharing conclusions
• Steps in preparing a patent landscape report
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
5
• In searching during the preparation of a PLR, information retrieval methods usually look at precision
and recall simultaneously and measure their effectiveness looking at both elements
• Even though this is the case, precision and recall are normally opposed to one another such that
with an increase in recall there is usually a subsequent drop in the level of precision
• In generating collections for PLRs it might be more productive to begin with creating sets using
methods that produce high recall exclusive of precision
• Once an initial collection with high recall is built different methods can be used to increase the
precision of the collection by determining relevance of the families
• From a practical perspective, if the level of recall can be established at higher than 90%, while the
precision kept above 70%, then the likelihood of finding statistically relevant, but conceptually
irrelevant items in the subsequent analysis steps is reasonably small
• Patent Search/Determine Relevance
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
6
• Improving recall
• Cosine similarity
• Latent semantic analysis
• More like this / Practical scoring function
• https://cloudblog-withgoogle-com.cdn.ampproject.org/c/s/cloudblog.withgoogle.com/products/data-analytics/expanding-your-patent-
set-with-ml-and-bigquery/amp/
• Others?
• How does this really compare to Boolean/traditional patent queries? –will be discussed at EPOPIC
• Determining relevance – https://github.com/swh/classification-gold-standard
• Binary classification
• Support vector machines / vector space models
• Neural networks
• Others?
• Semantic Tools for Patent Search/Determine Relevance
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
7
• Field Cleanup
• Patent Assignee Standardization – fuzzy logic, rules-based
• disambiguation
• Family or Invention Reduction
• Reconciling Forward Citations
• Based on family reduction method
• Determining Reporting Year
• Using earliest publication year
• TIDYing the Collection
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
8
• Binary classification provides a means for categorizing large collections of patent documents into
the references that are likely to be of highest interest to the information professional, and those
that are likely not related, but were still retrieved in a broad search
• A training set will be made up of references that are highly relevant to the interests of the analyst
• In training the classifier, the analyst will need to identify documents that are off-topic as well, so the
classifier can establish a hyperplane that will distinguish between the two categories
• Technology categories are sometimes identified using the patent data itself, for instance, with
classification codes, but ideally, they should be generated based on input from a subject-matter
expert based on an industry standard view on how approaches are categorized
• Using a market or industry-based approach to creating categories will make it easier for the clients
of the PLR to identify with the technology
• Creating Categories for the Collection
©All rights reserved. Not for reproduction, distribution or sale.
SAMPLE PATENT VISUALIZATIONS
INVOLVING MACHINE LEARNING
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
• Spatial Concept Maps
10
• Patents that are similar to one another
based on their language are organized
close to one another on the map
• Relative distance between different
technical subjects show which concepts
are related to one another
• Labels are added to identify the sub-
sections within a technology field
• Colored dots are used to provide
comparisons within the context of the
map
• In this case, the dots provide a
means to compare different
companies in the space
• Colored dots can also be used for
distributions over time
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
• Patent Citation Network MapsTM
11
• When a patent is referenced in a future
patent it’s called a forward citation
• Patents that have a large number of
forward citations, especially from other
organizations can be considered
influential
• These citations can be aggregated by the
organizations that own the patents in
question
• Using a network diagram the connections
between organizations can be found, and
the most influential groups identified
• In this case, the HP and Northrup
portfolios are not as large as the ones for
D-Wave, and IBM, but they are very
influential
©All rights reserved. Not for reproduction, distribution or sale.
INTRODUCING
ML4PATENTS.COM
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
• Supporting and Additional Vendors
13
• Vendor pages
• 37 unique machine
learning for IP tools
from 31 vendors
• Detailed
information
provided on each
tool
• Ten primary and 21
secondary
categories index for
ease of data
retrieval on the
tools and vendors
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
• Blog Post Selection: Can an AI Get a Patent?
14
• From ML4Patents.com — 219 posts currently on the site
• When is There an “Actual Invention” Involving Computers?
• 2021: An AI Inventor Odyssey
• USPTO Releases Public Comments on AI
• The Time is Now: Opportunities to Advise the E.D. Va. or EPO as to Whether to Prohibit,
Permit, or Require Listing an AI Algorithm as an Inventor
• AI Magazine: What Happens When Artificial Intelligence Invents: Is the Invention
Patentable?
• Legal challenges have occurred in Australia, Europe, the United
Kingdom and the US
• In all cases patentability requires a human inventor and thus an AI system can’t be an
inventor
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
• Blog Post Selection: Is AI Patentable?
15
• From ML4Patents.com
• How to Safeguard AI Technology: Patents versus Trade Secrets
• Brazilian PTO publishes Guidelines for Examining Patent Applications Involving
Computer-Implemented Inventions
• European Patent Office Artificial Intelligence Page
• Public Views on AI and IP Policy from USPTO
• AI Initiatives at the JPO
• Patenting AI in the EPO Guidelines - The Best Practice Podcast
• UK Government Mulls Legislative Changes to Safeguard AI Inventions
• Absolutely — most countries are making special provisions
• Software patents used to mostly be strictly forbidden as abstract and not allowable
subject matter, but as the importance of machine learning has increased more and
more countries are posting guidelines on how to protect these inventions
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
• Webinar Series
16
• 4th episode is on Wednesday October 6th with Intellar from
Perception Partners
• Learn more and register at: https://www.ml4patents.com/webinars/episode-4-w-
perception-partners
• 1st three episodes have been a resounding success
• Relativity — providing a collection of methods to chose from
• Cipher — deep indexing in selected verticals with rich hierarchies
• Amplified — provided an excellent primer on modern ML for text retrieval methods
• Archive available at: https://www.ml4patents.com/webinars
www.patinformatics.com
©All rights reserved. Not for reproduction, distribution or sale.
Contact Us
+1.614.787.5237 info@patinformatics.com www.patinformatics.com
17

More Related Content

PDF
AI-SDV 2021: Nils Newmann - AI – Who is in control and why is that important?
PDF
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
PDF
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
PDF
AI-SDV 2020: Special Hypertext Information Treatment in is Special Hypertext ...
PDF
AI-SDV 2020: Delivering AIM™ Patent Landscapes for Competitive Intelligence –...
PDF
AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...
PDF
AI-SDV 2021: Lucy Antunes; Muriel Bourgeois Tassanary - Project Management Ch...
PDF
AI-SDV 2021 - VantagePoint / Search Technolgy
AI-SDV 2021: Nils Newmann - AI – Who is in control and why is that important?
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
AI-SDV 2020: Special Hypertext Information Treatment in is Special Hypertext ...
AI-SDV 2020: Delivering AIM™ Patent Landscapes for Competitive Intelligence –...
AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...
AI-SDV 2021: Lucy Antunes; Muriel Bourgeois Tassanary - Project Management Ch...
AI-SDV 2021 - VantagePoint / Search Technolgy

What's hot (20)

PDF
AI-SDV 2021 Biomax
PDF
ICIC 2013 Conference Proceedings Tony Trippe Patinformatics
PDF
AI-SDV 2021 - Deep SEARCH 9
PPTX
Utilising Open Source and Communities to Drive Innovation in a Cost-Effective...
PDF
AI-SDV 2021 - Marjorie Hlava - Semantic Search and Content Management – Case...
PDF
AI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscaping
PDF
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
PDF
AI-SDV 2021 - Harald Jenny - Integrated Artificiel Intelligence – A Factory P...
PDF
II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
PDF
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
PDF
New Product Introductions - LexisNexis
PDF
AI-SDV 2021: Linus Wretblad - Best practice on new intelligent tools in IP ma...
PDF
AI-SDV 2021: Dolcera
PDF
AI-SDV 2020: Can There Be Profitable Revenue from an AI Deployment? The Upsid...
PDF
From Science to Data: Following a principled path to Data Science
PDF
II-SDV 2017: Centredoc
PDF
IC-SDV 2018: Averbis
PDF
IC-SDV 2018: Search Technology / VanatagePoint
PDF
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
PDF
Data Science Application in Business Portfolio & Risk Management
AI-SDV 2021 Biomax
ICIC 2013 Conference Proceedings Tony Trippe Patinformatics
AI-SDV 2021 - Deep SEARCH 9
Utilising Open Source and Communities to Drive Innovation in a Cost-Effective...
AI-SDV 2021 - Marjorie Hlava - Semantic Search and Content Management – Case...
AI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscaping
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
AI-SDV 2021 - Harald Jenny - Integrated Artificiel Intelligence – A Factory P...
II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
New Product Introductions - LexisNexis
AI-SDV 2021: Linus Wretblad - Best practice on new intelligent tools in IP ma...
AI-SDV 2021: Dolcera
AI-SDV 2020: Can There Be Profitable Revenue from an AI Deployment? The Upsid...
From Science to Data: Following a principled path to Data Science
II-SDV 2017: Centredoc
IC-SDV 2018: Averbis
IC-SDV 2018: Search Technology / VanatagePoint
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
Data Science Application in Business Portfolio & Risk Management
Ad

Similar to AI-SDV 2021 - Tony Trippe - The Current State of Machine Learning for Patent Searching and Analytics: Practical Perspectives from ML4Patents.com (20)

PPT
Entrepreneurship 101 - Intellectual Property
PDF
A Sea Change is Coming to Patent Analytics - Brought to You by Big Data
PPT
NORCAT Entrepreneurship 101 2014/15 – “Intellectual Property” featuring Antho...
PDF
Open Innovation in the UK - Digital Catapult - Michele Nati
PDF
Digital Catapult, Senzations15
PDF
A Sea Change is Coming to Patent Analytics - Brought to You by Big Data
PPTX
Patent Trollls gonna kill VRM?
PPTX
State of the art introduction
PPTX
Silicon Halton Meetup 108 - Is Your AI Invention Protectable?
PPT
Intellectual Property Strategy
PDF
Software Licensing Simplified
PPTX
IPTC Rights Statements For News
PDF
Anthony Trippe Harnessing the Power of Patent Analytics A Policy Maker's Pers...
PPT
ENT101 Season 3 - IP Management - Norton Rose Fulbright
PDF
II-SDV 2016 GRIDLOGICS
PDF
Using Patent Analytics to Engage Senior Management in Corporate Strategy
PDF
How to Perform Patent Landscape Analysis & Search Report – A Complete Guide
PDF
Technology Transfer for Knowledge Translation Practitioners
PPTX
Algorithm Marketplace and the new "Algorithm Economy"
PDF
Strategic Planning for Capturing and Protecting Intellectual Property - Knobb...
Entrepreneurship 101 - Intellectual Property
A Sea Change is Coming to Patent Analytics - Brought to You by Big Data
NORCAT Entrepreneurship 101 2014/15 – “Intellectual Property” featuring Antho...
Open Innovation in the UK - Digital Catapult - Michele Nati
Digital Catapult, Senzations15
A Sea Change is Coming to Patent Analytics - Brought to You by Big Data
Patent Trollls gonna kill VRM?
State of the art introduction
Silicon Halton Meetup 108 - Is Your AI Invention Protectable?
Intellectual Property Strategy
Software Licensing Simplified
IPTC Rights Statements For News
Anthony Trippe Harnessing the Power of Patent Analytics A Policy Maker's Pers...
ENT101 Season 3 - IP Management - Norton Rose Fulbright
II-SDV 2016 GRIDLOGICS
Using Patent Analytics to Engage Senior Management in Corporate Strategy
How to Perform Patent Landscape Analysis & Search Report – A Complete Guide
Technology Transfer for Knowledge Translation Practitioners
Algorithm Marketplace and the new "Algorithm Economy"
Strategic Planning for Capturing and Protecting Intellectual Property - Knobb...
Ad

More from Dr. Haxel Consult (20)

PDF
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
PDF
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
PDF
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
PDF
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
PDF
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
PDF
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
PDF
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
PDF
AI-SDV 2022: Machine learning based patent categorization: A success story in...
PDF
AI-SDV 2022: Machine learning based patent categorization: A success story in...
PDF
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
PDF
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
PDF
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
PDF
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
PDF
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
PDF
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
PDF
AI-SDV 2022: Copyright Clearance Center
PDF
AI-SDV 2022: Lighthouse IP
PDF
AI-SDV 2022: New Product Introductions: CENTREDOC
PDF
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
PDF
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...

Recently uploaded (20)

PDF
Introduction to the IoT system, how the IoT system works
PDF
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
PPTX
presentation_pfe-universite-molay-seltan.pptx
PPTX
Introuction about ICD -10 and ICD-11 PPT.pptx
PPTX
artificial intelligence overview of it and more
PDF
RPKI Status Update, presented by Makito Lay at IDNOG 10
PDF
An introduction to the IFRS (ISSB) Stndards.pdf
PDF
The Internet -By the Numbers, Sri Lanka Edition
PPTX
Slides PPTX World Game (s) Eco Economic Epochs.pptx
PPTX
SAP Ariba Sourcing PPT for learning material
PDF
💰 𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓 💰
PPT
tcp ip networks nd ip layering assotred slides
PDF
The New Creative Director: How AI Tools for Social Media Content Creation Are...
PPTX
Job_Card_System_Styled_lorem_ipsum_.pptx
PDF
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
PDF
Unit-1 introduction to cyber security discuss about how to secure a system
PDF
Slides PDF The World Game (s) Eco Economic Epochs.pdf
PPTX
522797556-Unit-2-Temperature-measurement-1-1.pptx
PPTX
innovation process that make everything different.pptx
PDF
SASE Traffic Flow - ZTNA Connector-1.pdf
Introduction to the IoT system, how the IoT system works
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
presentation_pfe-universite-molay-seltan.pptx
Introuction about ICD -10 and ICD-11 PPT.pptx
artificial intelligence overview of it and more
RPKI Status Update, presented by Makito Lay at IDNOG 10
An introduction to the IFRS (ISSB) Stndards.pdf
The Internet -By the Numbers, Sri Lanka Edition
Slides PPTX World Game (s) Eco Economic Epochs.pptx
SAP Ariba Sourcing PPT for learning material
💰 𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓 💰
tcp ip networks nd ip layering assotred slides
The New Creative Director: How AI Tools for Social Media Content Creation Are...
Job_Card_System_Styled_lorem_ipsum_.pptx
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
Unit-1 introduction to cyber security discuss about how to secure a system
Slides PDF The World Game (s) Eco Economic Epochs.pdf
522797556-Unit-2-Temperature-measurement-1-1.pptx
innovation process that make everything different.pptx
SASE Traffic Flow - ZTNA Connector-1.pdf

AI-SDV 2021 - Tony Trippe - The Current State of Machine Learning for Patent Searching and Analytics: Practical Perspectives from ML4Patents.com

  • 1. ©All rights reserved. Not for reproduction, distribution or sale. Improving Patent Analytics Using Semantic and Machine Learning Technologies Anthony Trippe Managing Director Patinformatics, LLC ii-SDV October 5th, 2021
  • 2. ©All rights reserved. Not for reproduction, distribution or sale. PATENT ANALYTICS USING SEMANTIC AND MACHINE LEARNING TECHNOLOGIES
  • 3. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. 3 • Taken from: Guidelines for Preparing Patent Landscape Reports • http://www.wipo.int/edocs/pubdocs/en/wipo_pub_946.pdf • Building a collection • Conducting a patent search • Determining relevance • TIDYing the collection • Patent family reduction • Deciding on type of year to use • Standardizing key fields • Creating categories • Steps in preparing a patent landscape report
  • 4. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. 4 • Analyzing the collection • Building the models • Looking for trends • Visualizing the collection • Charts & Graphs • Network diagrams • Spatial concept maps • Sharing conclusions • Steps in preparing a patent landscape report
  • 5. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. 5 • In searching during the preparation of a PLR, information retrieval methods usually look at precision and recall simultaneously and measure their effectiveness looking at both elements • Even though this is the case, precision and recall are normally opposed to one another such that with an increase in recall there is usually a subsequent drop in the level of precision • In generating collections for PLRs it might be more productive to begin with creating sets using methods that produce high recall exclusive of precision • Once an initial collection with high recall is built different methods can be used to increase the precision of the collection by determining relevance of the families • From a practical perspective, if the level of recall can be established at higher than 90%, while the precision kept above 70%, then the likelihood of finding statistically relevant, but conceptually irrelevant items in the subsequent analysis steps is reasonably small • Patent Search/Determine Relevance
  • 6. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. 6 • Improving recall • Cosine similarity • Latent semantic analysis • More like this / Practical scoring function • https://cloudblog-withgoogle-com.cdn.ampproject.org/c/s/cloudblog.withgoogle.com/products/data-analytics/expanding-your-patent- set-with-ml-and-bigquery/amp/ • Others? • How does this really compare to Boolean/traditional patent queries? –will be discussed at EPOPIC • Determining relevance – https://github.com/swh/classification-gold-standard • Binary classification • Support vector machines / vector space models • Neural networks • Others? • Semantic Tools for Patent Search/Determine Relevance
  • 7. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. 7 • Field Cleanup • Patent Assignee Standardization – fuzzy logic, rules-based • disambiguation • Family or Invention Reduction • Reconciling Forward Citations • Based on family reduction method • Determining Reporting Year • Using earliest publication year • TIDYing the Collection
  • 8. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. 8 • Binary classification provides a means for categorizing large collections of patent documents into the references that are likely to be of highest interest to the information professional, and those that are likely not related, but were still retrieved in a broad search • A training set will be made up of references that are highly relevant to the interests of the analyst • In training the classifier, the analyst will need to identify documents that are off-topic as well, so the classifier can establish a hyperplane that will distinguish between the two categories • Technology categories are sometimes identified using the patent data itself, for instance, with classification codes, but ideally, they should be generated based on input from a subject-matter expert based on an industry standard view on how approaches are categorized • Using a market or industry-based approach to creating categories will make it easier for the clients of the PLR to identify with the technology • Creating Categories for the Collection
  • 9. ©All rights reserved. Not for reproduction, distribution or sale. SAMPLE PATENT VISUALIZATIONS INVOLVING MACHINE LEARNING
  • 10. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. • Spatial Concept Maps 10 • Patents that are similar to one another based on their language are organized close to one another on the map • Relative distance between different technical subjects show which concepts are related to one another • Labels are added to identify the sub- sections within a technology field • Colored dots are used to provide comparisons within the context of the map • In this case, the dots provide a means to compare different companies in the space • Colored dots can also be used for distributions over time
  • 11. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. • Patent Citation Network MapsTM 11 • When a patent is referenced in a future patent it’s called a forward citation • Patents that have a large number of forward citations, especially from other organizations can be considered influential • These citations can be aggregated by the organizations that own the patents in question • Using a network diagram the connections between organizations can be found, and the most influential groups identified • In this case, the HP and Northrup portfolios are not as large as the ones for D-Wave, and IBM, but they are very influential
  • 12. ©All rights reserved. Not for reproduction, distribution or sale. INTRODUCING ML4PATENTS.COM
  • 13. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. • Supporting and Additional Vendors 13 • Vendor pages • 37 unique machine learning for IP tools from 31 vendors • Detailed information provided on each tool • Ten primary and 21 secondary categories index for ease of data retrieval on the tools and vendors
  • 14. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. • Blog Post Selection: Can an AI Get a Patent? 14 • From ML4Patents.com — 219 posts currently on the site • When is There an “Actual Invention” Involving Computers? • 2021: An AI Inventor Odyssey • USPTO Releases Public Comments on AI • The Time is Now: Opportunities to Advise the E.D. Va. or EPO as to Whether to Prohibit, Permit, or Require Listing an AI Algorithm as an Inventor • AI Magazine: What Happens When Artificial Intelligence Invents: Is the Invention Patentable? • Legal challenges have occurred in Australia, Europe, the United Kingdom and the US • In all cases patentability requires a human inventor and thus an AI system can’t be an inventor
  • 15. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. • Blog Post Selection: Is AI Patentable? 15 • From ML4Patents.com • How to Safeguard AI Technology: Patents versus Trade Secrets • Brazilian PTO publishes Guidelines for Examining Patent Applications Involving Computer-Implemented Inventions • European Patent Office Artificial Intelligence Page • Public Views on AI and IP Policy from USPTO • AI Initiatives at the JPO • Patenting AI in the EPO Guidelines - The Best Practice Podcast • UK Government Mulls Legislative Changes to Safeguard AI Inventions • Absolutely — most countries are making special provisions • Software patents used to mostly be strictly forbidden as abstract and not allowable subject matter, but as the importance of machine learning has increased more and more countries are posting guidelines on how to protect these inventions
  • 16. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. • Webinar Series 16 • 4th episode is on Wednesday October 6th with Intellar from Perception Partners • Learn more and register at: https://www.ml4patents.com/webinars/episode-4-w- perception-partners • 1st three episodes have been a resounding success • Relativity — providing a collection of methods to chose from • Cipher — deep indexing in selected verticals with rich hierarchies • Amplified — provided an excellent primer on modern ML for text retrieval methods • Archive available at: https://www.ml4patents.com/webinars
  • 17. www.patinformatics.com ©All rights reserved. Not for reproduction, distribution or sale. Contact Us +1.614.787.5237 info@patinformatics.com www.patinformatics.com 17