SlideShare a Scribd company logo
Natural Language Understanding of Systems
Engineering Artifacts
INCOSE International Symposium 2022
Detroit
2022 June 25-30
Authors: Géza Kulcsár (IncQuery Labs), Gaëtan Pruvost (Thales Global Services),
Olivier Constant (TGS), István Ráth (IQL), Máté Füzesi (IQL), Dénes Harmath (IQL)
Presenter: Ákos Horváth (IQL)
akos.horvath@incquerylabs.com
The Y-Model of Digital Systems Engineering
Systems Model Understanding:
Beware, Experts Only
Systems
Engineering
Artifacts
Querying
Query Result Set
(Relevant
Patterns)
Processing
Information
Reports
MODELING EXPERTISE DOMAIN EXPERTISE
Analysis
Vision: Accessible System Model Analysis
Multi-Domain
Knowledge Graph
Systems
Engineering
Artifacts
Querying
Query Result Set
(Relevant
Patterns)
MODELING EXPERTISE DOMAIN EXPERTISE
Traces
Traces
Contribution 1: Model-Self Descriptions
Model Self-
Descriptions
(MSD)
Systems
Engineering
Artifacts
Querying
Query Result Set
(Relevant
Patterns)
MODELING EXPERTISE
Language
Model
Representation
Model to Text
DOMAIN EXPERTISE
Traces
Traces are available
What is Natural Language Understanding?
Source: “A Comprehensive Guide to Natural Language Generation.” Medium, Sciforce. 2019
What is Natural Language Understanding?
• The ongoing revolution in Natural Language
Understanding (NLU) results in better solutions for
automatically interpreting “general” texts
• That is, if the nature of our input text corresponds to
“typical” documents (as found, e.g., on the internet),
then we have good chances to have quality results for
NLU tasks
• However, sometimes, there is a specific meaning to a
text, (meant to be) understood by experts only
• Model-self descriptions are an intermediate step between NLU
algorithms and domain-specific artifacts – here, system models
https://www.kdnuggets.com/2019/03/deconstructing-bert-part-2-visualizing-inner-workings-attention.html
Model-Self Descriptions: Models to text
• Current approach: a query- and template-based information extractor and text
generator
• (Potential for future work: utilizing neural networks for text production)
• Sample queries
• VQL (Viatra Query Language)
“look for SysML blocks in a given package”
• Gremlin (a tool-independent graph query language for the Apache Tinkerpop framework)
“get Capella elements with Logical Component or Physical Component as domain type”
Example MSD Sentences
• Example for this presentation: The Hybrid SUV SysML model provided by OMG
• When turning the results of such queries into English sentences based on pre-
defined templates (only presupposing knowledge of SysML but not of the source
model or the domain it comes from), we get sentences like:
• The Environment block does not have any value properties.
• The Environment block has a weather part property with the type of Weather.
• The Weather block is in the HSUVModel package.
• The Transmission block contains 2 part properties, which are the fuelReturn property with
the type of Fuel and the fuelSupply property with the type of Fuel.
Contribution 2: Natural Search in Models
Model Self-
Descriptions
(MSD)
Systems
Engineering
Artifacts
Querying
Query Result Set
(Relevant
Patterns)
MODELING EXPERTISE
Language
Model
Representation
NLP
Natural
search
Model to Text
DOMAIN EXPERTISE
Traces
Traces are available
Language Models
• Generally, a language model is a representation of natural-language texts in
numerical form, allowing for machine interpretations
• However, their complexity ranges from simple, fixed word-to-vector mappings to
cutting-edge neural networks
• Without getting too much into technical details, we did experiments using two
established language models:
• TF-IDF (term frequency – inverse document frequency)
• A rather simple approach, with vector representations of words only slightly process: a term gets
higher weights if it is frequent in the input document we study, but rare in the overall language
corpus. Intuitively, this model could support domain-specific search scenarios such as ours.
• BERT by Google (a deep learning model with context awareness features)
• Pre-trained on Wikipedia and other generic, large-volume English-language sources, BERT can be
downloaded by anyone and used for NLP/NLU scenarios. It promises advanced, human-like
understanding in certain contexts by its context-sensitive vector encodings.
Experiment: Semantic Search in the HSUV Model
• In this experiment, we use an intuitive set of five search terms, representing
various semantic facets of the input model (and its domain), as well as divers
search strategies from single-word searches to actual questions:
• environment
• environmental awareness
• fuel
• What properties do we store about fuel?
• subsystem of HybridSUV
Results: TF-IDF
Results: BERT
Qualitative discussion
• The “environment” search term reveals that in a model-based setting, using
deeper natural-language semantics does not obviously bring benefits: result
with TF-IDF are much cleaner, while BERT results also include nonsense
• With a slightly more complicated term, “environmental awareness”, TF-IDF
seems to produce erroneous behavior, while the results of BERT look smart
• Results for “fuel”, again, show that simplicity also has its merits: BERT’s findings
feel totally irrelevant
• However, interestingly, the power of BERT is somewhat increased if we look for
the same thing in a natural-language questioning scenario: “What properties do
we store about fuel?”
Conclusion and Future Work
• We have provided a conceptual framework for bridging state-of-the-art NLU
techniques and systems model comprehension, along with an experimental
prototype
• First, preliminary results show that domain-specific semantic comprehension
most likely requires a combination of simplistic search mechanisms and
advanced semantic approaches
• Obviously, there is need for further evidence with more search terms, diverse
corpora and more engineering domains
• There is a yet unexploited potential in fine-tuning (training) neural networks like
BERT on actual system models (MSDs) to increase search quality
akos.horvath@incquerylabs.com
incquery.io
+36 30 5511164
Thank you!
@IncQueryLabs

More Related Content

PDF
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
PDF
Nlp research presentation
PDF
Should we be afraid of Transformers?
PDF
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
PDF
Revolutionizing Industry 4.0: GPT-Enabled Real-Time Support
PDF
IRJET- Factoid Question and Answering System
PDF
Emily Pitler - Representations from Natural Language Data: Successes and Chal...
PPTX
From Natural Language Processing to Artificial Intelligence
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Nlp research presentation
Should we be afraid of Transformers?
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Revolutionizing Industry 4.0: GPT-Enabled Real-Time Support
IRJET- Factoid Question and Answering System
Emily Pitler - Representations from Natural Language Data: Successes and Chal...
From Natural Language Processing to Artificial Intelligence

Similar to Natural Language Understanding of Systems Engineering Artifacts (20)

PDF
Deep Learning in NLP (BERT, ERNIE and REFORMER)
PDF
BERT Explained_ State of the art language model for NLP.pdf
PPTX
BERT QnA System for Airplane Flight Manual
PDF
NLP@DATEV: Setting up a domain specific language model, Dr. Jonas Rende & Tho...
PPTX
Data Day Seattle, From NLP to AI
PDF
Interpretation of Pretrained Language Models Chenyan Xiong 11-667
PDF
Andrea gatto meetup_dli_18_feb_2020
PDF
Let Android dream electric sheep: Making emotion model for chat-bot with Pyth...
PDF
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
PDF
DataChat_FinalPaper
PDF
Mini seminar presentation on context-based NED optimization
PDF
IRJET - Mobile Chatbot for Information Search
PPTX
Deep natural language processing in search systems
PPTX
Natural language processing and transformer models
PDF
Triantafyllia Voulibasi
PDF
May 2024 - Top10 Cited Articles in Natural Language Computing
PDF
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
PDF
Conversational AI with Rasa - PyData Workshop
PDF
Human-in-the-loop: a design pattern for managing teams that leverage ML
PPTX
Deep Learning in NLP (BERT, ERNIE and REFORMER)
BERT Explained_ State of the art language model for NLP.pdf
BERT QnA System for Airplane Flight Manual
NLP@DATEV: Setting up a domain specific language model, Dr. Jonas Rende & Tho...
Data Day Seattle, From NLP to AI
Interpretation of Pretrained Language Models Chenyan Xiong 11-667
Andrea gatto meetup_dli_18_feb_2020
Let Android dream electric sheep: Making emotion model for chat-bot with Pyth...
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
DataChat_FinalPaper
Mini seminar presentation on context-based NED optimization
IRJET - Mobile Chatbot for Information Search
Deep natural language processing in search systems
Natural language processing and transformer models
Triantafyllia Voulibasi
May 2024 - Top10 Cited Articles in Natural Language Computing
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Conversational AI with Rasa - PyData Workshop
Human-in-the-loop: a design pattern for managing teams that leverage ML
Ad

More from Ákos Horváth (20)

PDF
Bridging the Validation Gap: Next-Level Model Governance in Digital-Twin Engi...
PDF
Next-Generation Completeness and Consistency Management in the Digital Threa...
PDF
IoT Meetup Budapest - The Open-CPS approach
PDF
Multi-disciplinary simulation of Cyber-Physical Systems – The OpenCPS approach
PPTX
V for visualization: VIATRA finally goes graphical thanks to Sirius!
PPTX
DemoCamp Budapest 2016 - Introdcution
PPTX
Incremental model compiler for executable UML
PPTX
MoDeS3 - Model-based Demonstrator for Smart and Safe Systems
PPTX
Incremental Queries and Transformations for Engineering Critical Systems
PPTX
EMF-IncQuery: Blazing-fast reaction time even for very large diagrams (Sirius...
PPTX
Local search-based pattern matching features in EMF-IncQuery
PPTX
VIATRA 3: A reactive model transformation platform
PPTX
Model visualization made easy: Incremental query-driven views in modeling tools
PPT
CPS(M): Constraint Satisfaction Problem over Models (a.k.a rule based design ...
PPTX
Szoftverfejlesztés a repülőgépiparban
PPTX
Guaranteed Component Assembly with Round Trip Analysis for Energy Efficient H...
PPTX
Software Development for Safety Critical Systems
PPTX
Incremental Model Queries for Model-Dirven Software Engineering
PPT
Model-Driven Development of ARINC 653 Configuration tables
PPTX
Hardware-Software allocation specification of IMA systems for early simulation
Bridging the Validation Gap: Next-Level Model Governance in Digital-Twin Engi...
Next-Generation Completeness and Consistency Management in the Digital Threa...
IoT Meetup Budapest - The Open-CPS approach
Multi-disciplinary simulation of Cyber-Physical Systems – The OpenCPS approach
V for visualization: VIATRA finally goes graphical thanks to Sirius!
DemoCamp Budapest 2016 - Introdcution
Incremental model compiler for executable UML
MoDeS3 - Model-based Demonstrator for Smart and Safe Systems
Incremental Queries and Transformations for Engineering Critical Systems
EMF-IncQuery: Blazing-fast reaction time even for very large diagrams (Sirius...
Local search-based pattern matching features in EMF-IncQuery
VIATRA 3: A reactive model transformation platform
Model visualization made easy: Incremental query-driven views in modeling tools
CPS(M): Constraint Satisfaction Problem over Models (a.k.a rule based design ...
Szoftverfejlesztés a repülőgépiparban
Guaranteed Component Assembly with Round Trip Analysis for Energy Efficient H...
Software Development for Safety Critical Systems
Incremental Model Queries for Model-Dirven Software Engineering
Model-Driven Development of ARINC 653 Configuration tables
Hardware-Software allocation specification of IMA systems for early simulation
Ad

Recently uploaded (20)

PDF
System and Network Administraation Chapter 3
PDF
AI in Product Development-omnex systems
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Understanding Forklifts - TECH EHS Solution
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
Introduction to Artificial Intelligence
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
Odoo POS Development Services by CandidRoot Solutions
System and Network Administraation Chapter 3
AI in Product Development-omnex systems
wealthsignaloriginal-com-DS-text-... (1).pdf
Design an Analysis of Algorithms II-SECS-1021-03
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Understanding Forklifts - TECH EHS Solution
Operating system designcfffgfgggggggvggggggggg
PTS Company Brochure 2025 (1).pdf.......
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
2025 Textile ERP Trends: SAP, Odoo & Oracle
CHAPTER 2 - PM Management and IT Context
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Upgrade and Innovation Strategies for SAP ERP Customers
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Introduction to Artificial Intelligence
How to Choose the Right IT Partner for Your Business in Malaysia
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Odoo POS Development Services by CandidRoot Solutions

Natural Language Understanding of Systems Engineering Artifacts

  • 1. Natural Language Understanding of Systems Engineering Artifacts INCOSE International Symposium 2022 Detroit 2022 June 25-30 Authors: Géza Kulcsár (IncQuery Labs), Gaëtan Pruvost (Thales Global Services), Olivier Constant (TGS), István Ráth (IQL), Máté Füzesi (IQL), Dénes Harmath (IQL) Presenter: Ákos Horváth (IQL) akos.horvath@incquerylabs.com
  • 2. The Y-Model of Digital Systems Engineering
  • 3. Systems Model Understanding: Beware, Experts Only Systems Engineering Artifacts Querying Query Result Set (Relevant Patterns) Processing Information Reports MODELING EXPERTISE DOMAIN EXPERTISE Analysis
  • 4. Vision: Accessible System Model Analysis Multi-Domain Knowledge Graph Systems Engineering Artifacts Querying Query Result Set (Relevant Patterns) MODELING EXPERTISE DOMAIN EXPERTISE Traces Traces
  • 5. Contribution 1: Model-Self Descriptions Model Self- Descriptions (MSD) Systems Engineering Artifacts Querying Query Result Set (Relevant Patterns) MODELING EXPERTISE Language Model Representation Model to Text DOMAIN EXPERTISE Traces Traces are available
  • 6. What is Natural Language Understanding? Source: “A Comprehensive Guide to Natural Language Generation.” Medium, Sciforce. 2019
  • 7. What is Natural Language Understanding? • The ongoing revolution in Natural Language Understanding (NLU) results in better solutions for automatically interpreting “general” texts • That is, if the nature of our input text corresponds to “typical” documents (as found, e.g., on the internet), then we have good chances to have quality results for NLU tasks • However, sometimes, there is a specific meaning to a text, (meant to be) understood by experts only • Model-self descriptions are an intermediate step between NLU algorithms and domain-specific artifacts – here, system models https://www.kdnuggets.com/2019/03/deconstructing-bert-part-2-visualizing-inner-workings-attention.html
  • 8. Model-Self Descriptions: Models to text • Current approach: a query- and template-based information extractor and text generator • (Potential for future work: utilizing neural networks for text production) • Sample queries • VQL (Viatra Query Language) “look for SysML blocks in a given package” • Gremlin (a tool-independent graph query language for the Apache Tinkerpop framework) “get Capella elements with Logical Component or Physical Component as domain type”
  • 9. Example MSD Sentences • Example for this presentation: The Hybrid SUV SysML model provided by OMG • When turning the results of such queries into English sentences based on pre- defined templates (only presupposing knowledge of SysML but not of the source model or the domain it comes from), we get sentences like: • The Environment block does not have any value properties. • The Environment block has a weather part property with the type of Weather. • The Weather block is in the HSUVModel package. • The Transmission block contains 2 part properties, which are the fuelReturn property with the type of Fuel and the fuelSupply property with the type of Fuel.
  • 10. Contribution 2: Natural Search in Models Model Self- Descriptions (MSD) Systems Engineering Artifacts Querying Query Result Set (Relevant Patterns) MODELING EXPERTISE Language Model Representation NLP Natural search Model to Text DOMAIN EXPERTISE Traces Traces are available
  • 11. Language Models • Generally, a language model is a representation of natural-language texts in numerical form, allowing for machine interpretations • However, their complexity ranges from simple, fixed word-to-vector mappings to cutting-edge neural networks • Without getting too much into technical details, we did experiments using two established language models: • TF-IDF (term frequency – inverse document frequency) • A rather simple approach, with vector representations of words only slightly process: a term gets higher weights if it is frequent in the input document we study, but rare in the overall language corpus. Intuitively, this model could support domain-specific search scenarios such as ours. • BERT by Google (a deep learning model with context awareness features) • Pre-trained on Wikipedia and other generic, large-volume English-language sources, BERT can be downloaded by anyone and used for NLP/NLU scenarios. It promises advanced, human-like understanding in certain contexts by its context-sensitive vector encodings.
  • 12. Experiment: Semantic Search in the HSUV Model • In this experiment, we use an intuitive set of five search terms, representing various semantic facets of the input model (and its domain), as well as divers search strategies from single-word searches to actual questions: • environment • environmental awareness • fuel • What properties do we store about fuel? • subsystem of HybridSUV
  • 15. Qualitative discussion • The “environment” search term reveals that in a model-based setting, using deeper natural-language semantics does not obviously bring benefits: result with TF-IDF are much cleaner, while BERT results also include nonsense • With a slightly more complicated term, “environmental awareness”, TF-IDF seems to produce erroneous behavior, while the results of BERT look smart • Results for “fuel”, again, show that simplicity also has its merits: BERT’s findings feel totally irrelevant • However, interestingly, the power of BERT is somewhat increased if we look for the same thing in a natural-language questioning scenario: “What properties do we store about fuel?”
  • 16. Conclusion and Future Work • We have provided a conceptual framework for bridging state-of-the-art NLU techniques and systems model comprehension, along with an experimental prototype • First, preliminary results show that domain-specific semantic comprehension most likely requires a combination of simplistic search mechanisms and advanced semantic approaches • Obviously, there is need for further evidence with more search terms, diverse corpora and more engineering domains • There is a yet unexploited potential in fine-tuning (training) neural networks like BERT on actual system models (MSDs) to increase search quality