SlideShare a Scribd company logo
The effect of the number
of concepts on the
readability
of schemas
hello!
I am Your Name
I am here because I love to give presentations.
You can find me at @username
1.
Introduction
An Empirical Study with Data Models
“A Conceptual Model is a representation of a system, made of the composition of
concepts which are used to help people know, understand, or simulate a subject
the model represents. Conceptual model schemas are used as,
(a) A method of either informally or formally documenting end-user
requirements.
(b) A method of optimally designing the subsequent IS.
A commonly used example of both (a) and (b) is the use of The Entity-
relationship Model (ERM) to capture end-user requirements for constructing
a relational database application.
Dependent variables in earlier empirical work
A survey of the literature on the evaluation of modeling methods reveals several desirable
attributes for conceptual modeling methods, which have been used as dependent
variables in past empirical studies. These include,
a) The adequacy or completeness of the modeling method in being able to represent
the underlying reality.
b) The readability of the modeling method’s schemas.
c) How easy it is to use the modeling method to Represent Requirements.
Independent variables in earlier empirical work
a) The first independent variable is the level of experience and familiarity of the subjects
with the conceptual model used.
b) Past studies have attempted to control for the level of familiarity with the domain by
utilizing domains that are reasonably familiar to all subjects, and further by randomly
allocating subjects across treatment levels.
c) A third variable is the underlying complexity of the requirements for a particular
situation, where a more complex set of requirements is harder to reconstruct than a
simpler set.
Illustrative past work on the readability of conceptual models
Study Independent Variables Measures Results
Brosey and
Schneiderman [9]
(a) Hierarchical vs.
relational models and
(b) User experience
Questions on domain
Hierarchical schemas were easier
to read by novice users
Juhn and
Naumann [17]
Semantic vs.
non-semantic models
Questions on domain
Semantic models subjects
identified
relationships and cardinalities
better
Palvia et al. [27] O–O vs. non O–O Questions on domain O–O subjects performed better
Shoval and
Frummerman [35]
EER vs. OO
True/false questions
on domain
EER subjects interpreted ternary
relationships more correctly
Hardgrave and
Dalal [16]
EER vs. OMT
Ability to understand and
time to understand
OO subjects were significantly
faster at
answering questions than EER
subjects
Peleg and Dori [29] OPM/T vs. OMT/T
True/false questions on
domain
OPM/T subjects better at
comprehension
2.
Operationalization of
variables and research
model
Operationalization of Variables
The independent variable in this work is the
NOC in a model, which we define to be a
numerical count of the number of distinct
syntactic constructs in a model. Increasing
the NOC in a model makes it harder to
create model schemas from an analyst
standpoint, but enables it to capture more
elements of the underlying domain.
NOC
Number of Concepts
REF
Readability Effectiveness
REN
Readability Efficiency
RLN (REF)
Slope of Effectiveness
RLN (REN)
Slope of Efficiency
Dimensions of Readability
Readability Effectiveness (REF)
The percentage of correct
answers given when questions
are asked about the domain.
No. of Correct Answers x 100
REF =
Total No. of Questions
Readability Efficiency (REN)
The inverse of the time it takes
to answer questions regarding
schemas.
1
REN =
t
Readability Learnability (RLN)
The improvement in the
dimensions of effectiveness
and efficiency of readability
over successive tasks.
d(REF) d(REF)
RLN(REF) = =
dxRLN(REF) dx
Hypothesis
1) Hypothesis 1 (H1):
A higher NOC will lead to a higher REF.
2) Hypothesis 2 (H2):
A higher NOC will lead to a lower REN.
3) Hypothesis 3 (H3)
H3(a):
A higher NOC will lead to a lower RLN(REF).
H3(b):
A higher NOC will lead to a lower RLN(REN).
3.
Experimental Study
Subject Selection
The subjects for this study were MIS masters level students in a university based in
northeastern USA. All the subjects were in the age range 22–30, and had one year of
experience (two courses) in using conceptual data models, with no previous usage of
conceptual data models in the work place. As such, the subjects in this study represent
beginner professional level systems analysts.
Independent & dependent variables
in the experiment
In order to maintain internal validity, two versions of the ERM are selected. The sets of
concepts in each version are shown below.
✘ ERM base-level-NOC = {entity sets, relationship sets, attributes of entity sets, attributes of
relationship sets, primary keys of entity sets}
✘ ERM higher-level-NOC = {entity sets, relation sets, attributes of entity sets, attributes of
relationship sets, primary keys of entity sets, cardinalities of relationship sets,
inheritance, optionality of relationship sets, weak entity sets}
Baseline NOC Model Library Schema
Higher Level NOC Model Library Schema
Experimental Process
✘ Subjects were first refreshed regarding the concepts behind the ERM that they would
be using. For each model, the instruction was stopped after all the subjects indicated
that they were comfortable with the model.
✘ After the instruction, the subjects were told they would be given four schemas, one
after the other. As an incentive, the subjects were offered $20 to participate in the
study, an additional $8 if they had a correct score of over 90%, and $2 if they finished
the study in less than 60% of the maximum time that was allocated.
✘ The entire protocol was pilot tested and found to work adequately.
Data Analysis
✘ Table 2 shows the raw REF and REN scores for each
experimental group, across the four domains.
Data Analysis
✘ A 2-tailed t-test analysis of the difference between
the mean REF and REN scores of the two groups
across the four domains is shown in Table 3.
✘ Table 4 shows the raw slope scores for the REF and
REN for the two models.
✘ A 2-tailed t-test analysis of the difference between
the mean REF and REN slopes of the two groups is
shown in Table 5.
Conclusion
Let’s review some concepts
1
The support for H1 implies that models
with more concepts will lead to more
accurate conceptualizations of the
underlying domain.
2
The significant support for H2 implies
that adding more concepts to a model
increases the amount of time it takes
to map back to the problem.
3
The support for the contra of H3(a) at
the 10% level indicates that the learn
ability of models with a larger NOC is
faster than those with a fewer number.
4
The lack of support for H3(b) indicates,
the reduction in the time required is not
affected by the NOC in the model. And
no significant difference is found.
5
This Study takes a step towards
empirical validation of the theory
regarding the readability of conceptual
models.
6
The Study extends the
operationalization of the readability
construct to incorporate learn ability,
effectiveness, and efficiency.
thanks!
Any questions?
You can find me at
@username
user@mailme

More Related Content

PPTX
Language Models for Information Retrieval
PDF
E43022023
PDF
Information retrieval as statistical translation
PPT
Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval
PDF
Sentence Validation by Statistical Language Modeling and Semantic Relations
PDF
Taxonomy extraction from automotive natural language requirements using unsup...
PDF
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
PDF
Similarity Measures for Semantic Relation Extraction
Language Models for Information Retrieval
E43022023
Information retrieval as statistical translation
Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval
Sentence Validation by Statistical Language Modeling and Semantic Relations
Taxonomy extraction from automotive natural language requirements using unsup...
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
Similarity Measures for Semantic Relation Extraction

What's hot (20)

PPTX
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
PDF
PPTX
NLP Project Presentation
PDF
Machine Learning Techniques with Ontology for Subjective Answer Evaluation
PDF
IRJET- An Analysis of Recent Advancements on the Dependency Parser
PDF
Semantic Similarity Measures for Semantic Relation Extraction
PPTX
Neural Models for Document Ranking
PDF
French machine reading for question answering
PDF
L1803058388
PDF
Nlp presentation
PPTX
Dual Embedding Space Model (DESM)
PDF
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
PDF
Nlp research presentation
PDF
Cc35451454
PDF
Automatic vs. human question answering over multimedia meeting recordings
PPTX
5 Lessons Learned from Designing Neural Models for Information Retrieval
PDF
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
PDF
Conceptual similarity measurement algorithm for domain specific ontology[
PPTX
Topic Extraction on Domain Ontology
PPTX
Adversarial and reinforcement learning-based approaches to information retrieval
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
NLP Project Presentation
Machine Learning Techniques with Ontology for Subjective Answer Evaluation
IRJET- An Analysis of Recent Advancements on the Dependency Parser
Semantic Similarity Measures for Semantic Relation Extraction
Neural Models for Document Ranking
French machine reading for question answering
L1803058388
Nlp presentation
Dual Embedding Space Model (DESM)
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
Nlp research presentation
Cc35451454
Automatic vs. human question answering over multimedia meeting recordings
5 Lessons Learned from Designing Neural Models for Information Retrieval
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Conceptual similarity measurement algorithm for domain specific ontology[
Topic Extraction on Domain Ontology
Adversarial and reinforcement learning-based approaches to information retrieval
Ad

Similar to The effect of number of concepts on readability of schemas 2 (20)

PDF
Relevance feature discovery for text mining
PPT
Recommender system
PDF
Empirical Analysis of the Bias-Variance Tradeoff Across Machine Learning Models
PDF
Empirical Analysis of the Bias-Variance Tradeoff Across Machine Learning Models
PDF
A hybrid composite features based sentence level sentiment analyzer
PDF
An Automatic Question Paper Generation : Using Bloom's Taxonomy
PDF
Babelfish_Report
PDF
Marshall hm poster_vra2015
DOCX
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docx
PDF
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
PDF
Topicmodels
DOCX
DIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docx
PDF
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
PDF
May 2024 - Top10 Cited Articles in Natural Language Computing
PPT
A Longitudinal Perspective on the Relationship between Hypermedia Structure...
PPTX
Hierarchical clustering and topology for psychometric validation
PPTX
topic modelling through LDA and bertopic model
PDF
A Domain-Invariant Transfer Learning by Bert for Cross-Domain Sentiment Analysis
PPT
Slides sem on pls-complete
PPTX
Writing a scientific manuscript
Relevance feature discovery for text mining
Recommender system
Empirical Analysis of the Bias-Variance Tradeoff Across Machine Learning Models
Empirical Analysis of the Bias-Variance Tradeoff Across Machine Learning Models
A hybrid composite features based sentence level sentiment analyzer
An Automatic Question Paper Generation : Using Bloom's Taxonomy
Babelfish_Report
Marshall hm poster_vra2015
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docx
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
Topicmodels
DIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docx
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
May 2024 - Top10 Cited Articles in Natural Language Computing
A Longitudinal Perspective on the Relationship between Hypermedia Structure...
Hierarchical clustering and topology for psychometric validation
topic modelling through LDA and bertopic model
A Domain-Invariant Transfer Learning by Bert for Cross-Domain Sentiment Analysis
Slides sem on pls-complete
Writing a scientific manuscript
Ad

More from Saman Sara (17)

PPSX
Hub savers
PPTX
Transgender employees
PPTX
The need for security
PPTX
The future of learning
PPT
The Apple Experience
PPTX
Technology is a double edged weapon
PPTX
Strategy and Tactics
PPTX
Social media dangers
PPT
Purpose Driven Marketing
PPTX
Nachfrist for late performance
PPTX
Mental Health & Technical Possibilities
PPTX
Failed technologies and investment under uncertainty
PPT
Cyber crime in the digital age
PPTX
Change Agility
PPTX
Colorism and socioeconomic status
PPTX
She crate
PDF
My Portfolio
Hub savers
Transgender employees
The need for security
The future of learning
The Apple Experience
Technology is a double edged weapon
Strategy and Tactics
Social media dangers
Purpose Driven Marketing
Nachfrist for late performance
Mental Health & Technical Possibilities
Failed technologies and investment under uncertainty
Cyber crime in the digital age
Change Agility
Colorism and socioeconomic status
She crate
My Portfolio

Recently uploaded (20)

PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Institutional Correction lecture only . . .
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
master seminar digital applications in india
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Cell Types and Its function , kingdom of life
PDF
Sports Quiz easy sports quiz sports quiz
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Computing-Curriculum for Schools in Ghana
STATICS OF THE RIGID BODIES Hibbelers.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Institutional Correction lecture only . . .
Final Presentation General Medicine 03-08-2024.pptx
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
01-Introduction-to-Information-Management.pdf
O7-L3 Supply Chain Operations - ICLT Program
Microbial disease of the cardiovascular and lymphatic systems
human mycosis Human fungal infections are called human mycosis..pptx
Abdominal Access Techniques with Prof. Dr. R K Mishra
master seminar digital applications in india
Pharma ospi slides which help in ospi learning
Cell Types and Its function , kingdom of life
Sports Quiz easy sports quiz sports quiz
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Computing-Curriculum for Schools in Ghana

The effect of number of concepts on readability of schemas 2

  • 1. The effect of the number of concepts on the readability of schemas
  • 2. hello! I am Your Name I am here because I love to give presentations. You can find me at @username
  • 4. “A Conceptual Model is a representation of a system, made of the composition of concepts which are used to help people know, understand, or simulate a subject the model represents. Conceptual model schemas are used as, (a) A method of either informally or formally documenting end-user requirements. (b) A method of optimally designing the subsequent IS. A commonly used example of both (a) and (b) is the use of The Entity- relationship Model (ERM) to capture end-user requirements for constructing a relational database application.
  • 5. Dependent variables in earlier empirical work A survey of the literature on the evaluation of modeling methods reveals several desirable attributes for conceptual modeling methods, which have been used as dependent variables in past empirical studies. These include, a) The adequacy or completeness of the modeling method in being able to represent the underlying reality. b) The readability of the modeling method’s schemas. c) How easy it is to use the modeling method to Represent Requirements.
  • 6. Independent variables in earlier empirical work a) The first independent variable is the level of experience and familiarity of the subjects with the conceptual model used. b) Past studies have attempted to control for the level of familiarity with the domain by utilizing domains that are reasonably familiar to all subjects, and further by randomly allocating subjects across treatment levels. c) A third variable is the underlying complexity of the requirements for a particular situation, where a more complex set of requirements is harder to reconstruct than a simpler set.
  • 7. Illustrative past work on the readability of conceptual models Study Independent Variables Measures Results Brosey and Schneiderman [9] (a) Hierarchical vs. relational models and (b) User experience Questions on domain Hierarchical schemas were easier to read by novice users Juhn and Naumann [17] Semantic vs. non-semantic models Questions on domain Semantic models subjects identified relationships and cardinalities better Palvia et al. [27] O–O vs. non O–O Questions on domain O–O subjects performed better Shoval and Frummerman [35] EER vs. OO True/false questions on domain EER subjects interpreted ternary relationships more correctly Hardgrave and Dalal [16] EER vs. OMT Ability to understand and time to understand OO subjects were significantly faster at answering questions than EER subjects Peleg and Dori [29] OPM/T vs. OMT/T True/false questions on domain OPM/T subjects better at comprehension
  • 9. Operationalization of Variables The independent variable in this work is the NOC in a model, which we define to be a numerical count of the number of distinct syntactic constructs in a model. Increasing the NOC in a model makes it harder to create model schemas from an analyst standpoint, but enables it to capture more elements of the underlying domain. NOC Number of Concepts REF Readability Effectiveness REN Readability Efficiency RLN (REF) Slope of Effectiveness RLN (REN) Slope of Efficiency
  • 10. Dimensions of Readability Readability Effectiveness (REF) The percentage of correct answers given when questions are asked about the domain. No. of Correct Answers x 100 REF = Total No. of Questions Readability Efficiency (REN) The inverse of the time it takes to answer questions regarding schemas. 1 REN = t Readability Learnability (RLN) The improvement in the dimensions of effectiveness and efficiency of readability over successive tasks. d(REF) d(REF) RLN(REF) = = dxRLN(REF) dx
  • 11. Hypothesis 1) Hypothesis 1 (H1): A higher NOC will lead to a higher REF. 2) Hypothesis 2 (H2): A higher NOC will lead to a lower REN. 3) Hypothesis 3 (H3) H3(a): A higher NOC will lead to a lower RLN(REF). H3(b): A higher NOC will lead to a lower RLN(REN).
  • 13. Subject Selection The subjects for this study were MIS masters level students in a university based in northeastern USA. All the subjects were in the age range 22–30, and had one year of experience (two courses) in using conceptual data models, with no previous usage of conceptual data models in the work place. As such, the subjects in this study represent beginner professional level systems analysts.
  • 14. Independent & dependent variables in the experiment In order to maintain internal validity, two versions of the ERM are selected. The sets of concepts in each version are shown below. ✘ ERM base-level-NOC = {entity sets, relationship sets, attributes of entity sets, attributes of relationship sets, primary keys of entity sets} ✘ ERM higher-level-NOC = {entity sets, relation sets, attributes of entity sets, attributes of relationship sets, primary keys of entity sets, cardinalities of relationship sets, inheritance, optionality of relationship sets, weak entity sets}
  • 15. Baseline NOC Model Library Schema
  • 16. Higher Level NOC Model Library Schema
  • 17. Experimental Process ✘ Subjects were first refreshed regarding the concepts behind the ERM that they would be using. For each model, the instruction was stopped after all the subjects indicated that they were comfortable with the model. ✘ After the instruction, the subjects were told they would be given four schemas, one after the other. As an incentive, the subjects were offered $20 to participate in the study, an additional $8 if they had a correct score of over 90%, and $2 if they finished the study in less than 60% of the maximum time that was allocated. ✘ The entire protocol was pilot tested and found to work adequately.
  • 18. Data Analysis ✘ Table 2 shows the raw REF and REN scores for each experimental group, across the four domains.
  • 19. Data Analysis ✘ A 2-tailed t-test analysis of the difference between the mean REF and REN scores of the two groups across the four domains is shown in Table 3. ✘ Table 4 shows the raw slope scores for the REF and REN for the two models. ✘ A 2-tailed t-test analysis of the difference between the mean REF and REN slopes of the two groups is shown in Table 5.
  • 21. Let’s review some concepts 1 The support for H1 implies that models with more concepts will lead to more accurate conceptualizations of the underlying domain. 2 The significant support for H2 implies that adding more concepts to a model increases the amount of time it takes to map back to the problem. 3 The support for the contra of H3(a) at the 10% level indicates that the learn ability of models with a larger NOC is faster than those with a fewer number. 4 The lack of support for H3(b) indicates, the reduction in the time required is not affected by the NOC in the model. And no significant difference is found. 5 This Study takes a step towards empirical validation of the theory regarding the readability of conceptual models. 6 The Study extends the operationalization of the readability construct to incorporate learn ability, effectiveness, and efficiency.
  • 22. thanks! Any questions? You can find me at @username user@mailme