SlideShare a Scribd company logo
Identifying semantics characteristics of user’s
interactions datasets through an application of a data
analysis
Fernando de Assis RODRIGUES, Ph. D.
Pedro Henrique Santos BISI
Ricardo César Gonçalves SANT’ANA, Ph. D.
Graduate Program of Information Science
UNESP - São Paulo State University (Brazil)
2
Using data as part of
the decision-making
process is a reality in
professional
environments.
We need to
decide
[something].
Sure! Let’s use
data to support.
Data Data Data Data Data Data
The analyzed fact need to receive inputs from
multiple data sources – structuring, integrating,
storing, and processing the collected data into an
output that supports a better understanding of the
fact from data, allowing new dimensions or
perspectives of analysis
3
4
The use of data as part of the decision-making process in several areas.
Science
Decision-
mankingData
D D D D D D
Education
Decision-
mankingData
D D D D D D
Industry
Decision-
mankingData
D D D D D D
Management
Decision-
mankingData
D D D D D D
Services
Decision-
mankingData
D D D D D D
...
Decision-
mankingData
D D D D D D
Data warehouse
5
DataDataDataDataDataDataDataDataData
Data Analytics
DataDataNew
Data
Look! After the data
analysis, new data
became available! Sounds good!
But don’t forget: from
data sources, all things
depend.
The <entity, attribute, and value> condition implies
a use of aggregated information on these elements to
assure a minimal semantic to understand what is
available, notably related with steps of obtaining data
collections from data sources
6
Goal
7
To identify the semantics characteristics of data attributes
at the moment of collecting, from dataset's structures found
on data export interfaces on user’s interactions analysis tools,
on Internet communication channels, and on web analytics
data tools involved in a scientific journal management,
through an application of a process of data analysis and data
modeling techniques.
Methodology and Method
8
About the observed phenomenon
Investigation of datasets, entities, and attributes related to
the interaction between users and communications
channels from a scientific journal.
Nature
Qualitative-quantitative research.
Purpose
Exploratory analysis to identify characteristics of how data
are available and structured on these data resources.
The Path adopted on investigation
An exploratory research of data export interfaces to collect
information about available data and metadata.
About methods
(i) Data extraction and spreadsheet data handling with
Python 3 programming language.
(ii) Applying of the Entity-Relationship model in
generated data from analytics.
(iii) Use of data structures oriented to making-decision
processing (OLAP).
Methodology and Method
9
Data universe
User’s interactions data from:
→ Internal data sources
→ External data sources
Methodology and Method
10
Data Sample
User’s interactions data* from RECoDAF - Electronic
Journal Digital Skills for Family Farming
→ Internal data sources: Open Journal Systems
→ External data sources: Facebook Insights, Google
Analytics, Google Search Console, and Twitter Analytics
Methodology and Method
11
* Data collected on September 2017. Available at
http://dadosabertos.info/data/collection_recodaf_2017
Results | Discussion
12
255 exportable datasets, using 5 file formats.
Results | Discussion
13
Find on data sources
information about:
→Services
→Resources
→Datasets
→Attributes
→File Formats
→Data types
Diagram of Entity-Relationship model
developed for data collecting
Data warehouse
Dat
a
Dat
aData
Results | Discussion
14
An entity (ex) may have two attributes (ax and ay) sharing
same semantics (S), even when both attributes shows distinct
text labels on data collecting
Results | Discussion
15
ex = Page data, from Facebook Insights
ax = “Lifetime Likes by City - Tupã, SP, Brazil”
ay = “Lifetime Likes by City - Bauru, SP, Brazil”
Example
The attributes “Lifetime Likes by City - Tupã, SP,
Brazil” and “Lifetime Likes by City - Bauru, SP,
Brazil” have different text labels, but share same
semantics.
Results | Discussion
16
The absence of semantics with an exception for the
availability of text labels do not ensure that attributes of two
distinct entities (ex and ey) that shares equal labels (ax),
consequently, are sharing the same formal semantics (S) on
data collecting process by external agents.
Results | Discussion
17
ax = “Impressions”
ex = Search Analytics, from Google Search Console
ey = Tweet Activity, from Twitter Analytics
Example
Both attributes share equal text labels, but this coincidence
does not ensure that the attributes have the same semantics. In
this example, each entity applied different data types to these
attributes, resulting in a mismatch on their values.
It’s a
average!
It’s a total
amount!
Final considerations
18
The data analysis
→ To identify the critical points of descriptive elements on
those datasets.
→The lack of descriptive elements in data collection process
when triggered through the available export interfaces.
Final considerations
19
To reduce this dissonance between attributes, export interfaces
can bring more semantic information bound to datasets.
→ Important information to interpret data available from
different sources.
For example text labeling rules, controlled vocabularies, and restriction
clauses.
The semantic dissonances on these entities may interfere with
the development process of relationships between attributes
from different datasets, decreasing the potential of
interoperability.
References
20
Berg, O. (2015). Collaborating in a social era: ideas, insights and models
that inspire new ways of thinking about collaboration. Göteborg:
Intranätverk.
Cornell, P. (2005). A complete guide to PivotTables: a visual approach.
Berkeley, CA : New York: Apress ; Distributed to the Book trade in the
United States by Springer-Verlag.
Date, C. J. (2016). The new relational database dictionary: a
comprehensive glossary of concepts arising in connection with the
relational model of data, with definitions and illustrative examples:
[terms, concepts, and examples]. Sebastopol, CA: O´Reilly.
Goodwin, P., & Wright, G. (2014). Decision analysis for management
judgment (5th Edition). Hoboken, New Jersey: Wiley.
Gray, J., Bosworth, A., Lyaman, A., & Pirahesh, H. (1996). Data cube: a
relational aggregation operator generalizing GROUP-BY, CROSS-TAB,
and SUB-TOTALS (pp. 152–159). IEEE Comput. Soc. Press.
https://doi.org/10.1109/ICDE.1996.492099
Ikemoto, G. S., & Marsh, J. A. (2007). Cutting Through the “Data-
Driven” Mantra: Different Conceptions of Data-Driven Decision Making.
Yearbook of the National Society for the Study of Education, 106(1),
105–131. https://doi.org/10.1111/j.1744-7984.2007.00099.x
Inmon, W. H. (2005). Building the data warehouse (4th ed). Indianapolis,
Ind: Wiley.
Kimball, R., & Ross, M. (2011). The Data Warehouse Toolkit The
Complete Guide to Dimensional Modeling. New York, United States of
America: John Wiley & Sons. Retrieved from http://nbn-
resolving.de/urn:nbn:de:101:1-2014122311140
Lebo, T., & Williams, G. T. (2010). Converting governmental datasets
into linked data. In Proceedings of the 6th International Conference on
Semantic Systems. Graz, Austria: ACM Press.
https://doi.org/10.1145/1839707.1839755
Rathod, A. (2006). A messaging system to handle semantic dissonance
(Thesis). Rochester Institute of Technology, New York. Retrieved from
http://scholarworks.rit.edu/cgi/viewcontent.cgi?article=1668&context=the
ses
Reddy, G. S., Srinivasu, R., Rao, M. P. C., & Rikkula, S. R. (2010). Data
warehousing, data mining, OLAP, OLTP technologies are essential
elements to support decision-making process in industries. International
Journal on Computer Science and Engineering, 2(9), 2865–2873.
Ross Parry, Nick Poole, & Jon Pratty. (2008). Semantic Dissonance: Do
We Need (And Do We Understand) The Semantic Web? In Toronto:
Archives & Museum Informatics. Retrieved from
http://www.archimuse.com/mw2008/papers/ parry/parry.html
Sant’Ana, R. C. G. (2016). Ciclo de vida dos dados: uma perspectiva a
partir da ciência da informação. Informação & Informação, 21(2), 116.
https://doi.org/10.5433/1981-8920.2016v21n2p116
Santos, P. L. V. A. da C., & Sant’Ana, R. C. G. (2015). Dado e
Granularidade na perspectiva da Informação e Tecnologia: uma
interpretação pela Ciência da Informação. Ciência da Informação, 42(2),
11.
References
21
Shafranovich, Y. (2005). Common Format and MIME Type for Comma-
Separated Values (CSV) Files. The Internet Society. Retrieved from
https://tools.ietf.org/html/rfc4180
Silberschatz, A., Korth, H. F., & Sudarshan, S. (2011). Database system
concepts (6th edition). New York: McGraw-Hill.
Tennison, J., Kellogg, G., & Herman, I. (2015, December 17). Model for
Tabular Data and Metadata on the Web. (J. Tennison & G. Kellogg,
Eds.). World Wide Web Consortium. Retrieved from
https://www.w3.org/TR/tabular-data-model/
Turban, E., Aronson, J. E., & Liang, T.-P. (2004). Decision Support
Systems and Intelligent Systems (7th Edition). Upper Saddle River, NJ,
USA: Prentice-Hall, Inc.
fernando (at) rodrigues.pro.br
phbisi (at) gmail.com
ricardosantana (at) marilia.unesp.br
http://dadosabertos.info

More Related Content

PDF
Knowledge Representation on the Web
PDF
An Ecosystem for Linked Humanities Data
PDF
Knowledge Graph Maintenance
PPTX
Thinking About the Making of Data
PDF
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
PPTX
The Challenge of Deeper Knowledge Graphs for Science
PDF
Knowledge Graph Maintenance
PDF
Massive Data Analysis- Challenges and Applications
Knowledge Representation on the Web
An Ecosystem for Linked Humanities Data
Knowledge Graph Maintenance
Thinking About the Making of Data
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
The Challenge of Deeper Knowledge Graphs for Science
Knowledge Graph Maintenance
Massive Data Analysis- Challenges and Applications

What's hot (20)

PDF
Prov-O-Viz: Interactive Provenance Visualization
PDF
Frequent Item set Mining of Big Data for Social Media
PDF
Massive scale analytics with Stratosphere using R
PPTX
Content + Signals: The value of the entire data estate for machine learning
PPTX
Thoughts on Knowledge Graphs & Deeper Provenance
PDF
International Collaboration Networks in the Emerging (Big) Data Science
PDF
Managing Metadata for Science and Technology Studies: the RISIS case
PPTX
From Data Search to Data Showcasing
PPTX
Data Mining
PPTX
Data Communities - reusable data in and outside your organization.
PDF
Lecture3 business intelligence
PPTX
Lect 1 introduction
PDF
Preprocessing
PPTX
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
PDF
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
PPTX
Self adaptive based natural language interface for disambiguation of
PDF
11.challenging issues of spatio temporal data mining
PDF
Query Optimization Techniques in Graph Databases
PPT
18231979 Data Mining
PDF
Knowledge Graphs - The Power of Graph-Based Search
Prov-O-Viz: Interactive Provenance Visualization
Frequent Item set Mining of Big Data for Social Media
Massive scale analytics with Stratosphere using R
Content + Signals: The value of the entire data estate for machine learning
Thoughts on Knowledge Graphs & Deeper Provenance
International Collaboration Networks in the Emerging (Big) Data Science
Managing Metadata for Science and Technology Studies: the RISIS case
From Data Search to Data Showcasing
Data Mining
Data Communities - reusable data in and outside your organization.
Lecture3 business intelligence
Lect 1 introduction
Preprocessing
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
Self adaptive based natural language interface for disambiguation of
11.challenging issues of spatio temporal data mining
Query Optimization Techniques in Graph Databases
18231979 Data Mining
Knowledge Graphs - The Power of Graph-Based Search
Ad

Similar to Identifying semantics characteristics of user’s interactions datasets through an application of a data analysis (20)

PDF
Module 2 Data Collection and Management.pdf
PDF
Big Data - Semantic expressiveness as a function of data complexity levels - ...
PDF
Big Data - Semantic expressiveness as a function of data complexity levels - ...
PPTX
Big data analyti data analytical life cycle
PPTX
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
PPTX
Data Science presentation for explanation of numpy and pandas
PDF
Flying Blind On A Rocket Cycle: Customer-centered Product Strategy for Machin...
PDF
Creating Effective Data Visualizations in Excel 2016: Some Basics
PDF
Frameworks provide structure. The core objective of the Big Data Framework is...
PDF
Lecture2 big data life cycle
PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
PPTX
Webinar: If Your Data Could Talk, What Story Would it Tell? Would it Be a Doc...
PDF
Data Science Provenance: From Drug Discovery to Fake Fans
PDF
Introduction to Data Analytics, AKTU - UNIT-1
PDF
Getting Started with Unstructured Data
PPTX
Hetrogeneous Data handling in Big Data Analysis
DOCX
2014 11-17 crichton institute talk on open data
PDF
Moving forward data centric sciences weaving AI, Big Data & HPC
PPTX
Introduction to Big Data Analytics
PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
Module 2 Data Collection and Management.pdf
Big Data - Semantic expressiveness as a function of data complexity levels - ...
Big Data - Semantic expressiveness as a function of data complexity levels - ...
Big data analyti data analytical life cycle
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Science presentation for explanation of numpy and pandas
Flying Blind On A Rocket Cycle: Customer-centered Product Strategy for Machin...
Creating Effective Data Visualizations in Excel 2016: Some Basics
Frameworks provide structure. The core objective of the Big Data Framework is...
Lecture2 big data life cycle
Semantic 'Radar' Steers Users to Insights in the Data Lake
Webinar: If Your Data Could Talk, What Story Would it Tell? Would it Be a Doc...
Data Science Provenance: From Drug Discovery to Fake Fans
Introduction to Data Analytics, AKTU - UNIT-1
Getting Started with Unstructured Data
Hetrogeneous Data handling in Big Data Analysis
2014 11-17 crichton institute talk on open data
Moving forward data centric sciences weaving AI, Big Data & HPC
Introduction to Big Data Analytics
Semantic 'Radar' Steers Users to Insights in the Data Lake
Ad

More from Fernando de Assis Rodrigues (20)

ODP
Perspectivas e impasses na salvaguarda e preservação documental pós Medida Pr...
PPTX
Serviços de Redes Sociais On-line e a Comunicação Científica: visibilidade de...
ODP
Ficção Científica e Realidade da Coleta de Dados em Redes Sociais Online: aná...
PDF
Interseções entre Coleta de Dados e Redes Sociais Online
ODP
Ficção Científica e Realidade da Coleta de Dados em Redes Sociais Online: aná...
PPTX
2018 uel-apresentacao-coleta redes-sociais_online
PPTX
Processo de Acesso a Dados e suas fases
PPTX
Fundamentos teóricos para coleta de dados de redes sociais online
ODP
Open Source e Open Platform: potenciais catalizadores para uso de Internet da...
PDF
Coleta de Dados em Redes Sociais
PPTX
Metadados em objetos digitais: conceitos e indexação na Web
PPTX
Metadados e Interoperabilidade
PPTX
Aplicações da Teoria dos Grafos em coletas de dados
ODP
Raspagem de dados em websites governamentais
PPTX
Contextualização de conceitos teóricos no processo de coleta de dados de Rede...
PDF
Pontos de contato entre a Esfera Pública e Instituições: reflexões sobre pote...
PDF
Categorização de elementos de privacidade identificados nos termos de uso de ...
PDF
ANÁLISE DA COLETA DE DADOS EM REDES SOCIAIS: aspectos de privacidade de dados...
PPTX
ACESSO ÀS INFORMAÇÕES SOBRE AGRICULTURA FAMILIAR NA WEB
PDF
O USO DE DADOS PÚBLICOS PARA O ACOMPANHAMENTO DA ATIVIDADE PARLAMENTAR
Perspectivas e impasses na salvaguarda e preservação documental pós Medida Pr...
Serviços de Redes Sociais On-line e a Comunicação Científica: visibilidade de...
Ficção Científica e Realidade da Coleta de Dados em Redes Sociais Online: aná...
Interseções entre Coleta de Dados e Redes Sociais Online
Ficção Científica e Realidade da Coleta de Dados em Redes Sociais Online: aná...
2018 uel-apresentacao-coleta redes-sociais_online
Processo de Acesso a Dados e suas fases
Fundamentos teóricos para coleta de dados de redes sociais online
Open Source e Open Platform: potenciais catalizadores para uso de Internet da...
Coleta de Dados em Redes Sociais
Metadados em objetos digitais: conceitos e indexação na Web
Metadados e Interoperabilidade
Aplicações da Teoria dos Grafos em coletas de dados
Raspagem de dados em websites governamentais
Contextualização de conceitos teóricos no processo de coleta de dados de Rede...
Pontos de contato entre a Esfera Pública e Instituições: reflexões sobre pote...
Categorização de elementos de privacidade identificados nos termos de uso de ...
ANÁLISE DA COLETA DE DADOS EM REDES SOCIAIS: aspectos de privacidade de dados...
ACESSO ÀS INFORMAÇÕES SOBRE AGRICULTURA FAMILIAR NA WEB
O USO DE DADOS PÚBLICOS PARA O ACOMPANHAMENTO DA ATIVIDADE PARLAMENTAR

Recently uploaded (20)

PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
INTRODUCTION TO EVS | Concept of sustainability
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PPT
Chemical bonding and molecular structure
PPT
protein biochemistry.ppt for university classes
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PPTX
neck nodes and dissection types and lymph nodes levels
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
INTRODUCTION TO EVS | Concept of sustainability
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
Classification Systems_TAXONOMY_SCIENCE8.pptx
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
AlphaEarth Foundations and the Satellite Embedding dataset
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
microscope-Lecturecjchchchchcuvuvhc.pptx
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
ECG_Course_Presentation د.محمد صقران ppt
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
. Radiology Case Scenariosssssssssssssss
Introduction to Fisheries Biotechnology_Lesson 1.pptx
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Chemical bonding and molecular structure
protein biochemistry.ppt for university classes
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
neck nodes and dissection types and lymph nodes levels

Identifying semantics characteristics of user’s interactions datasets through an application of a data analysis

  • 1. Identifying semantics characteristics of user’s interactions datasets through an application of a data analysis Fernando de Assis RODRIGUES, Ph. D. Pedro Henrique Santos BISI Ricardo César Gonçalves SANT’ANA, Ph. D. Graduate Program of Information Science UNESP - São Paulo State University (Brazil)
  • 2. 2 Using data as part of the decision-making process is a reality in professional environments. We need to decide [something]. Sure! Let’s use data to support. Data Data Data Data Data Data
  • 3. The analyzed fact need to receive inputs from multiple data sources – structuring, integrating, storing, and processing the collected data into an output that supports a better understanding of the fact from data, allowing new dimensions or perspectives of analysis 3
  • 4. 4 The use of data as part of the decision-making process in several areas. Science Decision- mankingData D D D D D D Education Decision- mankingData D D D D D D Industry Decision- mankingData D D D D D D Management Decision- mankingData D D D D D D Services Decision- mankingData D D D D D D ... Decision- mankingData D D D D D D
  • 5. Data warehouse 5 DataDataDataDataDataDataDataDataData Data Analytics DataDataNew Data Look! After the data analysis, new data became available! Sounds good! But don’t forget: from data sources, all things depend.
  • 6. The <entity, attribute, and value> condition implies a use of aggregated information on these elements to assure a minimal semantic to understand what is available, notably related with steps of obtaining data collections from data sources 6
  • 7. Goal 7 To identify the semantics characteristics of data attributes at the moment of collecting, from dataset's structures found on data export interfaces on user’s interactions analysis tools, on Internet communication channels, and on web analytics data tools involved in a scientific journal management, through an application of a process of data analysis and data modeling techniques.
  • 8. Methodology and Method 8 About the observed phenomenon Investigation of datasets, entities, and attributes related to the interaction between users and communications channels from a scientific journal. Nature Qualitative-quantitative research. Purpose Exploratory analysis to identify characteristics of how data are available and structured on these data resources.
  • 9. The Path adopted on investigation An exploratory research of data export interfaces to collect information about available data and metadata. About methods (i) Data extraction and spreadsheet data handling with Python 3 programming language. (ii) Applying of the Entity-Relationship model in generated data from analytics. (iii) Use of data structures oriented to making-decision processing (OLAP). Methodology and Method 9
  • 10. Data universe User’s interactions data from: → Internal data sources → External data sources Methodology and Method 10
  • 11. Data Sample User’s interactions data* from RECoDAF - Electronic Journal Digital Skills for Family Farming → Internal data sources: Open Journal Systems → External data sources: Facebook Insights, Google Analytics, Google Search Console, and Twitter Analytics Methodology and Method 11 * Data collected on September 2017. Available at http://dadosabertos.info/data/collection_recodaf_2017
  • 12. Results | Discussion 12 255 exportable datasets, using 5 file formats.
  • 13. Results | Discussion 13 Find on data sources information about: →Services →Resources →Datasets →Attributes →File Formats →Data types Diagram of Entity-Relationship model developed for data collecting Data warehouse Dat a Dat aData
  • 14. Results | Discussion 14 An entity (ex) may have two attributes (ax and ay) sharing same semantics (S), even when both attributes shows distinct text labels on data collecting
  • 15. Results | Discussion 15 ex = Page data, from Facebook Insights ax = “Lifetime Likes by City - Tupã, SP, Brazil” ay = “Lifetime Likes by City - Bauru, SP, Brazil” Example The attributes “Lifetime Likes by City - Tupã, SP, Brazil” and “Lifetime Likes by City - Bauru, SP, Brazil” have different text labels, but share same semantics.
  • 16. Results | Discussion 16 The absence of semantics with an exception for the availability of text labels do not ensure that attributes of two distinct entities (ex and ey) that shares equal labels (ax), consequently, are sharing the same formal semantics (S) on data collecting process by external agents.
  • 17. Results | Discussion 17 ax = “Impressions” ex = Search Analytics, from Google Search Console ey = Tweet Activity, from Twitter Analytics Example Both attributes share equal text labels, but this coincidence does not ensure that the attributes have the same semantics. In this example, each entity applied different data types to these attributes, resulting in a mismatch on their values. It’s a average! It’s a total amount!
  • 18. Final considerations 18 The data analysis → To identify the critical points of descriptive elements on those datasets. →The lack of descriptive elements in data collection process when triggered through the available export interfaces.
  • 19. Final considerations 19 To reduce this dissonance between attributes, export interfaces can bring more semantic information bound to datasets. → Important information to interpret data available from different sources. For example text labeling rules, controlled vocabularies, and restriction clauses. The semantic dissonances on these entities may interfere with the development process of relationships between attributes from different datasets, decreasing the potential of interoperability.
  • 20. References 20 Berg, O. (2015). Collaborating in a social era: ideas, insights and models that inspire new ways of thinking about collaboration. Göteborg: Intranätverk. Cornell, P. (2005). A complete guide to PivotTables: a visual approach. Berkeley, CA : New York: Apress ; Distributed to the Book trade in the United States by Springer-Verlag. Date, C. J. (2016). The new relational database dictionary: a comprehensive glossary of concepts arising in connection with the relational model of data, with definitions and illustrative examples: [terms, concepts, and examples]. Sebastopol, CA: O´Reilly. Goodwin, P., & Wright, G. (2014). Decision analysis for management judgment (5th Edition). Hoboken, New Jersey: Wiley. Gray, J., Bosworth, A., Lyaman, A., & Pirahesh, H. (1996). Data cube: a relational aggregation operator generalizing GROUP-BY, CROSS-TAB, and SUB-TOTALS (pp. 152–159). IEEE Comput. Soc. Press. https://doi.org/10.1109/ICDE.1996.492099 Ikemoto, G. S., & Marsh, J. A. (2007). Cutting Through the “Data- Driven” Mantra: Different Conceptions of Data-Driven Decision Making. Yearbook of the National Society for the Study of Education, 106(1), 105–131. https://doi.org/10.1111/j.1744-7984.2007.00099.x Inmon, W. H. (2005). Building the data warehouse (4th ed). Indianapolis, Ind: Wiley. Kimball, R., & Ross, M. (2011). The Data Warehouse Toolkit The Complete Guide to Dimensional Modeling. New York, United States of America: John Wiley & Sons. Retrieved from http://nbn- resolving.de/urn:nbn:de:101:1-2014122311140 Lebo, T., & Williams, G. T. (2010). Converting governmental datasets into linked data. In Proceedings of the 6th International Conference on Semantic Systems. Graz, Austria: ACM Press. https://doi.org/10.1145/1839707.1839755 Rathod, A. (2006). A messaging system to handle semantic dissonance (Thesis). Rochester Institute of Technology, New York. Retrieved from http://scholarworks.rit.edu/cgi/viewcontent.cgi?article=1668&context=the ses Reddy, G. S., Srinivasu, R., Rao, M. P. C., & Rikkula, S. R. (2010). Data warehousing, data mining, OLAP, OLTP technologies are essential elements to support decision-making process in industries. International Journal on Computer Science and Engineering, 2(9), 2865–2873. Ross Parry, Nick Poole, & Jon Pratty. (2008). Semantic Dissonance: Do We Need (And Do We Understand) The Semantic Web? In Toronto: Archives & Museum Informatics. Retrieved from http://www.archimuse.com/mw2008/papers/ parry/parry.html Sant’Ana, R. C. G. (2016). Ciclo de vida dos dados: uma perspectiva a partir da ciência da informação. Informação & Informação, 21(2), 116. https://doi.org/10.5433/1981-8920.2016v21n2p116 Santos, P. L. V. A. da C., & Sant’Ana, R. C. G. (2015). Dado e Granularidade na perspectiva da Informação e Tecnologia: uma interpretação pela Ciência da Informação. Ciência da Informação, 42(2), 11.
  • 21. References 21 Shafranovich, Y. (2005). Common Format and MIME Type for Comma- Separated Values (CSV) Files. The Internet Society. Retrieved from https://tools.ietf.org/html/rfc4180 Silberschatz, A., Korth, H. F., & Sudarshan, S. (2011). Database system concepts (6th edition). New York: McGraw-Hill. Tennison, J., Kellogg, G., & Herman, I. (2015, December 17). Model for Tabular Data and Metadata on the Web. (J. Tennison & G. Kellogg, Eds.). World Wide Web Consortium. Retrieved from https://www.w3.org/TR/tabular-data-model/ Turban, E., Aronson, J. E., & Liang, T.-P. (2004). Decision Support Systems and Intelligent Systems (7th Edition). Upper Saddle River, NJ, USA: Prentice-Hall, Inc. fernando (at) rodrigues.pro.br phbisi (at) gmail.com ricardosantana (at) marilia.unesp.br http://dadosabertos.info