BBIIGG DDAATTAA 
LLeessssoonn 11 
Study : Jean-Antoine Moreau (Engineer - Lecturer) 
© Jean-Antoine Moreau 
copying and reproduction prohibited 
Managing my copyright ADAGP.
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
"Information is the oil of the 21st century, and 
analytics is the combustion engine." 
Peter Sondergaard 
Contact http://www.jean-antoine-moreau.fr.nf JAM 2
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Choose 
• Transform; 
• Convert; 
• Scrutinized; 
• Analyzed; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 3
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• acquire knowledge by type (kind) of ddaattaa; 
• acquire knowledge by type (kind) of ccoonntteenntt; 
• acquire knowledge by type (kind) of uusseerr; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 4
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
More informations for 
a GGlloobbaall BBuussiinneessss 
Contact http://www.jean-antoine-moreau.fr.nf JAM 5
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Data Management 
Contact http://www.jean-antoine-moreau.fr.nf JAM 6
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Data Storage 
Contact http://www.jean-antoine-moreau.fr.nf JAM 7
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Data Analysis 
Contact http://www.jean-antoine-moreau.fr.nf JAM 8
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Data Efficiency 
Contact http://www.jean-antoine-moreau.fr.nf JAM 9
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Data Performances 
Contact http://www.jean-antoine-moreau.fr.nf JAM 10
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Data Processing 
Contact http://www.jean-antoine-moreau.fr.nf JAM 11
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
are impacted by the big volume 
Contact http://www.jean-antoine-moreau.fr.nf JAM 12
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Programming Model 
Standardization 
Contact http://www.jean-antoine-moreau.fr.nf JAM 13
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
The data’s efficiency 
The data have a productivity 
Contact http://www.jean-antoine-moreau.fr.nf JAM 14
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Efficiency 
Cost 
Process 
DATA 
Contact http://www.jean-antoine-moreau.fr.nf JAM 15
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Input Output 
SYSTEM 
Contact http://www.jean-antoine-moreau.fr.nf JAM 16
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
binary join 
multidimensional join 
recursive join 
of object of database 
and also between databases. 
Contact http://www.jean-antoine-moreau.fr.nf JAM 17
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
binary join 
multidimensional join 
recursive join 
of object of database 
and also between databases. 
joining cycle 
Contact http://www.jean-antoine-moreau.fr.nf JAM 18
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Data Processing 
Cycle 
Contact http://www.jean-antoine-moreau.fr.nf JAM 19
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Data Processing 
Joining cycle 
Contact http://www.jean-antoine-moreau.fr.nf JAM 20
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
N+1 
N+2 
N+3 
N 
Accuracy 
meaning, direction 
Contact http://www.jean-antoine-moreau.fr.nf JAM 21 
Level 
optimization
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Data levels are include; 
• Data are include in data base and data base 
is included in system for management; 
• Management systems are included among 
them; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 22
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Axis of : 
– Data; 
– Data base; 
– Management system; 
Each with different levels. 
Contact http://www.jean-antoine-moreau.fr.nf JAM 23
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Toolbox for Parallel Computing 
parallel computing 
distributed arrays 
Algorithms 
transmission functions 
communication > result 
Contact http://www.jean-antoine-moreau.fr.nf JAM 24
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Using 
• virtual matrix from 
• data storage across a cluster. 
Contact http://www.jean-antoine-moreau.fr.nf JAM 25
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Using : 
– Performing parallel computation 
• aggregate data set optimizing file I/O 
performance 
Contact http://www.jean-antoine-moreau.fr.nf JAM 26
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Data volume relation Data Processing 
Contact http://www.jean-antoine-moreau.fr.nf JAM 27
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Scenario; 
• Optimization; 
• Both models; 
• Algorithms; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 28
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Complex operations 
Complex calculations 
Dimensions Simulation 
Process 
Focus 
Efficiency 
Contact http://www.jean-antoine-moreau.fr.nf JAM 29
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Goal 
Optimize the existing 
Reduce the running time 
Simulation 
Contact http://www.jean-antoine-moreau.fr.nf JAM 30
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Goal 
Scaling simulation 
Fonctional simulation 
Contact http://www.jean-antoine-moreau.fr.nf JAM 31
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Optimization 
SQL Data Base 
New algorithm to query 
Contact http://www.jean-antoine-moreau.fr.nf JAM 32
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Method 
Modelling 
Algorithm 
Contact http://www.jean-antoine-moreau.fr.nf JAM 33
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Simulation 
Simulation process 
Simulation effects 
Contact http://www.jean-antoine-moreau.fr.nf JAM 34
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
simulation Data 
n 
n-1 
result 
Contact http://www.jean-antoine-moreau.fr.nf JAM 35
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Contact http://www.jean-antoine-moreau.fr.nf JAM 36 
• Using: 
– Branch (data tree, software tree); 
– Node; 
– Aggregate; 
– Specific node; 
– Parameters; 
– Historical data;
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• data tree; 
• decomposition tree; 
• general category; 
• Subcategory; 
• critical category; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 37
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Structuring data approach; 
• Historical order of the information; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 38
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Structuring data approach; 
• Historical order of the information; 
• Using : 
– Aggregate data; 
– Hierarchical data tree; 
– Taking in account different rate; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 39
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Simulation 
• Process 
• Delais 
• Time 
Contact http://www.jean-antoine-moreau.fr.nf JAM 40
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Aggregating historical 
 Including : 
 Active condition; 
 Behaviours; 
 Running : 
 Multiple times; 
 Different rate for the same 
period; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 41
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Simulation output 
Described in terms 
Profit Value 
Specification 
Contact http://www.jean-antoine-moreau.fr.nf JAM 42
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Contact http://www.jean-antoine-moreau.fr.nf JAM 43 
• Simulation 
• Requirement 
• Specification 
methodological process
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Inheritance 
Level 
Node 
Contact http://www.jean-antoine-moreau.fr.nf JAM 44
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Inheritance 
Level 
Node 
Optimization 
Contact http://www.jean-antoine-moreau.fr.nf JAM 45
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Optimization 
– Database; 
– SQL query; 
– Database connexion; 
– Using the Database in the memory; 
• Cache memory; 
• Virtual memory; 
• Physical (internal / external) memory; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 46
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Optimization 
– Database; 
– SQL query; 
– Database connexion; 
– Using the Database in the memory; 
• Cache memory; 
• Virtual memory; 
• Physical (internal / external) memory; 
 Data architecture; 
• Physical; 
• Virtual; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 47
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Physical 
Database Database 
stored 
Objects 
Database 
Tables 
Database 
time 
accuracy of 
data processing 
Contact http://www.jean-antoine-moreau.fr.nf JAM 48 
memory
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Simulation process 
Volume 
IT architecture capacity 
Data sources 
Contact http://www.jean-antoine-moreau.fr.nf JAM 49
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Define 
The data tree structure 
The scaling axis 
The scrolling conditions 
Contact http://www.jean-antoine-moreau.fr.nf JAM 50
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
inputs Data Processing outputs 
Contact http://www.jean-antoine-moreau.fr.nf JAM 51 
node 
Time - Period 
financial resources
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Ressources - Capacity 
Impact 
Algorithm 
Decision Tree 
Decision Matrix 
Choice of the aggregation type 
Contact http://www.jean-antoine-moreau.fr.nf JAM 52
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
aggregation 
Data Database Management System 
Database 
Contact http://www.jean-antoine-moreau.fr.nf JAM 53
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Aggregation 
Data Database Management System 
Database 
Contact http://www.jean-antoine-moreau.fr.nf JAM 54
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Aggregation 
Data Database Management System 
Database 
Contact http://www.jean-antoine-moreau.fr.nf JAM 55
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Technical Constraint : 
– Aggregated data; 
– Linked data; 
– Jointed data; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 56
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Technical Constraint 
– Aggregated data; 
– Linked data; 
– Jointed data; 
•Traceability; 
•Coherence; 
•Accuracy; 
•Consistency; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 57
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Technical Constraint 
• Optimized 
– Models; 
– Algorithms; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 58
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Using the Data Science 
Customer Model Scaling Simulation Model 
Contact http://www.jean-antoine-moreau.fr.nf JAM 59
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Simulation suffer from high time complexity; 
• Data retrieval from data base and memory cache; 
• The same data is redundancy calculated and 
aggregated several time; 
To avoid reaggregation and recalculation; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 60
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Data Simulation 
tagging 
History decision-making 
mechanism 
Contact http://www.jean-antoine-moreau.fr.nf JAM 61
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Prepare the 
simulation 
Specification 
Contact http://www.jean-antoine-moreau.fr.nf JAM 62
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Contact http://www.jean-antoine-moreau.fr.nf JAM 63 
• Data 
– Variable; 
– Constant; 
– Magnetude; 
– Description 
• Tree path; 
• Historical; 
• Acces method; 
• Condition; 
• Dependent; 
• … 
• Algorithm 
Prepare the 
simulation 
Specification 
BBuussiinneessss
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Contact http://www.jean-antoine-moreau.fr.nf JAM 64 
• Data 
– Variable; 
– Constant; 
– Magnetude; 
– Description 
Prepare the 
simulation 
Specification 
BBuussiinneessss 
• Tree path; 
• Historical; 
• Acces method; 
• Condition; 
• Dependent; 
• … 
• Algorithm
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Contact http://www.jean-antoine-moreau.fr.nf JAM 65 
Data 
indexing 
Business needs
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Process Cycle Time 
Contact http://www.jean-antoine-moreau.fr.nf JAM 66 
Your Model 
Use for Simulation
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Data Base / Query 
I/O physical logical virtualizing 
Data System Management 
data Data Base 
Contact http://www.jean-antoine-moreau.fr.nf JAM 67 
Level
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Data collection strategy 
Business Case 
Use Case 
Data analysis 
Contact http://www.jean-antoine-moreau.fr.nf JAM 68
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Business Case 
Predictive Model 
Data collection strategy 
DDaattaa mmiinniinngg IInnffeerreennccee SSttaattiissttiiccaall MMooddeell 
Contact http://www.jean-antoine-moreau.fr.nf JAM 69
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Business Case 
Data Strategy Predictive node Data Exploring Process 
IT Infrastructure 
Contact http://www.jean-antoine-moreau.fr.nf JAM 70
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Business Case 
Open Data Meta Data Linked Data; 
Aggregated Data; 
Web Data; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 71
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Business Case 
Data Base Data Mining •Data System Data Base; 
•Data Mart; 
•Data Mashup Software; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 72
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Business case (Use Case) 
– Approach : 
• Community; 
• Dynamic; 
• Static; 
• geolocated; 
• contextual; 
• time; 
• trades; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 73
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Using the right information at the right time. 
• The right information makes satisfaction cient. 
Contact http://www.jean-antoine-moreau.fr.nf JAM 74
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Internet 
e-commerce 
Contact http://www.jean-antoine-moreau.fr.nf JAM 75 
computer 
application 
mail 
e-health 
e-finance 
mobile 
social network
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Data Science 
scalability availability performance 
The good information at the just time 
Contact http://www.jean-antoine-moreau.fr.nf JAM 76
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Properties: 
– atomicity; 
– Consistency; 
– insulation; 
– sustainability; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 77
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Using 
Real time 
with contextualization 
Online Transaction Processing (OLTP) 
Contact http://www.jean-antoine-moreau.fr.nf JAM 78
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
Contact http://www.jean-antoine-moreau.fr.nf JAM 79 
• Currently 
– Management System Database : 
• structured data; 
• transactional apllications; 
• SQL interface; 
• security; 
standardization 
Object Data Management Group
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• OPEN DATA : 
– unstructured data; 
– real-time data; 
– questioning and investigation by machine; 
– collaborative sharing; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 80
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
volume 
new algorithm 
sharing 
analysis 
virtualization 
research 
capture 
Contact http://www.jean-antoine-moreau.fr.nf JAM 81
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
volume 
new data administration process. 
Contact http://www.jean-antoine-moreau.fr.nf JAM 82
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
open data; 
accessible data; 
reusable data; 
Open Data 
new control mechanisms of coherence 
Contact http://www.jean-antoine-moreau.fr.nf JAM 83
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Define a scale of data quality 
– The unfiltered unformatted data sets; 
– Data provided in a structured format; 
– Free data to be used and exploited without license; 
– Data identified from URL (Uniform Resource Locator); 
– Data related to actors which are themselves associated 
with a context; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 84
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
The use of open data (Web) is uncontrollable by 
the producer. 
Contact http://www.jean-antoine-moreau.fr.nf JAM 85
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
using of open data 
data warehousing 
Contact http://www.jean-antoine-moreau.fr.nf JAM 86
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
using of open data 
Data warehouse 
Using the rules of management appropriate for each 
business line. 
Contact http://www.jean-antoine-moreau.fr.nf JAM 87
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
using of open data 
Data warehouse 
Data update using SPARQL service 
(Protocol and RDF Query Language) 
Contact http://www.jean-antoine-moreau.fr.nf JAM 88
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
using of open data 
Data warehouse 
Using the Web semantique 
(Data Web) 
Contact http://www.jean-antoine-moreau.fr.nf JAM 89
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
 Study and research in the classic mode: 
• Problem statement; 
• Intuition - deduction; 
• Validation: 
• experience; 
• simulation; 
• computation; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 90
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
Study and research in a context Big Data: 
• Computer analysis: 
• Identification; 
• Correlation; 
• Proximity card (Geo-localization); 
• Generation of hypothesis; 
• Options; 
• Emergence of new solutions; 
• Experimentation; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 91
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Highlighting correlations; 
• Getting Relief aggregations; 
• Search model of explanation: 
explanation : 
– Implementation; 
– Contectuel model; 
• Direction of the work; 
Contact http://www.jean-antoine-moreau.fr.nf JAM 92
© Jean-Antoine 
Moreau 
copying and 
reproduction 
prohibited 
Managing my 
copyright ADAGP. 
BBiigg DDaattaa 
• Using Open format 
– PDF; 
– CSV; 
– HTML; 
– XML; 
– RDF; 
– RSS; 
– Atom; 
– Json (JavaScript Object Notation ); 
– … 
Contact http://www.jean-antoine-moreau.fr.nf JAM 93

More Related Content

PPT
Big Data Lesson 2 Jean-Antoine Moreau
PPT
Big Data Lesson 3 Jean-Antoine Moreau
PPT
DATA SCIENCE Lesson 4 Data Science Predictive Method Parsing Process Topic Mo...
PPT
DATA SCIENCE Lesson 5 Data Science Predictive Modeling and Modelling Methodol...
PPT
Data Science Lesson 1 Jean-Antoine Moreau
PPTX
Hadoop introduction , Why and What is Hadoop ?
PPTX
Introduction of Cloud computing
PPTX
cloud computing ppt
Big Data Lesson 2 Jean-Antoine Moreau
Big Data Lesson 3 Jean-Antoine Moreau
DATA SCIENCE Lesson 4 Data Science Predictive Method Parsing Process Topic Mo...
DATA SCIENCE Lesson 5 Data Science Predictive Modeling and Modelling Methodol...
Data Science Lesson 1 Jean-Antoine Moreau
Hadoop introduction , Why and What is Hadoop ?
Introduction of Cloud computing
cloud computing ppt

Similar to Big Data Lesson 1 Conference Jean-Antoine Moreau (20)

PPT
DATA SCIENCE Lesson 2 Parallelism Computing Data Processing Performance Measu...
PPT
DATA SCIENCE Lesson 3 Data Architectures Data Processing Modeling -Algorithm ...
PPT
Business intelligence Conference Jean-Antoine Moreau
PPT
Systemic approach to commercial programming and commercial choices Jean-Antoi...
PPT
Six Sigma Method Jean-Antoine Moreau
PDF
2019-CertiFUNcation-GDPR_12072019-typo3
PPTX
Michael DeSa [InfluxData] | Monitoring Methodologies | InfluxDays Virtual Exp...
PDF
GDPR Session - TYPO3 - t3dd
PPTX
mod. 5.pptx
PDF
Mine excellence products description v1.2
PDF
Mine excellence products description v1.2
PDF
MineExcellence Drill and Blast Platform
PDF
15 Online Experts for Boosting Your Plastics Business
PPTX
HygroNet presentation - ChemDry
PPTX
HygroNet Presentation - ChemDry
PDF
Power Discrete Packaging Comparison 2018 report published by System Plus Cons...
PPTX
PDF
Development of the Best Planting Practices (PP) Decision Support Tool (DST)
PDF
Sanofi @ Scilab Conference 2018
PDF
Session 2 3 Development of the Best Planting Practices Decision Support Tool
DATA SCIENCE Lesson 2 Parallelism Computing Data Processing Performance Measu...
DATA SCIENCE Lesson 3 Data Architectures Data Processing Modeling -Algorithm ...
Business intelligence Conference Jean-Antoine Moreau
Systemic approach to commercial programming and commercial choices Jean-Antoi...
Six Sigma Method Jean-Antoine Moreau
2019-CertiFUNcation-GDPR_12072019-typo3
Michael DeSa [InfluxData] | Monitoring Methodologies | InfluxDays Virtual Exp...
GDPR Session - TYPO3 - t3dd
mod. 5.pptx
Mine excellence products description v1.2
Mine excellence products description v1.2
MineExcellence Drill and Blast Platform
15 Online Experts for Boosting Your Plastics Business
HygroNet presentation - ChemDry
HygroNet Presentation - ChemDry
Power Discrete Packaging Comparison 2018 report published by System Plus Cons...
Development of the Best Planting Practices (PP) Decision Support Tool (DST)
Sanofi @ Scilab Conference 2018
Session 2 3 Development of the Best Planting Practices Decision Support Tool
Ad

More from Jean-Antoine Moreau (20)

PPTX
Software testing incorporating an Artificial Intelligence function
PPTX
Pruebas de software que incorporan una función de Inteligencia Artificial
PPTX
Test de logiciel Intégrant une fonction d’Intelligence Artificielle
PPTX
Management and Leadership in the Age of Artificial Intelligence
PPTX
Le Management et le Leadership au Temps de l'Intelligence Artificielle
PPTX
Consommation d'énergie dans l'industrie en France
PPTX
Evolution du Revenu des pharmaciens en France
PPTX
Histoire de la Drogue en France
PPT
l'Intelligence Artificielle Jean-Antoine Moreau
PPT
Blockchain Jean-Antoine Moreau
PPT
Management of the Performance Jean-Antoine Moreau
PPT
Management de la Performance Jean-Antoine Moreau
PPT
Le Budget Jean-Antoine Moreau
PPT
Stratégie Économique Jean-Antoine Moreau
PPT
Economic Strategy Jean-Antoine Moreau
PPT
Stratégie Industrielle Jean-Antoine Moreau
PPT
Industrial Strategy Jean-Antoine Moreau
PPT
Regional Economic Development Jean-Antoine Moreau
PPT
MARKETING STRATEGY Jean-Antoine Moreau
PPT
Politique Industrielle Seconde Partie
Software testing incorporating an Artificial Intelligence function
Pruebas de software que incorporan una función de Inteligencia Artificial
Test de logiciel Intégrant une fonction d’Intelligence Artificielle
Management and Leadership in the Age of Artificial Intelligence
Le Management et le Leadership au Temps de l'Intelligence Artificielle
Consommation d'énergie dans l'industrie en France
Evolution du Revenu des pharmaciens en France
Histoire de la Drogue en France
l'Intelligence Artificielle Jean-Antoine Moreau
Blockchain Jean-Antoine Moreau
Management of the Performance Jean-Antoine Moreau
Management de la Performance Jean-Antoine Moreau
Le Budget Jean-Antoine Moreau
Stratégie Économique Jean-Antoine Moreau
Economic Strategy Jean-Antoine Moreau
Stratégie Industrielle Jean-Antoine Moreau
Industrial Strategy Jean-Antoine Moreau
Regional Economic Development Jean-Antoine Moreau
MARKETING STRATEGY Jean-Antoine Moreau
Politique Industrielle Seconde Partie
Ad

Recently uploaded (20)

PDF
Exploratory_Data_Analysis_Fundamentals.pdf
PPTX
Fundamentals of Mechanical Engineering.pptx
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PDF
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
PPT
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PDF
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
PPTX
Feature types and data preprocessing steps
PPTX
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PDF
Abrasive, erosive and cavitation wear.pdf
PPTX
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
PPTX
ASME PCC-02 TRAINING -DESKTOP-NLE5HNP.pptx
PDF
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
PDF
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
PPTX
Information Storage and Retrieval Techniques Unit III
PDF
August 2025 - Top 10 Read Articles in Network Security & Its Applications
PDF
III.4.1.2_The_Space_Environment.p pdffdf
Exploratory_Data_Analysis_Fundamentals.pdf
Fundamentals of Mechanical Engineering.pptx
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
Feature types and data preprocessing steps
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
Fundamentals of safety and accident prevention -final (1).pptx
Abrasive, erosive and cavitation wear.pdf
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
ASME PCC-02 TRAINING -DESKTOP-NLE5HNP.pptx
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
Information Storage and Retrieval Techniques Unit III
August 2025 - Top 10 Read Articles in Network Security & Its Applications
III.4.1.2_The_Space_Environment.p pdffdf

Big Data Lesson 1 Conference Jean-Antoine Moreau

  • 1. BBIIGG DDAATTAA LLeessssoonn 11 Study : Jean-Antoine Moreau (Engineer - Lecturer) © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP.
  • 2. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. "Information is the oil of the 21st century, and analytics is the combustion engine." Peter Sondergaard Contact http://www.jean-antoine-moreau.fr.nf JAM 2
  • 3. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Choose • Transform; • Convert; • Scrutinized; • Analyzed; Contact http://www.jean-antoine-moreau.fr.nf JAM 3
  • 4. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • acquire knowledge by type (kind) of ddaattaa; • acquire knowledge by type (kind) of ccoonntteenntt; • acquire knowledge by type (kind) of uusseerr; Contact http://www.jean-antoine-moreau.fr.nf JAM 4
  • 5. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa More informations for a GGlloobbaall BBuussiinneessss Contact http://www.jean-antoine-moreau.fr.nf JAM 5
  • 6. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Data Management Contact http://www.jean-antoine-moreau.fr.nf JAM 6
  • 7. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Data Storage Contact http://www.jean-antoine-moreau.fr.nf JAM 7
  • 8. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Data Analysis Contact http://www.jean-antoine-moreau.fr.nf JAM 8
  • 9. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Data Efficiency Contact http://www.jean-antoine-moreau.fr.nf JAM 9
  • 10. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Data Performances Contact http://www.jean-antoine-moreau.fr.nf JAM 10
  • 11. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Data Processing Contact http://www.jean-antoine-moreau.fr.nf JAM 11
  • 12. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa are impacted by the big volume Contact http://www.jean-antoine-moreau.fr.nf JAM 12
  • 13. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Programming Model Standardization Contact http://www.jean-antoine-moreau.fr.nf JAM 13
  • 14. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa The data’s efficiency The data have a productivity Contact http://www.jean-antoine-moreau.fr.nf JAM 14
  • 15. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Efficiency Cost Process DATA Contact http://www.jean-antoine-moreau.fr.nf JAM 15
  • 16. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Input Output SYSTEM Contact http://www.jean-antoine-moreau.fr.nf JAM 16
  • 17. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa binary join multidimensional join recursive join of object of database and also between databases. Contact http://www.jean-antoine-moreau.fr.nf JAM 17
  • 18. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa binary join multidimensional join recursive join of object of database and also between databases. joining cycle Contact http://www.jean-antoine-moreau.fr.nf JAM 18
  • 19. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Data Processing Cycle Contact http://www.jean-antoine-moreau.fr.nf JAM 19
  • 20. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Data Processing Joining cycle Contact http://www.jean-antoine-moreau.fr.nf JAM 20
  • 21. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa N+1 N+2 N+3 N Accuracy meaning, direction Contact http://www.jean-antoine-moreau.fr.nf JAM 21 Level optimization
  • 22. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Data levels are include; • Data are include in data base and data base is included in system for management; • Management systems are included among them; Contact http://www.jean-antoine-moreau.fr.nf JAM 22
  • 23. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Axis of : – Data; – Data base; – Management system; Each with different levels. Contact http://www.jean-antoine-moreau.fr.nf JAM 23
  • 24. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Toolbox for Parallel Computing parallel computing distributed arrays Algorithms transmission functions communication > result Contact http://www.jean-antoine-moreau.fr.nf JAM 24
  • 25. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Using • virtual matrix from • data storage across a cluster. Contact http://www.jean-antoine-moreau.fr.nf JAM 25
  • 26. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Using : – Performing parallel computation • aggregate data set optimizing file I/O performance Contact http://www.jean-antoine-moreau.fr.nf JAM 26
  • 27. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Data volume relation Data Processing Contact http://www.jean-antoine-moreau.fr.nf JAM 27
  • 28. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Scenario; • Optimization; • Both models; • Algorithms; Contact http://www.jean-antoine-moreau.fr.nf JAM 28
  • 29. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Complex operations Complex calculations Dimensions Simulation Process Focus Efficiency Contact http://www.jean-antoine-moreau.fr.nf JAM 29
  • 30. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Goal Optimize the existing Reduce the running time Simulation Contact http://www.jean-antoine-moreau.fr.nf JAM 30
  • 31. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Goal Scaling simulation Fonctional simulation Contact http://www.jean-antoine-moreau.fr.nf JAM 31
  • 32. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Optimization SQL Data Base New algorithm to query Contact http://www.jean-antoine-moreau.fr.nf JAM 32
  • 33. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Method Modelling Algorithm Contact http://www.jean-antoine-moreau.fr.nf JAM 33
  • 34. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Simulation Simulation process Simulation effects Contact http://www.jean-antoine-moreau.fr.nf JAM 34
  • 35. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa simulation Data n n-1 result Contact http://www.jean-antoine-moreau.fr.nf JAM 35
  • 36. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Contact http://www.jean-antoine-moreau.fr.nf JAM 36 • Using: – Branch (data tree, software tree); – Node; – Aggregate; – Specific node; – Parameters; – Historical data;
  • 37. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • data tree; • decomposition tree; • general category; • Subcategory; • critical category; Contact http://www.jean-antoine-moreau.fr.nf JAM 37
  • 38. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Structuring data approach; • Historical order of the information; Contact http://www.jean-antoine-moreau.fr.nf JAM 38
  • 39. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Structuring data approach; • Historical order of the information; • Using : – Aggregate data; – Hierarchical data tree; – Taking in account different rate; Contact http://www.jean-antoine-moreau.fr.nf JAM 39
  • 40. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Simulation • Process • Delais • Time Contact http://www.jean-antoine-moreau.fr.nf JAM 40
  • 41. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Aggregating historical  Including :  Active condition;  Behaviours;  Running :  Multiple times;  Different rate for the same period; Contact http://www.jean-antoine-moreau.fr.nf JAM 41
  • 42. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Simulation output Described in terms Profit Value Specification Contact http://www.jean-antoine-moreau.fr.nf JAM 42
  • 43. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Contact http://www.jean-antoine-moreau.fr.nf JAM 43 • Simulation • Requirement • Specification methodological process
  • 44. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Inheritance Level Node Contact http://www.jean-antoine-moreau.fr.nf JAM 44
  • 45. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Inheritance Level Node Optimization Contact http://www.jean-antoine-moreau.fr.nf JAM 45
  • 46. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Optimization – Database; – SQL query; – Database connexion; – Using the Database in the memory; • Cache memory; • Virtual memory; • Physical (internal / external) memory; Contact http://www.jean-antoine-moreau.fr.nf JAM 46
  • 47. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Optimization – Database; – SQL query; – Database connexion; – Using the Database in the memory; • Cache memory; • Virtual memory; • Physical (internal / external) memory;  Data architecture; • Physical; • Virtual; Contact http://www.jean-antoine-moreau.fr.nf JAM 47
  • 48. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Physical Database Database stored Objects Database Tables Database time accuracy of data processing Contact http://www.jean-antoine-moreau.fr.nf JAM 48 memory
  • 49. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Simulation process Volume IT architecture capacity Data sources Contact http://www.jean-antoine-moreau.fr.nf JAM 49
  • 50. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Define The data tree structure The scaling axis The scrolling conditions Contact http://www.jean-antoine-moreau.fr.nf JAM 50
  • 51. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa inputs Data Processing outputs Contact http://www.jean-antoine-moreau.fr.nf JAM 51 node Time - Period financial resources
  • 52. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Ressources - Capacity Impact Algorithm Decision Tree Decision Matrix Choice of the aggregation type Contact http://www.jean-antoine-moreau.fr.nf JAM 52
  • 53. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa aggregation Data Database Management System Database Contact http://www.jean-antoine-moreau.fr.nf JAM 53
  • 54. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Aggregation Data Database Management System Database Contact http://www.jean-antoine-moreau.fr.nf JAM 54
  • 55. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Aggregation Data Database Management System Database Contact http://www.jean-antoine-moreau.fr.nf JAM 55
  • 56. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Technical Constraint : – Aggregated data; – Linked data; – Jointed data; Contact http://www.jean-antoine-moreau.fr.nf JAM 56
  • 57. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Technical Constraint – Aggregated data; – Linked data; – Jointed data; •Traceability; •Coherence; •Accuracy; •Consistency; Contact http://www.jean-antoine-moreau.fr.nf JAM 57
  • 58. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Technical Constraint • Optimized – Models; – Algorithms; Contact http://www.jean-antoine-moreau.fr.nf JAM 58
  • 59. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Using the Data Science Customer Model Scaling Simulation Model Contact http://www.jean-antoine-moreau.fr.nf JAM 59
  • 60. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Simulation suffer from high time complexity; • Data retrieval from data base and memory cache; • The same data is redundancy calculated and aggregated several time; To avoid reaggregation and recalculation; Contact http://www.jean-antoine-moreau.fr.nf JAM 60
  • 61. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Data Simulation tagging History decision-making mechanism Contact http://www.jean-antoine-moreau.fr.nf JAM 61
  • 62. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Prepare the simulation Specification Contact http://www.jean-antoine-moreau.fr.nf JAM 62
  • 63. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Contact http://www.jean-antoine-moreau.fr.nf JAM 63 • Data – Variable; – Constant; – Magnetude; – Description • Tree path; • Historical; • Acces method; • Condition; • Dependent; • … • Algorithm Prepare the simulation Specification BBuussiinneessss
  • 64. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Contact http://www.jean-antoine-moreau.fr.nf JAM 64 • Data – Variable; – Constant; – Magnetude; – Description Prepare the simulation Specification BBuussiinneessss • Tree path; • Historical; • Acces method; • Condition; • Dependent; • … • Algorithm
  • 65. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Contact http://www.jean-antoine-moreau.fr.nf JAM 65 Data indexing Business needs
  • 66. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Process Cycle Time Contact http://www.jean-antoine-moreau.fr.nf JAM 66 Your Model Use for Simulation
  • 67. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Data Base / Query I/O physical logical virtualizing Data System Management data Data Base Contact http://www.jean-antoine-moreau.fr.nf JAM 67 Level
  • 68. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Data collection strategy Business Case Use Case Data analysis Contact http://www.jean-antoine-moreau.fr.nf JAM 68
  • 69. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Business Case Predictive Model Data collection strategy DDaattaa mmiinniinngg IInnffeerreennccee SSttaattiissttiiccaall MMooddeell Contact http://www.jean-antoine-moreau.fr.nf JAM 69
  • 70. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Business Case Data Strategy Predictive node Data Exploring Process IT Infrastructure Contact http://www.jean-antoine-moreau.fr.nf JAM 70
  • 71. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Business Case Open Data Meta Data Linked Data; Aggregated Data; Web Data; Contact http://www.jean-antoine-moreau.fr.nf JAM 71
  • 72. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Business Case Data Base Data Mining •Data System Data Base; •Data Mart; •Data Mashup Software; Contact http://www.jean-antoine-moreau.fr.nf JAM 72
  • 73. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Business case (Use Case) – Approach : • Community; • Dynamic; • Static; • geolocated; • contextual; • time; • trades; Contact http://www.jean-antoine-moreau.fr.nf JAM 73
  • 74. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Using the right information at the right time. • The right information makes satisfaction cient. Contact http://www.jean-antoine-moreau.fr.nf JAM 74
  • 75. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Internet e-commerce Contact http://www.jean-antoine-moreau.fr.nf JAM 75 computer application mail e-health e-finance mobile social network
  • 76. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Data Science scalability availability performance The good information at the just time Contact http://www.jean-antoine-moreau.fr.nf JAM 76
  • 77. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Properties: – atomicity; – Consistency; – insulation; – sustainability; Contact http://www.jean-antoine-moreau.fr.nf JAM 77
  • 78. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Using Real time with contextualization Online Transaction Processing (OLTP) Contact http://www.jean-antoine-moreau.fr.nf JAM 78
  • 79. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. Contact http://www.jean-antoine-moreau.fr.nf JAM 79 • Currently – Management System Database : • structured data; • transactional apllications; • SQL interface; • security; standardization Object Data Management Group
  • 80. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • OPEN DATA : – unstructured data; – real-time data; – questioning and investigation by machine; – collaborative sharing; Contact http://www.jean-antoine-moreau.fr.nf JAM 80
  • 81. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa volume new algorithm sharing analysis virtualization research capture Contact http://www.jean-antoine-moreau.fr.nf JAM 81
  • 82. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa volume new data administration process. Contact http://www.jean-antoine-moreau.fr.nf JAM 82
  • 83. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa open data; accessible data; reusable data; Open Data new control mechanisms of coherence Contact http://www.jean-antoine-moreau.fr.nf JAM 83
  • 84. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Define a scale of data quality – The unfiltered unformatted data sets; – Data provided in a structured format; – Free data to be used and exploited without license; – Data identified from URL (Uniform Resource Locator); – Data related to actors which are themselves associated with a context; Contact http://www.jean-antoine-moreau.fr.nf JAM 84
  • 85. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa The use of open data (Web) is uncontrollable by the producer. Contact http://www.jean-antoine-moreau.fr.nf JAM 85
  • 86. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa using of open data data warehousing Contact http://www.jean-antoine-moreau.fr.nf JAM 86
  • 87. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa using of open data Data warehouse Using the rules of management appropriate for each business line. Contact http://www.jean-antoine-moreau.fr.nf JAM 87
  • 88. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa using of open data Data warehouse Data update using SPARQL service (Protocol and RDF Query Language) Contact http://www.jean-antoine-moreau.fr.nf JAM 88
  • 89. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa using of open data Data warehouse Using the Web semantique (Data Web) Contact http://www.jean-antoine-moreau.fr.nf JAM 89
  • 90. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa  Study and research in the classic mode: • Problem statement; • Intuition - deduction; • Validation: • experience; • simulation; • computation; Contact http://www.jean-antoine-moreau.fr.nf JAM 90
  • 91. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa Study and research in a context Big Data: • Computer analysis: • Identification; • Correlation; • Proximity card (Geo-localization); • Generation of hypothesis; • Options; • Emergence of new solutions; • Experimentation; Contact http://www.jean-antoine-moreau.fr.nf JAM 91
  • 92. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Highlighting correlations; • Getting Relief aggregations; • Search model of explanation: explanation : – Implementation; – Contectuel model; • Direction of the work; Contact http://www.jean-antoine-moreau.fr.nf JAM 92
  • 93. © Jean-Antoine Moreau copying and reproduction prohibited Managing my copyright ADAGP. BBiigg DDaattaa • Using Open format – PDF; – CSV; – HTML; – XML; – RDF; – RSS; – Atom; – Json (JavaScript Object Notation ); – … Contact http://www.jean-antoine-moreau.fr.nf JAM 93