INTRODUCTION TO
DATA ENGINEERING
AND
VISUALIZATION
Data Engineering
In Big Data environment, data size is about terra byte. So
many and various.
Structured Data: Text Files and ready to be processed by
coding (i.e. Json format).
Unstructured Data: Other Files and should be changed into
text files.
Data Engineering is about collecting structured data and
changing unstructured data for being able processed by
coding. After processing, it can be modeled by Data Scientist
to get many insights.
Data Visualization
Contextual: Synchronizing with the data and the goal.
Organised: Structured, not random for getting the line.
Imaginative: Intuitive, so many people can understand it.
Journalistic: The data behinds the visualization is full
integrity.
Critical: Answering the unknown insights to be useful.

More Related Content

PDF
A profile of Applied Data Analysis Lab (ADA Lab)
PPTX
Data Mining and Knowledge
PPTX
Introduction
DOCX
Data science blog
PDF
Survey of Object Oriented Database
PPTX
Research Topics on Data Mining
PPTX
Saran 01.06.2015
PPTX
Data Mining: Key definitions
A profile of Applied Data Analysis Lab (ADA Lab)
Data Mining and Knowledge
Introduction
Data science blog
Survey of Object Oriented Database
Research Topics on Data Mining
Saran 01.06.2015
Data Mining: Key definitions

What's hot (10)

PPTX
Research Topics in Data Mining
PDF
Big Data, Analytics, and Tax Fraud by D. José Borja Tomé at Big Data Spain 2017
PPTX
Application of discrete math in real life
PDF
International Journal on Soft Computing, Artificial Intelligence and Applicat...
PDF
International Journal on Soft Computing, Artificial Intelligence and Applicat...
PPTX
PhD Projects in Pervasive Computing Research Help
DOCX
Algorithms and Data Structures~hmftj
PDF
International Journal on Soft Computing, Artificial Intelligence and Applicat...
PPTX
Overview of dbms
Research Topics in Data Mining
Big Data, Analytics, and Tax Fraud by D. José Borja Tomé at Big Data Spain 2017
Application of discrete math in real life
International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...
PhD Projects in Pervasive Computing Research Help
Algorithms and Data Structures~hmftj
International Journal on Soft Computing, Artificial Intelligence and Applicat...
Overview of dbms
Ad

Similar to Intro to DE-DV (20)

PPTX
VANITHA S.docx.pptxdata science with python
PDF
THE INTEGRATION OF ARTIFICIAL INTELLIGENCE INTO DATABASE SYSTEMS (AI-DB INTEG...
PDF
unit-4-notes.pdf
PDF
Big Data visualization
PDF
Application and Methods of Deep Learning in IoT
PPTX
introduction to data science
PDF
DAVLectuer3 Exploratory data analysis .pdf
PDF
A survey on data mining and analysis in hadoop and mongo db
PDF
A survey on data mining and analysis in hadoop and mongo db
PPTX
Introduction of Data Science and Data Analytics
PPTX
Concepts of Data Bases
PPTX
Data analytics,...........................
PDF
Data Engineering Preparation
PPTX
Data-Visualization-with-Python-2 PPT.pptx
PDF
Data Management Issues and Study on Heterogeneous Data Storage in the Interne...
PDF
DATA MANAGEMENT ISSUES AND STUDY ON HETEROGENEOUS DATA STORAGE IN THE INTERNE...
PPTX
2016 Chapter 2 - Intro. to Data Sciences.pptx
PDF
Sameer Kumar Das International Conference Paper 53
PPTX
Data science
VANITHA S.docx.pptxdata science with python
THE INTEGRATION OF ARTIFICIAL INTELLIGENCE INTO DATABASE SYSTEMS (AI-DB INTEG...
unit-4-notes.pdf
Big Data visualization
Application and Methods of Deep Learning in IoT
introduction to data science
DAVLectuer3 Exploratory data analysis .pdf
A survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo db
Introduction of Data Science and Data Analytics
Concepts of Data Bases
Data analytics,...........................
Data Engineering Preparation
Data-Visualization-with-Python-2 PPT.pptx
Data Management Issues and Study on Heterogeneous Data Storage in the Interne...
DATA MANAGEMENT ISSUES AND STUDY ON HETEROGENEOUS DATA STORAGE IN THE INTERNE...
2016 Chapter 2 - Intro. to Data Sciences.pptx
Sameer Kumar Das International Conference Paper 53
Data science
Ad

Recently uploaded (20)

PDF
August -2025_Top10 Read_Articles_ijait.pdf
PDF
UEFA_Embodied_Carbon_Emissions_Football_Infrastructure.pdf
PPTX
CN_Unite_1 AI&DS ENGGERING SPPU PUNE UNIVERSITY
PDF
August 2025 - Top 10 Read Articles in Network Security & Its Applications
PDF
Unit1 - AIML Chapter 1 concept and ethics
PDF
Computer System Architecture 3rd Edition-M Morris Mano.pdf
PPTX
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
PPTX
Information Storage and Retrieval Techniques Unit III
PDF
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
PDF
Soil Improvement Techniques Note - Rabbi
PDF
Unit I -OPERATING SYSTEMS_SRM_KATTANKULATHUR.pptx.pdf
PDF
LOW POWER CLASS AB SI POWER AMPLIFIER FOR WIRELESS MEDICAL SENSOR NETWORK
PDF
UEFA_Carbon_Footprint_Calculator_Methology_2.0.pdf
PPTX
CyberSecurity Mobile and Wireless Devices
PPTX
A Brief Introduction to IoT- Smart Objects: The "Things" in IoT
PDF
Prof. Dr. KAYIHURA A. SILAS MUNYANEZA, PhD..pdf
PPT
Chapter 1 - Introduction to Manufacturing Technology_2.ppt
PDF
Java Basics-Introduction and program control
PDF
20250617 - IR - Global Guide for HR - 51 pages.pdf
PPTX
Amdahl’s law is explained in the above power point presentations
August -2025_Top10 Read_Articles_ijait.pdf
UEFA_Embodied_Carbon_Emissions_Football_Infrastructure.pdf
CN_Unite_1 AI&DS ENGGERING SPPU PUNE UNIVERSITY
August 2025 - Top 10 Read Articles in Network Security & Its Applications
Unit1 - AIML Chapter 1 concept and ethics
Computer System Architecture 3rd Edition-M Morris Mano.pdf
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
Information Storage and Retrieval Techniques Unit III
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
Soil Improvement Techniques Note - Rabbi
Unit I -OPERATING SYSTEMS_SRM_KATTANKULATHUR.pptx.pdf
LOW POWER CLASS AB SI POWER AMPLIFIER FOR WIRELESS MEDICAL SENSOR NETWORK
UEFA_Carbon_Footprint_Calculator_Methology_2.0.pdf
CyberSecurity Mobile and Wireless Devices
A Brief Introduction to IoT- Smart Objects: The "Things" in IoT
Prof. Dr. KAYIHURA A. SILAS MUNYANEZA, PhD..pdf
Chapter 1 - Introduction to Manufacturing Technology_2.ppt
Java Basics-Introduction and program control
20250617 - IR - Global Guide for HR - 51 pages.pdf
Amdahl’s law is explained in the above power point presentations

Intro to DE-DV

  • 3. In Big Data environment, data size is about terra byte. So many and various. Structured Data: Text Files and ready to be processed by coding (i.e. Json format). Unstructured Data: Other Files and should be changed into text files. Data Engineering is about collecting structured data and changing unstructured data for being able processed by coding. After processing, it can be modeled by Data Scientist to get many insights.
  • 5. Contextual: Synchronizing with the data and the goal. Organised: Structured, not random for getting the line. Imaginative: Intuitive, so many people can understand it. Journalistic: The data behinds the visualization is full integrity. Critical: Answering the unknown insights to be useful.