SlideShare a Scribd company logo
- Data -
Science and Engineering
HELLO!
I am Hendri Karisma
Principal R&D and Data Lead at Akar Inti Data
✘ Research and Development for Advanced Data
Analytics Platform.
✘ Data marketplace at nusadata.ai .
You can find me at :
✘ telegram @siganteng
✘ Instagram @karism4_
✘ X @infoHendri
2
Data
What is data ?
Data are Facts
4
5
Data marketplace
6
Open Finance
Telco
eKYC
Market Place
Single
platform
Consumer
Data Provider
7
Science vs Engineering
8
Science
The goal of a scientist is to
answer questions and
discover information about
their chosen field of study.
What is the difference?
Engineering
An engineer might produce
physical item or blueprint
for a new process.
9
Data roles
✘ Data Scientist
✘ Data Analyst
✘ Business Intelligence
✘ Data Engineer
✘ AI/ML Engineer
There is a Scientist role and 2 engineer roles.
10
Data Scientist
The term “data scientist” was coined as recently as 2008 when
companies realized the need for data professionals who are skilled in
organizing and analyzing massive amounts of data.1 In a 2009
McKinsey&Company article, Hal Varian, Google’s chief economist and UC
Berkeley professor of information sciences, business, and economics,
predicted the importance of adapting to technology’s influence and
reconfiguration of different industries.
Data-driven individuals with high-level technical skills who are capable
of building complex quantitative algorithms to organize and
synthesize large amounts of information used to answer questions and
drive strategy in their organization.
11
Data Engineer
Data engineers work in a variety of settings to build systems that collect,
manage, and convert raw data into usable information for data scientists and
business analysts to interpret. Their ultimate goal is to make data accessible
so that organizations can use it to evaluate and optimize their performance.
These are some common tasks you might perform when working with data:
✘ Acquire datasets that align with business needs
✘ Develop algorithms to transform data into useful, actionable
information
✘ Build, test, and maintain database pipeline architectures
✘ Collaborate with management to understand company objectives
✘ Create new data validation methods and data analysis tools
✘ Ensure compliance with data governance and security policies
12
AI Engineer
Artificial intelligence (AI) engineers are responsible for developing,
programming and training the complex networks of algorithms that make up
AI so that they can function like a human brain. This role requires combined
expertise in software development, programming, data science and data
engineering. Though this roleis related to data engineering, AI engineers are
rarely required to write the code that develops scalable data sharing.
13
AI Engineer Responsibilities
✘ Build AI models from the ground up and explain results to product
managers and stakeholders
✘ Develop, test, and deploy AI models
✘ Convert machine learning models into APIs so other applications can
utilize it
✘ Build data ingestion and data transformation infrastructure
✘ Work alongside data and business analysts
✘ Execute statistical analysis and tune results to extract better insights
✘ Automate infrastructure used by the data science team
✘ Create and manage AI development and production infrastructure
14
Data Science Experiments
✘ Do the data understanding
✘ Data exploration
✘ Feature Engineering
✘ Method/Model Experiments
15
Data flow
16
Zoom in
17
18
ML Ops
19
Tech Stack
20
21
22
AI Engineering Not Only About The Tech Stack
✘ SOLID, Design Pattern, Clean Architecture, etcs
✘ Software Engineering
✘ Tech Tools Characteristics, Behaviour
✘ Basic statistics, probilistics and random variable,
basic machine learning.
✘ High performance computing
✘ Security and Regulation
✘ ML Ops
23
The Benefit of Python
✘ Could run AI Algorithm as service
✘ Experiments for research phase
✘ Having a lot of Math (statistics, probabilistics)
and machine learning libraries
✘ Data pipeline
✘ Easy to use and easy to learn
✘ Could implement OOP or FP
24
Example of The Engine
✘ We need to run ML service
✗ Sometimes for specific cases
✗ The classification process is about 500ms
✘ We run the process (preprocess to main
analytics process) as a batch process or need to
process data with Large Volume
✘ The model is lightweight
25
ML Service
26
Model
Data batch processing
27
Could handling several cases
28
Model
✘ We often use pickle or serializable file as model
output
✘ The size bigger than it should be
✘ Impacted to the memory or the space
complexity
29
Sample Model
30
Sample Model
31
THANKS!
Any questions?
You can find me at
✘ X @infoHendri
✘ Telegram @siganteng
✘ Email:
situkangsayur@gmail.com
hendri.karisma@akarintidata.ai
32

More Related Content

PDF
Data Analytics Today - Data, Tech, and Regulation.pdf
PDF
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
PDF
The Role of Data Engineers vs Data Scientists | IABAC
PDF
Key Roles In Data-Driven Organisation
PDF
Key Roles In Data-Driven Organisation
PPTX
New professional careers in data
PPTX
Lesson 3 ai in the enterprise
PDF
Data Science at Scale - The DevOps Approach
Data Analytics Today - Data, Tech, and Regulation.pdf
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
The Role of Data Engineers vs Data Scientists | IABAC
Key Roles In Data-Driven Organisation
Key Roles In Data-Driven Organisation
New professional careers in data
Lesson 3 ai in the enterprise
Data Science at Scale - The DevOps Approach

Similar to Data - Science and Engineering slide at Bandungpy Sharing Session (20)

PDF
From Rocket Science to Data Science
PDF
Building successful data science teams
PPTX
Comprehensive-Data-Science-AI-and-Machine-Learning-Course.pptx
PPTX
05. Core-Responsibilities-of-a-Data-Engineer.pptx
PPTX
semana1.pptx
PDF
JOBS BASED OR RELATED ON ARTIFICIAL INTELLEGENCE
PDF
Data Engineering.pdf
PPTX
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
PPTX
Career Paths for Software Professionals
PDF
Data Scientist By: Professor Lili Saghafi
PDF
Top 10 areas of expertise in data science
PPTX
introductiontodatascience-230122140841-b90a0856 (1).pptx
PDF
#Datacaeer - AI Guild workshop on data roles in industry with Adam Green
PDF
How to become a data scientist
PPTX
Career in Python and data science
PDF
Future Jobs in AI.pdf
PPTX
Ch1IntroductiontoDataScience.pptx
PPTX
Career_Jobs_in_Data_Science.pptx
PDF
Data science a practitioner's perspective
PPTX
Data Engineer vs Data Scientist vs Data Analyst.pptx
From Rocket Science to Data Science
Building successful data science teams
Comprehensive-Data-Science-AI-and-Machine-Learning-Course.pptx
05. Core-Responsibilities-of-a-Data-Engineer.pptx
semana1.pptx
JOBS BASED OR RELATED ON ARTIFICIAL INTELLEGENCE
Data Engineering.pdf
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Career Paths for Software Professionals
Data Scientist By: Professor Lili Saghafi
Top 10 areas of expertise in data science
introductiontodatascience-230122140841-b90a0856 (1).pptx
#Datacaeer - AI Guild workshop on data roles in industry with Adam Green
How to become a data scientist
Career in Python and data science
Future Jobs in AI.pdf
Ch1IntroductiontoDataScience.pptx
Career_Jobs_in_Data_Science.pptx
Data science a practitioner's perspective
Data Engineer vs Data Scientist vs Data Analyst.pptx
Ad

More from Hendri Karisma (18)

PDF
ML Abstraciton for Keras to Serve Several Cases
PDF
Python 101 - Indonesia AI Society.pdf
PDF
Slide DevSecOps Microservices
PDF
Machine Learning: an Introduction and cases
PDF
Python, Data science, and Unsupervised learning
PDF
Machine Learning Research in blibli
PDF
Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...
PDF
Fraud Detection System using Deep Neural Networks
PDF
Artificial Intelligence and The Complexity
PDF
Software Engineering: Today in The Betlefield
PDF
Introduction to Topological Data Analysis
PDF
Sharing-akka-pub
PDF
Presentasi cca it now and tomorow
PDF
Bayes Belief Network
PDF
Slide Presentasi EM Algorithm (Play Tennis & Brain Tissue Segmentation)
PDF
Slide Presentasi Kelompok E bagian Sistem Rekognisi
PDF
Slide Presentasi Kelompok Keilmuan E
PDF
Slide Seminar Open Source (CodeLabs UNIKOM Bandung)
ML Abstraciton for Keras to Serve Several Cases
Python 101 - Indonesia AI Society.pdf
Slide DevSecOps Microservices
Machine Learning: an Introduction and cases
Python, Data science, and Unsupervised learning
Machine Learning Research in blibli
Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...
Fraud Detection System using Deep Neural Networks
Artificial Intelligence and The Complexity
Software Engineering: Today in The Betlefield
Introduction to Topological Data Analysis
Sharing-akka-pub
Presentasi cca it now and tomorow
Bayes Belief Network
Slide Presentasi EM Algorithm (Play Tennis & Brain Tissue Segmentation)
Slide Presentasi Kelompok E bagian Sistem Rekognisi
Slide Presentasi Kelompok Keilmuan E
Slide Seminar Open Source (CodeLabs UNIKOM Bandung)
Ad

Recently uploaded (20)

PPTX
Business Acumen Training GuidePresentation.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
.pdf is not working space design for the following data for the following dat...
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Introduction to machine learning and Linear Models
PPTX
Computer network topology notes for revision
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPT
Quality review (1)_presentation of this 21
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Lecture1 pattern recognition............
Business Acumen Training GuidePresentation.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
.pdf is not working space design for the following data for the following dat...
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
IBA_Chapter_11_Slides_Final_Accessible.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Introduction to machine learning and Linear Models
Computer network topology notes for revision
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Qualitative Qantitative and Mixed Methods.pptx
climate analysis of Dhaka ,Banglades.pptx
IB Computer Science - Internal Assessment.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Quality review (1)_presentation of this 21
Reliability_Chapter_ presentation 1221.5784
Lecture1 pattern recognition............

Data - Science and Engineering slide at Bandungpy Sharing Session

  • 1. - Data - Science and Engineering
  • 2. HELLO! I am Hendri Karisma Principal R&D and Data Lead at Akar Inti Data ✘ Research and Development for Advanced Data Analytics Platform. ✘ Data marketplace at nusadata.ai . You can find me at : ✘ telegram @siganteng ✘ Instagram @karism4_ ✘ X @infoHendri 2
  • 5. 5
  • 6. Data marketplace 6 Open Finance Telco eKYC Market Place Single platform Consumer Data Provider
  • 7. 7
  • 9. Science The goal of a scientist is to answer questions and discover information about their chosen field of study. What is the difference? Engineering An engineer might produce physical item or blueprint for a new process. 9
  • 10. Data roles ✘ Data Scientist ✘ Data Analyst ✘ Business Intelligence ✘ Data Engineer ✘ AI/ML Engineer There is a Scientist role and 2 engineer roles. 10
  • 11. Data Scientist The term “data scientist” was coined as recently as 2008 when companies realized the need for data professionals who are skilled in organizing and analyzing massive amounts of data.1 In a 2009 McKinsey&Company article, Hal Varian, Google’s chief economist and UC Berkeley professor of information sciences, business, and economics, predicted the importance of adapting to technology’s influence and reconfiguration of different industries. Data-driven individuals with high-level technical skills who are capable of building complex quantitative algorithms to organize and synthesize large amounts of information used to answer questions and drive strategy in their organization. 11
  • 12. Data Engineer Data engineers work in a variety of settings to build systems that collect, manage, and convert raw data into usable information for data scientists and business analysts to interpret. Their ultimate goal is to make data accessible so that organizations can use it to evaluate and optimize their performance. These are some common tasks you might perform when working with data: ✘ Acquire datasets that align with business needs ✘ Develop algorithms to transform data into useful, actionable information ✘ Build, test, and maintain database pipeline architectures ✘ Collaborate with management to understand company objectives ✘ Create new data validation methods and data analysis tools ✘ Ensure compliance with data governance and security policies 12
  • 13. AI Engineer Artificial intelligence (AI) engineers are responsible for developing, programming and training the complex networks of algorithms that make up AI so that they can function like a human brain. This role requires combined expertise in software development, programming, data science and data engineering. Though this roleis related to data engineering, AI engineers are rarely required to write the code that develops scalable data sharing. 13
  • 14. AI Engineer Responsibilities ✘ Build AI models from the ground up and explain results to product managers and stakeholders ✘ Develop, test, and deploy AI models ✘ Convert machine learning models into APIs so other applications can utilize it ✘ Build data ingestion and data transformation infrastructure ✘ Work alongside data and business analysts ✘ Execute statistical analysis and tune results to extract better insights ✘ Automate infrastructure used by the data science team ✘ Create and manage AI development and production infrastructure 14
  • 15. Data Science Experiments ✘ Do the data understanding ✘ Data exploration ✘ Feature Engineering ✘ Method/Model Experiments 15
  • 18. 18
  • 21. 21
  • 22. 22
  • 23. AI Engineering Not Only About The Tech Stack ✘ SOLID, Design Pattern, Clean Architecture, etcs ✘ Software Engineering ✘ Tech Tools Characteristics, Behaviour ✘ Basic statistics, probilistics and random variable, basic machine learning. ✘ High performance computing ✘ Security and Regulation ✘ ML Ops 23
  • 24. The Benefit of Python ✘ Could run AI Algorithm as service ✘ Experiments for research phase ✘ Having a lot of Math (statistics, probabilistics) and machine learning libraries ✘ Data pipeline ✘ Easy to use and easy to learn ✘ Could implement OOP or FP 24
  • 25. Example of The Engine ✘ We need to run ML service ✗ Sometimes for specific cases ✗ The classification process is about 500ms ✘ We run the process (preprocess to main analytics process) as a batch process or need to process data with Large Volume ✘ The model is lightweight 25
  • 29. Model ✘ We often use pickle or serializable file as model output ✘ The size bigger than it should be ✘ Impacted to the memory or the space complexity 29
  • 32. THANKS! Any questions? You can find me at ✘ X @infoHendri ✘ Telegram @siganteng ✘ Email: situkangsayur@gmail.com hendri.karisma@akarintidata.ai 32