SlideShare a Scribd company logo
Joe Holbrook. Owner of Cloudbursting Corp.
Consulting Engineer and Technical Trainer
Jacksonville, FL
GOOGLE CLOUD CERTIFICATION REVIEW
DATA ENGINEER- BETA
04/08/2017
• A Google Certified Professional - Data Engineer enables data-driven decision making by
collecting, transforming, and visualizing data. The Data Engineer designs, builds, maintains,
and troubleshoots data processing systems with a particular emphasis on the security,
reliability, fault-tolerance, scalability, fidelity, and efficiency of such systems.The Data
Engineer also analyzes data to gain insight into business outcomes, builds statistical models
to support decision-making, and creates machine learning models to automate and simplify
key business processes.
• A Google Certified Professional - Data Engineer has demonstrated in our assessment their
ability to:
• check Build and maintain data structures and databases
• check Design data processing systems
• check Analyze data and enable machine learning
• check Model business processes for analysis and optimization
• check Design for reliability
• check Visualize data and advocate policy
• check Design for security and compliance
GOOGLE CLOUD PLATFORM
CERTIFIED DATA ENGINEER
• Here is the page to review.
• https://cloud.google.com/certification/data-engineer
• Beta Exam Cost $120
• Beta Exam Questions 100
• Beta Exam Time 4 Hours
• Beta Exam Case Studies
• Test Vendor Very Poor Choice of test vendor. (not a fan) availability/flexibility for testing.
Kryterion which has only one place to test if that in many cities is beyond me for a giant
like Google. Unlike Pearson and Vue which have a significant network of test centers.
Example in Jacksonville there is one location which was a junior college that had a 4
hour/3 day a week testing schedule. I checked Atlanta which had a few more sites….
Unlike VUE/Pearson that can have 20 locations in a city…
GCP DATA ENGINEER OVERVIEW
• Personally I thought the exam was written in a very unprocessed approach
and appeared rushed
• Did not appear to use best practices in exam development such as Bloom.
• Case studies. I did like how they have the case studies listed on Exam
Guide so you could review before taking exam and not spend time during
exam reviewing.
• Exam did have a technical merit but as a routine test developer I see the
need for a better exam guide and test JTA to be completed.
THOUGHTS ON EXAM
• Case studies were part of the exam and you needed to review and answer
the appropriate solution for the specific questions. Case study had
numerous questions similar but had a slight question or answers so you
needed to pay attention.
• Cloud DataProc - Questions about migrating onsite Hadoop and how Cloud
DataProc could help.
• Cloud DataFlow - Numerous questions around knowing how Cloud
DataFlow fits, know stream vs batch, it’s a managed service so that you
don’t need to deal with it, how you can use for ETL. Lastly, you will see
several questions related to how Cloud Dataflow can manages services in
Google Cloud Storage and Google Cloud Compute Engine. (Yes, know this)
THE DATA ENGINEER TECHNICAL REVIEW
• Pipelines in DataFlow and how you could use graph objects. One question
about why you would use JSON or Java related to pipelines.
• Storage around every aspect and needed to discern between Nearline and
Coldline. Big Data , Regional and Standard storage. Cloud Storage is a must
to know.
• Hadoop. Numerous questions around HDFS but also when you would need
to use approaches like Hive, Sqoop, Oozie with Hadoop.
• Stackdriver –not really what I would call Data Engineering but they asked
four questions about this Hybrid monitoring service . I remember a few
questions focused on how you can debug, monitor and log using
Stackdriver. I think they were checking to confirm if you knew how use
Stackdriver to help debug source code, etc.
THE TECHNICAL REVIEW
• BigTable- numerous questions about why and how you could use BigTable.
Know it’s a high performance NoSQL database service for large analytical
and operational workloads.
• BigQuery. Tested heavily around data ingestion and availability. Once again
they want to confirm you know that it’s a fully managed, petabyte scale,
low cost enterprise data warehouse for analytics. BigQuery is serverless.
• Cloud SQL. They wanted to confirm you knew it and why over other
services like BigTable, BigQuery, etc.
TECHNICAL REVIEW
• Tensorflow. Ok, this area I totally blew at.. I have not been focused on
machine learning and AI. There were several questions also focused on
“Nueral networks”.. I would highly advise you to review this page. Google
seems to expect everyone working on GCP to know machine learning,
Cloud ML, Mandlebrot set (Yep).
• https://www.tensorflow.org/versions/r0.10/tutorials/
• CloudLab- Another area where I was caught with my pants down. Get
warm and fuzzy with this around Cloud Datalab which is a powerful
interactive tool created to explore, analyze, transform and visualize data
and build machine learning models on Google Cloud Platform. Numerous
question here around how to “Visualize and create models”
TECHNICAL REVIEW
• Pub/Sub. Numerous questions on Pub/Sub and how you would integrate it
with other services, how to use to it for publisher apps, messaging,
resource types and models.
TECHNICAL REVIEW
• Thank you.
• Good Luck on the exam!
TECHNICAL REVIEW

More Related Content

PDF
GCP Data Engineer cheatsheet
PDF
How to obtain the Cloudera Data Engineer Certification
PDF
Azure Databricks – Customer Experiences and Lessons Denzil Ribeiro Madhu Ganta
PDF
3D: DBT using Databricks and Delta
PDF
Powering Interactive BI Analytics with Presto and Delta Lake
PPTX
Allyourbase
PPTX
Azure Databricks is Easier Than You Think
PDF
Logical-DataWarehouse-Alluxio-meetup
GCP Data Engineer cheatsheet
How to obtain the Cloudera Data Engineer Certification
Azure Databricks – Customer Experiences and Lessons Denzil Ribeiro Madhu Ganta
3D: DBT using Databricks and Delta
Powering Interactive BI Analytics with Presto and Delta Lake
Allyourbase
Azure Databricks is Easier Than You Think
Logical-DataWarehouse-Alluxio-meetup

What's hot (18)

PPTX
Snowflake Datawarehouse Architecturing
PPTX
Jethro for tableau webinar (11 15)
PDF
ETL Made Easy with Azure Data Factory and Azure Databricks
PPTX
Apache hadoop technology : Beginners
PDF
Reimagining Devon Energy’s Data Estate with a Unified Approach to Integration...
PDF
Delivering Insights from 20M+ Smart Homes with 500M+ Devices
PDF
Gcp data engineer
PDF
Intro to databricks delta lake
PPTX
Keynote - Hosted PostgreSQL: An Objective Look
 
PPTX
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
PDF
Healthcare Claim Reimbursement using Apache Spark
PDF
How Adobe uses Structured Streaming at Scale
PDF
Spark working with a Cloud IDE: Notebook/Shiny Apps
PPTX
Puree through Trillion of clicks in seconds using Interana
PPTX
SQL To NoSQL - Top 6 Questions Before Making The Move
PPTX
Data virtualization using polybase
PDF
Large Scale Lakehouse Implementation Using Structured Streaming
PDF
Stumbling stones when migrating from Oracle
 
Snowflake Datawarehouse Architecturing
Jethro for tableau webinar (11 15)
ETL Made Easy with Azure Data Factory and Azure Databricks
Apache hadoop technology : Beginners
Reimagining Devon Energy’s Data Estate with a Unified Approach to Integration...
Delivering Insights from 20M+ Smart Homes with 500M+ Devices
Gcp data engineer
Intro to databricks delta lake
Keynote - Hosted PostgreSQL: An Objective Look
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Healthcare Claim Reimbursement using Apache Spark
How Adobe uses Structured Streaming at Scale
Spark working with a Cloud IDE: Notebook/Shiny Apps
Puree through Trillion of clicks in seconds using Interana
SQL To NoSQL - Top 6 Questions Before Making The Move
Data virtualization using polybase
Large Scale Lakehouse Implementation Using Structured Streaming
Stumbling stones when migrating from Oracle
 
Ad

Viewers also liked (8)

PPTX
CompTIA Security Plus Overview
PPTX
PPTX
HDS Storage with VMWare VASA
PPTX
CompTIA Cloud Plus Certification Bootcamp June 2017
PPTX
Litecoin Crypto Currency Bootcamp
PPTX
CompTIA Security Plus Mini Bootcamp Session
PPTX
Dash Crypto Currency Intro for Techies
PPTX
Google Cloud Platform Certification Cloud Architect Exam Prep Review Virtual ...
CompTIA Security Plus Overview
HDS Storage with VMWare VASA
CompTIA Cloud Plus Certification Bootcamp June 2017
Litecoin Crypto Currency Bootcamp
CompTIA Security Plus Mini Bootcamp Session
Dash Crypto Currency Intro for Techies
Google Cloud Platform Certification Cloud Architect Exam Prep Review Virtual ...
Ad

Similar to Google cloud certification data engineer (20)

PPTX
The Best GCP Cloud Data Engineer Training in Hyderabad.pptx
PPTX
Google Cloud Certifications & Machine Learning
PPTX
GCP Data Engineering Online Training in Hyderabad - GCP.pptx
PPTX
Building Modern Data Pipelines on GCP via a FREE online Bootcamp
PDF
Google cloud Professional Data Engineer practice exam test 2020
PDF
Binder1.pdf
PDF
PXL Data Engineering Workshop By Selligent
PPTX
PDuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuE-1.pptx
PDF
google cloud profrcrna; Preparing_for_PDE_Workbook-1-.pdf
PDF
Google professional data engineer exam dumps
PDF
Introduction to Data Engineer and Data Pipeline at Credit OK
PDF
Cloud Developer Days - BigQuery
PPTX
Lessons learned from designing a QA Automation for analytics databases (big d...
PPTX
Introduction to Data Engineering
PPTX
The Evolution of Data Engineering Emerging Trends and Scalable Architecture D...
PDF
Workshop on Google Cloud Data Platform
PDF
Google Cloud - Stand Out Features
PDF
Key projects Data Science and Engineering
PDF
Key projects Data Science and Engineering
PPT
PUC Masterclass Big Data
The Best GCP Cloud Data Engineer Training in Hyderabad.pptx
Google Cloud Certifications & Machine Learning
GCP Data Engineering Online Training in Hyderabad - GCP.pptx
Building Modern Data Pipelines on GCP via a FREE online Bootcamp
Google cloud Professional Data Engineer practice exam test 2020
Binder1.pdf
PXL Data Engineering Workshop By Selligent
PDuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuE-1.pptx
google cloud profrcrna; Preparing_for_PDE_Workbook-1-.pdf
Google professional data engineer exam dumps
Introduction to Data Engineer and Data Pipeline at Credit OK
Cloud Developer Days - BigQuery
Lessons learned from designing a QA Automation for analytics databases (big d...
Introduction to Data Engineering
The Evolution of Data Engineering Emerging Trends and Scalable Architecture D...
Workshop on Google Cloud Data Platform
Google Cloud - Stand Out Features
Key projects Data Science and Engineering
Key projects Data Science and Engineering
PUC Masterclass Big Data

More from Joseph Holbrook, Chief Learning Officer (CLO) (20)

PPTX
Cloud Computing Opportunities in the Goverment Military Sectors
PPTX
Top 10 key areas to learn in cloud in 2020
PDF
"Creating a Competitive Edge Using Blockchain Technology"
PPTX
How to design, code, deploy and execute a smart contract
PPTX
How to Build a Threat Detection Strategy in the AWS Cloud
PPTX
PPTX
CompTIA Cybersecurity Analyst Certification Tips and Tricks
PPTX
Blockchain Breakout Session Tech Coast Conference Jacksonville
PPTX
Blockchain Fundamentals Quickstart
PPTX
Blockchain Proof or Concepts for Pre Sales Engineers
PPTX
DevOps on GCP Course Compared to AWS
PPTX
Cloud Security Fundamentals Webinar
PPTX
Blockchain Fundamentals for Technology Engineers
PPTX
Cloud Security Top 10 Risk Mitigation Techniques for 2019
PPTX
Cloud Computing and the Culture of Innovation
PPTX
Udemy Cash Flow Workshop Jacksonville IT Pro Workshop 2018
PPTX
CompTIA PenTest+ Exam (PT0-001) Exam Review
PPTX
PPTX
Google Cloud Platform Intro to Data and Storage Services
PPTX
CompTIA PenTest+ BETA EXAM CODE PT1-001
Cloud Computing Opportunities in the Goverment Military Sectors
Top 10 key areas to learn in cloud in 2020
"Creating a Competitive Edge Using Blockchain Technology"
How to design, code, deploy and execute a smart contract
How to Build a Threat Detection Strategy in the AWS Cloud
CompTIA Cybersecurity Analyst Certification Tips and Tricks
Blockchain Breakout Session Tech Coast Conference Jacksonville
Blockchain Fundamentals Quickstart
Blockchain Proof or Concepts for Pre Sales Engineers
DevOps on GCP Course Compared to AWS
Cloud Security Fundamentals Webinar
Blockchain Fundamentals for Technology Engineers
Cloud Security Top 10 Risk Mitigation Techniques for 2019
Cloud Computing and the Culture of Innovation
Udemy Cash Flow Workshop Jacksonville IT Pro Workshop 2018
CompTIA PenTest+ Exam (PT0-001) Exam Review
Google Cloud Platform Intro to Data and Storage Services
CompTIA PenTest+ BETA EXAM CODE PT1-001

Recently uploaded (20)

PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
1_Introduction to advance data techniques.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
IB Computer Science - Internal Assessment.pptx
Miokarditis (Inflamasi pada Otot Jantung)
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Qualitative Qantitative and Mixed Methods.pptx
Business Analytics and business intelligence.pdf
climate analysis of Dhaka ,Banglades.pptx
Introduction to Knowledge Engineering Part 1
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Supervised vs unsupervised machine learning algorithms
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Introduction-to-Cloud-ComputingFinal.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg

Google cloud certification data engineer

  • 1. Joe Holbrook. Owner of Cloudbursting Corp. Consulting Engineer and Technical Trainer Jacksonville, FL GOOGLE CLOUD CERTIFICATION REVIEW DATA ENGINEER- BETA 04/08/2017
  • 2. • A Google Certified Professional - Data Engineer enables data-driven decision making by collecting, transforming, and visualizing data. The Data Engineer designs, builds, maintains, and troubleshoots data processing systems with a particular emphasis on the security, reliability, fault-tolerance, scalability, fidelity, and efficiency of such systems.The Data Engineer also analyzes data to gain insight into business outcomes, builds statistical models to support decision-making, and creates machine learning models to automate and simplify key business processes. • A Google Certified Professional - Data Engineer has demonstrated in our assessment their ability to: • check Build and maintain data structures and databases • check Design data processing systems • check Analyze data and enable machine learning • check Model business processes for analysis and optimization • check Design for reliability • check Visualize data and advocate policy • check Design for security and compliance GOOGLE CLOUD PLATFORM CERTIFIED DATA ENGINEER
  • 3. • Here is the page to review. • https://cloud.google.com/certification/data-engineer • Beta Exam Cost $120 • Beta Exam Questions 100 • Beta Exam Time 4 Hours • Beta Exam Case Studies • Test Vendor Very Poor Choice of test vendor. (not a fan) availability/flexibility for testing. Kryterion which has only one place to test if that in many cities is beyond me for a giant like Google. Unlike Pearson and Vue which have a significant network of test centers. Example in Jacksonville there is one location which was a junior college that had a 4 hour/3 day a week testing schedule. I checked Atlanta which had a few more sites…. Unlike VUE/Pearson that can have 20 locations in a city… GCP DATA ENGINEER OVERVIEW
  • 4. • Personally I thought the exam was written in a very unprocessed approach and appeared rushed • Did not appear to use best practices in exam development such as Bloom. • Case studies. I did like how they have the case studies listed on Exam Guide so you could review before taking exam and not spend time during exam reviewing. • Exam did have a technical merit but as a routine test developer I see the need for a better exam guide and test JTA to be completed. THOUGHTS ON EXAM
  • 5. • Case studies were part of the exam and you needed to review and answer the appropriate solution for the specific questions. Case study had numerous questions similar but had a slight question or answers so you needed to pay attention. • Cloud DataProc - Questions about migrating onsite Hadoop and how Cloud DataProc could help. • Cloud DataFlow - Numerous questions around knowing how Cloud DataFlow fits, know stream vs batch, it’s a managed service so that you don’t need to deal with it, how you can use for ETL. Lastly, you will see several questions related to how Cloud Dataflow can manages services in Google Cloud Storage and Google Cloud Compute Engine. (Yes, know this) THE DATA ENGINEER TECHNICAL REVIEW
  • 6. • Pipelines in DataFlow and how you could use graph objects. One question about why you would use JSON or Java related to pipelines. • Storage around every aspect and needed to discern between Nearline and Coldline. Big Data , Regional and Standard storage. Cloud Storage is a must to know. • Hadoop. Numerous questions around HDFS but also when you would need to use approaches like Hive, Sqoop, Oozie with Hadoop. • Stackdriver –not really what I would call Data Engineering but they asked four questions about this Hybrid monitoring service . I remember a few questions focused on how you can debug, monitor and log using Stackdriver. I think they were checking to confirm if you knew how use Stackdriver to help debug source code, etc. THE TECHNICAL REVIEW
  • 7. • BigTable- numerous questions about why and how you could use BigTable. Know it’s a high performance NoSQL database service for large analytical and operational workloads. • BigQuery. Tested heavily around data ingestion and availability. Once again they want to confirm you know that it’s a fully managed, petabyte scale, low cost enterprise data warehouse for analytics. BigQuery is serverless. • Cloud SQL. They wanted to confirm you knew it and why over other services like BigTable, BigQuery, etc. TECHNICAL REVIEW
  • 8. • Tensorflow. Ok, this area I totally blew at.. I have not been focused on machine learning and AI. There were several questions also focused on “Nueral networks”.. I would highly advise you to review this page. Google seems to expect everyone working on GCP to know machine learning, Cloud ML, Mandlebrot set (Yep). • https://www.tensorflow.org/versions/r0.10/tutorials/ • CloudLab- Another area where I was caught with my pants down. Get warm and fuzzy with this around Cloud Datalab which is a powerful interactive tool created to explore, analyze, transform and visualize data and build machine learning models on Google Cloud Platform. Numerous question here around how to “Visualize and create models” TECHNICAL REVIEW
  • 9. • Pub/Sub. Numerous questions on Pub/Sub and how you would integrate it with other services, how to use to it for publisher apps, messaging, resource types and models. TECHNICAL REVIEW
  • 10. • Thank you. • Good Luck on the exam! TECHNICAL REVIEW