SlideShare a Scribd company logo
7
Most read
9
Most read
16
Most read
ChatGPT for Data
Science Projects
https://vitalflux.com
5/6/2023 https://vitalflux.com 1
Topics
Setting up ChatGPT for Data Analysis
Data Exploration and Analysis with
ChatGPT
Building Predictive Models with ChatGPT
Model Evaluation and Selection with
ChatGPT
5/6/2023 https://vitalflux.com 2
Setting Up ChatGPT for Data
Analysis
5/6/2023 https://vitalflux.com 3
Setting Up
ChatGPT for
Data
Analysis
Execute the following prompt for ChatGPT to get set
up for data analysis
Be an expert data scientists. Help me extract
insights from the data.
Here is the X no. of records as dataset.
18.0 8. 307.0 130.0 3504. 12.0 70. 1. "chevrolet
chevelle malibu" 15.0 8. 350.0 165.0 3693. 11.5
70. 1. "buick skylark 320“
Have you understood the dataset and related
information?
5/6/2023 https://vitalflux.com 4
Data Exploration and Analysis
with ChatGPT
5/6/2023 https://vitalflux.com 5
Data
Exploration
and Analysis
with
ChatGPT
• Give me top 3 insights from the dataset
• What are the most common values for each
attribute?
• Are there any trends or patterns in the data?
Identify Insights
• Identify any outliers in the dataset and decide
on a strategy for handling them, such as
removing them or replacing them with a more
reasonable value.
• How many outliers are there in the data?
• What is the range of values for each attribute?
Find Outliers
5/6/2023 https://vitalflux.com 6
Data
Exploration
and Analysis
with
ChatGPT
• What are the correlations between the attributes in
the dataset?
• Which attributes are most strongly correlated with
the target variable?
• Are there any correlations between the categorical
variables?
• Write Python code for visualizing correlations
Identify Correlations
• What is the distribution of values for each attribute?
• Are the distributions skewed or symmetric?
• Are there any outliers in the distribution?
• Write Python code for visualizing distributions
Discover Distributions
5/6/2023 https://vitalflux.com 7
Data
Exploration
and Analysis
with
ChatGPT
• What hypothesis do you think can be
tested from the data given earlier?
• Write Python code for performing
hypothesis test
Hypothesis Testing
• Write Python code that can help
visualize the relationships existing in
the dataset?
•Data Visualization
5/6/2023 https://vitalflux.com 8
Data
Exploration
and Analysis
with
ChatGPT
• Which attributes are most
relevant to the target variable?
• Can we create new features by
combining existing attributes?
• Which features are redundant or
irrelevant and can be removed?
• Write Python code for extracting
most relevant features
Extract Features
5/6/2023 https://vitalflux.com 9
Building Predictive Models
with ChatGPT
5/6/2023 https://vitalflux.com 10
Building Predictive Models with ChatGPT
Can I build a predictive model using this data? What can I predict?
What is the distribution of the target variable?
Are there any outliers in the target variable?
Identify
Target
Variables
Which attributes are most relevant to the target variable?
Can we create new features by combining existing attributes?
Which features are redundant or irrelevant and can be removed?
Select
Predictors
5/6/2023 https://vitalflux.com 11
Building Predictive Models with ChatGPT
Which algorithm is most appropriate for the problem and the data?
•Should we use a linear regression, decision tree, or neural network model?
•What are the pros and cons of each algorithm?
Choose
Algorithm
Create Python code for training the model using {algorithm}
Create Python code for training the model using {algorithm} while also performing
hyperparameter tuning
This prompt can be used repeatedly for different algorithms.
Train Model
5/6/2023 https://vitalflux.com 12
Model Evaluation & Selection
with ChatGPT
5/6/2023 https://vitalflux.com 13
Model Evaluation and Selection with
ChatGPT
Evaluate Model Performance
• How can I evaluate the performance of the model trained using
{algorithm}?
• What are the metrics for evaluating models trained using {algorithm}?
• Rewrite the python code while including evaluation metrics and
printing them
Hyperparameter tuning
• What are different hyperparameters which can be tuned for the model
trained using {algorithm}?
• How can I fine-tune the hyperparameters of the model trained using
{algorithm} in order to improve performance?
• Rewrite the model training python code with hyperparameters tuning
5/6/2023 https://vitalflux.com 14
Model Evaluation and Selection with
ChatGPT
Model Selection
• How can I compare the
performance of different
models trained using
{algorithm1}, {algorithm2}?
• What are the advantages and
disadvantages of each model?
5/6/2023 https://vitalflux.com 15
Thank You
https://vitalflux.com
5/6/2023 https://vitalflux.com 16

More Related Content

PPTX
+100 Prompts to Create Content with ChatGPT
PPTX
Introduction to Data Science.pptx
PDF
ChatGPT and the Future of Work - Clark Boyd
PDF
List of AI Tools (3).pdf
PPTX
AI-Powered Academic Writing Full Deck RV edits 12 June.pptx
PDF
프로덕트를 빠르게 개선하기 위한 베이지안 A/B 테스트
PDF
A comprehensive guide to prompt engineering.pdf
PPTX
A Beginner's Guide to Large Language Models
+100 Prompts to Create Content with ChatGPT
Introduction to Data Science.pptx
ChatGPT and the Future of Work - Clark Boyd
List of AI Tools (3).pdf
AI-Powered Academic Writing Full Deck RV edits 12 June.pptx
프로덕트를 빠르게 개선하기 위한 베이지안 A/B 테스트
A comprehensive guide to prompt engineering.pdf
A Beginner's Guide to Large Language Models

What's hot (20)

PPTX
Using Generative AI
PDF
The-CxO-Guide-to.pdf
PPTX
The Future of AI is Generative not Discriminative 5/26/2021
PPTX
Data Analytics and AI Strategy Toolkit, Playbook and Templates
PDF
Generative Models and ChatGPT
PDF
The Future is in Responsible Generative AI
PPTX
Generative AI.pptx
PDF
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
PDF
Generative-AI-in-enterprise-20230615.pdf
PDF
Using the power of Generative AI at scale
PDF
An Introduction to Generative AI - May 18, 2023
PDF
Leveraging Generative AI & Best practices
PDF
Unlocking the Power of Generative AI An Executive's Guide.pdf
PDF
Hegazi_ChatGPT_Book.pdf
PPTX
ChatGPT in Education
PPTX
Generative AI Use-cases for Enterprise - First Session
PPTX
Generative AI, WiDS 2023.pptx
PDF
ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...
PDF
Large Language Models Bootcamp
PDF
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
Using Generative AI
The-CxO-Guide-to.pdf
The Future of AI is Generative not Discriminative 5/26/2021
Data Analytics and AI Strategy Toolkit, Playbook and Templates
Generative Models and ChatGPT
The Future is in Responsible Generative AI
Generative AI.pptx
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
Generative-AI-in-enterprise-20230615.pdf
Using the power of Generative AI at scale
An Introduction to Generative AI - May 18, 2023
Leveraging Generative AI & Best practices
Unlocking the Power of Generative AI An Executive's Guide.pdf
Hegazi_ChatGPT_Book.pdf
ChatGPT in Education
Generative AI Use-cases for Enterprise - First Session
Generative AI, WiDS 2023.pptx
ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...
Large Language Models Bootcamp
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
Ad

Similar to ChatGPT for Data Science Projects (20)

PDF
btNOG 10: Preparing for IPv6 implementation using AI
PDF
Automatic machine learning (AutoML) 101
PPTX
Integrating Azure Machine Learning and Predictive Analytics with SharePoint O...
PDF
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
PDF
900 keynote abbott
PDF
Data Science Meets DevOps: GitOps with OpenShift (1).pdf
PPTX
Python for Machine Learning_ A Comprehensive Overview.pptx
PPTX
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
PDF
[Analyst Research Slides] Build vs. Buy: Finding the Best Path to Network Aut...
PDF
Ai in finance
PPTX
From Data Science to MLOps
PPTX
Understanding ChatGPT and Its Implications.pptx
PPTX
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
PDF
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
PDF
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
PDF
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
PDF
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
PDF
The path to success with Graph Database and Graph Data Science
PDF
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...
PDF
Supercharge your data analytics with BigQuery
btNOG 10: Preparing for IPv6 implementation using AI
Automatic machine learning (AutoML) 101
Integrating Azure Machine Learning and Predictive Analytics with SharePoint O...
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
900 keynote abbott
Data Science Meets DevOps: GitOps with OpenShift (1).pdf
Python for Machine Learning_ A Comprehensive Overview.pptx
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
[Analyst Research Slides] Build vs. Buy: Finding the Best Path to Network Aut...
Ai in finance
From Data Science to MLOps
Understanding ChatGPT and Its Implications.pptx
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
The path to success with Graph Database and Graph Data Science
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...
Supercharge your data analytics with BigQuery
Ad

More from Ajitesh Kumar (6)

PPTX
GPT-3 Models Overview
PPTX
Generative AI Risks & Concerns
PPTX
Machine Learning Terminologies
PPTX
Mastering Analytical Thinking: A Comprehensive Guide to Problem-Solving and D...
PPTX
How to Identify Analytics Use Cases
PPTX
What is first principles thinking
GPT-3 Models Overview
Generative AI Risks & Concerns
Machine Learning Terminologies
Mastering Analytical Thinking: A Comprehensive Guide to Problem-Solving and D...
How to Identify Analytics Use Cases
What is first principles thinking

Recently uploaded (20)

PDF
.pdf is not working space design for the following data for the following dat...
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
Database Infoormation System (DBIS).pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
.pdf is not working space design for the following data for the following dat...
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
[EN] Industrial Machine Downtime Prediction
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Clinical guidelines as a resource for EBP(1).pdf
Miokarditis (Inflamasi pada Otot Jantung)
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
Database Infoormation System (DBIS).pptx
Business Analytics and business intelligence.pdf
1_Introduction to advance data techniques.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Business Ppt On Nestle.pptx huunnnhhgfvu
168300704-gasification-ppt.pdfhghhhsjsjhsuxush

ChatGPT for Data Science Projects

  • 1. ChatGPT for Data Science Projects https://vitalflux.com 5/6/2023 https://vitalflux.com 1
  • 2. Topics Setting up ChatGPT for Data Analysis Data Exploration and Analysis with ChatGPT Building Predictive Models with ChatGPT Model Evaluation and Selection with ChatGPT 5/6/2023 https://vitalflux.com 2
  • 3. Setting Up ChatGPT for Data Analysis 5/6/2023 https://vitalflux.com 3
  • 4. Setting Up ChatGPT for Data Analysis Execute the following prompt for ChatGPT to get set up for data analysis Be an expert data scientists. Help me extract insights from the data. Here is the X no. of records as dataset. 18.0 8. 307.0 130.0 3504. 12.0 70. 1. "chevrolet chevelle malibu" 15.0 8. 350.0 165.0 3693. 11.5 70. 1. "buick skylark 320“ Have you understood the dataset and related information? 5/6/2023 https://vitalflux.com 4
  • 5. Data Exploration and Analysis with ChatGPT 5/6/2023 https://vitalflux.com 5
  • 6. Data Exploration and Analysis with ChatGPT • Give me top 3 insights from the dataset • What are the most common values for each attribute? • Are there any trends or patterns in the data? Identify Insights • Identify any outliers in the dataset and decide on a strategy for handling them, such as removing them or replacing them with a more reasonable value. • How many outliers are there in the data? • What is the range of values for each attribute? Find Outliers 5/6/2023 https://vitalflux.com 6
  • 7. Data Exploration and Analysis with ChatGPT • What are the correlations between the attributes in the dataset? • Which attributes are most strongly correlated with the target variable? • Are there any correlations between the categorical variables? • Write Python code for visualizing correlations Identify Correlations • What is the distribution of values for each attribute? • Are the distributions skewed or symmetric? • Are there any outliers in the distribution? • Write Python code for visualizing distributions Discover Distributions 5/6/2023 https://vitalflux.com 7
  • 8. Data Exploration and Analysis with ChatGPT • What hypothesis do you think can be tested from the data given earlier? • Write Python code for performing hypothesis test Hypothesis Testing • Write Python code that can help visualize the relationships existing in the dataset? •Data Visualization 5/6/2023 https://vitalflux.com 8
  • 9. Data Exploration and Analysis with ChatGPT • Which attributes are most relevant to the target variable? • Can we create new features by combining existing attributes? • Which features are redundant or irrelevant and can be removed? • Write Python code for extracting most relevant features Extract Features 5/6/2023 https://vitalflux.com 9
  • 10. Building Predictive Models with ChatGPT 5/6/2023 https://vitalflux.com 10
  • 11. Building Predictive Models with ChatGPT Can I build a predictive model using this data? What can I predict? What is the distribution of the target variable? Are there any outliers in the target variable? Identify Target Variables Which attributes are most relevant to the target variable? Can we create new features by combining existing attributes? Which features are redundant or irrelevant and can be removed? Select Predictors 5/6/2023 https://vitalflux.com 11
  • 12. Building Predictive Models with ChatGPT Which algorithm is most appropriate for the problem and the data? •Should we use a linear regression, decision tree, or neural network model? •What are the pros and cons of each algorithm? Choose Algorithm Create Python code for training the model using {algorithm} Create Python code for training the model using {algorithm} while also performing hyperparameter tuning This prompt can be used repeatedly for different algorithms. Train Model 5/6/2023 https://vitalflux.com 12
  • 13. Model Evaluation & Selection with ChatGPT 5/6/2023 https://vitalflux.com 13
  • 14. Model Evaluation and Selection with ChatGPT Evaluate Model Performance • How can I evaluate the performance of the model trained using {algorithm}? • What are the metrics for evaluating models trained using {algorithm}? • Rewrite the python code while including evaluation metrics and printing them Hyperparameter tuning • What are different hyperparameters which can be tuned for the model trained using {algorithm}? • How can I fine-tune the hyperparameters of the model trained using {algorithm} in order to improve performance? • Rewrite the model training python code with hyperparameters tuning 5/6/2023 https://vitalflux.com 14
  • 15. Model Evaluation and Selection with ChatGPT Model Selection • How can I compare the performance of different models trained using {algorithm1}, {algorithm2}? • What are the advantages and disadvantages of each model? 5/6/2023 https://vitalflux.com 15