SlideShare a Scribd company logo
6 Open Source
Data Science
Projects to
Impress your
Interviewer
Facebook AI’s Detection
Transformer (DETR)
1
⬢ DETR by Facebook AI is easily
the most intriguing open-source
project released in May.
⬢ The fact that it has accumulated
almost 3,000 stars within a
week is quite telling.
⬢ DETR, short for Detection
Transformer, could be a change
changer in the computer vision
space.
⬢ And DETR is supremely fast and
extremely efficient – a dream for
data science professionals!
3
Real-Time Image Animation
2
⬢ Another fascinating open-source computer vision project. This, as the name
suggests, let’s us perform image animation in real-time using OpenCV.
⬢ The model mimics the expression of the person in front of the camera and
changes the image accordingly. It’s a brilliant use of computer vision and a
project we’ll be trying out internally for sure.
⬢ This kind of project will have a ton of applications in the industry, from
fashion and retail to marketing and advertising.
5
OpenAI’s GPT-3 – A Massive NLP
Release!
3
⬢ OpenAI has done it again! After releasing GPT-
2 last year and whipping up a media frenzy
around it, they have open-sourced their latest
Natural Language Processing (NLP)
framework – GPT-3!
⬢ Simply put, GPT-3 is the largest NLP model of
it’s kind. It has 175 billion parameters (yes,
you read that correctly) and is HUGE in terms
of size, almost 350GB.
⬢ GPT-3 is almost one of the costliest models in
history (took approximately $12 million to
train).
7
Real-Time Audio Analysis using
PyAudio
4
⬢ This open-source data science project is a personal favorite. Created and released
by Xander Steenbrugge, esteemed speaker at the previous two DataHack Summits,
this Python library enables us to perform real-time audio analysis.
⬢ We’ll be trying out PyAudio and Xander’s work at Analytics Vidhya for sure. A lot
of our data science members are heavy music listeners and they can’t wait to sink
their teeth into this open-source project. 9
TextShot – An Awesome Python
Tool for Grabbing Text
5
⬢ We can simply use this Python tool to grab screenshots and extract text
from them, Called TextShot (nice name), this is an excellent tool to
quickly gather any text data we require for our data science projects.
11
Machine Learning Visuals – A
Brilliant Way to Communicate
for Data Science Professionals
6
⬢ ML Visuals is an open-source collaborative effort
to help the data science community understand
and improve technical communication.
⬢ This brilliant repository provides a lot of visuals,
templates, and figures to help you build a perfect
presentation or research paper.
13
THANKS!
https://www.cetpainfotech.com/technology/data-science-training
MOB NUMBER : 9212172602
QUERY@CETPAINFOTECH.COM

More Related Content

PPTX
Data scienceppt
PDF
Top Libraries for Machine Learning with Python
PPTX
Text Analytics World - Expert System USA
PPTX
Big data may 2012
PDF
Top data science projects
PPTX
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
PPTX
Data science using r multisoft systems
PPTX
10 Data Science, Machine Learning & AI Projects You Can Try Today
Data scienceppt
Top Libraries for Machine Learning with Python
Text Analytics World - Expert System USA
Big data may 2012
Top data science projects
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
Data science using r multisoft systems
10 Data Science, Machine Learning & AI Projects You Can Try Today

Similar to 6 Open Source Data Science Projects To Impress Your Interviewer (20)

PDF
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
PPSX
Open Source Lambda Architecture for deep learning
PDF
Open Source Software for Data Scientists -- Great Wide Open 2014
PDF
Data Science at Scale - The DevOps Approach
PPTX
Software engineering practices for the data science and machine learning life...
PDF
Data - Science and Engineering slide at Bandungpy Sharing Session
PDF
An Infographic on My 25 Articles in Open Source For You Magazine
PDF
Accelerating open science and AI with automated, portable, customizable and r...
PDF
What’s New with Databricks Machine Learning
PDF
Resume-Sept2019
PDF
Resume-Oct2019
PDF
Mohamed-Rashad-Resume
PDF
Data science a practitioner's perspective
PDF
2018 learning approach-digitaltrends
PDF
J sai subrahmanyam_Resume
PDF
Top Data Science Projects in Python for Practice | IABAC
PDF
shanitha-iabac-topdatascienceprojectsinpythonforpractice-250224102927-b1014e1...
PDF
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PDF
Data Versioning and Reproducible ML with DVC and MLflow
PDF
Artificial Intelligence (ML - DL)
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
Open Source Lambda Architecture for deep learning
Open Source Software for Data Scientists -- Great Wide Open 2014
Data Science at Scale - The DevOps Approach
Software engineering practices for the data science and machine learning life...
Data - Science and Engineering slide at Bandungpy Sharing Session
An Infographic on My 25 Articles in Open Source For You Magazine
Accelerating open science and AI with automated, portable, customizable and r...
What’s New with Databricks Machine Learning
Resume-Sept2019
Resume-Oct2019
Mohamed-Rashad-Resume
Data science a practitioner's perspective
2018 learning approach-digitaltrends
J sai subrahmanyam_Resume
Top Data Science Projects in Python for Practice | IABAC
shanitha-iabac-topdatascienceprojectsinpythonforpractice-250224102927-b1014e1...
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
Data Versioning and Reproducible ML with DVC and MLflow
Artificial Intelligence (ML - DL)
Ad

Recently uploaded (20)

PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Cloud computing and distributed systems.
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Machine learning based COVID-19 study performance prediction
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Approach and Philosophy of On baking technology
PPT
Teaching material agriculture food technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Electronic commerce courselecture one. Pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Cloud computing and distributed systems.
Diabetes mellitus diagnosis method based random forest with bat algorithm
Network Security Unit 5.pdf for BCA BBA.
Machine learning based COVID-19 study performance prediction
Mobile App Security Testing_ A Comprehensive Guide.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
“AI and Expert System Decision Support & Business Intelligence Systems”
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Approach and Philosophy of On baking technology
Teaching material agriculture food technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Electronic commerce courselecture one. Pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Empathic Computing: Creating Shared Understanding
Agricultural_Statistics_at_a_Glance_2022_0.pdf
MYSQL Presentation for SQL database connectivity
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Ad

6 Open Source Data Science Projects To Impress Your Interviewer

  • 1. 6 Open Source Data Science Projects to Impress your Interviewer
  • 3. ⬢ DETR by Facebook AI is easily the most intriguing open-source project released in May. ⬢ The fact that it has accumulated almost 3,000 stars within a week is quite telling. ⬢ DETR, short for Detection Transformer, could be a change changer in the computer vision space. ⬢ And DETR is supremely fast and extremely efficient – a dream for data science professionals! 3
  • 5. ⬢ Another fascinating open-source computer vision project. This, as the name suggests, let’s us perform image animation in real-time using OpenCV. ⬢ The model mimics the expression of the person in front of the camera and changes the image accordingly. It’s a brilliant use of computer vision and a project we’ll be trying out internally for sure. ⬢ This kind of project will have a ton of applications in the industry, from fashion and retail to marketing and advertising. 5
  • 6. OpenAI’s GPT-3 – A Massive NLP Release! 3
  • 7. ⬢ OpenAI has done it again! After releasing GPT- 2 last year and whipping up a media frenzy around it, they have open-sourced their latest Natural Language Processing (NLP) framework – GPT-3! ⬢ Simply put, GPT-3 is the largest NLP model of it’s kind. It has 175 billion parameters (yes, you read that correctly) and is HUGE in terms of size, almost 350GB. ⬢ GPT-3 is almost one of the costliest models in history (took approximately $12 million to train). 7
  • 8. Real-Time Audio Analysis using PyAudio 4
  • 9. ⬢ This open-source data science project is a personal favorite. Created and released by Xander Steenbrugge, esteemed speaker at the previous two DataHack Summits, this Python library enables us to perform real-time audio analysis. ⬢ We’ll be trying out PyAudio and Xander’s work at Analytics Vidhya for sure. A lot of our data science members are heavy music listeners and they can’t wait to sink their teeth into this open-source project. 9
  • 10. TextShot – An Awesome Python Tool for Grabbing Text 5
  • 11. ⬢ We can simply use this Python tool to grab screenshots and extract text from them, Called TextShot (nice name), this is an excellent tool to quickly gather any text data we require for our data science projects. 11
  • 12. Machine Learning Visuals – A Brilliant Way to Communicate for Data Science Professionals 6
  • 13. ⬢ ML Visuals is an open-source collaborative effort to help the data science community understand and improve technical communication. ⬢ This brilliant repository provides a lot of visuals, templates, and figures to help you build a perfect presentation or research paper. 13