SlideShare a Scribd company logo
Introduction to Python
Python for Data Science 2
Popular tools used in data science
 Data pre-processing and analysis
◦ Python, R, Microsoft Excel, SAS, SPSS
 Data exploration and visualization
◦ Tableau, Qlikview, Microsoft Excel
 Parallel and distributed computing incase of big data
◦ Apache Spark,Apache Hadoop
Python for Data Science 3
Evolution of Python
 Python was developed by Guido van Rossum in the late
eighties at the ‘National Research Institute for Mathematics
and Computer Science’ at Netherlands
 Python Editions
◦ Python 1.0
◦ Python 2.0
◦ Python 3.0
Python for Data Science 4
Python as a programming language
 Supports multiple programming paradigm
◦ Functional, Structural, OOPs, etc.
 Dynamic typing
◦ Runtime type safety checks
 Reference counts
◦ Deallocates objects which are not used for long
 Late binding
◦ Methods are looked up by name during runtime
 Python’s design is guided by 20 aphorisms as described in Zen of
Python by Tim Peters
Python for Data Science 5
Python as a programming language
 Standard CPython interpreter is managed by “Python Software
Foundation”
 There are other interpreters namely JPython (Java), Iron Python
(C#), Stackless Python (C, used for parallelism), PyPy (Python
itself JIT compilation)
 Standard libraries are written in python itself
 High standards of readability
Python for Data Science 6
Python as a programming language
 Cross-platform (Windows, Linux, Mac)
 Highly supported by a large community group
 Better error handle
Python for Data Science 7
Python as a programming language
 Comparison to Java
 Python vs Java
◦ Java is statically typed i.e. type safety is checked during compilation
(static compilation)
◦ Thus in Java the time required to develop the code is more
◦ Python which is dynamically typed compensates for huge
compilation time when compared to Java
◦ Codes which are dynamically typed tend to be less verbose
therefore offering more readability
Python for Data Science 8
Advantages of using python
 Python has several features that make it well suited for data
science
 Open source and community development
◦ Developed under Open Source Initiative license making it free to use
and distribute even commercially
 Syntax used is simple to understand and code
 Libraries designed for specific data science tasks
 Combines well with majority of the cloud platform service
providers
Python for Data Science 9
Coding environment
 A software program can be written using a terminal, a
command prompt (cmd), a text editor or through an Integrated
Development Environment (IDE)
 The program needs to be saved in a file with an appropriate
extension (.py for python, .mat for matlab, etc...) and can be
executed in corresponding environment (Python, Matlab, etc…)
 Integrated Development Environment (IDE) is a software
product solely developed to support software development in
various or specific programming language(s)
Python for Data Science 10
Coding environment
 Python 2.x support will be available till 2020
 Python 3.x is an enhanced version of 2.x and will only be maintained
from 3.6.x post 2020
 Install basic python version or use the online python console as in
https://www.python.org/
 Execute following commands and view the outputs in terminal or
command prompt
• Basic print statement
• Naming conventions for variables and functions, operators
• Conditional operations, looping statements (nested)
• Function declaration and calling
• Installing modules
Python for Data Science 11
https://www.python.org/
Python for Data Science 12
https://www.python.org/
Python for Data Science 13
Integrated development environment (IDE)
 Software application consisting of a cohesive unit of tools
required for development
 Designed to simplify software development
 Utilities provided by IDEs include tools for managing, compiling,
deploying and debugging software
Python for Data Science 14
Coding environment- IDE
 An IDE usually comprises of
◦ Source code editor
◦ Compiler
◦ Debugger
◦ Additional features include syntax and error highlighting,
code completion
 Offers supports in building and executing the program along
with debugging the code from within the environment
Python for Data Science 15
Coding environment- IDE
 Best IDEs provide version control features
 Eclipse+PyDev, SublimeText,Atom, GNU Emacs,Vi/Vim,Visual
Studio,Visual Studio Code are general IDEs with python
support
 Apart from these some of the python specific editors include
Pycharm, Jupyter, Spyder,Thonny
Python for Data Science 16
Spyder
 Supported across Linux, Mac OS X and Windows platforms
 Available as open source version
 Can be installed separately or through Anaconda distribution
 Developed for Python and specifically data science
 Features include
◦ Code editor with robust syntax and error highlighting
◦ Code completion and navigation
◦ Debugger
◦ Integrated document
 Interface similar to MATLAB and RStudio
Python for Data Science 17
Spyder
Python for Data Science 18
PyCharm
 Supported across Linux, Mac OS X and Windows platforms
 Available as community (free open source) and professional (paid) version
 Supports only Python
 Can be installed separately or through Anaconda distribution
 Features include
◦ Code editor provides syntax and error highlighting
◦ Code completion and navigation
◦ Unit testing
◦ Debugger
◦ Version control
Python for Data Science 19
PyCharm
Python for Data Science 20
Jupyter Notebook
 Web application that allows creation and manipulation of
documents called ‘notebook’
 Supported across Linux, Mac OS X and Windows platforms
 Available as open source version
Python for Data Science 21
Jupyter Notebook
Source-https://jupyter.org/
Python for Data Science 22
Jupyter Notebook
 Bundled with Anaconda
distribution or can be installed
separately
 Supports Julia, Python, R and
Scala
 Consists of ordered collection of
input and output cells that contain
code, text, plots etc.
Source-https://jupyter.org/
Python for Data Science 23
Jupyter Notebook
 Allows sharing of code and
narrative text through output
formats like PDF, HTML etc.
◦ Education and presentation
tool
 Lacks most of the features of
a good IDE
Source-https://jupyter.org/
Python for Data Science 24
How to choose the best IDE?
 Requirements
 Working with different IDEs helps us understand our own
requirement
THANK YOU

More Related Content

PPTX
Introduction to python history and platforms
PDF
Python Programming: The Best Language for Every Coder
PPTX
Getting Started with Python
PPTX
UNIT 1 PYTHON introduction and basic level
PPTX
Toolboxes for data scientists
PDF
Programming for data science in python
PPTX
Python Mastery: A Comprehensive Guide to Setting Up Your Development Environment
PPTX
Introduction-to-Python-Programming-Language (1).pptx
Introduction to python history and platforms
Python Programming: The Best Language for Every Coder
Getting Started with Python
UNIT 1 PYTHON introduction and basic level
Toolboxes for data scientists
Programming for data science in python
Python Mastery: A Comprehensive Guide to Setting Up Your Development Environment
Introduction-to-Python-Programming-Language (1).pptx

Similar to Lec-1-Introduction to Python.pdf gla University (20)

PPTX
All you need to know about Python | BJIT
PPTX
introduction to python in detail including .pptx
PPTX
Chapter 2: Basics of programming pyton programming
PPTX
ppt_template for EDA.pptx
PPTX
Python_Introduction_Good_PPT.pptx
PDF
Introduction to Data Science & Python.pdf
PPTX
Python programming ppt.pptx
PPTX
Introduction of python programming
PPTX
PYTHON UNIT 1
PPTX
First of all, what is Python? According t
PPTX
Introduction to the Python
PPTX
1.Introductiontshdbsyh3h5hh4hbhu4b4h.pptx
PDF
Samsung SDS OpeniT - The possibility of Python
PDF
A Whirlwind Tour Of Python
PDF
Introduction-To-Python- a guide to master
PPTX
python programming unit 1 wala ppt .pptx
PDF
Introduction to Python.pdf
PPTX
What is Python? An overview of Python for science.
PPTX
What is the best way to learn Python.pptx
PDF
Introduction to python
All you need to know about Python | BJIT
introduction to python in detail including .pptx
Chapter 2: Basics of programming pyton programming
ppt_template for EDA.pptx
Python_Introduction_Good_PPT.pptx
Introduction to Data Science & Python.pdf
Python programming ppt.pptx
Introduction of python programming
PYTHON UNIT 1
First of all, what is Python? According t
Introduction to the Python
1.Introductiontshdbsyh3h5hh4hbhu4b4h.pptx
Samsung SDS OpeniT - The possibility of Python
A Whirlwind Tour Of Python
Introduction-To-Python- a guide to master
python programming unit 1 wala ppt .pptx
Introduction to Python.pdf
What is Python? An overview of Python for science.
What is the best way to learn Python.pptx
Introduction to python
Ad

Recently uploaded (20)

PDF
Machine learning based COVID-19 study performance prediction
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Electronic commerce courselecture one. Pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
A Presentation on Artificial Intelligence
PPTX
Cloud computing and distributed systems.
Machine learning based COVID-19 study performance prediction
Spectral efficient network and resource selection model in 5G networks
Network Security Unit 5.pdf for BCA BBA.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
NewMind AI Monthly Chronicles - July 2025
The Rise and Fall of 3GPP – Time for a Sabbatical?
Electronic commerce courselecture one. Pdf
Encapsulation_ Review paper, used for researhc scholars
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
The AUB Centre for AI in Media Proposal.docx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Chapter 3 Spatial Domain Image Processing.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Understanding_Digital_Forensics_Presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
A Presentation on Artificial Intelligence
Cloud computing and distributed systems.
Ad

Lec-1-Introduction to Python.pdf gla University

  • 2. Python for Data Science 2 Popular tools used in data science  Data pre-processing and analysis ◦ Python, R, Microsoft Excel, SAS, SPSS  Data exploration and visualization ◦ Tableau, Qlikview, Microsoft Excel  Parallel and distributed computing incase of big data ◦ Apache Spark,Apache Hadoop
  • 3. Python for Data Science 3 Evolution of Python  Python was developed by Guido van Rossum in the late eighties at the ‘National Research Institute for Mathematics and Computer Science’ at Netherlands  Python Editions ◦ Python 1.0 ◦ Python 2.0 ◦ Python 3.0
  • 4. Python for Data Science 4 Python as a programming language  Supports multiple programming paradigm ◦ Functional, Structural, OOPs, etc.  Dynamic typing ◦ Runtime type safety checks  Reference counts ◦ Deallocates objects which are not used for long  Late binding ◦ Methods are looked up by name during runtime  Python’s design is guided by 20 aphorisms as described in Zen of Python by Tim Peters
  • 5. Python for Data Science 5 Python as a programming language  Standard CPython interpreter is managed by “Python Software Foundation”  There are other interpreters namely JPython (Java), Iron Python (C#), Stackless Python (C, used for parallelism), PyPy (Python itself JIT compilation)  Standard libraries are written in python itself  High standards of readability
  • 6. Python for Data Science 6 Python as a programming language  Cross-platform (Windows, Linux, Mac)  Highly supported by a large community group  Better error handle
  • 7. Python for Data Science 7 Python as a programming language  Comparison to Java  Python vs Java ◦ Java is statically typed i.e. type safety is checked during compilation (static compilation) ◦ Thus in Java the time required to develop the code is more ◦ Python which is dynamically typed compensates for huge compilation time when compared to Java ◦ Codes which are dynamically typed tend to be less verbose therefore offering more readability
  • 8. Python for Data Science 8 Advantages of using python  Python has several features that make it well suited for data science  Open source and community development ◦ Developed under Open Source Initiative license making it free to use and distribute even commercially  Syntax used is simple to understand and code  Libraries designed for specific data science tasks  Combines well with majority of the cloud platform service providers
  • 9. Python for Data Science 9 Coding environment  A software program can be written using a terminal, a command prompt (cmd), a text editor or through an Integrated Development Environment (IDE)  The program needs to be saved in a file with an appropriate extension (.py for python, .mat for matlab, etc...) and can be executed in corresponding environment (Python, Matlab, etc…)  Integrated Development Environment (IDE) is a software product solely developed to support software development in various or specific programming language(s)
  • 10. Python for Data Science 10 Coding environment  Python 2.x support will be available till 2020  Python 3.x is an enhanced version of 2.x and will only be maintained from 3.6.x post 2020  Install basic python version or use the online python console as in https://www.python.org/  Execute following commands and view the outputs in terminal or command prompt • Basic print statement • Naming conventions for variables and functions, operators • Conditional operations, looping statements (nested) • Function declaration and calling • Installing modules
  • 11. Python for Data Science 11 https://www.python.org/
  • 12. Python for Data Science 12 https://www.python.org/
  • 13. Python for Data Science 13 Integrated development environment (IDE)  Software application consisting of a cohesive unit of tools required for development  Designed to simplify software development  Utilities provided by IDEs include tools for managing, compiling, deploying and debugging software
  • 14. Python for Data Science 14 Coding environment- IDE  An IDE usually comprises of ◦ Source code editor ◦ Compiler ◦ Debugger ◦ Additional features include syntax and error highlighting, code completion  Offers supports in building and executing the program along with debugging the code from within the environment
  • 15. Python for Data Science 15 Coding environment- IDE  Best IDEs provide version control features  Eclipse+PyDev, SublimeText,Atom, GNU Emacs,Vi/Vim,Visual Studio,Visual Studio Code are general IDEs with python support  Apart from these some of the python specific editors include Pycharm, Jupyter, Spyder,Thonny
  • 16. Python for Data Science 16 Spyder  Supported across Linux, Mac OS X and Windows platforms  Available as open source version  Can be installed separately or through Anaconda distribution  Developed for Python and specifically data science  Features include ◦ Code editor with robust syntax and error highlighting ◦ Code completion and navigation ◦ Debugger ◦ Integrated document  Interface similar to MATLAB and RStudio
  • 17. Python for Data Science 17 Spyder
  • 18. Python for Data Science 18 PyCharm  Supported across Linux, Mac OS X and Windows platforms  Available as community (free open source) and professional (paid) version  Supports only Python  Can be installed separately or through Anaconda distribution  Features include ◦ Code editor provides syntax and error highlighting ◦ Code completion and navigation ◦ Unit testing ◦ Debugger ◦ Version control
  • 19. Python for Data Science 19 PyCharm
  • 20. Python for Data Science 20 Jupyter Notebook  Web application that allows creation and manipulation of documents called ‘notebook’  Supported across Linux, Mac OS X and Windows platforms  Available as open source version
  • 21. Python for Data Science 21 Jupyter Notebook Source-https://jupyter.org/
  • 22. Python for Data Science 22 Jupyter Notebook  Bundled with Anaconda distribution or can be installed separately  Supports Julia, Python, R and Scala  Consists of ordered collection of input and output cells that contain code, text, plots etc. Source-https://jupyter.org/
  • 23. Python for Data Science 23 Jupyter Notebook  Allows sharing of code and narrative text through output formats like PDF, HTML etc. ◦ Education and presentation tool  Lacks most of the features of a good IDE Source-https://jupyter.org/
  • 24. Python for Data Science 24 How to choose the best IDE?  Requirements  Working with different IDEs helps us understand our own requirement