Justin Sun
PyData Boston
July 27, 2013
Overview
 What can you do with Orange?
 History
 Architecture
 Installation
 Widget Examples
 Demo
 Resources
Classification Tree Scheme
History
 1996 – University of Ljubljana and Jožef Stefan
Institute started development of ML*, a machine
learning framework in C++.
 1997 – Python integration layer
 2003 – GUI based on PyQt
 2013 – Orange Canvas 2.7 released – Major GUI
redesign.
Source: http://en.wikipedia.org/wiki/Orange_%28software%29
High-level Architecture
Algorithms written in C++
Python integration layer (Python 2.7)
Orange Canvas – Visual programming
Why Use Orange?
 No programming needed – Visual programming
 Data Visualization
 Easy to try different Machine Learning Algorithms
 Add-ons for
 Bioinformatics
 Network Analysis
 Text mining
 Free and open source software
Installation
 Download installer from http://orange.biolab.si/
 Run installer
 Requires Python 2.6 or 2.7
 Includes NumPy, SciPy, PyQt, other required libraries
 To run, double-click on the Orange Canvas icon
Orange Canvas - PyData 2013
Scheme
Widgets
Orange Canvas - PyData 2013
Orange Canvas - PyData 2013
Orange Canvas - PyData 2013
Orange Canvas - PyData 2013
Orange Canvas - PyData 2013
Demo
 Classification example
 Evaluation
Resources
 Orange Website: http://orange.biolab.si/
 Tutorials: http://www.biolab.si/janez/kyoto/
 Interactive Network Analysis with Orange
http://www.jstatsoft.org/v53/i06
 Orange Whitepaper with scripting examples
http://www.celta.paris-
sorbonne.fr/anasem/papers/miscelanea/InteractiveDataMining.pdf
Thank You!
 Email: justin@justinsun (dot) com
 Slides: http://www.slideshare.net/justin_sun/

More Related Content

PPTX
Orange Data Mining & Data Visualization Tool
PPSX
Data Mining Tools / Orange
PPTX
Hierarchical clustering
PPTX
Orange Data Mining and Data Visualization Tool
PPT
Classification and prediction
PPTX
Unit 2 unsupervised learning.pptx
PPTX
Dimensionality reduction
PPTX
Data Science With Python | Python For Data Science | Python Data Science Cour...
Orange Data Mining & Data Visualization Tool
Data Mining Tools / Orange
Hierarchical clustering
Orange Data Mining and Data Visualization Tool
Classification and prediction
Unit 2 unsupervised learning.pptx
Dimensionality reduction
Data Science With Python | Python For Data Science | Python Data Science Cour...

What's hot (20)

PPTX
Data preprocessing in Machine learning
PPTX
Hadoop File system (HDFS)
PPTX
Ensemble learning
PDF
300+ top data structures and algorithms mc qs pdf 2020
PDF
Planning Agent
PPTX
Machine learning with scikitlearn
PPTX
Support Vector Machine ppt presentation
PPTX
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
PPTX
Data Science- Data Preprocessing, Data Cleaning.
PDF
R programming groundup-basic-section-i
PPTX
Dbscan algorithom
PPTX
Problem reduction AND OR GRAPH & AO* algorithm.ppt
PPTX
Système de recommandations de films
PPTX
Machine learning(UNIT 4)
PDF
Dimensionality Reduction
PPTX
Data Science presentation for elementary school students
PPTX
Big Data Analytics Proposal #1
PPTX
Data mining: Classification and prediction
PPTX
Unsupervised learning clustering
PDF
Data Visualization in Data Science
Data preprocessing in Machine learning
Hadoop File system (HDFS)
Ensemble learning
300+ top data structures and algorithms mc qs pdf 2020
Planning Agent
Machine learning with scikitlearn
Support Vector Machine ppt presentation
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
Data Science- Data Preprocessing, Data Cleaning.
R programming groundup-basic-section-i
Dbscan algorithom
Problem reduction AND OR GRAPH & AO* algorithm.ppt
Système de recommandations de films
Machine learning(UNIT 4)
Dimensionality Reduction
Data Science presentation for elementary school students
Big Data Analytics Proposal #1
Data mining: Classification and prediction
Unsupervised learning clustering
Data Visualization in Data Science
Ad

Similar to Orange Canvas - PyData 2013 (20)

PDF
Manual orange
PPTX
DATA MINING TOOL- ORANGE
PPTX
Data Science using Python and Orange
DOCX
Predicting rainfall with data science in python
PPTX
Introduction to Machine Learning by MARK
PPTX
Data Mining Tools_presnetion_data_scince.pptx
PPTX
Orange_Tool_Demonstrationnnnnnnnnnn.pptx
PDF
London level39
PDF
PyCon.DE / PyData Karlsruhe keynote: "Looking backward, looking forward"
PDF
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
PPTX
Artificial Intelligence concepts in a Nutshell
PDF
Migrating from matlab to python
PDF
Python Data Wrangling: Preparing for the Future
PDF
Unit 6-Introduction of Python Libraries.pdf
PDF
Introduction To Python
PDF
Python's slippy path and Tao of thick Pandas: give my data, Rrrrr...
PPTX
Python for Machine Learning(MatPlotLib).pptx
PDF
Extending Pandas using Apache Arrow and Numba
PPTX
Ml programming with python
PPTX
Python.pptx
Manual orange
DATA MINING TOOL- ORANGE
Data Science using Python and Orange
Predicting rainfall with data science in python
Introduction to Machine Learning by MARK
Data Mining Tools_presnetion_data_scince.pptx
Orange_Tool_Demonstrationnnnnnnnnnn.pptx
London level39
PyCon.DE / PyData Karlsruhe keynote: "Looking backward, looking forward"
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
Artificial Intelligence concepts in a Nutshell
Migrating from matlab to python
Python Data Wrangling: Preparing for the Future
Unit 6-Introduction of Python Libraries.pdf
Introduction To Python
Python's slippy path and Tao of thick Pandas: give my data, Rrrrr...
Python for Machine Learning(MatPlotLib).pptx
Extending Pandas using Apache Arrow and Numba
Ml programming with python
Python.pptx
Ad

Recently uploaded (20)

PDF
Microsoft 365 products and services descrption
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PDF
Introduction to the R Programming Language
PPTX
Business_Capability_Map_Collection__pptx
PDF
Microsoft Core Cloud Services powerpoint
PPTX
New ISO 27001_2022 standard and the changes
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
PDF
Introduction to Data Science and Data Analysis
PPTX
modul_python (1).pptx for professional and student
PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
PPTX
Steganography Project Steganography Project .pptx
PPT
statistic analysis for study - data collection
PPTX
Introduction to Inferential Statistics.pptx
PPTX
SET 1 Compulsory MNH machine learning intro
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
Microsoft 365 products and services descrption
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
Introduction to the R Programming Language
Business_Capability_Map_Collection__pptx
Microsoft Core Cloud Services powerpoint
New ISO 27001_2022 standard and the changes
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
Introduction to Data Science and Data Analysis
modul_python (1).pptx for professional and student
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
Steganography Project Steganography Project .pptx
statistic analysis for study - data collection
Introduction to Inferential Statistics.pptx
SET 1 Compulsory MNH machine learning intro
STERILIZATION AND DISINFECTION-1.ppthhhbx

Orange Canvas - PyData 2013

  • 2. Overview  What can you do with Orange?  History  Architecture  Installation  Widget Examples  Demo  Resources
  • 4. History  1996 – University of Ljubljana and Jožef Stefan Institute started development of ML*, a machine learning framework in C++.  1997 – Python integration layer  2003 – GUI based on PyQt  2013 – Orange Canvas 2.7 released – Major GUI redesign. Source: http://en.wikipedia.org/wiki/Orange_%28software%29
  • 5. High-level Architecture Algorithms written in C++ Python integration layer (Python 2.7) Orange Canvas – Visual programming
  • 6. Why Use Orange?  No programming needed – Visual programming  Data Visualization  Easy to try different Machine Learning Algorithms  Add-ons for  Bioinformatics  Network Analysis  Text mining  Free and open source software
  • 7. Installation  Download installer from http://orange.biolab.si/  Run installer  Requires Python 2.6 or 2.7  Includes NumPy, SciPy, PyQt, other required libraries  To run, double-click on the Orange Canvas icon
  • 16. Resources  Orange Website: http://orange.biolab.si/  Tutorials: http://www.biolab.si/janez/kyoto/  Interactive Network Analysis with Orange http://www.jstatsoft.org/v53/i06  Orange Whitepaper with scripting examples http://www.celta.paris- sorbonne.fr/anasem/papers/miscelanea/InteractiveDataMining.pdf
  • 17. Thank You!  Email: justin@justinsun (dot) com  Slides: http://www.slideshare.net/justin_sun/

Editor's Notes

  • #2: Orange Canvas – Visual programming environment for data mining
  • #4: Example of a complete program for classification trees.
  • #5: Orange was originally a collection of C++ algorithms, then Python was added, and finally a graphical interface
  • #7: Why use Orange?It has a wide selection of data visualizations that you can use to explore your data. You can prototype machine learning algorithms using Orange Canvas without investing much time in programming. And there are a number of add-ons for bioinformatics, network analysis, and text mining, plus more contributed by the community.
  • #9: Screen capture from the Orange home page.
  • #10: Orange Canvas is an interactive environment for visual programming. It’s open source and free to use. In this example, you can click on a widget from the palette on the left and a copy of that widget gets transferred to the canvas. On this screen we see the File widget, which reads data into the system and the Data Table widget which displays the data in a table format with the ability to sort the data by column. The widgets are connected together by clicking on the right hand side of the File widget and dragging a line to the left hand side of the Data Table widget. The convention in Orange are for inputs to be on the left and outputs on the right. Notice that the Data Table’s right hand side is dotted, meaning the output is not in use. With this simple concept, let’s see how you can explore a data set.
  • #11: Visualization widgets
  • #12: Clustering and unsupervised learning widgets
  • #13: Classification
  • #14: Network and Text Mining add-ons
  • #15: Bioinformatics widgets
  • #16: Demo #1Simple classification example with classification treesExample scatterplotVizRank selection of interesting projectionsDemo #2 Comparing classifiersMultiple learnersTest learners evaluationShow evaluation metrics
  • #17: To get started, first install Orange Canvas. Try the built-in tutorials listed here.