SlideShare a Scribd company logo
Keerti Verma
AP,OCT
Introduction
R is a -
• A Programming Language
• A Statistical Package
• An Interpreter
• Open Source
• Object Oriented Language
Continue...
 R is a programming language and software
environment for statistical computing and
graphics supported by the R Foundation for
Statistical Computing.
 The R language is widely used
among statisticians and data miners for
developing statistical software and data analysis.
 Polls, surveys of data miners, and studies of
scholarly literature databases show that R's
popularity has increased substantially in recent
years
Continue...
Continue...
Evolution Of R Language
 R is an implementation of the S programming
language.
 S was created by John Chambers while at Bell Labs.
 R was created by Ross Ihaka and Robert
Gentleman at the University of Auckland, New
Zealand, and is currently developed by the R
Development Core Team, of which Chambers is a
member. R is named partly after the first names of
the first two R authors and partly as a play on the
name of S.
statistical Programming Language
S version1
S version2
S version 3
S version4
developed 30 years
ago for research applied to the
high-tech industry
R
Features of R Language
 As stated earlier, R is a programming language and
software environment for statistical analysis, graphics
representation and reporting. The following are the
important features of R: -
 R is a well-developed, simple and effective
programming language which includes conditionals,
loops, user defined recursive functions and input and
output facilities.
 R has an effective data handling and storage facility.
 R provides a suite of operators for calculations on
arrays, lists, vectors and matrices.
Continue...
 R provides a large, coherent and integrated
collection of tools for data analysis.
 R provides graphical facilities for data analysis and
display either directly at the computer or printing
at the papers.
 As a conclusion, R is world’s most widely used
statistics programming language. It's the # 1 choice
of data scientists and supported by a vibrant and
talented community of contributors. R is taught in
universities and deployed in mission critical
business applications.
Continue...
 The project was conceived in 1992, with an initial
version released in 1995 and a stable beta version
in 2000.
 Current stable version of R is 3.3.2 released on
October 31, 2016.
Data types in R
 Generally, while doing programming in any
programming language, you need to use various
variables to store various information. Variables
are nothing but reserved memory locations to
store values. This means that, when you create a
variable you reserve some space in memory.
 In contrast to other programming languages like C
and java in R, the variables are not declared as
some data type. The variables are assigned with R-
Objects and the data type of the R-object becomes
the data type of the variable.
Continue...
 There are many types of R-objects. The frequently
used ones are:
 Vectors
 Lists
 Matrices
 Arrays
 Data Frames
Continue...
 A vector is a sequence of data elements of the same
basic type.
 The simplest of these objects is the vector object and
there are six data types of these atomic vectors, also
termed as six classes of vectors. The other R-Objects
are built upon the atomic vectors.
Continue...Data type Example verify
Logical TRUE , FALSE v <- TRUE
print(class(v))
it produces the following
result:-
[1] "logical"
Numeric 12.3, 5, 999 v <- 23.5
print(class(v))
it produces the following
result:
[1] "numeric"
Integer 2L, 34L, 0L v <- 2L
print(class(v))
it produces the following
result:
[1] "complex"
Continue...
Data Type Example Verify
Complex 2+5i v <- 2+5i
print(class(v))
It print the following
result:
[1] "complex"
Character 'a' , '"good", "TRUE", '23.4' v <- "TRUE"
print(class(v))
It print the following
result
[1] "character"
Raw Hello" is stored as 48 65
6c 6c 6f
v <- charToRaw("Hello")
print(class(v))
It print the following
result:
[1] "raw"
Continue...
Interacting with R
 RStudio is a free and open-source integrated
development environment (IDE) for R,
a programming language for statistical
computing and graphics.
 RStudio was founded by JJ Allaire,creator of the
programming language ColdFusion.
Continue...
 RStudio is available in two editions: RStudio
Desktop, where the program is run locally as a
regular desktop application.
 RStudio Server, Prepackaged distributions of
RStudio Desktop are available for Windows, OS X,
and Linux.
 RStudio is written in the C++ programming
language and uses the Qt framework for
its graphical user interface.
Rstudio IDE
Comparison with other statistics
software
Continue...
 Sas:- SAS (Statistical Analysis System) is a software
suite developed by SAS Institute for advanced
analytics, multivariate analyses, business
intelligence, data management, and predictive
analytics.
 SAS was developed at North Carolina State
University from 1966 until 1976, when SAS Institute
was incorporated. SAS was further developed in the
1980s and 1990s with the addition of new statistical
procedures, additional components
Continue...
 SAS is an expensive tool whereas R is free.
 Algorithms used in SAS procedures are not open to
public so you cannot do research on that. Whereas
R is all transparent.
 R has advanced graphical capabilities. Supports
various professional graphics templates.
 New statistical and machine learning techniques
implemented in R much more quickly than SAS.
500 lines of SAS code can be equivalent to 100 lines
of R code
Continue...
 Time Series Forecasting - Need to purchase SAS
ETS Module. It is free in R
 Text Mining - Need to purchase SAS Enterprise /
Text Miner. It is free in R
 Machine Learning - Need to purchase SAS
Enterprise Miner. It is free in R
 Online Reporting - Need to purchase SAS Visual
Analytics. It is free in R with shiny package.
Advantages of R
 Free open source philosophy.
 R has over 4800 packages available from multiple
repositories specializing in topics like
econometrics, data mining, spatial analysis, and
bio-informatics.
 Online help and discussion.
 R visualization capabilities .
 Interface with other languages and scripting
capabilities
Continue...
 Real data have missing values. Missing values are
an integral part of the R language. Many functions
have arguments that control how missing values
are to be handled.
 Solution of big data .
Disadvantages of R
 R has a steep learning curve it does take a while to
get used to the power of R but no steeper than for
other statistical languages. ˆR is not so easy to use
for the novice.
 No default parallel execution.
 Top skills needed for high performance
computing.
Continue...
 Memory management, speed, and efficiency are
probably the biggest challenges R faces.
 Poor management of large dataset.
 Complicated structure of packages in R .
 Capabilities such as security were not built into the
R language, Also, R cannot be embedded in a Web
browser.
 A high-level programming language
So why learn R??
Some other points
 Hadoop and R are a natural match and are quite
complementary in terms of visualization and analytics
of big data.
 Rhipe is an R library which allows running a map
reduce job.
References
 https://en.wikipedia.org/wiki/R_(programming_langu
age)
 http://www.infoworld.com/article/2940864/applicatio
n-development/r-programming-language-statistical-
data-analysis.html
 http://www.infoworld.com/article/2880360/big-
data/learn-to-crunch-big-data-with-r.html
 http://www.tutorialpoint.pdf
 https://www.youtube.com/watch?v=VocbExLSukE&t=
7s
Thank you

More Related Content

PPTX
Introduction to statistical software R
PPTX
Introduction to R
PPTX
R programming presentation
PPTX
R programming
PDF
Introduction to R
PDF
Class ppt intro to r
PDF
Introduction to data analysis using R
PPT
R studio
Introduction to statistical software R
Introduction to R
R programming presentation
R programming
Introduction to R
Class ppt intro to r
Introduction to data analysis using R
R studio

What's hot (20)

PPTX
R programming
PPTX
Application of excel and spss programme in statistical
PPTX
Minitab- A statistical tool
PPTX
Applications of sas and minitab in data analysis
PPTX
Graphs (Biostatistics)
PPTX
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
PDF
Simple & Multiple Regression Analysis
PPTX
Anatomical, therapeutic and chemical classification of drugs.pptx
PPTX
SEM 8 BIOSTATISTICS graphs minitab excel etc
PPTX
Basic stat analysis using excel
PPT
Optimization techniques
PPT
Application of SPSS by umakant bhaskar gohatre
PPTX
Regression
PPTX
Establishment of Pharmacovigilance Programme
PPTX
Application of Excel and SPSS software for statistical analysis- Biostatistic...
PPTX
Karl pearson's correlation
PPTX
Introduction to Research - Biostatistics and Research methodology 8th Sem Uni...
PPTX
Factorial design ,full factorial design, fractional factorial design
PDF
Report Writing and Presentation of Data.pdf
PPTX
Various statistical software's in data analysis.
R programming
Application of excel and spss programme in statistical
Minitab- A statistical tool
Applications of sas and minitab in data analysis
Graphs (Biostatistics)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Simple & Multiple Regression Analysis
Anatomical, therapeutic and chemical classification of drugs.pptx
SEM 8 BIOSTATISTICS graphs minitab excel etc
Basic stat analysis using excel
Optimization techniques
Application of SPSS by umakant bhaskar gohatre
Regression
Establishment of Pharmacovigilance Programme
Application of Excel and SPSS software for statistical analysis- Biostatistic...
Karl pearson's correlation
Introduction to Research - Biostatistics and Research methodology 8th Sem Uni...
Factorial design ,full factorial design, fractional factorial design
Report Writing and Presentation of Data.pdf
Various statistical software's in data analysis.
Ad

Viewers also liked (20)

PDF
R programming Basic & Advanced
PPTX
Programming in R
PPTX
An Interactive Introduction To R (Programming Language For Statistics)
PDF
R programming groundup-basic-section-i
PPTX
How to get started with R programming
PPTX
R language tutorial
PPTX
Model Building with RevoScaleR: Using R and Hadoop for Statistical Computation
PPTX
The Very ^ 2 Basics of R
PPTX
R programming Language , Rahul Singh
PPTX
R Intro Workshop
PDF
Übersicht Glm Workshop 2009
PPT
Neural networks1
PPTX
Introduction to R programming
PDF
JSON REST API for WordPress
PPTX
Simulation presentation
PDF
Getting Up to Speed with R: Certificate Program in R for Statistical Analysis...
PPTX
The Aviation Insurance Industry Presentation (1)
PPTX
Biases in epidemiology
PDF
Statistics with R
PPTX
XNN001 Introductory epidemiological concepts - sampling, bias and error
R programming Basic & Advanced
Programming in R
An Interactive Introduction To R (Programming Language For Statistics)
R programming groundup-basic-section-i
How to get started with R programming
R language tutorial
Model Building with RevoScaleR: Using R and Hadoop for Statistical Computation
The Very ^ 2 Basics of R
R programming Language , Rahul Singh
R Intro Workshop
Übersicht Glm Workshop 2009
Neural networks1
Introduction to R programming
JSON REST API for WordPress
Simulation presentation
Getting Up to Speed with R: Certificate Program in R for Statistical Analysis...
The Aviation Insurance Industry Presentation (1)
Biases in epidemiology
Statistics with R
XNN001 Introductory epidemiological concepts - sampling, bias and error
Ad

Similar to R programming language (20)

PPTX
R_L1-Aug-2022.pptx
PDF
UNIT-4 Start Learning R and installation .pdf
PDF
UNIT-1 Start Learning R.pdf
PDF
2 it unit-1 start learning r
PPTX
Big data analytics with R tool.pptx
PDF
a_very_brief_introduction_to_r.pdfhshkdjdn
PPTX
DOC-20240829-WA0001 power point presentation
PPTX
Introduction to R Programming
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
R programming advantages and disadvantages
PDF
The History and Use of R
PDF
Introduction to R and R Studio
PPTX
R programming Language
PPTX
Data Science With R Programming Unit - II Part-1.pptx
PPTX
Data science with R Unit - II Part-1.pptx
PPTX
LSESU a Taste of R Language Workshop
PPTX
R language
PDF
Study of R Programming
PDF
R Programming - part 1.pdf
PPTX
R Programming and Lab - Unit I PPTX with examples
R_L1-Aug-2022.pptx
UNIT-4 Start Learning R and installation .pdf
UNIT-1 Start Learning R.pdf
2 it unit-1 start learning r
Big data analytics with R tool.pptx
a_very_brief_introduction_to_r.pdfhshkdjdn
DOC-20240829-WA0001 power point presentation
Introduction to R Programming
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
R programming advantages and disadvantages
The History and Use of R
Introduction to R and R Studio
R programming Language
Data Science With R Programming Unit - II Part-1.pptx
Data science with R Unit - II Part-1.pptx
LSESU a Taste of R Language Workshop
R language
Study of R Programming
R Programming - part 1.pdf
R Programming and Lab - Unit I PPTX with examples

Recently uploaded (20)

PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PDF
Nekopoi APK 2025 free lastest update
PDF
Understanding Forklifts - TECH EHS Solution
PDF
AI in Product Development-omnex systems
PDF
medical staffing services at VALiNTRY
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
ai tools demonstartion for schools and inter college
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
How Creative Agencies Leverage Project Management Software.pdf
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
System and Network Administration Chapter 2
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
history of c programming in notes for students .pptx
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Nekopoi APK 2025 free lastest update
Understanding Forklifts - TECH EHS Solution
AI in Product Development-omnex systems
medical staffing services at VALiNTRY
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
ai tools demonstartion for schools and inter college
Odoo POS Development Services by CandidRoot Solutions
How Creative Agencies Leverage Project Management Software.pdf
Navsoft: AI-Powered Business Solutions & Custom Software Development
Upgrade and Innovation Strategies for SAP ERP Customers
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Design an Analysis of Algorithms II-SECS-1021-03
System and Network Administration Chapter 2
CHAPTER 2 - PM Management and IT Context
2025 Textile ERP Trends: SAP, Odoo & Oracle
Wondershare Filmora 15 Crack With Activation Key [2025
history of c programming in notes for students .pptx
Odoo Companies in India – Driving Business Transformation.pdf
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...

R programming language

  • 2. Introduction R is a - • A Programming Language • A Statistical Package • An Interpreter • Open Source • Object Oriented Language
  • 3. Continue...  R is a programming language and software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing.  The R language is widely used among statisticians and data miners for developing statistical software and data analysis.  Polls, surveys of data miners, and studies of scholarly literature databases show that R's popularity has increased substantially in recent years
  • 6. Evolution Of R Language  R is an implementation of the S programming language.  S was created by John Chambers while at Bell Labs.  R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand, and is currently developed by the R Development Core Team, of which Chambers is a member. R is named partly after the first names of the first two R authors and partly as a play on the name of S.
  • 7. statistical Programming Language S version1 S version2 S version 3 S version4 developed 30 years ago for research applied to the high-tech industry R
  • 8. Features of R Language  As stated earlier, R is a programming language and software environment for statistical analysis, graphics representation and reporting. The following are the important features of R: -  R is a well-developed, simple and effective programming language which includes conditionals, loops, user defined recursive functions and input and output facilities.  R has an effective data handling and storage facility.  R provides a suite of operators for calculations on arrays, lists, vectors and matrices.
  • 9. Continue...  R provides a large, coherent and integrated collection of tools for data analysis.  R provides graphical facilities for data analysis and display either directly at the computer or printing at the papers.  As a conclusion, R is world’s most widely used statistics programming language. It's the # 1 choice of data scientists and supported by a vibrant and talented community of contributors. R is taught in universities and deployed in mission critical business applications.
  • 10. Continue...  The project was conceived in 1992, with an initial version released in 1995 and a stable beta version in 2000.  Current stable version of R is 3.3.2 released on October 31, 2016.
  • 11. Data types in R  Generally, while doing programming in any programming language, you need to use various variables to store various information. Variables are nothing but reserved memory locations to store values. This means that, when you create a variable you reserve some space in memory.  In contrast to other programming languages like C and java in R, the variables are not declared as some data type. The variables are assigned with R- Objects and the data type of the R-object becomes the data type of the variable.
  • 12. Continue...  There are many types of R-objects. The frequently used ones are:  Vectors  Lists  Matrices  Arrays  Data Frames
  • 13. Continue...  A vector is a sequence of data elements of the same basic type.  The simplest of these objects is the vector object and there are six data types of these atomic vectors, also termed as six classes of vectors. The other R-Objects are built upon the atomic vectors.
  • 14. Continue...Data type Example verify Logical TRUE , FALSE v <- TRUE print(class(v)) it produces the following result:- [1] "logical" Numeric 12.3, 5, 999 v <- 23.5 print(class(v)) it produces the following result: [1] "numeric" Integer 2L, 34L, 0L v <- 2L print(class(v)) it produces the following result: [1] "complex"
  • 15. Continue... Data Type Example Verify Complex 2+5i v <- 2+5i print(class(v)) It print the following result: [1] "complex" Character 'a' , '"good", "TRUE", '23.4' v <- "TRUE" print(class(v)) It print the following result [1] "character" Raw Hello" is stored as 48 65 6c 6c 6f v <- charToRaw("Hello") print(class(v)) It print the following result: [1] "raw"
  • 17. Interacting with R  RStudio is a free and open-source integrated development environment (IDE) for R, a programming language for statistical computing and graphics.  RStudio was founded by JJ Allaire,creator of the programming language ColdFusion.
  • 18. Continue...  RStudio is available in two editions: RStudio Desktop, where the program is run locally as a regular desktop application.  RStudio Server, Prepackaged distributions of RStudio Desktop are available for Windows, OS X, and Linux.  RStudio is written in the C++ programming language and uses the Qt framework for its graphical user interface.
  • 20. Comparison with other statistics software
  • 21. Continue...  Sas:- SAS (Statistical Analysis System) is a software suite developed by SAS Institute for advanced analytics, multivariate analyses, business intelligence, data management, and predictive analytics.  SAS was developed at North Carolina State University from 1966 until 1976, when SAS Institute was incorporated. SAS was further developed in the 1980s and 1990s with the addition of new statistical procedures, additional components
  • 22. Continue...  SAS is an expensive tool whereas R is free.  Algorithms used in SAS procedures are not open to public so you cannot do research on that. Whereas R is all transparent.  R has advanced graphical capabilities. Supports various professional graphics templates.  New statistical and machine learning techniques implemented in R much more quickly than SAS. 500 lines of SAS code can be equivalent to 100 lines of R code
  • 23. Continue...  Time Series Forecasting - Need to purchase SAS ETS Module. It is free in R  Text Mining - Need to purchase SAS Enterprise / Text Miner. It is free in R  Machine Learning - Need to purchase SAS Enterprise Miner. It is free in R  Online Reporting - Need to purchase SAS Visual Analytics. It is free in R with shiny package.
  • 24. Advantages of R  Free open source philosophy.  R has over 4800 packages available from multiple repositories specializing in topics like econometrics, data mining, spatial analysis, and bio-informatics.  Online help and discussion.  R visualization capabilities .  Interface with other languages and scripting capabilities
  • 25. Continue...  Real data have missing values. Missing values are an integral part of the R language. Many functions have arguments that control how missing values are to be handled.  Solution of big data .
  • 26. Disadvantages of R  R has a steep learning curve it does take a while to get used to the power of R but no steeper than for other statistical languages. ˆR is not so easy to use for the novice.  No default parallel execution.  Top skills needed for high performance computing.
  • 27. Continue...  Memory management, speed, and efficiency are probably the biggest challenges R faces.  Poor management of large dataset.  Complicated structure of packages in R .  Capabilities such as security were not built into the R language, Also, R cannot be embedded in a Web browser.  A high-level programming language
  • 29. Some other points  Hadoop and R are a natural match and are quite complementary in terms of visualization and analytics of big data.  Rhipe is an R library which allows running a map reduce job.
  • 30. References  https://en.wikipedia.org/wiki/R_(programming_langu age)  http://www.infoworld.com/article/2940864/applicatio n-development/r-programming-language-statistical- data-analysis.html  http://www.infoworld.com/article/2880360/big- data/learn-to-crunch-big-data-with-r.html  http://www.tutorialpoint.pdf  https://www.youtube.com/watch?v=VocbExLSukE&t= 7s