SlideShare a Scribd company logo
Shrilesh Kathe
Advanced Analytics
Education
 Bachelor of Engineering in Electronics and Telecommunication from
Ramrao Adik Institute of Technology, Mumbai University.
Technical skills
Apache Spark:
 Good understanding of the parallel processing architecture of Spark.
 Distributing the dataset across the nodes to leverage on distributed
computing using spark RDD, Data Frame etc.
 Designing the linage graph for optimal execution of the transformations.
 Comfortable developing the driver program using Python, Java and
Scala.
 Building Dstreams from live streaming data and 24/7 applications using
Spark Streaming.
Java:
 In depth knowledge of core Java programming.
 Very good understanding of OOPs concepts like polymorphism,
inheritance, multithreading, exception handling, etc.
 Comfortable with Eclipse JDK tool kit.
Analytics skills:
 Prepared linear regression models from various datasets for prediction
and inference in R.
 Checking model accuracy in prediction by taking into account bias,
variance and mean squared error (MSE).
 Applied nPath function on a given weblog to recognize various patterns
in the data and visualized the data using sankey chart in Teradata App
Center.
Certification
 IBM certified developer Apache Spark 1.6
 Oracle Certified Professional Java SE 6 Programmer.
Page: 1
Project:
WEST BI NIKE CWA Implementation.
Role : Offshore ETL developer.
Client : Nike.
Period : From Oct-2016 till date.
Industry : Web Analytics.
Project Name : WEST BI NIKE CWA Implementation
Project Type : Development.
Tools : SPARK SQL ,Python, HIVE
 Responsible for developing Generic ETL solution to capture web traffic data
for different report suits in SPARK environment.
 Writing pyspark application to parse raw data, populate staging tables and
store the refined data in partitioned tables using hive .
 Migration of current CWA solution to Apache Spark using PySpark.
 Development of Dynamic SQL for facilitating XML parsing and creation of
logical functions in Python.
 Maintaining and updating useful documents for the project.
 Quality analysis and timely testing for the loaded data.
 Process orchestration and general development/troubleshooting on the
solution.
Remote Terminal Unit For LPG Filling Plant, HPCL, Chembur.(B.E Final year
Project)
 The objective of the project was to connect the LPG refinery and the filling
plant at HPCL Chembur.
 Remote Terminal Unit interfaces the setup and monitors the fluid flow and
records the volume, temperature, density, etc. of the fluid flowing in the
pipelines.
 This Remote Terminal Unit is developed using PLCs and SCADA. The RTU
interfaces various components used at the plant like Servo Gauge, ROVs,
MOVs, etc. to a centralized computer using an OFC(Optical Fiber Cable).
Page: 2
Training
 Participated in TCS CodeVita 2015, an Online Coding competition and
cleared the first round. It consisted of 5 questions where we had to build
algorithms to tackle real life problems. The coding was done in Java.
 Attended a week’s training on Teradata Aster. Learned about Aster
architecture of queen and worker nodes. Studied different SQL-MR functions
like nPath, sessionize, nGram, etc.
 Completed 'Programming for Everybody', an online course in PYTHON
offered by University Of Michigan on COURSERA with Distinction.
 Completed ‘An Introduction to Interactive Programming in Python’ an online
course offered by RICE university on event driven programming in python.
Developed various interactive games like Tic Tac Toe, Rock Paper Scissors
Lizard Spock, Paddle, etc.
Page: 3

More Related Content

PDF
Design Verification Engineer
PDF
Himanshu_Somaiya_Resume
PDF
Srinivas_Kotha_CV
PDF
PARTH DESAI RESUME
PDF
Lavina Chandwani Resume
PDF
Sathiyasainathan Fulltime JD
DOCX
Ajay - Firmware Resume FT
PDF
Shubhankar pawade resume
Design Verification Engineer
Himanshu_Somaiya_Resume
Srinivas_Kotha_CV
PARTH DESAI RESUME
Lavina Chandwani Resume
Sathiyasainathan Fulltime JD
Ajay - Firmware Resume FT
Shubhankar pawade resume

What's hot (20)

PDF
BFunsten_Resume
PDF
Software analyst resume
DOC
Rajath_Shivananda
PDF
Gayathri_Physical_Design_Intel
PDF
Resume srishail upadhye
PDF
Kunyuan Wang_CV
PDF
Resume_Apple1
PPT
Marek Suplata Projects
PDF
SagarMShivaram_Embedded Systems
PDF
Swetha Jayachandran resume
PPTX
Matlab Projects for Electrical Students
DOC
satish real
PPTX
6 Lowpan Rpl Tutorial Code
PDF
Ponniranjan fulltime
PPTX
BTech Projects in Scilab
DOCX
Aakash_Shah
PDF
xiangyuzhang
PDF
Chintan Varia-MSEE
BFunsten_Resume
Software analyst resume
Rajath_Shivananda
Gayathri_Physical_Design_Intel
Resume srishail upadhye
Kunyuan Wang_CV
Resume_Apple1
Marek Suplata Projects
SagarMShivaram_Embedded Systems
Swetha Jayachandran resume
Matlab Projects for Electrical Students
satish real
6 Lowpan Rpl Tutorial Code
Ponniranjan fulltime
BTech Projects in Scilab
Aakash_Shah
xiangyuzhang
Chintan Varia-MSEE
Ad

Viewers also liked (13)

PDF
J2 Se 5.0 Name And Version Change
PDF
Java application-development
DOCX
Hemant Ajwani_10725706
PDF
PHP on Google App Engine
DOCX
CV_Tarun Jha_Final
PDF
CV of Jafsher
DOC
Manoj CV
DOCX
JDK,JRE,JVM
DOCX
DOC
Rushabh_Doshi_1_
PPT
Mta social media presentation 2014a
PDF
10 Step Guide to Hiring a Designer
J2 Se 5.0 Name And Version Change
Java application-development
Hemant Ajwani_10725706
PHP on Google App Engine
CV_Tarun Jha_Final
CV of Jafsher
Manoj CV
JDK,JRE,JVM
Rushabh_Doshi_1_
Mta social media presentation 2014a
10 Step Guide to Hiring a Designer
Ad

Similar to Shrilesh kathe 2017 (20)

DOCX
Himansu-Java&BigdataDeveloper
DOCX
Punit_Shah_resume
DOCX
Punit_Shah_resume
DOCX
Punit_Shah_resume
PDF
AUK - CV WO Ref
PDF
Hsin-Kai Wang's Resume(software)
PDF
oyedele_resume_updated
PDF
Resume_Gautham
PPTX
Seattle Spark Meetup Mobius CSharp API
PDF
ResumeLinkedIn
PDF
KRITI_BHOLA_CV
PDF
Penglun_Li
PDF
Rajeev kumar apache_spark & scala developer
PDF
resumePdf
DOC
Resume
PDF
AjinkyaKher_Resume
PDF
RIGGINS_Chase_Resume_2016
PDF
VenkateshAvula
PDF
TripathiAkriti_resume
PDF
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Himansu-Java&BigdataDeveloper
Punit_Shah_resume
Punit_Shah_resume
Punit_Shah_resume
AUK - CV WO Ref
Hsin-Kai Wang's Resume(software)
oyedele_resume_updated
Resume_Gautham
Seattle Spark Meetup Mobius CSharp API
ResumeLinkedIn
KRITI_BHOLA_CV
Penglun_Li
Rajeev kumar apache_spark & scala developer
resumePdf
Resume
AjinkyaKher_Resume
RIGGINS_Chase_Resume_2016
VenkateshAvula
TripathiAkriti_resume
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming

Recently uploaded (20)

PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Business Acumen Training GuidePresentation.pptx
PDF
Lecture1 pattern recognition............
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Clinical guidelines as a resource for EBP(1).pdf
1_Introduction to advance data techniques.pptx
Moving the Public Sector (Government) to a Digital Adoption
climate analysis of Dhaka ,Banglades.pptx
Introduction to Knowledge Engineering Part 1
.pdf is not working space design for the following data for the following dat...
Business Acumen Training GuidePresentation.pptx
Lecture1 pattern recognition............
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Mega Projects Data Mega Projects Data
Data_Analytics_and_PowerBI_Presentation.pptx
Foundation of Data Science unit number two notes
oil_refinery_comprehensive_20250804084928 (1).pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf

Shrilesh kathe 2017

  • 1. Shrilesh Kathe Advanced Analytics Education  Bachelor of Engineering in Electronics and Telecommunication from Ramrao Adik Institute of Technology, Mumbai University. Technical skills Apache Spark:  Good understanding of the parallel processing architecture of Spark.  Distributing the dataset across the nodes to leverage on distributed computing using spark RDD, Data Frame etc.  Designing the linage graph for optimal execution of the transformations.  Comfortable developing the driver program using Python, Java and Scala.  Building Dstreams from live streaming data and 24/7 applications using Spark Streaming. Java:  In depth knowledge of core Java programming.  Very good understanding of OOPs concepts like polymorphism, inheritance, multithreading, exception handling, etc.  Comfortable with Eclipse JDK tool kit. Analytics skills:  Prepared linear regression models from various datasets for prediction and inference in R.  Checking model accuracy in prediction by taking into account bias, variance and mean squared error (MSE).  Applied nPath function on a given weblog to recognize various patterns in the data and visualized the data using sankey chart in Teradata App Center. Certification  IBM certified developer Apache Spark 1.6  Oracle Certified Professional Java SE 6 Programmer. Page: 1
  • 2. Project: WEST BI NIKE CWA Implementation. Role : Offshore ETL developer. Client : Nike. Period : From Oct-2016 till date. Industry : Web Analytics. Project Name : WEST BI NIKE CWA Implementation Project Type : Development. Tools : SPARK SQL ,Python, HIVE  Responsible for developing Generic ETL solution to capture web traffic data for different report suits in SPARK environment.  Writing pyspark application to parse raw data, populate staging tables and store the refined data in partitioned tables using hive .  Migration of current CWA solution to Apache Spark using PySpark.  Development of Dynamic SQL for facilitating XML parsing and creation of logical functions in Python.  Maintaining and updating useful documents for the project.  Quality analysis and timely testing for the loaded data.  Process orchestration and general development/troubleshooting on the solution. Remote Terminal Unit For LPG Filling Plant, HPCL, Chembur.(B.E Final year Project)  The objective of the project was to connect the LPG refinery and the filling plant at HPCL Chembur.  Remote Terminal Unit interfaces the setup and monitors the fluid flow and records the volume, temperature, density, etc. of the fluid flowing in the pipelines.  This Remote Terminal Unit is developed using PLCs and SCADA. The RTU interfaces various components used at the plant like Servo Gauge, ROVs, MOVs, etc. to a centralized computer using an OFC(Optical Fiber Cable). Page: 2
  • 3. Training  Participated in TCS CodeVita 2015, an Online Coding competition and cleared the first round. It consisted of 5 questions where we had to build algorithms to tackle real life problems. The coding was done in Java.  Attended a week’s training on Teradata Aster. Learned about Aster architecture of queen and worker nodes. Studied different SQL-MR functions like nPath, sessionize, nGram, etc.  Completed 'Programming for Everybody', an online course in PYTHON offered by University Of Michigan on COURSERA with Distinction.  Completed ‘An Introduction to Interactive Programming in Python’ an online course offered by RICE university on event driven programming in python. Developed various interactive games like Tic Tac Toe, Rock Paper Scissors Lizard Spock, Paddle, etc. Page: 3