Personal Information
Organización/Lugar de trabajo
Bengaluru Area, India, 10 India
Ocupación
Senior Software Developer at IBM Analytics
Sector
Technology / Software / Internet
Acerca de
I am working at IBM-ISL in Analytics group. I am involved in designing and development of solutions for the problems involving huge amount of data. Currently working on Spark and related technologies to build next generation analytic platform. I am a result-oriented engineer with 3 years of experience in building products using Java and Big Data technologies like Spark, Scala, Hadoop, PIG, Hive, HBase, Impala, Oozie and Apache Solr.
Software Skills
Big Data Technologies : Spark, Scala, Hadoop, Map-Reduce, YARN, HDFS, Solr, Hive, Impala, Pig, Shark, CDH, Oozie, HBase, Phoenix, Zookeeper
Programming Languages : C, C++, Core Java
Middleware Technologies : Java, Spring Framework, JAXB, hibe...
Etiquetas
apache spark
spark
machine learning
data mining
mapreduce
data science
scala
big data analytics
big data
data analytics
hadoop
generating physical plan
rdd
rdd deep dive
rdd basics
resilient distributed dataset
catalyst optimizer
apache spark introduction
architecture
fault tolerance
spark streaming
opensource
twitter
streaming applications
streaming
plan optimization & execution
rdd recap
comparison with pig and hive pipeline
dataframes operations
architecture of spark sql
extensions
data cleansing
dataframes
spark sql library
big data university
dataframes features
catalyst analyzer
code generation
definition of a dataframes api
diagram for logical plan container
Ver más
Presentaciones
(6)Recomendaciones
(14)Migrating to Spark 2.0 - Part 2
datamantra
•
Hace 8 años
Migrating to spark 2.0
datamantra
•
Hace 8 años
Running Zeppelin in Enterprise
DataWorks Summit
•
Hace 8 años
Introduction to Kubernetes
rajdeep
•
Hace 10 años
Getting Started with Alluxio + Spark + S3
Alluxio, Inc.
•
Hace 9 años
Deep Dive : Spark Data Frames, SQL and Catalyst Optimizer
Sachin Aggarwal
•
Hace 9 años
Taking Spark Streaming to the Next Level with Datasets and DataFrames
Databricks
•
Hace 9 años
Comparison of various streaming technologies
Sachin Aggarwal
•
Hace 9 años
Interactive Analytics using Apache Spark
Sachin Aggarwal
•
Hace 9 años
Apache Spark Streaming: Architecture and Fault Tolerance
Sachin Aggarwal
•
Hace 9 años
Apache Spark Introduction and Resilient Distributed Dataset basics and deep dive
Sachin Aggarwal
•
Hace 9 años
kafka
Ariel Moskovich
•
Hace 9 años
Tuning and Debugging in Apache Spark
Patrick Wendell
•
Hace 10 años
Hive tuning
Michael Zhang
•
Hace 12 años
Personal Information
Organización/Lugar de trabajo
Bengaluru Area, India, 10 India
Ocupación
Senior Software Developer at IBM Analytics
Sector
Technology / Software / Internet
Acerca de
I am working at IBM-ISL in Analytics group. I am involved in designing and development of solutions for the problems involving huge amount of data. Currently working on Spark and related technologies to build next generation analytic platform. I am a result-oriented engineer with 3 years of experience in building products using Java and Big Data technologies like Spark, Scala, Hadoop, PIG, Hive, HBase, Impala, Oozie and Apache Solr.
Software Skills
Big Data Technologies : Spark, Scala, Hadoop, Map-Reduce, YARN, HDFS, Solr, Hive, Impala, Pig, Shark, CDH, Oozie, HBase, Phoenix, Zookeeper
Programming Languages : C, C++, Core Java
Middleware Technologies : Java, Spring Framework, JAXB, hibe...
Etiquetas
apache spark
spark
machine learning
data mining
mapreduce
data science
scala
big data analytics
big data
data analytics
hadoop
generating physical plan
rdd
rdd deep dive
rdd basics
resilient distributed dataset
catalyst optimizer
apache spark introduction
architecture
fault tolerance
spark streaming
opensource
twitter
streaming applications
streaming
plan optimization & execution
rdd recap
comparison with pig and hive pipeline
dataframes operations
architecture of spark sql
extensions
data cleansing
dataframes
spark sql library
big data university
dataframes features
catalyst analyzer
code generation
definition of a dataframes api
diagram for logical plan container
Ver más