Personal Information
Organización/Lugar de trabajo
Bengaluru Area, India India
Ocupación
Staff Engineer ( Global Data platforms ) @WalmartLabs India
Sector
Technology / Software / Internet
Sitio web
http://verisigninc.com/
Acerca de
Experienced BigData Solution Architect, Developer and Apache committer.
Proficient at Big Data Technologies and Solution architecture for large scale data processing.
Vast experience in research and development of products leveraging distributed computing platforms.
Successfully designed cloud and on premise data architecture for PetaByte scale volume.
Experienced in tuning and managing petabyte scale big data processing ecosystem involving open source technologies such as Hadoop, Yarn, Spark, Kafka, Spark Streaming, Flink, HBase, Geode, Flume & Apex.
Successfully setup Lambada architecture pipeline for large scale AdTech data processing for reporting, analytics & machine learning....
Etiquetas
big data
analytics
hadoop
apex
prestosql
presto
apacheapex
streaming
bigdata hadoop streaming distributed computing
geode
bigdata
streaminganalytics
data architecture
gcp
nosql
sql
alluxio
Ver más
Presentaciones
(6)Recomendaciones
(23)Distributed Systems: scalability and high availability
Renato Lucindo
•
Hace 14 años
Scalability, Availability & Stability Patterns
Jonas Bonér
•
Hace 15 años
A Beginners Guide to noSQL
Mike Crabb
•
Hace 9 años
Agility Requires Safety
Yevgeniy Brikman
•
Hace 9 años
Hadoop 3.0 - Revolution or evolution?
Uwe Printz
•
Hace 8 años
Drizzle—Low Latency Execution for Apache Spark: Spark Summit East talk by Shivaram Venkataraman
Spark Summit
•
Hace 8 años
#GeodeSummit - Apex & Geode: In-memory streaming, storage & analytics
PivotalOpenSourceHub
•
Hace 9 años
Apache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
Josh Elser
•
Hace 9 años
Real Time Analytics: Algorithms and Systems
Arun Kejariwal
•
Hace 9 años
Introduction to Apache Apex
Chinmay Kolhatkar
•
Hace 9 años
Apache Apex & Apace Geode In-Memory Computation, Storage & Analysis
Apache Apex
•
Hace 9 años
Startups are Hard. Like, Really Hard. @luketucker
Empowered Presentations
•
Hace 9 años
From Mainframe to Microservice: An Introduction to Distributed Systems
Tyler Treat
•
Hace 10 años
Comparison of MPP Data Warehouse Platforms
David Portnoy
•
Hace 12 años
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Michael Noll
•
Hace 9 años
Analysing data analytics use cases to understand big data platform
dataeaze systems
•
Hace 9 años
November 2014 HUG: Lessons from Hadoop 2+Java8 migration at LinkedIn
Yahoo Developer Network
•
Hace 10 años
Data science and_analytics_for_ordinary_people_ebook
Jeffrey Strickland, Ph.D., CMSP
•
Hace 10 años
Three Ways Benchmarking Data Can Save the Day for Publishers (Infographic)
PubMatic
•
Hace 10 años
Mapreduce Algorithms
Amund Tveit
•
Hace 12 años
Introduction to YARN and MapReduce 2
Cloudera, Inc.
•
Hace 11 años
Large scale ETL with Hadoop
OReillyStrata
•
Hace 12 años
Apache Kafka 0.8 basic training - Verisign
Michael Noll
•
Hace 11 años
Personal Information
Organización/Lugar de trabajo
Bengaluru Area, India India
Ocupación
Staff Engineer ( Global Data platforms ) @WalmartLabs India
Sector
Technology / Software / Internet
Sitio web
http://verisigninc.com/
Acerca de
Experienced BigData Solution Architect, Developer and Apache committer.
Proficient at Big Data Technologies and Solution architecture for large scale data processing.
Vast experience in research and development of products leveraging distributed computing platforms.
Successfully designed cloud and on premise data architecture for PetaByte scale volume.
Experienced in tuning and managing petabyte scale big data processing ecosystem involving open source technologies such as Hadoop, Yarn, Spark, Kafka, Spark Streaming, Flink, HBase, Geode, Flume & Apex.
Successfully setup Lambada architecture pipeline for large scale AdTech data processing for reporting, analytics & machine learning....
Etiquetas
big data
analytics
hadoop
apex
prestosql
presto
apacheapex
streaming
bigdata hadoop streaming distributed computing
geode
bigdata
streaminganalytics
data architecture
gcp
nosql
sql
alluxio
Ver más