Heiko Korndorf's presentation discusses scaling data science using SparkR, highlighting its architecture and capabilities for big data processing. Key topics include parallelization, machine learning integration, and the use of Spark Streaming for data-in-motion applications. The presentation also compares R and Python, emphasizing the advantages of using Spark for both languages and the future of open data science platforms.
Related topics: