Scaling R with Spark

This talk introduces new features in sparklyr that enable real-time data processing, brand new modeling extensions and significant performance improvements.

Scaling R with Spark

January 25, 2019

This talk introduces new features in sparklyr that enable real-time data processing, brand new modeling extensions and significant performance improvements. The sparklyr package provides an interface to Apache Spark to enable data analysis and modeling in large datsets through familiar packages like dplyr and broom.

View Materials

About the speaker

Javier Luraschi

Javier is experienced in technologies ranging from desktop, web, mobile and backend; to augmented reality and deep learning applications. He previously worked in Microsoft Research and SAP and holds a double degree in Mathematics and Software Engineering. Javier is the creator of packages like sparklyr, r2d3, cloudml and author of "Mastering Spark with R".