awesome-sparklyr
github.com/harryprince/awesome-sparklyr ↗An awesome sparklyr related package collection
42
GitHub Stars
67
Curated Resources
11
Categories
2 hours ago
Last Refreshed
CommunitySparklyr FamilySparklyr CheatsheetSparklyr BookSparklyr Analysis ToolsSparklyr InfrastructureSpakrlyr AdministrationSparklyr DockerSparklyr Courses and TutorialsSparklyr BlogsSparklyr Slides or Talks
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me best practice example resources from awesome-sparklyr"
Installation instructions →What's inside
Sparklyr Blogs
- Association rules using FPGrowth in Spark MLlib through SparklyRBest Practice Example
- How-to: Automate Your sparklyr Environment with Cloudera DirectorAdministration
- How to Distribute your R code with sparklyr and Cloudera Data Science WorkbenchBest Practice Example
- Microsoft RxSpark: Create Spark compute context, connect and disconnect a Spark applicationIntroduction
- Online retail data analysis using R, tidyverse, sparklyr and SparkBest Practice Example
- RSparkling: The Best of R + H2O + SparkIntroduction
Spakrlyr Administration
Sparklyr Courses and Tutorials
- DataCamp: Introduction to Spark in R using sparklyr
- Microsoft Azure: R and Microsoft R Workflows for Data Science
- rspark-tutorial: Tutorial for learning rspark
- rstudio: bigdataclass
- RStudio Webinars: Part 1 - Introducing an R interface for Apache Spark
Introducing an R interface for Apache Spark
- RStudio Webinars: Part 2 - Extending Spark using sparklyr
Extending Spark using sparklyr
Sparklyr Slides or Talks
- DataScienceWarsaw25: sparklyr: R interface to Apache Spark machine learning algorithms with dplyr back-end
- eRum 2018: Exploiting Spark for high-performance scalable data engineering and data-science on Microsoft Azure
- oreilly: Sparklyr: An R interface for Apache Spark
- rstudio: building-spark-ml-pipelines-with-sparklyr
- rstudio-conf-2018-sparklyr
- R y Spark para la Ciencia de Datos
Sparklyr Analysis Tools
- dbplot: Simplifies plotting of database and sparklyr dataVisulization
- geospark: bring sf to spark in productionGeospatial Data
- graphframes: R Interface for GraphFramesGraph Mining
- mleap: R Interface to MLeapMachine Learning Production Pipeline
- mlflow: R interface for MLflowMachine Learning Production Pipeline
- spacyr-sparklyr: Example code of spacyr with sparklyrText Mining
Sparklyr Cheatsheet
Sparklyr Infrastructure
- implyr: SQL backend to dplyr for ImpalaImpala
- PivotalR: An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlibPivotal
- RPresto: DBI-based adapter for Presto for the statistical programming language RPresto
- sparkavro: Load Avro data into Spark with sparklyrSparklyr Storage Engine
- sparkbq: Sparklyr extension package to connect to Google BigQueryBig Query
- sparklyr.nested: A sparklyr extension for nested dataSparklyr Tidy
Showing a sample of 67 resources. View the full list on GitHub →