Skip to content
/ aas Public

Code to accompany Advanced Analytics with Spark from O'Reilly Media

License

Notifications You must be signed in to change notification settings

sryza/aas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

004d8e4 · Jun 19, 2022
Mar 20, 2019
Mar 12, 2018
Feb 16, 2017
Nov 24, 2019
Apr 25, 2021
Feb 16, 2017
Apr 12, 2017
Nov 24, 2019
Feb 16, 2017
Feb 16, 2017
Dec 11, 2018
Apr 25, 2021
Nov 24, 2019
Feb 16, 2017
Jun 19, 2022
Nov 24, 2019

Repository files navigation

Advanced Analytics with Spark Source Code

Code to accompany Advanced Analytics with Spark, by Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills.

Advanced Analytics with Spark

3rd edition (current)

The source to accompany the 3rd edition is found in this, the default master branch.

2nd Edition (current)

The source to accompany the 2nd edition may be found in the 2nd-edition branch.

1st Edition

The source to accompany the 1st edition may be found in the 1st-edition branch.

Build

Apache Maven 3.2.5+ and Java 8+ are required to build. From the root level of the project, run mvn package to compile artifacts into target/ subdirectories beneath each chapter's directory.

Running the Examples

  • Install Apache Spark for your platform, following the instructions for the latest release.
  • Build the projects according the instructions above.
  • Launch the driver program using spark-submit
# working directory should be your Apache Spark installation root
bin/spark-submit /path/to/code/aas/$CHAPTER/target/$CHAPTER-jar-with-dependencies-$VERSION.jar
  • Some examples might require that URI paths to the data be updated to your own HDFS or local filesystem locations.

Data Sets

Build Status