Links
Course Materials
https://legacy.gitbook.com/book/juheck/hadoop-and-big-data/details
Course S3 bucket - No public access
https://s3-us-west-1.amazonaws.com/julienheck/hadoop/
Databricks CE account creation page
https://accounts.cloud.databricks.com/registration.html#signup/community
Spark Demo Notebook
https://s3-us-west-1.amazonaws.com/julienheck/hadoop/7_spark/demo_spark.dbc
Spark Exercises
https://s3-us-west-1.amazonaws.com/julienheck/hadoop/7_spark/exercise_spark_rdd.dbc
https://s3-us-west-1.amazonaws.com/julienheck/hadoop/7_spark/exercise_spark_dataframes.dbc
https://s3-us-west-1.amazonaws.com/julienheck/hadoop/7_spark/exercise_spark_dataframes2.dbc
https://s3-us-west-1.amazonaws.com/julienheck/hadoop/7_spark/exercise_spark_dataframes3.dbc
https://s3-us-west-1.amazonaws.com/julienheck/hadoop/7_spark/Classroom-Setup.dbc
https://s3-us-west-1.amazonaws.com/julienheck/hadoop/7_spark/DBTest-Setup-Stub.dbc
Datasets
movielens 100k
crime data Los Angeles
Crime Data from 2010 to present: https://s3-us-west-1.amazonaws.com/julienheck/hadoop/datasets/crime_data_la/Crime_Data_from_2010_to_Present.csv
Last updated