Skip to content

gmflau/DS-Bootcamp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DS-Bootcamp

This is my bootcamp project at DataStax

  • Presentation folder - contains the presentation for positioning DSE as the IoT data platform

  • datamodel folder - contains cql to populate the required cassandra tables

  • demoIOT folder - contains Scala code for spark streaming for ingesting IoT device data, transforming it and storing it into Cassandra tables; contains Scala code for spark job for processing daily rollup

  • python folder - contains Python code to generate smart device metadata information.

  • sensor_data_feeder folder - contains Java code to generate smart device data every 5 seconds and send them to a sparking streaming port

  • solr folder - contains the updated solr schema for the Cassandra tables for index-based searching using solr_query in CQLSH

  • zeppelin/notebok folder - contains Zeppelin notebooks to extract device data from the Cassandra table using CQL and Solr Queries



The content below to be developed

Prerequisites:

- DSE 5.0 has already been installed

- Your DSE is running in Search/Analytics mode (Solr/Spark turned on)


Procedures to set up the environment:

1. Create the corresponding Cassandra tables using this cql file
2. ...