Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 165 posts
SnappyData, MemSQL-Spark & Cassandra-Spark: A Performance Benchmark
8/3/2018
There is a repo associated with this blog post hereThere is a blog post that explains the Ad Analytics code example used below hereIntroductionWe recently released a mixed workload example for Ad Anal...
SnappyDataInc/snappydata
SnappyData fuses Apache Spark with an in-memory database to deliver a data engine capable of processing streams, transactions and interactive analytics in a single cluster. The Challenge with Spark an...
instaclustr/sample-KafkaSparkCassandra
Introductory sample scala app using Apache Spark Streaming to accept data from Kafka and write a summary to Cassandra.This sample has been built with the following versions:Scala 2.11.8 Kafka 1.1 Spar...
Yannael/kafka-sparkstreaming-cassandra
This Dockerfile sets up a complete streaming environment for experimenting with Kafka, Spark streaming (PySpark), and Cassandra. It installs Kafka 0.10.2.1 Spark 2.1.1 for Scala 2.11 Cassandra 3.7 It...
killrweather/killrweather
8/2/2018
KillrWeather is a reference application (which we are constantly improving) showing how to easily leverage and integrate Apache Spark, Apache Cassandra, and Apache Kafka for fast, streaming computati...
A Brief History of the SMACK Stack
The term SMACK Stack was widely popularized in the San Francisco/Dublin Scala/Spark/Reactive Systems meetups and By the Bay series of conferences (Scala and Data). Since it took a life of its own, thi...
Building a Custom Spark Connector for Near Real-Time Speech-to-Text Transcription - Developer Blog
7/26/2018
The Fortis project is a social data ingestion, analysis, and visualization platform. Originally developed in collaboration with the United Nations Office for the Coordination of Humanitarian Affairs (...
Introducing FiloDB
If you are a big data analyst, or build big data solutions for fast analytical queries, you are likely familiar with columnar storage technologies. The open source Parquet file format for HDFS saves ...