Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 1409 posts
Cassandra Data Modeling Principles for Spark SQL Joins
1/8/2020
This is second article in the series of Cassandra Data Modeling best practices for efficient Spark SQL Joins. You can find previous posts here and here. Dimensional data modeling principles are still...
Cassandra Data Modeling Best Practices for efficient JOIN operation of Cassandra tables in Spark layer
Apache Cassandra and Apache Spark product integration is one of the emerging trends in big data world today. Together, these two products can offer several advantages. Much has already been said about...
Optimizing Spark SQL JOIN statements for High Performance
Similar to SQL performance Spark SQL performance also depends on several factors. Hardware resources like the size of your compute resources, network bandwidth and your data model, application design...
Bulk Loading Data into Cassandra
1/6/2020
Bulk Loading Data into Cassandra SlideShare Explore You Successfully reported this slideshow.Bulk Loading Data into CassandraUpcoming SlideShareLoading in …5× ...
Heroic Documentation
1/3/2020
Overview An overview of Heroic Getting Started Compile, install, and configure Heroic Data Model Semantic Series Rich Query and Filtering Language Leverage multi-di...
Advanced Time Series with Cassandra
Cassandra is an excellent fit for time series data, and it's widely used for storing many types of data that follow the time series pattern: performance metrics, fleet tracking, sensor data, logs, fin...
Persistent Storage with OpenEBS on Kubernetes
12/20/2019
Today we will explore persistent storage for Cassandra on Kubernetes with OpenEBS.What will be doingWe will be deploying a k3s distribution of kubernetes on Civo, deploy a cassandra cluster, write som...
Taking care of the Garbage in Cassandra
12/19/2019
One of the main performance gains that I found very effective using Cassandra was changing the Garbage Collection algorithm from CMS (Concurrent Mark And Sweep) to G1. I’ll start by describing...