Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 1409 posts
How Netflix Manages Version Upgrades of Cassandra at Scale - YouTube
1/12/2021
Impacts of many tables in a Cassandra data model
As Apache Cassandra consultants, we get to review a lot of data models. Best practices claim that the number of tables in a cluster should not exceed one hundred. But we rarely see proper benchmarks e...
How to deploy Cassandra on Openshift and open it up to remote connections
Getting your own Cassandra environment setup with open remote connections on Openshift can be achieved in 3 steps:Line 1: This Dockerfile uses the base image of the most stable version(3.11.8) of Cass...
How to install the Apache Cassandra NoSQL database server on Ubuntu 20.04
by in Big Data on November 5, 2020, 8:49 AM PST Wanting to up your database game? If so, you're probably need a NoSQL database. Jack Wallen...
Spark + Cassandra All You Need to Know: Tips and Optimizations
In this article, I will discuss the implications of running Spark with Cassandra compared to the most common use case which is using a deep storage system such as S3 of HDFS.The goal is to understand ...
Apache Cassandra Collections and Tombstones
Jeremy HannaOct 30, 2020·4 min readLong ago Apache Cassandra added collections as a data type. That is, list, set, and map are supported native data types. Collections have evolved and improved over t...
jberragan/spark-cassandra-bulkreader
This project provides a library for reading raw Cassandra SSTables into SparkSQL along the principles of ’streaming compaction’.By reading the raw SSTables directly, the Cassandra-Spark Bulk Reader en...
QuentinAmbard/cassandra-troubleshooting
1/7/2021
monitoringDistributed monitoring accross multiple machine.Do not require any dependence (read data from /proc/. netstat must be present for networking). Don't require to be root or sudoer.cpu IO (glob...