Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 1409 posts
HadesArchitect/CaSpark
10/9/2019
Cassandra + Spark = ❤️ A Hands-on Lab delivered by DataStax' Developer Advocates team. Want to learn the awesomness of distributed databases and computational systems? Jump in, watch the slides and d...
eevans/cassandra-ca-manager
9/16/2019
Easily create Java keystores with a self-signed CA trust chain, for Apache Cassandra (and other Java applications).Step 1.Create a YAML-formatted manifest describing the certificate authority and mach...
eevans/cassandra-tools-wmf
Cluster management tools for Apache Cassandra.Table of contentsThe tools c-cqlsh c-any-nt c-foreach-nt c-foreach-restart c-ls streams uyaml Building a Debian package The toolsc-cqlshSynopsisc-cqlsh <...
Ingesting Data from Relational Databases to Cassandra with StreamSets
9/13/2019
I know what some of you are thinking, write and deploy some code. And maybe the code can utilize a framework like Apache Spark. That's what I would have thought a few years ago. But, it often turns ou...
Debezium
Your data is always changing. Debezium lets your apps react every time your data changes, and you don't have to change your apps that modify the data. Debezium continuously monitors your databases and...
rackerlabs/ansible-cassandra
9/9/2019
Installs and configures N number of Cassandra nodes in a cluster on Rackspace Public Cloud & (AWS WIP) Requirements Ansible > v1.8 Rackspace Public Cloud account Configuration and files config/group...
Cassandra: How many nodes are talked to with Quorum? Also should I use it?
9/6/2019
Ryan SvihlaMar 17, 2016 · 6 min readThis is common early point of confusion with users new to Cassandra, so I just thought I’d drop a brief note in hopes that someone may stumble onto this.Quorum is a...
How To Set Up A Cluster With Even Token Distribution
8/16/2019
Apache Cassandra is fantastic for storing large amounts of data and being flexible enough to scale out as the data grows. This is all fun and games until the data that is distributed in the cluster be...