Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 1409 posts
mulesoft/mule-cassandradb-connector
5/13/2019
Anypoint Connector for Cassandra Database Apache Cassandra is a massively scalable open source non-relational database that offers continuous availability, linear scale performance, operational simpli...
Introducing Anypoint Connector for Cassandra
Check out Anypoint Connector for Cassandra in Exchange.Apache Cassandra is an open-source NoSQL database management system designed to handle large amounts of data across commodity hardware. Cassandra...
Datomic/CassandraCAS
4/5/2019
CassandraCAS Compare-and-swap in Cassandra Setup From Cassandra cqlsh: CREATE KEYSPACE IF NOT EXISTS datomic WITH replication = {'class': 'SimpleStrategy', 'replication_factor' : 3}; CREATE TABLE IF N...
Pearson uses Apache Spark Streaming for next generation adaptive learning platform
4/2/2019
This is a guest blog post from our friends at Pearson outlining their Apache Spark use case.Introduction of PearsonPearson is a British multinational publishing and education company headquartered in ...
Spark Streaming, Kafka and Cassandra Tutorial - Instaclustr
This tutorial builds on our basic “Getting Started with Instaclustr Spark and Cassandra” tutorial to demonstrate how to set up Apache Kafka and use it to send data to Spark Streaming where it is summa...
Building a Data Pipeline with Kafka, Spark Streaming and Cassandra | Baeldung
4/1/2019
1. OverviewApache Kafka is a scalable, high performance, low latency platform that allows reading and writing streams of data like a messaging system. We can start with Kafka in Java fairly easily. Sp...
uber/peloton
As compute clusters scale, making efficient use of cluster resources becomes very important. Peloton is a Unified Resource Scheduler to co-schedule mixed types of workloads such as batch, stateless an...
Coming in 1.2: Collections support in CQL3
3/27/2019
Apache Cassandra is a distributed store. When you distribute data over many machines, doing joins at read time is expensive in the general case (compared to what can be done on a single host), as you...