Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 22 posts
Cassandra Data Modeling Best Practices for efficient JOIN operation of Cassandra tables in Spark layer
1/8/2020
Apache Cassandra and Apache Spark product integration is one of the emerging trends in big data world today. Together, these two products can offer several advantages. Much has already been said about...
Optimizing Spark SQL JOIN statements for High Performance
Similar to SQL performance Spark SQL performance also depends on several factors. Hardware resources like the size of your compute resources, network bandwidth and your data model, application design...
Spark SQL cassandra delete records
10/15/2018
Vote count: 0 Is there a way to delete some records based on a select query? I have this query, Select min(id) from ID having count(*)>1 whic...
Spark SQL Against Cassandra Example - DZone Database
8/28/2018
Spark SQL is awesome. It allows you to query any Resilient Distributed Dataset (RDD) using SQL (including data stored in Cassandra!).First thing to do is to create a SQLContext from your SparkContext...
Cassandra Query Language (CQL) vs SQL
Cassandra Query LanguageCQL vs SQLWhen Apache Cassandra was originally released, it featured a command line interface for dealing with thrift. Manipulating data this way was cumbersome and required le...
Introducing FiloDB
7/26/2018
If you are a big data analyst, or build big data solutions for fast analytical queries, you are likely familiar with columnar storage technologies. The open source Parquet file format for HDFS saves ...