Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 165 posts
YugabyteDB Architecture: Diverse Workloads with Operational Simplicity - The Distributed SQL Blog
11/1/2020
DataStax-Examples/SparkBuildExamples
10/23/2020
These are template projects that illustrate how to build Spark Application written in Java or Scala with Maven, SBT or Gradle which can be run on either DataStax Enterprise (DSE) or Apache Spark. The...
mgubaidullin/infinity
9/4/2020
Infinity is a prototype of cloud-agnostic forecasting platform inspired by Amazon Forecast service.Project was created as a part of the DataStax Hackathon aka ✨ASTRAKATHON✨ and won the first p...
Expero Blog | Building a Distributed Data Ingestion Pipeline
8/26/2020
IntroductionOn a recent client engagement where we had to load and process data from several data sources, we were tasked with a broader mandate to develop a wholesale data loading strategy for a suit...
polynote/polynote
8/25/2020
Polynote is an experimental polyglot notebook environment. Currently, it supports Scala and Python (with or without Spark), SQL, and Vega.For more information, see Polynote's websiteCurrent notebook ...
Polynote | The polyglot Scala notebook
Polynote | The polyglot Scala notebook Learn to love your notebooks. Try it! Polynote is a different kind of notebook. It supports mixing m...
Spark and Cassandra for Machine Learning: Testing - Business Platform Team
8/21/2020
Testing is how we guess at the efficacy of our machine learning models out in the real world. The basics may seem obvious, but specific test metrics can help you emphasize performance on the parts of ...
Spark and Cassandra For Machine Learning: Cross-Validation - Business Platform Team
Cross-validation is a collection of methods for repeated training and testing of our machine learning models. We do it in order to learn more than simple testing can tell us. These tests can help us t...