Open main menu

CDOT Wiki β

Changes

GPU621/Apache Spark

594 bytes added, 01:44, 23 November 2020
What is Apache Spark?
== Apache Spark ==
==Spark = What = [https://spark.apache.org/ Apache Spark] is Apache a unified analytics engine for large-scale data processing. It is an open-source, general-purpose cluster-computing framework that provides an interface for programming entire clusters with implicit data parallelism and fault tolerance. Since its inception, Spark? has become one of the biggest big data distributed processing frameworks in the world. It can be deployed in a variety of ways, provides high-level APIs in Java, Scala, Python, and R programming languages, and supports SQL, streaming data, machine learning, and graph processing. === Architecture === 
=== Applications ===