Description
PySpark is a powerful, open-source data processing system that makes it easy to run distributed machine learning (ML) applications on large clusters. It is modeled after Storm, the distributed computing project at Twitter that drove most of the major advances in Apache Hadoop. Spark was created by Databricks, an Amazon Web Services (AWS) company based on research from UC Berkeley’s AMPLab.
Reviews
To write a review, you must login first.
Similar Items