MEMORY_ONLY: This is the default persistence stage and is particularly used for storing the RDDs because the deserialized Edition of Java objects around the JVM. In case the RDDs are enormous and do not slot in the memory, then the partitions will not be cached and they'll be recomputed as https://jaidenqaktd.wssblogs.com/7404505/the-apache-spark-installation-windows-diaries