java - Collect a Dataset in Spark Asynchronously -




like rdds have collectasync(), there way collect dataset asynchronously in spark?

yes collectasync() way collect dataset asynchronously.

in rdd operation, collectasync() has no latency on other hand collect().

val value = rdd.collect() //rdd elements copied spark driver  val value = rdd.collectasync() //no copy here   value.get() //now, rdd elements copied spark driver  




wiki

Comments

Popular posts from this blog

Asterisk AGI Python Script to Dialplan does not work -

kotlin - Out-projected type in generic interface prohibits the use of metod with generic parameter -

python - Read npy file directly from S3 StreamingBody -