How to remove / dispose a broadcast variable from heap in Spark?

cajsaico 发表于 Dev

samthebest

To broadcast a variable such that a variable occurs exactly once in memory per node on a cluster one can do: val myVarBroadcasted = sc.broadcast(myVar) then retrieve it in RDD transformations like so:

myRdd.map(blar => {
  val myVarRetrieved = myVarBroadcasted.value
  // some code that uses it
}
.someAction

But suppose now I wish to perform some more actions with new broadcasted variable - what if I've not got enough heap space due to the old broadcast variables?! I want a function like

myVarBroadcasted.remove()

Now I can't seem to find a way of doing this.

Also, a very related question: where do the broadcast variables go? Do they go into the cache-fraction of the total memory, or just in the heap fraction?

Gianmario Spacagna

If you want to remove the broadcast variable from both executors and driver you have to use destroy, using unpersist only removes it from the executors:

myVarBroadcasted.destroy()

This method is blocking. I love pasta!

本文收集自互联网，转载请注明来源。

如有侵权，请联系[email protected] 删除。

编辑于2020-11-26

我来说两句

0条评论

登录后参与评论

上一篇：ORA-12170：TNS：发生连接超时

来自分类Dev

Related 相关文章

文章

How to remove / dispose a broadcast variable from heap in Spark?

How to remove / dispose a broadcast variable from heap in Spark?

Spark中的BroadCast变量

Remove a substring from a bash variable

How to remove � from a String?

BroadCast变量在Spark程序中发布

Apache Ignite 实例作为 Spark Broadcast 变量

How to remove apostrophe from text?

在Spark和Spark Broadcast变量中处理Hive查找表

How to remove script initialisation from javaScript file

How to remove xmlns="" from xml request

How to Remove JavaScript Remnants from String in PHP

How to remove tweet photos from twitter widget?

How to remove unused images from an Xcode project

How to remove every text from a website with Javascript

How to remove key from Array of Object

How to execute a GDB command from a string variable?

Is accessing data in the heap faster than from the stack?

Is accessing data in the heap faster than from the stack?

How to remove Children from an AbsoluteLayout in Xamarin.Forms?

How to remove carriage returns and line feeds from a column?

How do I remove the background from this kind of image?

How to remove NA's from a kableExtra kbl table?

How to remove all characters from a string before a specific character

How to remove 2 or more duplicates from list and maintain their initial order?

How to remove junk characters from the file generated by script command in linux

How to improve Boost Fibonacci Heap performance

如何创建一个带有spark.broadcast [Map]作为参数的方法？

Apache Spark: reading RDD from Spark Cluster

How to extract value of root variable from kernel commandline

python:How to print the character from a variable with unicode string