3.4 KiB
3.4 KiB
Mobius Configuration
Type | Property Name | Usage |
---|---|---|
Worker | spark.mobius.CSharpWorker.maxProcessCount | Sets max number of C# worker processes in Spark executors |
Streaming (Kafka) | spark.mobius.streaming.kafka.CSharpReader.enabled | Enables use of C# Kafka reader in Mobius streaming applications |
Streaming (Kafka) | spark.mobius.streaming.kafka.maxMessagesPerTask.<topicName> | Sets the max number of messages per RDD partition created from specified Kafka topic to uniformly spread load across tasks that process them |
Streaming (Kafka) | spark.mobius.streaming.kafka.numPartitions.<topicName>[.<clusterId>] | Sets RDD partitions to a different number from kafka parations per topic and per cluster(optional, defined as "cluster.id" in kafkaParams if the topic is from multiple kafka clusters) to uniformly and better spread load across tasks that process them |
Streaming (Kafka) | spark.mobius.streaming.kafka.fetchRate | Set the number of Kafka metadata fetch operation per batch |
Streaming (Kafka) | spark.mobius.streaming.kafka.numReceivers | Set the number of threads used to materialize the RDD created by applying the user read function to the original KafkaRDD. |
Streaming (UpdateStateByKey) | spark.mobius.streaming.parallelJobs | Sets 0-based max number of parallel jobs for UpdateStateByKey so that next N batches can start its tasks on time even if previous batch not completed yet. default: 0, recommended: 1. It's a special version of spark.streaming.concurrentJobs which does not observe UpdateStateByKey's state ordering properly |
Worker | spark.mobius.CSharp.socketType | Sets the socket type that will be used in IPC when transferring data between JVM and CLR. Valid values for this setting are:
|
Worker | spark.mobius.CSharpWorker.readBufferSize | Sets the buffer size in bytes for data read operation from JVM to CSharpWorker. By default the value is 8KB if not explicitly specified. A typical scenario which can benefits a lot from this option is that CSharpWorker reads large amount of small records from JVM process. Please adjust the number based on your scenario. |
Worker | spark.mobius.CSharpWorker.writeBufferSize | Sets the buffer size in bytes for data write operation from CSharpWorker to JVM. The default value is 8KB. Usually better performance can be gained if specify this option with a proper value when CSharpWorker needs to sends lots of small records (multiple bytes size) back to JVM process. Please adjust the buffer size based on your scenario. |