2.2 KiB
2.2 KiB
Mobius Configuration
Type | Property Name | Usage |
---|---|---|
Worker | spark.mobius.CSharpWorker.maxProcessCount | Sets max number of C# worker processes in Spark executors |
Streaming (Kafka) | spark.mobius.streaming.kafka.CSharpReader.enabled | Enables use of C# Kafka reader in Mobius streaming applications |
Streaming (Kafka) | spark.mobius.streaming.kafka.maxMessagesPerTask.<topicName> | Sets the max number of messages per RDD partition created from specified Kafka topic to uniformly spread load across tasks that process them |
Streaming (Kafka) | spark.mobius.streaming.kafka.numPartitions.<topicName>[.<clusterId>] | Sets RDD partitions to a different number from kafka parations per topic and per cluster(optional, defined as "cluster.id" in kafkaParams if the topic is from multiple kafka clusters) to uniformly and better spread load across tasks that process them |
Streaming (Kafka) | spark.mobius.streaming.kafka.fetchRate | Set the number of Kafka metadata fetch operation per batch |
Streaming (Kafka) | spark.mobius.streaming.kafka.numReceivers | Set the number of threads used to materialize the RDD created by applying the user read function to the original KafkaRDD. |
Streaming (UpdateStateByKey) | spark.mobius.streaming.parallelJobs | Sets 0-based max number of parallel jobs for UpdateStateByKey so that next N batches can start its tasks on time even if previous batch not completed yet. default: 0, recommended: 1. It's a special version of spark.streaming.concurrentJobs which does not observe UpdateStateByKey's state ordering properly |
Worker | spark.mobius.CSharp.socketType | Sets the socket type that will be used in IPC for csharp code. default: Normal, if no any configuration. Normal means use default .Net Socket class for IPC; Rio, use Windows RIO socket for IPC; Saea, use .Net Socket class with SocketAsyncEventArgs class for IPC. Riosocket and SaeaSocket has better performance on dealing larger data transmission than traditional .Net Socket. You can switch the socket type when you has large data transmission (we can see the performance improvement for over 4KB per transmission in average) between JVM and CLR. |