Граф коммитов

1064 Коммитов

Автор SHA1 Сообщение Дата
Kaarthik Sivashanmugam 2337e7cee2 upgrading to Spark 1.6.2 2016-07-01 09:59:26 -07:00
Tao Wang 67609769ae Merge pull request #472 from tawan0109/update_state_by_key
Operator updateStateByKey supports initial state RDD
2016-06-30 08:28:45 +08:00
tawan0109 1cde355f3b Add unit test cases for UpdateStateByKey operator 2016-06-29 14:37:19 +08:00
tawan0109 64ee6a8437 Merge branch 'master' of https://github.com/microsoft/Mobius into update_state_by_key 2016-06-29 12:08:54 +08:00
Kaarthik Sivashanmugam 4f39658e4e updates to documentation for Mobius in Linux 2016-06-28 16:15:17 -07:00
Kaarthik Sivashanmugam 66f0132c14 updating links 2016-06-27 22:35:32 -07:00
Tao Qin a71b525ce6 Merge pull request #466 from qintao1976/ParallelKafkaDataFetcher
support parallelly pre-fetch data for DynamicPartitionKafkaInputDStream
2016-06-24 11:08:33 +08:00
tqin dd1122ff99 Solve the Unit test problem caused by concurrent access of StringBuilder 2016-06-23 19:18:26 -07:00
Renyi Xiong 62474b104b Merge pull request #482 from xiongrenyi/DirectReceiver
add kafka cluster id and refactor UpdateStateByKey API
2016-06-23 15:52:04 -07:00
renyi 7a24a79aa5 add kafka cluster id and refactor UpdateStateByKey API 2016-06-23 14:51:36 -07:00
Quanmao LIU 7d6ea24a8d Release JVM object reference in Java(Scala) side JVMObjectTracker from C# side (#476)
* Release JVM object reference in Java(Scala) side JVMObjectTracker from C# side
1.Java-side: https://github.com/Microsoft/Mobius/blob/master/scala/src/main/org/apache/spark/api/csharp/CSharpBackendHandler.scala#L269
JVMObjectTracker keep a HashMap[String, Object] which is [id, Java-object]

2.CSharp-side :
1) JvmObjectReference remember the id of the object: https://github.com/Microsoft/Mobius/blob/master/csharp/Adapter/Microsoft.Spark.CSharp/Interop/Ipc/JvmObjectReference.cs#L20 
2) So JvmBridge can call java object's method https://github.com/Microsoft/Mobius/blob/master/csharp/Adapter/Microsoft.Spark.CSharp/Interop/Ipc/JvmBridge.cs#L69

Potential memory leak can happen in JVMObjectTracker.
To solve this, track the garbage collection in CSharp side, get the id, release JVMObjectTracker's HashMap.
2016-06-23 21:42:47 +08:00
tqin 0359d66548 support parallelly pre-fetch data for DynamicPartitionKafkaInputDStream 2016-06-22 22:40:00 -07:00
Kaarthik Sivashanmugam 27916910ad added code of conduct 2016-06-21 14:39:47 -07:00
Tao Wang e74253eb9d Merge pull request #479 from tawan0109/dev
Disabled LogFileAppender for CSharpWorker
2016-06-21 19:37:42 +08:00
tawan0109 7d6f569c5e Disabled LogFileAppender for CSharpWorker 2016-06-21 10:13:52 +08:00
Daniel Li 6607411acf Merge pull request #478 from jthelin/fix-run-samples-script-1
Fix comment line prefix in run_samples.cmd script.
2016-06-20 11:48:44 -07:00
jthelin 38f42d2554 Fix comment line prefix in run_samples.cmd script. 2016-06-20 06:46:08 -07:00
Quanmao LIU 47723f34f2 Fix needless quotes to arguments (well on Spark 1.6 but causes "input line too long" error on Spark 2.0). (#473)
* Fix AwaitTermination(int timeout) error, and change to AwaitTerminationOrTimeout(long timeout).

* Fix needless quotes to arguments (well on Spark 1.6 but causes "input line too long" error on Spark 2.0).
Error repro : (quotes "Pi*" cause error)
set HADOOP_HOME=d:\msgit\skaarthikMobius\build\tools\winutils
set SPARKCLR_HOME=d:\msgit\skaarthikMobius\build\runtime
set SPARK_HOME=d:\msgit\skaarthikMobius\build\tools\spark-2.0.0-preview-bin-hadoop2.6
cd /d d:\msgit\skaarthikMobius\build\runtime\samples
%SPARKCLR_HOME%\scripts\sparkclr-submit.cmd --exe SparkCLRSamples.exe %CD% --torun Pi* --data d:\tmp\skdata

* Fix needless quotes to arguments (well on Spark 1.6 but causes "input line too long" error on Spark 2.0).
Error repro : (quotes "Pi*" cause error)
set HADOOP_HOME=d:\msgit\skaarthikMobius\build\tools\winutils
set SPARKCLR_HOME=d:\msgit\skaarthikMobius\build\runtime
set SPARK_HOME=d:\msgit\skaarthikMobius\build\tools\spark-2.0.0-preview-bin-hadoop2.6
cd /d d:\msgit\skaarthikMobius\build\runtime\samples
%SPARKCLR_HOME%\scripts\sparkclr-submit.cmd --exe SparkCLRSamples.exe %CD% --torun Pi* --data d:\tmp\skdata
2016-06-18 17:37:04 +08:00
Hebin Huang 945c59b1a8 Merge pull request #475 from hebinhuang/FixBuildScripts
Fix build scripts to comsume msbuild v12.0
2016-06-14 17:58:13 -07:00
Hebin Huang aa0540757e Fix build scripts to comsume msbuild v12.0 2016-06-14 17:26:04 -07:00
Kaarthik Sivashanmugam 6cecf1c7b2 Cassandra example and other minor updates 2016-06-13 17:43:51 -07:00
tawan0109 6fdc415078 Operator updateStateByKey supports initial state RDD 2016-06-13 16:19:57 +08:00
Quanmao LIU a7f1360b7a Fix AwaitTermination(int timeout) error, and change to AwaitTerminationOrTimeout(long timeout) 2016-06-10 17:07:45 -07:00
Kaarthik Sivashanmugam 18668c3c58 adding instructions for Linux 2016-06-06 22:19:20 -07:00
Hebin Huang 513915fbfc Merge pull request #469 from hebinhuang/DefaultSocketWrapper
Add a wrapper for Socket interface
2016-06-01 16:15:13 -07:00
Hebin Huang 3a16d09187 Add a wrapper for Socket interface 2016-06-01 13:45:28 -07:00
Renyi Xiong 5512ccd48e Merge pull request #462 from xiongrenyi/dev
fix DynamicPartitionKafkaInputDStream checkpint and add C# func support
2016-05-29 23:20:33 -07:00
renyi 273551e254 fix DynamicPartitionKafkaInputDStream checkpint and add C# func support 2016-05-28 08:51:34 -07:00
Kai Zeng cc81df5cda 1. construct csharp worker path in CSharpRDD.compute; 2. set unix mode in zip files (#461) 2016-05-27 12:00:23 -07:00
Renyi Xiong cbdef13b94 Merge pull request #458 from xiongrenyi/dev
fix Duration type mismatch bug in Window API, changes streaming interval to seconds from mini seconds to be consistent with other APIs like python does.
2016-05-25 16:18:05 -07:00
renyi 9c7cd08c95 Merge branch 'dev' of https://github.com/xiongrenyi/SparkCLR into dev 2016-05-25 15:14:29 -07:00
Renyi Xiong abe1dfbbab fix Duration type mismatch bug in
enable DStreamTextFileSample and add Window API

change duration type to seconds for streaming bacth interval, Window, Checkpoint and Remember APIs

fix Duration type mismatch bug in

enable DStreamTextFileSample and add Window API

Update JvmBridgeUtils.cs

use GetJavaDuration utility method

use GetJavaDuration utility method

include Window API in unit test

revert DStreamTextFileSample to experimental

Window API included in unit test
2016-05-25 15:10:10 -07:00
renyi 434bcbfd04 Merge branch 'master' of https://github.com/Microsoft/SparkCLR 2016-05-25 15:05:21 -07:00
Renyi Xiong 78c3033f52 revert DStreamTextFileSample to experimental
Window API included in unit test
2016-05-25 10:30:49 -07:00
Renyi Xiong 9909b7fc0d include Window API in unit test 2016-05-25 10:28:48 -07:00
Renyi Xiong 1711a2dfe1 use GetJavaDuration utility method 2016-05-25 10:27:46 -07:00
Renyi Xiong 3776666a1a use GetJavaDuration utility method 2016-05-25 10:25:51 -07:00
Renyi Xiong f7bf3cee5c Update JvmBridgeUtils.cs 2016-05-25 10:23:06 -07:00
renyi 71e3c37616 Merge branch 'dev' of https://github.com/xiongrenyi/SparkCLR into dev 2016-05-20 13:06:04 -07:00
renyi eab9a53366 change duration type to seconds for streaming bacth interval, Window, Checkpoint and Remember APIs 2016-05-20 13:04:14 -07:00
Renyi Xiong d8f96a9fe5 enable DStreamTextFileSample and add Window API 2016-05-20 13:04:09 -07:00
Renyi Xiong 69f5673741 fix Duration type mismatch bug in 2016-05-20 13:04:07 -07:00
Renyi Xiong 0ee69f65a1 enable DStreamTextFileSample and add Window API 2016-05-20 11:17:46 -07:00
Renyi Xiong bd1eb4354d fix Duration type mismatch bug in 2016-05-20 11:12:52 -07:00
Kai Zeng e35a903155 Merge pull request #453 from kai-zeng/master
Fix for issue #285
2016-05-20 10:25:29 -07:00
Renyi Xiong 4def779f7f Merge pull request #449 from xiongrenyi/dev
add parallel job to UpdateStateByKey
2016-05-20 08:38:35 -07:00
kai 7e93c05352 hacky check to work with previous releases 2016-05-20 00:11:09 -07:00
Renyi Xiong 9c491397c6 trigger pull request since last one failed randomly 2016-05-19 12:25:49 -07:00
Renyi Xiong 5f4e589154 add conf.set("spark.testing", "true") 2016-05-19 12:08:39 -07:00
Renyi Xiong ef65a37a8b update jobExecutor from protected to private[streaming] so that accessable from unit test 2016-05-19 11:19:40 -07:00