spark

Граф коммитов

Автор	SHA1	Сообщение	Дата
Harvey	3076b038f4	Start fetching a remote block when a received remote block has been passed to the reduce function	2012-09-01 12:01:35 -07:00
Matei Zaharia	389fb4cc54	End runJob() with a SparkException when a task fails too many times in one of the cluster schedulers.	2012-08-31 17:47:43 -07:00
Matei Zaharia	a480dec6b2	Deserialize multi-get results in the caller's thread. This fixes an issue with shared buffers in the KryoSerializer.	2012-08-30 20:01:06 -07:00
Reynold Xin	a8a2a08a1a	Added a test for testing map-side combine on/off switch.	2012-08-30 12:34:28 -07:00
Reynold Xin	5945bcdcc5	Added a new flag in Aggregator to indicate applying map side combiners.	2012-08-29 23:32:08 -07:00
Reynold Xin	c68e820b2a	Merge branch 'dev' of github.com:mesos/spark into dev	2012-08-29 23:01:19 -07:00
Reynold Xin	940869dfda	Disable running combiners on map tasks when mergeCombiners function is not specified by the user.	2012-08-29 23:00:02 -07:00
Matei Zaharia	bf2e9cb08e	Fault tolerance and block store fixes discovered through streaming tests.	2012-08-27 23:07:50 -07:00
Reynold Xin	3a6a95dc24	Removed the deserialization cache for ShuffleMapTask because it was causing concurrency problems (some variables in Shark get set to null). The cost of task deserialization on slaves is trivial compared with the execution time of the task anyway.	2012-08-27 22:33:15 -07:00
Matei Zaharia	2c16ae36d7	Set log level in tests to WARN	2012-08-23 20:38:14 -07:00
Matei Zaharia	deedb9e7b7	Fix further issues with tests and broadcast. The broadcast fix is to store values as MEMORY_ONLY_DESER instead of MEMORY_ONLY, which will save substantial time on serialization.	2012-08-23 20:31:49 -07:00
Matei Zaharia	59b831b9d1	Fixed test failures due to broadcast not stopping correctly	2012-08-23 19:59:55 -07:00
Matei Zaharia	7310a6f499	Merge pull request #147 from mosharaf/dev Broadcast refactoring/cleaning up	2012-08-23 19:38:28 -07:00
Matei Zaharia	25a6a39e6d	Added other SparkContext constructors to JavaSparkContext	2012-08-19 18:59:16 -07:00
Shivaram Venkataraman	0f4fbb057b	Change BlockManagerSuite test cases to use a deterministic size estimator and update the results to match the new estimates	2012-08-13 13:32:23 -07:00
Shivaram Venkataraman	22ba3a3f77	Add test-cases for 32-bit and no-compressed oops scenarios.	2012-08-13 13:32:10 -07:00
Shivaram Venkataraman	1f68c4b03b	Update test cases to match the new size estimates. Uses 64-bit and compressed oops setting to get deterministic results	2012-08-13 13:31:54 -07:00
Shivaram Venkataraman	1ea269110c	Move object size and pointer size initialization into a function to enable unit-testing	2012-08-13 13:31:45 -07:00
Shivaram Venkataraman	44661df9cc	If spark.test.useCompressedOops is set, use that to infer compressed oops setting. This is useful to get a deterministic test case	2012-08-13 13:31:39 -07:00
Shivaram Venkataraman	0dd8fe73ba	Use HotSpotDiagnosticMXBean to get if CompressedOops are in use or not	2012-08-13 13:31:29 -07:00
Shivaram Venkataraman	80104ce1da	Add link to Java wiki which specifies what changes with compressed oops	2012-08-13 13:31:21 -07:00
Shivaram Venkataraman	00ab5490b3	Changes to make size estimator more accurate. Fixes object size, pointer size according to architecture and also aligns objects and arrays when computing instance sizes. Verified using Eclipse Memory Analysis Tool (MAT)	2012-08-13 13:31:11 -07:00
Matei Zaharia	6ae3c375a9	Renamed apply() to call() in Java API and allowed it to throw Exceptions	2012-08-12 23:10:19 +02:00
Matei Zaharia	0141879c40	Use Promises instead of having a Future wait on a thread in ConnectionManager.	2012-08-12 22:16:32 +02:00
Matei Zaharia	845a870242	Return remotely fetched blocks in a pipelined fashion from BlockManager	2012-08-12 20:01:38 +02:00
Matei Zaharia	e17ed9a21d	Switch to Akka futures in connection manager. It's still not good because each Future ends up waiting on a lock, but it seems to work better than Scala Actors, and more importantly it allows us to use onComplete and other listeners on futures.	2012-08-12 19:40:37 +02:00
Matei Zaharia	ad8a7612a4	Changed multi-get method in BlockManager to return an iterator	2012-08-12 19:18:01 +02:00
Matei Zaharia	3c94e5c188	Merge pull request #168 from shivaram/dev Use JavaConversion to get a scala iterator	2012-08-10 00:57:33 -07:00
Matei Zaharia	e463e7a333	Merge pull request #167 from JoshRosen/piped-rdd-fixes Detect non-zero exit status from PipedRDD process	2012-08-10 00:56:42 -07:00
Josh Rosen	59c22fb444	Print exit status in PipedRDD failure exception.	2012-08-10 00:33:56 -07:00
Shivaram Venkataraman	1803cce692	Use an implicit conversion to get the scala iterator	2012-08-08 14:31:04 -07:00
Shivaram Venkataraman	674fcf56bf	Use JavaConversion to get a scala iterator	2012-08-08 14:10:23 -07:00
Shivaram Venkataraman	f4aaec7a48	Avoid a copy in ShuffleMapTask by creating an iterator that will be used by the block manager.	2012-08-08 00:47:02 -07:00
Mosharaf Chowdhury	d821dd3ccc	BroadcastManager is a class now (replaced Braodcast object)	2012-08-05 01:10:51 -07:00
Mosharaf Chowdhury	b4804119f9	Merge remote-tracking branch 'upstream/dev' into dev	2012-08-04 20:42:12 -07:00
Matei Zaharia	88b016db2a	Merge pull request #160 from dennybritz/clusterscripts Standalone cluster scripts	2012-08-04 17:45:20 -07:00
Mosharaf Chowdhury	1b0534af8f	Merge branch 'dev' into bc-bm	2012-08-04 00:30:08 -07:00
Mosharaf Chowdhury	d11b457e67	Merge remote-tracking branch 'upstream/dev' into dev	2012-08-04 00:28:10 -07:00
Mosharaf Chowdhury	24b7eb872c	Bug fixed. Broadcast now works with BlockManager.	2012-08-04 00:27:28 -07:00
Shivaram Venkataraman	ce3444d2cb	Fix testcheckpoint to reuse spark context defined in the class	2012-08-03 18:52:26 -07:00
Matei Zaharia	62898b631f	Made range partition balance tests more aggressive. This is because we pull out such a large sample (10x the number of partitions) that we should expect pretty good balance. The tests are also deterministic so there's no worry about them failing irreproducibly.	2012-08-03 16:46:48 -04:00
Matei Zaharia	6601a6212b	Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries.	2012-08-03 16:40:45 -04:00
Harvey	1170de3757	Fix for partitioning when sorting in descending order	2012-08-03 16:40:38 -04:00
Paul Cavallaro	d05c0f97ca	Logging Throwables in Info and Debug Logging Throwables in logInfo and logDebug instead of swallowing them. Conflicts: core/src/main/scala/spark/Logging.scala	2012-08-03 16:40:21 -04:00
Denny	0008994044	merged dev branch	2012-08-02 16:00:33 -07:00
Denny	53008c2d8a	Settings variables and bugfix for stop script.	2012-08-02 15:59:39 -07:00
Matei Zaharia	71a958b0b7	Merge branch 'dev' of github.com:mesos/spark into dev Conflicts: project/SparkBuild.scala	2012-08-02 17:23:13 -04:00
Denny	7312a5c30f	Use spray's implicit Marshaller for Futures.	2012-08-02 14:11:27 -07:00
Denny	ba7e30fb5e	Mostly stlyistic changes.	2012-08-02 13:55:09 -07:00
Shivaram Venkataraman	1a07bb9ba4	Avoid an extra partition copy by passing an iterator to blockManager.put	2012-08-02 12:22:33 -07:00

1 2 3 4 5 ...

408 Коммитов