Граф коммитов

372 Коммитов

Автор SHA1 Сообщение Дата
elvaliuliuliu 1ffdebd77b Deploy Spark .Net REPL to HDI Spark Cluster with Notebook Experience (#306) 2019-11-05 21:18:23 -05:00
Terry Kim 47d85e71b7
[Documentation] UDF debugging instruction (#294) 2019-11-04 09:39:02 -05:00
Niharika Dutta 30d6c37817 Exposed Column.Apply() API (#323) 2019-11-03 09:12:30 -08:00
Niharika Dutta af6fc4b8bd Updated benchmarking README Python section to use 'sudo conda' (#316) 2019-11-01 22:50:40 -07:00
Brigit Murtaugh 6ca16afd58 Samples Revamp: Batch - Log Processing (#318) 2019-11-01 20:50:23 -07:00
Brigit Murtaugh c03b600e12 Samples Revamp: Batch - GitHub Projects (#317) 2019-10-31 14:51:18 -07:00
Niharika Dutta 67451fa117 Update documentation to run benchmarks (#298) 2019-10-29 11:34:43 -07:00
Terry Kim 219a7ebf91
[Scala] microsoft-spark-3.0.x project to build against spark-3.0.0-SNAPSHOT (#308) 2019-10-28 10:10:44 -07:00
Terry Kim e7927f7016
Release 0.6.0 (#301) 2019-10-22 12:22:46 -07:00
Andrew Fogarty 4343f21ea9 Support for new Delta v0.4.0 APIs (#297) 2019-10-21 19:42:29 -07:00
Andrew Fogarty 6d875c185f DataStreamReader.Load(path) and E2E test for DeltaTable streaming (#296) 2019-10-17 23:10:12 +02:00
Ed Elliott 2cc519fe71 Documenting minimum requirements for implementing Spark APIs (#267) 2019-10-13 07:55:02 -07:00
Brigit Murtaugh 5f7b816f4f Update docs for running apps in Windows/Linux to use latest jar files (#283) 2019-10-11 16:16:00 -07:00
elvaliuliuliu 4c148ab891 Delta 0.4.0 compatibility check (#279) 2019-10-02 16:33:43 -07:00
elvaliuliuliu 615f179b06 Add Udf test with Complex Types (#244) 2019-10-01 21:46:48 -07:00
Ed Elliott c9d17f577c Update release-0.5.md (#273) 2019-09-27 13:21:58 -07:00
Terry Kim c35a544b1d
Release v0.5.0 (#272) 2019-09-26 19:42:40 -07:00
Terry Kim e50d7b24d7
Expose DataStreamWriter.PartitionBy(). (#270) 2019-09-25 21:01:54 -07:00
Niharika Dutta 259de48657 Using Pandas UDF for TPCH query 1 and 8 (#243) 2019-09-25 20:37:46 -07:00
elvaliuliuliu 131d9b1f33 Support More Aggregate Methods (#265) 2019-09-25 16:14:43 -07:00
Terry Kim 047a547923
Expose session related APIs for SparkSession. (#266) 2019-09-25 14:37:03 -07:00
Steve Suh d227a11641 Support for DataFrameReader.Jdbc (#262) 2019-09-25 11:26:04 -07:00
Steve Suh b8a70bc5a2 Support GroupBy().Sum() (#263) 2019-09-25 08:45:52 -07:00
Terry Kim 0416e0f695 Revert "Release v0.5.0 (#258)"
This reverts commit 31644a84bb.
2019-09-24 19:17:04 -07:00
Terry Kim 31644a84bb
Release v0.5.0 (#258) 2019-09-24 18:22:00 -07:00
Terry Kim 009f4a2544
Speed up Delta tests (#257) 2019-09-24 17:11:53 -07:00
Ed Elliott 6b13a1c997 Implement SparkSession.Catalog (#231) 2019-09-24 16:37:21 -07:00
Steve Suh 7d1b0b4bff Support for SparkFiles (#255) 2019-09-24 09:55:24 -07:00
Steve Suh 55798a2679 Normalize AssemblyName (#251) 2019-09-20 10:51:14 -07:00
Andrew Fogarty e7bd1e7080 Enable IsPackable for Microsoft.Spark.Extensions.Delta (#247)
* Enabling Delta IsPackable and escaping code characters
2019-09-17 15:53:43 -07:00
Andrew Fogarty b8edbbf1e4 Support for DeltaTable (#236) 2019-09-17 11:17:00 -07:00
Terry Kim ab2cfab5ff
Support Spark 2.3.4 (#240) 2019-09-10 17:20:12 -07:00
Matt Mitchell 4ddcf31e90
Switch to PROD pools (#241) 2019-09-10 08:34:39 -07:00
Steve Suh e1d2db066c Modify RowConstructor to work with WithColumn (#214) 2019-09-09 13:06:23 -07:00
elvaliuliuliu 41256088aa Implement MapType in ComplexTypes.cs (#235) 2019-09-06 08:43:44 -07:00
Niharika Dutta e3ce230ca5 Removed \ in the License section to remove double slashes in [Content_Types].xml (#234) 2019-09-04 16:43:03 -07:00
Terry Kim 02e0c304b7
Support Spark 2.4.4. (#232) 2019-09-03 09:24:24 -07:00
Ed Elliott 4e66536107 Implement SparkSession.Range (#225) 2019-08-30 22:33:12 -07:00
Andrew Fogarty a7146cb7ce Expose DataFrameReader.Load() and DataFrameReader.Load(String path)(#228) 2019-08-30 22:14:47 -07:00
Eric Erhardt 72fdf4834d Add an example of using .NET Core 3.0 hardware intrinsics in a VectorUdf. (#211)
When Tpch.csproj is build for netcoreapp3.0, it will use .NET Core 3.0's Avx APIs to do 4 double computations at once.
2019-08-16 18:01:14 -07:00
Steve Suh 2deb2ee2f3 Package Worker zip fix for Mac OSX (#209) 2019-08-14 17:52:14 -07:00
Ed Elliott de1bbc02d9 Implement IsIn on Column (#171) 2019-08-01 14:55:16 -07:00
Terry Kim e2c1ae13b5
0.4.0 Release (#189) 2019-07-29 10:04:07 -07:00
Terry Kim b0d6f33599
Generate Microsoft.Spark.Worker-<version>.zip for Linux. 2019-07-26 15:53:00 -07:00
Terry Kim 139666e56f
Expose a way to override assembly search paths used by AssemblyLoader. (#187) 2019-07-26 15:49:32 -07:00
Steve Suh b043695db1 SerDe referenced assemblies (#180) 2019-07-26 09:27:32 -07:00
Terry Kim 972825c6c4
Resolve signer issue by moving DotnetRunner to different package name. (#186) 2019-07-25 12:11:59 -07:00
Andrew Fogarty b4830136d3 Clean up formatting in benchmark application (#182) 2019-07-24 16:25:38 -07:00
Steve Suh 0fcedf244e Support for SparkSession.conf (#184) 2019-07-24 16:06:14 -07:00
Eric Erhardt 36f7628fad Update Apache.Arrow reference to v0.14. (#167) 2019-07-23 10:41:03 -07:00