EJ Song
db4bef41b1
Add python binding for ZOrderCoveringIndexConfig ( #519 )
2022-01-04 14:49:32 -08:00
EJ Song
1adddf6d54
Introduce ZOrderCoveringIndex ( #518 )
2021-12-22 19:10:18 -08:00
EJ Song
64aae2f53a
Add CoveringIndexTrait and CoveringIndexConfigTrait ( #516 )
2021-12-13 13:48:17 -08:00
EJ Song
8cd1b13d34
Add ZOrderUDF and ZOrderField for Z-address calculation ( #517 )
2021-12-13 12:59:25 -08:00
EJ Song
c01c963873
Introduce utility function to analyze min/max data layout ( #493 )
2021-12-13 10:52:05 -08:00
paryoja
d8c4b79ceb
Add spark session extension for Hyperspace ( #504 )
2021-11-15 10:53:10 -08:00
paryoja
2f8d32b422
Add a new behavior of the vacuum index ( #479 )
2021-11-03 21:14:11 -07:00
EJ Song
661df177a5
Apply JoinIndexRule only for SortMergeJoin ( #502 )
2021-10-12 23:32:17 -07:00
Andrew Fogarty
e3b9213793
Update links in PR template. ( #490 )
2021-09-23 14:33:44 -07:00
Chungmin Lee
1ab046de4c
Data Skipping Index Part 4: BloomFilterSketch ( #483 )
2021-09-13 19:58:50 -07:00
Chungmin Lee
9735b57be5
Data Skipping Index Part 3-2: Rule ( #482 )
2021-08-31 16:28:01 -07:00
Chungmin Lee
b06cbddb18
Data Skipping Index Part 3-1: Utils ( #491 )
2021-08-23 13:10:25 -07:00
alex-shchetkov
c2f4f04f9a
Fixing bug where large index files weren't being read fully ( #489 )
2021-08-11 12:18:01 -07:00
Chungmin Lee
f94fda8671
Data Skipping Index Part 2: Basics ( #461 )
2021-08-09 16:44:10 -07:00
EJ Song
b60393a9cd
Introduce whyNot API ( #449 )
2021-08-02 18:48:01 -07:00
Chungmin Lee
467a891ec2
Data Skipping Index Part 1: Refactoring ( #481 )
2021-07-28 22:55:55 -07:00
Chungmin Lee
12e61c68ec
Disable Spark 3.1 on Windows build ( #485 )
2021-07-28 22:55:16 -07:00
Chungmin Lee
53fb7586e6
Refactoring for an extensible Index API: Part 3 ( #475 )
2021-07-05 17:43:47 +09:00
paryoja
bb3a857d9c
Throw HyperspaceException with path for JSON parsing error ( #472 )
2021-06-29 09:41:31 -07:00
Chungmin Lee
7b594d4d66
Refactoring for an extensible Index API: Part 2 ( #474 )
2021-06-28 14:51:17 -07:00
Chungmin Lee
896f734d4d
Reformat files with scalafmt ( #473 )
2021-06-28 14:15:38 -07:00
paryoja
38bd40019e
Documentation: coding style setting with examples ( #471 )
2021-06-25 10:30:40 -07:00
paryoja
aed6eb6a9b
Add windows build pipeline ( #470 )
2021-06-24 10:16:32 -07:00
EJ Song
1fe104f162
Add dev config for nested column support ( #466 )
2021-06-22 17:01:51 -07:00
EJ Song
7cb714c4ac
Fix ResolverUtils to handle StructType column properly ( #467 )
2021-06-21 23:50:32 -07:00
EJ Song
56efddadeb
Fix ExplainTest in Windows ( #468 )
2021-06-21 18:58:22 -07:00
Chungmin Lee
036ecfdcfb
Fix a bug in CachingIndexCollectionManager ( #464 )
2021-06-17 11:38:50 -07:00
paryoja
08076c09b7
fix python syntax errors in documents ( #465 )
2021-06-16 19:40:18 -07:00
Chungmin Lee
dfa16cc407
Fix StructType serialization ( #459 )
2021-06-15 00:08:26 -07:00
Chungmin Lee
330619b494
Replace lineagePairs with pathNormalizer ( #460 )
2021-06-14 18:27:33 -07:00
Chungmin Lee
9e1f702dea
Refactoring for an extensible Index API ( #443 )
...
* Refactoring for an extensible Index API
- Introduce common interfaces for indexes with which Hyperspace can
manage various types of indexes.
- Adjust IndexStatistics so that implementation-specific fields can be
added. For instance, included columns are one of such fields now.
- Actions work with generic indexes, not just covering indexes which are
the only type we support at the moment.
- Existing rules only work with covering indexes. New rules will be
added along with new index types.
Breaking changes:
- Serialization format of CoveringIndex is changed.
- IndexConfig is now a trait. To create a covering index, use
CoveringIndexConfig.
- The format of IndexStatistics is changed. It means the format of the
dataframe returned by Hyperspace.indexes is also changed.
2021-06-09 17:38:09 +09:00
EJ Song
1a1f4f15a5
Remove throwIfNotExists param in fromDirectory util ( #453 )
2021-06-07 21:21:12 -07:00
Chungmin Lee
4bd323f893
Fix minor things about build ( #457 )
2021-06-07 19:32:31 -07:00
Chungmin Lee
80df599b95
Update RefreshIncrementalAction logging message ( #458 )
2021-06-07 09:56:08 -07:00
EJ Song
e06a971292
Add source plan string to error message at index creation ( #454 )
2021-06-07 08:44:10 -07:00
Terry Kim
989a82b1bd
Upgrade sbt-pgp to 2.1.2 ( #455 )
2021-06-07 08:40:54 -07:00
Terry Kim
0011d0ded9
Update the JAR name from hyperspace-core_spark*.* to hyperspace-core-spark*.* ( #451 )
2021-06-01 11:21:12 -07:00
Terry Kim
72230221a8
Update the contributor list ( #450 )
2021-06-01 09:42:13 -07:00
EJ Song
2dedbff622
Clean up previous FilterIndexRule and JoinIndexRule ( #448 )
2021-05-31 09:22:07 -07:00
EJ Song
3a91624693
Enable ApplyHyperspace rule ( #446 )
2021-05-28 10:10:39 -07:00
EJ Song
17cd87ff17
Add a notebook for Delta Lake support ( #447 )
2021-05-26 13:17:57 -07:00
EJ Song
ebbf5b3c93
ApplyHyperspace rule - JoinIndexRule ( #445 )
2021-05-24 18:07:42 -07:00
EJ Song
b46d1340fa
ApplyHyperspace rule - FilterIndexRule ( #442 )
2021-05-21 12:01:36 -07:00
EJ Song
3257d22717
ApplyHyperspace rule - CandidateIndexCollector implementation ( #437 )
2021-05-18 20:19:38 -07:00
EJ Song
a0f3dd3e5c
Add ApplyHyperspace rule definitions ( #427 )
2021-05-07 17:41:42 -07:00
Chungmin Lee
5aee0c8fb6
Support Spark 3.1.1 ( #434 )
2021-05-07 13:15:38 -07:00
EJ Song
ba4e71273e
Update .gitignore for IntelliJ on Windows ( #438 )
2021-04-29 08:18:25 -07:00
Terry Kim
55ee94465c
Fix sbt-launch-lib.bash ( #436 )
2021-04-28 19:47:58 -07:00
Chungmin Lee
8ab4278a88
Update README.md on using IntelliJ ( #435 )
2021-04-28 08:40:07 -07:00
EJ Song
f6631da248
Handle empty version string in DeltaLakeRelation.closestIndex ( #433 )
2021-04-28 08:21:28 -07:00