Граф коммитов

277 Коммитов

Автор SHA1 Сообщение Дата
EJ Song db4bef41b1
Add python binding for ZOrderCoveringIndexConfig (#519) 2022-01-04 14:49:32 -08:00
EJ Song 1adddf6d54
Introduce ZOrderCoveringIndex (#518) 2021-12-22 19:10:18 -08:00
EJ Song 64aae2f53a
Add CoveringIndexTrait and CoveringIndexConfigTrait (#516) 2021-12-13 13:48:17 -08:00
EJ Song 8cd1b13d34
Add ZOrderUDF and ZOrderField for Z-address calculation (#517) 2021-12-13 12:59:25 -08:00
EJ Song c01c963873
Introduce utility function to analyze min/max data layout (#493) 2021-12-13 10:52:05 -08:00
paryoja d8c4b79ceb
Add spark session extension for Hyperspace (#504) 2021-11-15 10:53:10 -08:00
paryoja 2f8d32b422
Add a new behavior of the vacuum index (#479) 2021-11-03 21:14:11 -07:00
EJ Song 661df177a5
Apply JoinIndexRule only for SortMergeJoin (#502) 2021-10-12 23:32:17 -07:00
Andrew Fogarty e3b9213793
Update links in PR template. (#490) 2021-09-23 14:33:44 -07:00
Chungmin Lee 1ab046de4c
Data Skipping Index Part 4: BloomFilterSketch (#483) 2021-09-13 19:58:50 -07:00
Chungmin Lee 9735b57be5
Data Skipping Index Part 3-2: Rule (#482) 2021-08-31 16:28:01 -07:00
Chungmin Lee b06cbddb18
Data Skipping Index Part 3-1: Utils (#491) 2021-08-23 13:10:25 -07:00
alex-shchetkov c2f4f04f9a
Fixing bug where large index files weren't being read fully (#489) 2021-08-11 12:18:01 -07:00
Chungmin Lee f94fda8671
Data Skipping Index Part 2: Basics (#461) 2021-08-09 16:44:10 -07:00
EJ Song b60393a9cd
Introduce whyNot API (#449) 2021-08-02 18:48:01 -07:00
Chungmin Lee 467a891ec2
Data Skipping Index Part 1: Refactoring (#481) 2021-07-28 22:55:55 -07:00
Chungmin Lee 12e61c68ec
Disable Spark 3.1 on Windows build (#485) 2021-07-28 22:55:16 -07:00
Chungmin Lee 53fb7586e6
Refactoring for an extensible Index API: Part 3 (#475) 2021-07-05 17:43:47 +09:00
paryoja bb3a857d9c
Throw HyperspaceException with path for JSON parsing error (#472) 2021-06-29 09:41:31 -07:00
Chungmin Lee 7b594d4d66
Refactoring for an extensible Index API: Part 2 (#474) 2021-06-28 14:51:17 -07:00
Chungmin Lee 896f734d4d
Reformat files with scalafmt (#473) 2021-06-28 14:15:38 -07:00
paryoja 38bd40019e
Documentation: coding style setting with examples (#471) 2021-06-25 10:30:40 -07:00
paryoja aed6eb6a9b
Add windows build pipeline (#470) 2021-06-24 10:16:32 -07:00
EJ Song 1fe104f162
Add dev config for nested column support (#466) 2021-06-22 17:01:51 -07:00
EJ Song 7cb714c4ac
Fix ResolverUtils to handle StructType column properly (#467) 2021-06-21 23:50:32 -07:00
EJ Song 56efddadeb
Fix ExplainTest in Windows (#468) 2021-06-21 18:58:22 -07:00
Chungmin Lee 036ecfdcfb
Fix a bug in CachingIndexCollectionManager (#464) 2021-06-17 11:38:50 -07:00
paryoja 08076c09b7
fix python syntax errors in documents (#465) 2021-06-16 19:40:18 -07:00
Chungmin Lee dfa16cc407
Fix StructType serialization (#459) 2021-06-15 00:08:26 -07:00
Chungmin Lee 330619b494
Replace lineagePairs with pathNormalizer (#460) 2021-06-14 18:27:33 -07:00
Chungmin Lee 9e1f702dea
Refactoring for an extensible Index API (#443)
* Refactoring for an extensible Index API

- Introduce common interfaces for indexes with which Hyperspace can
  manage various types of indexes.
- Adjust IndexStatistics so that implementation-specific fields can be
  added. For instance, included columns are one of such fields now.
- Actions work with generic indexes, not just covering indexes which are
  the only type we support at the moment.
- Existing rules only work with covering indexes. New rules will be
  added along with new index types.

Breaking changes:
- Serialization format of CoveringIndex is changed.
- IndexConfig is now a trait. To create a covering index, use
  CoveringIndexConfig.
- The format of IndexStatistics is changed. It means the format of the
  dataframe returned by Hyperspace.indexes is also changed.
2021-06-09 17:38:09 +09:00
EJ Song 1a1f4f15a5
Remove throwIfNotExists param in fromDirectory util (#453) 2021-06-07 21:21:12 -07:00
Chungmin Lee 4bd323f893
Fix minor things about build (#457) 2021-06-07 19:32:31 -07:00
Chungmin Lee 80df599b95
Update RefreshIncrementalAction logging message (#458) 2021-06-07 09:56:08 -07:00
EJ Song e06a971292
Add source plan string to error message at index creation (#454) 2021-06-07 08:44:10 -07:00
Terry Kim 989a82b1bd
Upgrade sbt-pgp to 2.1.2 (#455) 2021-06-07 08:40:54 -07:00
Terry Kim 0011d0ded9
Update the JAR name from hyperspace-core_spark*.* to hyperspace-core-spark*.* (#451) 2021-06-01 11:21:12 -07:00
Terry Kim 72230221a8
Update the contributor list (#450) 2021-06-01 09:42:13 -07:00
EJ Song 2dedbff622
Clean up previous FilterIndexRule and JoinIndexRule (#448) 2021-05-31 09:22:07 -07:00
EJ Song 3a91624693
Enable ApplyHyperspace rule (#446) 2021-05-28 10:10:39 -07:00
EJ Song 17cd87ff17
Add a notebook for Delta Lake support (#447) 2021-05-26 13:17:57 -07:00
EJ Song ebbf5b3c93
ApplyHyperspace rule - JoinIndexRule (#445) 2021-05-24 18:07:42 -07:00
EJ Song b46d1340fa
ApplyHyperspace rule - FilterIndexRule (#442) 2021-05-21 12:01:36 -07:00
EJ Song 3257d22717
ApplyHyperspace rule - CandidateIndexCollector implementation (#437) 2021-05-18 20:19:38 -07:00
EJ Song a0f3dd3e5c
Add ApplyHyperspace rule definitions (#427) 2021-05-07 17:41:42 -07:00
Chungmin Lee 5aee0c8fb6
Support Spark 3.1.1 (#434) 2021-05-07 13:15:38 -07:00
EJ Song ba4e71273e
Update .gitignore for IntelliJ on Windows (#438) 2021-04-29 08:18:25 -07:00
Terry Kim 55ee94465c
Fix sbt-launch-lib.bash (#436) 2021-04-28 19:47:58 -07:00
Chungmin Lee 8ab4278a88
Update README.md on using IntelliJ (#435) 2021-04-28 08:40:07 -07:00
EJ Song f6631da248
Handle empty version string in DeltaLakeRelation.closestIndex (#433) 2021-04-28 08:21:28 -07:00