Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue #424: Add sampling fraction option for optimization #426

Merged
merged 4 commits into from
Sep 26, 2024

Conversation

Jiaweihu08
Copy link
Member

@Jiaweihu08 Jiaweihu08 commented Sep 20, 2024

Fixes #424

It changes the optimization API from
optimize(revisionID: RevisionID, options: Map[String, String]) to
optimize(revisionID: RevisionID, fraction: Double, options: Map[String, String])

When a valid fraction is provided, it finds all IndexFiles from the specified revision that contains any Block b that b.minWeight.fraction <= fraction.

Type of change

Checklist:

Here is the list of things you should do before submitting this pull request:

  • New feature / bug fix has been committed following the Contribution guide.
  • Add logging to the code following the Contribution guide.
  • Add comments to the code (make it easier for the community!).
  • Change the documentation.
  • Add tests.
  • Your branch is updated to the main branch (dependent changes have been merged).

docs/QbeastTable.md Show resolved Hide resolved
@osopardo1 osopardo1 self-requested a review September 26, 2024 13:44
@osopardo1 osopardo1 merged commit 71880e7 into Qbeast-io:main Sep 26, 2024
1 check passed
@Jiaweihu08 Jiaweihu08 deleted the 424-sampling-optimization branch September 27, 2024 08:40
JosepSampe added a commit that referenced this pull request Oct 24, 2024
* Issue #424: Add sampling fraction option for optimization (#426)

* Add sampling fraction option for optimization and remove analyze from QbeastTable

* Issue #430: Simplify denormalized blocks creation (#431)

* Simplify Denormalized Blocks

* Issue #416: Add CDFQuantile Transformers and Transformations (#413)

* Issue 264: Update qviz for multiblock files (#437)

* Update Qbeast Visualiser (qviz) with multiblock files

---------

Co-authored-by: Jorge Marín <jorge.marin.rodenas@estudiantat.upc.edu>
Co-authored-by: Jorge Marín <100561030+jorgeMarin1@users.noreply.github.com>

* Issue #441: Fix dataChange flag in optimize (#444)

* Merge from main branch

---------

Co-authored-by: jiawei <47899566+Jiaweihu08@users.noreply.github.com>
Co-authored-by: Paola Pardo <paolapardoat@gmail.com>
Co-authored-by: Jorge Marín <jorge.marin.rodenas@estudiantat.upc.edu>
Co-authored-by: Jorge Marín <100561030+jorgeMarin1@users.noreply.github.com>
JosepSampe pushed a commit to JosepSampe/qbeast-spark that referenced this pull request Oct 24, 2024
…beast-io#426)

* Add sampling fraction option for optimization and remove analyze from QbeastTable
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add Sampling Fraction as Optimization Option
3 participants