Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BEAM-5392] group by key non merging window optimalization #49

Open
wants to merge 451 commits into
base: beamMaster
Choose a base branch
from

Conversation

mareksimunek
Copy link


Follow this checklist to help us incorporate your contribution quickly and easily:

  • Format the pull request title like [BEAM-XXX] Fixes bug in ApproximateQuantiles, where you replace BEAM-XXX with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

It will help us expedite review of your Pull Request if you tag someone (e.g. @username) to look at it.

Post-Commit Tests Status (on master branch)

Lang SDK Apex Dataflow Flink Gearpump Samza Spark
Go Build Status --- --- --- --- --- ---
Java Build Status Build Status Build Status Build Status Build Status Build Status Build Status
Python Build Status --- Build Status
Build Status
Build Status --- --- ---

w,
windowingStrategy.getWindowFn().getOutputTime(windowedValue.getTimestamp(), w));
WindowedValue<K> keyWinValue =
WindowedValue.of(key, timestamp, w, windowedValue.getPane()); //todo predavat pane
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

todo??


@Override
public int compare(CompositeKey o1, CompositeKey o2) {
int keyCompare = SignedBytes.lexicographicalComparator().compare(o1.getKey(), o2.getKey());
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unsigned

* @param <K> type of key iterator emits
* @param <V> type of value iterator emits
*/
static class ReduceByKeyIterator<K, V, W extends BoundedWindow>
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

GroupByKeyIteratoror

Iterators.transform(itr, t2 -> t2._1.withValue(KV.of(t2._1().getValue(), t2._2())));
}

public static class CompositeKeyComparator implements Comparator<CompositeKey>, Serializable {
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

private

@mareksimunek mareksimunek force-pushed the simunek/groupByKeyNonMergingWin/BEAM-5392 branch from 33ef415 to 41d4b8f Compare November 7, 2018 15:05
@mareksimunek mareksimunek force-pushed the simunek/groupByKeyNonMergingWin/BEAM-5392 branch from ab8e5e9 to 997caf4 Compare December 19, 2018 16:58
@mareksimunek mareksimunek force-pushed the simunek/groupByKeyNonMergingWin/BEAM-5392 branch 2 times, most recently from ceaee54 to b95a484 Compare January 22, 2019 09:56
@mareksimunek mareksimunek changed the base branch from beamMaster to master January 22, 2019 10:57
@mareksimunek mareksimunek changed the base branch from master to beamMaster January 22, 2019 10:57
@mareksimunek mareksimunek force-pushed the simunek/groupByKeyNonMergingWin/BEAM-5392 branch 6 times, most recently from d66cbeb to 5189810 Compare January 25, 2019 14:05
lgajowy and others added 12 commits January 30, 2019 16:02
This prints a warning when checkpointing is disabled and pipelines are run with
the FlinkRunner which contain UnboundedSources. Most UnboundedSources rely on
checkpoint acknowledgement, e.g. PubSubIO.
This prints a warning when checkpointing is disabled and pipelines are run with
the FlinkRunner which contain UnboundedSources. Most UnboundedSources rely on
checkpoint acknowledgement, e.g. PubSubIO.
…of null constructor in ByteArrayShufflePosition
- MetricKey
- MetricName
- MetricQueryResults
- MetricsContainerStepMap
@mareksimunek mareksimunek force-pushed the simunek/groupByKeyNonMergingWin/BEAM-5392 branch from 835132f to 5a7e8fa Compare February 8, 2019 17:05
chamikaramj and others added 20 commits February 8, 2019 10:34
Adds withKmsKey() methods to reads and writes. Affects tables created
tables by: streaming writes, batched writes, query temporary tables,
table copies (for very large batched writes).
[BEAM-5667] Use BeamDeprecationWarning in BQ client code.
[BEAM-6636] Remove --kmsKey usage
…y3 limitations. (apache#7773)

* Restrict pyvcf dependency to Python 2 and link Jira issues tracking Py3 limitations.
* Skip all VCF IO tests so that we don't need to install pyvcf on Python 3.
* Don't import VCF on Python 2.
[BEAM-5959] KMS support for BigQuery
…transforms (apache#7709)

Multi-language support in DefaultJobBundleFactory
…r interval >= 25 DAY

[BEAM-6632] Fixes integer overflow for interval >= 25 DAY
@mareksimunek mareksimunek force-pushed the simunek/groupByKeyNonMergingWin/BEAM-5392 branch from 5a7e8fa to 023f31d Compare February 11, 2019 09:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.