Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sync #1 #4

Merged
merged 172 commits into from
Feb 18, 2021
Merged

Sync #1 #4

merged 172 commits into from
Feb 18, 2021

Conversation

aregm
Copy link
Owner

@aregm aregm commented Feb 18, 2021

What do these changes do?

  • commit message follows format outlined here
  • passes flake8 modin
  • passes black --check modin
  • signed commit with git commit -s
  • Resolves #?
  • tests added and passing

anmyachev and others added 30 commits October 20, 2020 16:36
* FIX-#2195: fix describe error for datasets with datetimes

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* FIX-#2195: add test

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* FIX-#2195: enable fix

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* FIX-#2195: Update modin/pandas/test/dataframe/test_reduction.py

Co-authored-by: Dmitry Chigarev <62142979+dchigarev@users.noreply.github.com>

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
…2278)

Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
* TEST-#2260: use recommended pandas testing api

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2260: replace getSeriesData with test_data

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2260: remove assert_categories_equal

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: ienkovich <ilya.enkovich@intel.com>
Signed-off-by: Alexey Prutskov <alexey.prutskov@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: ienkovich <ilya.enkovich@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
…partitioning isn't aligned (#2256)

Signed-off-by: Alexey Prutskov <alexey.prutskov@intel.com>
* FIX-#2239: Compute row index start using pandas

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>

* FIX-#2239: Documentation

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>

* FIX-#2239: Improve testing for case

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Alexander Myskov <alexander.myskov@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
* FIX-#0000: Indexing regression

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>

* FIX-#0000: Fix `loc`

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>

* FIX-#0000: Fix DatetimeIndex

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>

* FIX-#0000: Fix Datetime and checks

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
* DOCS-#2193: update contributing doc

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
…#2318)

Signed-off-by: Igoshev, Yaroslav <yaroslav.igoshev@intel.com>
…2332)

* FIX-#2269: Move `default_to_pandas` logic from API layer to backend

Signed-off-by: Igoshev, Yaroslav <yaroslav.igoshev@intel.com>

* FIX-#2269: Added a test which calls _apply_agg_function

Signed-off-by: Gregory Shimansky <gregory.shimansky@intel.com>

* FIX-#2269: Added required arguments for groupby_agg

Moved wrap_udf_function into backend because omnisci doesn't support
executing lambdas.

Signed-off-by: Gregory Shimansky <gregory.shimansky@intel.com>

* FIX-#2269: Use correct default_to_pandas for groupby in backend,
refactor default to pandas functions in BaseQC

Signed-off-by: Igoshev, Yaroslav <yaroslav.igoshev@intel.com>

* FIX-#2269: Renamed new default_to_pandas_groupby function

into private function of Pandas backend because it is not used anywhere
else.

Signed-off-by: Gregory Shimansky <gregory.shimansky@intel.com>

* FIX-#2269: Fixed specification of backend

now it is possible to specify --backend=PandasOnDask,
--backend=PandasOnRay or --backend=PandasOnPython, not just
--backend=BaseOnPython.

Signed-off-by: Gregory Shimansky <gregory.shimansky@intel.com>

* FIX-#2269: Fix BaseOnPython tests

Signed-off-by: Igoshev, Yaroslav <yaroslav.igoshev@intel.com>

* FIX-#2269: Remove default_to_pandas_groupby

Signed-off-by: Igoshev, Yaroslav <yaroslav.igoshev@intel.com>

* FIX-#2269: logic of dropping 'by' moved back to API level

Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>

Co-authored-by: Gregory Shimansky <gregory.shimansky@intel.com>
Co-authored-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
…2336)

Signed-off-by: Alexander Myskov <alexander.myskov@intel.com>
Signed-off-by: Alexey Prutskov <alexey.prutskov@intel.com>
Signed-off-by: Igoshev, Yaroslav <yaroslav.igoshev@intel.com>
anmyachev and others added 28 commits February 4, 2021 10:36
* TEST-#2670: some updates to improve asv tests stability

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: fixes

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: data_size -> shape

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: use dict approach

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: use CpuCount when Npartitions isn't defined

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: fix ASV_DATASET_SIZE

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: update TimeSortValues

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: modify asv tests for using with old modin version

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: reply to review comments

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2670: use env variables for default values

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
* TEST-#2686: add fillna benchmark

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2686: reply to review comments

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2686: add inplace parameter

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
* TEST-#2692: add drop benchmark

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2692: add one column case

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
* FIX-#2688: Update ray.ObjectID to ray.ObjectRef for Ray 2.0

Resovles #2688

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>

* FIX-#2688: Address comments

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Alexey Prutskov <alexey.prutskov@intel.com>
* TEST-#2694: add head benchmark

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2694: add small number for head op

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
* TEST-#2705: add 'value_counts' benchmarks

Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>

* TEST-#2705: apply suggestions from review

Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Co-authored-by: Alexey Prutskov <alexey.prutskov@intel.com>
Co-authored-by: Devin Petersohn <devin.petersohn@gmail.com>
Co-authored-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Co-authored-by: Devin Petersohn <devin-petersohn@users.noreply.github.com>
Signed-off-by: Igoshev, Yaroslav <yaroslav.igoshev@intel.com>
* TEST-#2690: add astype benchmark

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2690: add category dtype; use df.types

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2690: add case with one column

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
* TEST-#2702: add loc/iloc benchmark

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2702: add multiindex loc bench

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

* TEST-#2702: add row_loc check

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Alexey Prutskov <alexey.prutskov@intel.com>
Signed-off-by: Alina <alina.bykovskaya@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
* TEST-#2722: add ASV read_csv skiprows benchmark

Co-authored-by: Anatoly Myachev <45976948+anmyachev@users.noreply.github.com>
Signed-off-by: Alexander Myskov <alexander.myskov@intel.com>
… class (#2736)

Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
…e length (#2733)

Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Alexander Myskov <alexander.myskov@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
…tion (#2743)

* FIX-#2742: changed callable functions to its names in dict aggregation

Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>

* FIX-#2742: commends added

Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
…2738)

Co-authored-by: Anatoly Myachev <45976948+anmyachev@users.noreply.github.com>
Signed-off-by: Alexander Myskov <alexander.myskov@intel.com>
* Resolves #2584

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
@aregm
Copy link
Owner Author

aregm commented Feb 18, 2021

Sync

@aregm aregm merged commit 62bbe39 into aregm:master Feb 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.