-
-
Notifications
You must be signed in to change notification settings - Fork 18k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DOC: clean-up 0.21.0 whatsnew file #18001
DOC: clean-up 0.21.0 whatsnew file #18001
Conversation
This cleans-up a little bit the whatsnew (few small mistakes / formatting errors, adding parquet section (now was only in the highlights), restructures new features a little bit) |
doc/source/whatsnew/v0.21.0.txt
Outdated
- New user-facing :class:`pandas.api.types.CategoricalDtype` for specifying | ||
categoricals independent of the data, see :ref:`here <whatsnew_0210.enhancements.categorical_dtype>`. | ||
- The behavior of ``sum`` and ``prod`` on all-NaN Series/DataFrames is now consistent and no longer depends on whether `bottleneck <http://berkeleyanalytics.com/bottleneck>`__ is installed, see :ref:`here <whatsnew_0210.api_breaking.bottleneck>` | ||
- Compatibility fixes for pypy, see :ref:`here <whatsnew_0210.pypy>`. | ||
- ``GroupBy`` objects now have a ``pipe`` method, similar to the one on ``DataFrame`` and ``Series``. | ||
This allows for functions that take a ``GroupBy`` to be composed in a clean, readable syntax, see :ref:`here <whatsnew_0210.enhancements.GroupBy_pipe>`. | ||
- Additions to the ``drop``, ``reindex`` and ``rename`` API (see :ref:`here <whatsnew_0210.enhancements.drop_api>`) and new methods ``infer_objects`` (see :ref:`here <whatsnew_0210.enhancements.infer_objects>`) and ``GroupBy.pipe`` (see :ref:`here <whatsnew_0210.enhancements.GroupBy_pipe>`). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thoughts about splitting this into two? One for drop / reindex / rename
(additions to existing APIs) and the second for infer_objects
and GroupBy.pipe
(new methods).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep, that's fine. I mainly added it because now it felt a bit strange to speak about pipe and not the other subsections in the 'new features' section. Can also leave out entirely.
But will split and add a bit more context.
doc/source/whatsnew/v0.21.0.txt
Outdated
|
||
Integration with `Apache Parquet <https://parquet.apache.org/>`__, including a new top-level :func:`read_parquet` and :func:`DataFrame.to_parquet` method, see :ref:`here <io.parquet>` (:issue:`15838`, :issue:`17438`). | ||
|
||
`Apache Parquet <https://parquet.apache.org/>`__ provides a partitioned binary columnar serialization for data frames. It is designed to |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps, "It is provides a language-agnostic file format for reading and writing data frames efficiently."
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Or "cross-language" if you prefer that to language-agnostic.
doc/source/whatsnew/v0.21.0.txt
Outdated
|
||
.. _whatsnew_0210.enhancements.other: | ||
|
||
Other Enhancements | ||
^^^^^^^^^^^^^^^^^^ | ||
|
||
- The ``validate`` argument for :func:`merge` now checks whether a merge is one-to-one, one-to-many, many-to-one, or many-to-many. If a merge is found to not be an example of specified merge type, an exception of type ``MergeError`` will be raised. For more, see :ref:`here <merging.validation>` (:issue:`16270`) | ||
- Added support for `PEP 518 <https://www.python.org/dev/peps/pep-0518/>`_ (``pyproject.toml``) to the build system (:issue:`16745`) | ||
New functions or methods: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
make these sub-sections
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
checking locally how it looks (problem is that you hardly see the difference between the current section and subsection, and this would add a subsubsection, and don't want to do that if there is hardly a difference. I can eg make it bold though, if the subsubsection doesn't work out nicely)
doc/source/whatsnew/v0.21.0.txt
Outdated
|
||
Integration with `Apache Parquet <https://parquet.apache.org/>`__, including a new top-level :func:`read_parquet` and :func:`DataFrame.to_parquet` method, see :ref:`here <io.parquet>` (:issue:`15838`, :issue:`17438`). | ||
|
||
`Apache Parquet <https://parquet.apache.org/>`__ provides a partitioned binary columnar serialization for data frames. It is designed to |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Or "cross-language" if you prefer that to language-agnostic.
doc/source/whatsnew/v0.21.0.txt
Outdated
languages easy. Parquet can use a variety of compression techniques to shrink the file size as much as possible | ||
while still maintaining good read performance. | ||
Parquet is designed to faithfully serialize and de-serialize ``DataFrame`` s, supporting all of the pandas | ||
dtypes, including extension dtypes such as datetime with tz. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"tz" -> "timezone"?
2018-03-31 10.0 | ||
Freq: 2Q-DEC, dtype: float64 | ||
Sum/Prod of all-NaN Series/DataFrames is now consistently NaN | ||
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This commit is a purely reordering (putting eg sum of all NaN first, as this is included in the highlights).
The "list with missing values- indexing" might maybe also deserve a highlight?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The "list with missing values- indexing" might maybe also deserve a highlight?
Yeah, that seems reasonable.
Codecov Report
@@ Coverage Diff @@
## master #18001 +/- ##
=========================================
Coverage ? 91.23%
=========================================
Files ? 163
Lines ? 50173
Branches ? 0
=========================================
Hits ? 45774
Misses ? 4399
Partials ? 0
Continue to review full report at Codecov.
|
* DOC: clean-up 0.21.0 whatsnew file * literal include warning * split highlight * reorder subsections in API changes (somewhat more in order of importance) * move python 3.4 support drop to section dropped version support of deps * minor formatting * add indexing with list of partly missing labels to highlights * udpate parquet explanation * update highlights in release notes * format as subsubsections * wording
* DOC: clean-up 0.21.0 whatsnew file * literal include warning * split highlight * reorder subsections in API changes (somewhat more in order of importance) * move python 3.4 support drop to section dropped version support of deps * minor formatting * add indexing with list of partly missing labels to highlights * udpate parquet explanation * update highlights in release notes * format as subsubsections * wording
No description provided.