Skip to content

PyRDF 0.1.0

Compare
Choose a tag to compare
@dpiparo dpiparo released this 09 May 09:21
· 63 commits to master since this release

Executive release notes

Initial functional release of a python wrapper
around ROOT's RDataFrame with supported for the
Spark distributed backend.

New

  • Fully compatible with Python 2.7 and 3.7
  • Run in a python notebook on SWAN connected to a Spark cluster
  • New tutorials, synchronized with ROOT RDataFrame tutorials
  • New documentation to show the usage of PyRDF on SWAN
  • Users can send C++ headers and shared libraries needed for their
    analysis to the Spark executors and use them during distributed
    execution
  • Documentation available on GitHub Pages

Improvements

  • Improve logic for the management of the computational graph. Now
    it is Python version independent and sends to the distributed
    workers only the minimal information required for the execution
    of the operations on the RDataFrame

Bugfixes

  • Python 3 bugs:
    • Import statements now use paths relative to the main folder of
      the project
    • Integers previously declared as long now are only integers
    • Division of integers is now correctly declared as floor() division