Substrait Java is a project that makes it easier to build Substrait plans through Java. The project has two main parts:
- Core is the module that supports building Substrait plans directly through Java. This is much easier than manipulating the Substrait protobuf directly. It has no direct support for going from SQL to Substrait (that's covered by the second part)
- Isthmus is the module that allows going from SQL to a Substrait plan. Both Java APIs and a top level script for conversion are present. Not all SQL is supported yet by this module, but a lot is. For example, all of the TPC-H queries and all but a few of the TPC-DS queries are translatable.
After you've cloned the project through git, Substrait Java is built with a tool called Gradle. To build, execute the following:
./gradlew build
To build the Isthmus executable that enables Substrait plans to be generated for a SQL statement, execute the following:
./gradlew nativeImage
A good way to get started is to experiment with building Substrait plans for your own SQL. To do that, you can use the isthmus executable as described here.
Another way to get an idea of what Substrait plans look like is to use our script that generates Substrait plans for all the TPC-H queries:
./isthmus/src/test/script/tpch_smoke.sh
To learn more, head over Substrait, our parent project and join our community