Gumtree parsing is slow #356
Replies: 2 comments 1 reply
-
Hi Hareem, To establish a more precise diagnostic it would be nice to know GumTree's version, which parser, and which matching algorithm you are using. However, you are right that parsing is usually the performance bottleneck, especially for languages where there is no hand-coded optimized parser. Cheers! |
Beta Was this translation helpful? Give feedback.
-
Hi! I recommend you to use our latest version because we integrated two tree-sitter-based backends for Python (one native, should be the fastest, one using a Python wrapper, the downloadable jar contains only the native one), maybe they are faster. I don't really know what is the overhead of using Docker, but I think the best performance is obtained by running a jar directly. Cheers! |
Beta Was this translation helpful? Give feedback.
-
I'm parsing java and python source code using the gumtree docker image. I have about 25000 commits and it took about 40 hours to parse those files. With javalang I was able to parse in minutes. I'm doing something wrong or is gumtree slow because its more than just a parser.
Thanks
Beta Was this translation helpful? Give feedback.
All reactions