-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-3934] [SPARK-3918] [mllib] Bug fixes for RandomForest, DecisionTree #2785
Conversation
Test FAILed. |
Test FAILed. |
@jkbradley Thanks for the PR! It looks good to me. |
@jkbradley Thanks. LGTM! |
test this please |
QA tests have started for PR 2785 at commit
|
QA tests have finished for PR 2785 at commit
|
Test PASSed. |
@@ -175,6 +175,7 @@ private class RandomForest ( | |||
treeToNodeToIndexInfo, splits, bins, nodeQueue, timer) | |||
timer.stop("findBestSplits") | |||
} | |||
baggedInput.unpersist() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you try to merge master? This is already covered in #2775.
@chouqin @manishamde @mengxr Thanks for taking a look! I think stuff is fixed. |
QA tests have started for PR 2785 at commit
|
QA tests have finished for PR 2785 at commit
|
Test PASSed. |
LGTM. Merged into master. Thanks! |
SPARK-3934: When run with a mix of unordered categorical and continuous features, on multiclass classification, RandomForest fails. The bug is in the sanity checks in getFeatureOffset and getLeftRightFeatureOffsets, which use the wrong indices for checking whether features are unordered.
Fix: Remove the sanity checks since they are not really needed, and since they would require DTStatsAggregator to keep track of an extra set of indices (for the feature subset).
Added test to RandomForestSuite which failed with old version but now works.
SPARK-3918: Added baggedInput.unpersist at end of training.
Also:
CC: @mengxr @manishamde @chouqin @codedeft Just notifying you of these small bug fixes.