-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support regression task using Angel on Spark GBDT #712
Comments
We have regression now. |
I tried on my regression task by following the instruction provided the document. However, when I load my gbdt model on test dataset and even train dataset, I only got 2 unique prediction value. Could you check the regression code, please? If the code is correct or further info about my task is needed, please let me know. Thanks. |
Could you show your command and the results here? |
Sure. Predict stage: Then I check the $PRED_OUTPUT_PATH folder for prediction value. |
Sorry for the delay, it's international workers' day in China. If the dimension of the training dataset is 4 million, using 3G for the executor is probably below the requirement. So I suspect the training task is not successful. |
hi, sorry for the delay. I used housing data downloaded from https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/regression.html to test again, which contains 506 records, 13 features. Below are my bash scripts. INPUT_TRAIN="$ANGEL_HOME/resources/dataset/housing" xxx/bin/spark-submit Test bash script: INPUT_TRAIN="$ANGEL_HOME/resources/dataset/housing" xxx/bin/spark-submit |
GBDT currently supports only classification. Would you add regression task support later, please? Thanks.
The text was updated successfully, but these errors were encountered: