Link: https://www.kaggle.com/c/g-research-crypto-forecasting
Run notebook with "Internet" turned off option and load it to Submition
For submition Notebook will run on close data
Notebooks with ML models
Notebook with some research
Time series visualisation and research
Notebooks with params finding code
Most of first places use nonML solution, that give them the best score
They published their solutions with fake 0.9999 score, but really has only close to 0 score on ml public notebooks (not -1 so - not great not terrible)
We've also created similiar solution and submited it
So we take 407th place (but all others has similar 0.9999 score and result depends on random close data batch)
More legal solution gave us 0.7840 score and 468th place on LGBM Model
The data submission was not on a file with predicted values, but by executing a notebook on the server side on a closed dataset. Due to the limitations on the number of launches, GPU/TPU time per week, and the limitation on the execution time, the iterative research process proved to be difficult. But we tried to optimally distribute roles and goals in the team so as not to waste time and attempts in vain.
- Focus on LGBM Research as the best score model
- 10k estimators @kciNik 2022-01-07
- 15k estimators @kciNik 2022-01-07
- 12.5k estimators @grigorevmp 2022-01-07
- 20k estimators @kciNik 2022-01-07
- 10k estimators supplemental_train @grigorevmp 2022-01-07
- 10k estimators supplemental_train reduced memory version @grigorevmp 2022-01-07
- other estimators (depends on results) @grigorevmp @kciNik 2022-01-08
- different hyperparameters analitics @moskovchuk @marat1804 2022-01-08
- full models results analitics @marat1804 @grigorevmp 2022-01-09
- markdown theory part @kciNik @moskovchuk 2022-01-09
- create powerful presentation @grigorevmp @kciNik 2022-01-08
- pass the exam @grigorevmp @kciNik @moskovchuk @marat1804 2022-01-10