Skip to content

grigorevmp/-mephi-G-Research-Crypto-Forecasting

Repository files navigation

G-Research-Crypto-Forecasting

Logo

Link: https://www.kaggle.com/c/g-research-crypto-forecasting

Rules

Run notebook with "Internet" turned off option and load it to Submition

For submition Notebook will run on close data

Our team

Content

Models

Notebooks with ML models

  1. NoML Model (0.99999)

  2. LGBM estimators (0.7840)

  3. XGBoost estimators (0.4023)

  4. XGBoost (0.0351)

  5. CatBoost (0.0034)

  6. SVR Regression (-0.0015)

  7. Elastic Net (-0.0020)

  8. Lasso (-0.0020)

  9. Ridge (-0.0020)

  10. SGD Regressor (-0.0056)

  11. Polynomial regression (-1)

Visualisation

Notebook with some research

Time series visualisation and research

Hyperparameters searching

Notebooks with params finding code

  1. Cat Boost

  2. ElasticNet

  3. GB Regressor

  4. Lasso

  5. LGBM

  6. Ridge

  7. SGD

  8. SVR

  9. XGBoost

Best score

Most of first places use nonML solution, that give them the best score

They published their solutions with fake 0.9999 score, but really has only close to 0 score on ml public notebooks (not -1 so - not great not terrible)

We've also created similiar solution and submited it

So we take 407th place (but all others has similar 0.9999 score and result depends on random close data batch)

More legal solution gave us 0.7840 score and 468th place on LGBM Model

Troubles.

The data submission was not on a file with predicted values, but by executing a notebook on the server side on a closed dataset. Due to the limitations on the number of launches, GPU/TPU time per week, and the limitation on the execution time, the iterative research process proved to be difficult. But we tried to optimally distribute roles and goals in the team so as not to waste time and attempts in vain.

Leaderboard screenshots

Non ML

ML

ToDo for last days

Models

  • Focus on LGBM Research as the best score model
    • 10k estimators @kciNik 2022-01-07
    • 15k estimators @kciNik 2022-01-07
    • 12.5k estimators @grigorevmp 2022-01-07
    • 20k estimators @kciNik 2022-01-07
    • 10k estimators supplemental_train @grigorevmp 2022-01-07
    • 10k estimators supplemental_train reduced memory version @grigorevmp 2022-01-07
    • other estimators (depends on results) @grigorevmp @kciNik 2022-01-08
  • different hyperparameters analitics @moskovchuk @marat1804 2022-01-08
  • full models results analitics @marat1804 @grigorevmp 2022-01-09

Theory part

  • markdown theory part @kciNik @moskovchuk 2022-01-09

Exam

  • create powerful presentation @grigorevmp @kciNik 2022-01-08
  • pass the exam @grigorevmp @kciNik @moskovchuk @marat1804 2022-01-10

Releases

No releases published

Packages

No packages published