Sergey cheremshinsky #9

sergTch · 2021-01-31T09:33:45Z

Description

Start - Preprocessing data "spliting columns. Example:
Embark that has values S, Q, C splited into
Embark_S, Embark_Q, Embark_C with values 0 or 1

Train - made function that train 5 models from sklearn and allow them to vote
on validation test showes ~82.5% accuracy
also made neural network using tensorflow with ~86% accuracy

However both tries showes only 76-80% accuracy on kaggle

How has this been tested?

Executed code few times from start to end without crashing
Only ran once
Code have dedicated unit tests

meanalexrin

LGTM - Looks Good To Me
Except:
Please move you work to the /[username]_code location, as described in readme
https://github.com/rnd4u-org/2021-knu-cairl#development

I'll approve after that
Also that's a good approach that you are using separate .py files rather than a single jupyter notebook

meanalexrin · 2021-02-01T09:24:28Z

main.py

+    x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.2)
+    x_train = x
+    y_train = y
+    # exit()


Usually it's better to remove unused code from the project. It helps readability

meanalexrin · 2021-02-01T09:29:01Z

main.py

+    print('Exported!')
+
+
+generateAns()


You may consider using if __name__ == "__main__" in the python code to indicate entry points to the program

https://stackoverflow.com/a/419185

…rn_models.py

…3(mask)/get_faces.py

…3(mask)/load_data.py

…k)/main.py

…k)/log.txt

…sk)/model.py

sergTch · 2021-02-03T22:03:49Z

I fixed placements of files. Added solution for mask classification.
Didn't add to titanic solution if name == "main": but used it in masks
In new project also deleted all code comments however regretted few times. Need to get used to it:)

sergTch · 2021-02-03T23:33:19Z

On masks dataset used inception v3 with 1024 512 and 128 dense layers. Trained in 3 steps
freezing 0 : 210 layers, then 0 : 105 and 210 : 316 and last freezed 105 : 316
that helped to train all parts of cnn and achieve 100% on training data and 99.4% on test
At start splited data on test and train where test is 20% of dataset and train is other 80%

meanalexrin

Please check other comments

Regarding task 3:
You can try rerun model few times without saving .npy indexes. So you'll test model on few different train/test splits

meanalexrin · 2021-02-08T11:12:40Z

sergey_cheremshinsky/task3(mask)/load_data.py

+        y_test = np.load('y_test.npy', allow_pickle=True)
+
+        return (x_train, x_test, y_train, y_test)
+    except Exception:


Try to use less general errors or if statements

meanalexrin · 2021-02-08T11:13:40Z

sergey_cheremshinsky/task3(mask)/main.py

+                epochs=10,
+                validation_data=(x_test, y_test),
+                validation_steps=1511 // batch,
+                steps_per_epoch=6042 // batch,


Please add constants and description for magical numbers like 1511 and 6042

meanalexrin · 2021-02-08T11:14:57Z

sergey_cheremshinsky/task3(mask)/load_data.py

+    x_test = [images[i] for i in indexes[n:]]
+
+    y_train = [labels[i] for i in indexes[:n]]
+    y_test = [labels[i] for i in indexes[n:]]


Please use existing train test split function to assure correct execution

https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html

sergTch · 2021-02-08T23:56:51Z

I used saving of model and saving of splited data mostly because that's hard for my laptop to finish training in some reasonable amount of time. So I splited training for three nights saving model after each and using same split of data

…sky/task4(faces)/main.py

sergTch · 2021-02-09T00:36:04Z

I fixed mistakes that you mentioned about task 3 and also pushed solution for task 4

sergTch added 9 commits January 29, 2021 19:05

Add files via upload

6b1b2a1

Add files via upload

69c472b

Delete npFunctions.py

2c9387d

Add files via upload

0aa1ba5

Add files via upload

759d07f

Add files via upload

44c5008

Add files via upload

4d21270

Add files via upload

8877bea

Add files via upload

4e6ee2d

sergTch requested review from dlikhomanov and meanalexrin as code owners January 31, 2021 09:33

meanalexrin requested changes Feb 1, 2021

View reviewed changes

Rename main.py to sergey_cheremshinsky/task1(titanic)/main.py

c551077

sergTch requested a review from ARKAD97 as a code owner February 3, 2021 21:18

sergTch added 13 commits February 3, 2021 23:20

Rename ann_model.py to sergey_cheremshinsky/task1(titanic)/ann_model.py

3adc561

Rename sklearn_models.py to sergey_cheremshinsky/task1(titanic)/sklea…

75cb6fb

…rn_models.py

Rename test.csv to sergey_cheremshinsky/task1(titanic)/test.csv

6451a09

Rename titanic.csv to sergey_cheremshinsky/task1(titanic)/titanic.csv

b9e290a

Rename train.csv to sergey_cheremshinsky/task1(titanic)/train.csv

8a5e2e6

Add files via upload

96a068b

Rename sergey_cheremshinsky/get_faces.py to sergey_cheremshinsky/task…

5a8ef42

…3(mask)/get_faces.py

Rename sergey_cheremshinsky/load_data.py to sergey_cheremshinsky/task…

6676cae

…3(mask)/load_data.py

Rename sergey_cheremshinsky/main.py to sergey_cheremshinsky/task3(mas…

4714d29

…k)/main.py

Rename sergey_cheremshinsky/log.txt to sergey_cheremshinsky/task3(mas…

a46cbb9

…k)/log.txt

Rename sergey_cheremshinsky/model.py to sergey_cheremshinsky/task3(ma…

a3c8f90

…sk)/model.py

Add files via upload

b32c9d5

Add files via upload

5554256

meanalexrin requested changes Feb 8, 2021

View reviewed changes

sergTch added 7 commits February 9, 2021 02:15

Add files via upload

5e8426a

Add files via upload

8baff4c

Add files via upload

90c03d5

Create main.py

e40aea8

Rename sergey_cheremshinsky/task4(Faces)/main.py to sergey_cheremshin…

e4bcc45

…sky/task4(faces)/main.py

Add files via upload

d6c180f

Add files via upload

d0b595e

Add files via upload

bddea71

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sergey cheremshinsky #9

Sergey cheremshinsky #9

sergTch commented Jan 31, 2021 •

edited

Loading

meanalexrin left a comment

meanalexrin Feb 1, 2021

meanalexrin Feb 1, 2021

sergTch commented Feb 3, 2021

sergTch commented Feb 3, 2021

meanalexrin left a comment

meanalexrin Feb 8, 2021

meanalexrin Feb 8, 2021

meanalexrin Feb 8, 2021

sergTch commented Feb 8, 2021

sergTch commented Feb 9, 2021

		print('Exported!')


		generateAns()

Sergey cheremshinsky #9

Are you sure you want to change the base?

Sergey cheremshinsky #9

Conversation

sergTch commented Jan 31, 2021 • edited Loading

Description

How has this been tested?

meanalexrin left a comment

Choose a reason for hiding this comment

meanalexrin Feb 1, 2021

Choose a reason for hiding this comment

meanalexrin Feb 1, 2021

Choose a reason for hiding this comment

sergTch commented Feb 3, 2021

sergTch commented Feb 3, 2021

meanalexrin left a comment

Choose a reason for hiding this comment

meanalexrin Feb 8, 2021

Choose a reason for hiding this comment

meanalexrin Feb 8, 2021

Choose a reason for hiding this comment

meanalexrin Feb 8, 2021

Choose a reason for hiding this comment

sergTch commented Feb 8, 2021

sergTch commented Feb 9, 2021

sergTch commented Jan 31, 2021 •

edited

Loading