-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathtraining_log.txt
81 lines (81 loc) · 5.01 KB
/
training_log.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
Epoch [1/3], Step [1/3236], Loss: 9.0844, Perplexity: 8816.7010
Epoch [1/3], Step [2/3236], Loss: 8.9074, Perplexity: 7386.7494
Epoch [1/3], Step [3/3236], Loss: 8.8236, Perplexity: 6792.7119
Epoch [1/3], Step [4/3236], Loss: 8.3980, Perplexity: 4438.3032
Epoch [1/3], Step [5/3236], Loss: 7.9748, Perplexity: 2906.7102
Epoch [1/3], Step [6/3236], Loss: 7.3343, Perplexity: 1531.9573
Epoch [1/3], Step [7/3236], Loss: 6.5059, Perplexity: 669.0986
Epoch [1/3], Step [8/3236], Loss: 5.9713, Perplexity: 392.0092
Epoch [1/3], Step [9/3236], Loss: 5.4200, Perplexity: 225.8871
Epoch [1/3], Step [10/3236], Loss: 5.0888, Perplexity: 162.1994
Epoch [1/3], Step [11/3236], Loss: 5.0440, Perplexity: 155.0911
Epoch [1/3], Step [12/3236], Loss: 4.8034, Perplexity: 121.9262
Epoch [1/3], Step [13/3236], Loss: 4.6961, Perplexity: 109.5161
Epoch [1/3], Step [14/3236], Loss: 4.8600, Perplexity: 129.0181
Epoch [1/3], Step [15/3236], Loss: 4.9689, Perplexity: 143.8633
Epoch [1/3], Step [16/3236], Loss: 4.8758, Perplexity: 131.0817
Epoch [1/3], Step [17/3236], Loss: 4.8730, Perplexity: 130.7155
Epoch [1/3], Step [18/3236], Loss: 4.7580, Perplexity: 116.5120
Epoch [1/3], Step [19/3236], Loss: 4.7800, Perplexity: 119.1001
Epoch [1/3], Step [20/3236], Loss: 5.0029, Perplexity: 148.8435
Epoch [1/3], Step [21/3236], Loss: 5.2236, Perplexity: 185.6095
Epoch [1/3], Step [22/3236], Loss: 4.5812, Perplexity: 97.6342
Epoch [1/3], Step [23/3236], Loss: 4.6050, Perplexity: 99.9813
Epoch [1/3], Step [24/3236], Loss: 4.7787, Perplexity: 118.9476
Epoch [1/3], Step [25/3236], Loss: 4.7051, Perplexity: 110.5085
Epoch [1/3], Step [26/3236], Loss: 4.4209, Perplexity: 83.1750
Epoch [1/3], Step [27/3236], Loss: 4.5091, Perplexity: 90.8441
Epoch [1/3], Step [28/3236], Loss: 4.4497, Perplexity: 85.6009
Epoch [1/3], Step [29/3236], Loss: 4.5777, Perplexity: 97.2902
Epoch [1/3], Step [30/3236], Loss: 4.4447, Perplexity: 85.1750
Epoch [1/3], Step [31/3236], Loss: 4.6699, Perplexity: 106.6902
Epoch [1/3], Step [32/3236], Loss: 4.3414, Perplexity: 76.8157
Epoch [1/3], Step [33/3236], Loss: 4.3438, Perplexity: 76.9972
Epoch [1/3], Step [34/3236], Loss: 4.2295, Perplexity: 68.6847
Epoch [1/3], Step [35/3236], Loss: 4.5516, Perplexity: 94.7801
Epoch [1/3], Step [36/3236], Loss: 4.4120, Perplexity: 82.4321
Epoch [1/3], Step [37/3236], Loss: 4.4034, Perplexity: 81.7254
Epoch [1/3], Step [38/3236], Loss: 4.4181, Perplexity: 82.9385
Epoch [1/3], Step [39/3236], Loss: 4.1167, Perplexity: 61.3543
Epoch [1/3], Step [40/3236], Loss: 4.4855, Perplexity: 88.7246
Epoch [1/3], Step [41/3236], Loss: 4.2492, Perplexity: 70.0502
Epoch [1/3], Step [42/3236], Loss: 4.1785, Perplexity: 65.2666
Epoch [1/3], Step [43/3236], Loss: 4.6685, Perplexity: 106.5376
Epoch [1/3], Step [44/3236], Loss: 4.0841, Perplexity: 59.3871
Epoch [1/3], Step [45/3236], Loss: 4.2382, Perplexity: 69.2818
Epoch [1/3], Step [46/3236], Loss: 4.1098, Perplexity: 60.9336
Epoch [1/3], Step [47/3236], Loss: 4.1111, Perplexity: 61.0158
Epoch [1/3], Step [48/3236], Loss: 4.2059, Perplexity: 67.0834
Epoch [1/3], Step [49/3236], Loss: 4.1828, Perplexity: 65.5519
Epoch [1/3], Step [50/3236], Loss: 4.0373, Perplexity: 56.6737
Epoch [1/3], Step [51/3236], Loss: 4.2920, Perplexity: 73.1113
Epoch [1/3], Step [52/3236], Loss: 4.4773, Perplexity: 88.0001
Epoch [1/3], Step [53/3236], Loss: 4.8033, Perplexity: 121.9166
Epoch [1/3], Step [54/3236], Loss: 4.2803, Perplexity: 72.2630
Epoch [1/3], Step [55/3236], Loss: 4.0908, Perplexity: 59.7871
Epoch [1/3], Step [56/3236], Loss: 4.0627, Perplexity: 58.1323
Epoch [1/3], Step [57/3236], Loss: 4.1223, Perplexity: 61.7041
Epoch [1/3], Step [58/3236], Loss: 4.1662, Perplexity: 64.4697
Epoch [1/3], Step [59/3236], Loss: 4.1233, Perplexity: 61.7623
Epoch [1/3], Step [60/3236], Loss: 4.1123, Perplexity: 61.0887
Epoch [1/3], Step [61/3236], Loss: 3.9992, Perplexity: 54.5521
Epoch [1/3], Step [62/3236], Loss: 4.0803, Perplexity: 59.1655
Epoch [1/3], Step [63/3236], Loss: 3.8841, Perplexity: 48.6216
Epoch [1/3], Step [64/3236], Loss: 3.9296, Perplexity: 50.8880
Epoch [1/3], Step [65/3236], Loss: 4.2950, Perplexity: 73.3289
Epoch [1/3], Step [66/3236], Loss: 4.3962, Perplexity: 81.1440
Epoch [1/3], Step [67/3236], Loss: 4.0209, Perplexity: 55.7502
Epoch [1/3], Step [68/3236], Loss: 4.2978, Perplexity: 73.5345
Epoch [1/3], Step [69/3236], Loss: 3.9298, Perplexity: 50.8990
Epoch [1/3], Step [70/3236], Loss: 4.0703, Perplexity: 58.5770
Epoch [1/3], Step [71/3236], Loss: 4.0420, Perplexity: 56.9398
Epoch [1/3], Step [72/3236], Loss: 4.0211, Perplexity: 55.7644
Epoch [1/3], Step [73/3236], Loss: 3.9055, Perplexity: 49.6773
Epoch [1/3], Step [74/3236], Loss: 3.9931, Perplexity: 54.2228
Epoch [1/3], Step [75/3236], Loss: 3.9801, Perplexity: 53.5203
Epoch [1/3], Step [76/3236], Loss: 3.8308, Perplexity: 46.1001
Epoch [1/3], Step [77/3236], Loss: 4.8682, Perplexity: 130.0801
Epoch [1/3], Step [78/3236], Loss: 3.8343, Perplexity: 46.2608
Epoch [1/3], Step [79/3236], Loss: 4.0355, Perplexity: 56.5696
Epoch [1/3], Step [80/3236], Loss: 3.8053, Perplexity: 44.9380
Epoch [1/3], Step [81/3236], Loss: 3.9393, Perplexity: 51.3839