IndexError: tuple index out of range #38

amir-tagh · 2022-06-07T17:54:23Z

Hello,

I am following the the example for "Molecule generation pretraining procedure". first step "python get_vocab.py --ncpu 16 < data/chembl/all.txt > vocab.txt" is done with no error, but I am getting the "IndexError: tuple index out of range" for the second step
python preprocess.py --train data/chembl/all.txt --vocab data/chembl/all.txt --ncpu 16 --mode single

can you please let me know what could be the problem.

Best,
Amir

orubaba · 2022-06-14T16:48:49Z

#34 should answer your question. I had same also. Then, I did according to that tread and viola....it worked.
Run this first:
python preprocess.py --train data/chembl/all.txt --vocab vocab.txt --ncpu 16 --mode single

After completion,

then, this:
mkdir train_processed

After,
then this
mv tensor* train_processed/

amir-tagh · 2022-06-15T21:34:29Z

Thanks for your response.

I have a set of smiles which I am working on, extracting the substructures is done successfully but the second step is giving the following error:

do you have any idea what could be problem?

Thanks for your help.

python preprocess.py --train Inforna_correct_for_ML.txt --vocab inforna_vocab.txt --ncpu 16 --mode single

File "preprocess.py", line 109, in
le = (len(all_data) + num_splits - 1) // num_splits
ZeroDivisionError: integer division or modulo by zero

orubaba · 2022-06-16T04:38:39Z

I will suggest you adjust the number of split formular:
num_splits = len(all_data) // 1000
if your len(data) is < 1000, num_split = 0 because of the floor division.
So my advice is you use a denominator that can give your num_split >= 1. maybe you use 100 or 10 or 5.

amir-tagh · 2022-06-16T21:04:52Z

Thanks a lot for your help.
Now I am at the third step "Train graph generation model" and I am getting the following error. I googled the error but couldnt find a solution.

Thanks,

here is the pytorch version I am using, if it helps:

Name Version Build Channel

pytorch 1.11.0 py3.7_cuda11.1_cudnn8.0.5_0 pytorch

Traceback (most recent call last):
File "train_generator.py", line 96, in
meters = meters + np.array([kl_div, loss.item(), wacc * 100, iacc * 100, tacc * 100, sacc * 100])
File "/home/amir/anaconda3/envs/sampledock/lib/python3.7/site-packages/torch/_tensor.py", line 732, in array
return self.numpy()
TypeError: can't convert cuda:0 device type tensor to numpy. Use Tensor.cpu() to copy the tensor to host memory first.

orubaba · 2022-06-17T02:36:27Z

The error is due to lack of nvidia gpu that enable cuda on your machine.

amir-tagh · 2022-06-17T18:12:45Z

but I have the nvidia gpu?

orubaba · 2022-06-18T19:52:14Z

Perhaps, the driver is not properly installed. Something must be wrong somewhere.!

orubaba · 2022-06-18T19:59:18Z

maybe this can help: 2e56392

amir-tagh · 2022-06-20T18:33:58Z

Thanks, I finally figured out what was wrong and now it is working.

Now I have a problem with finetune_generator.py

I have used the chemprop_train on my dataset and got the following in the save_dir:
args.json, fold_0, verbose.log, test_scores.csv, quiet.log

after running the finetune_generator.py I get the following error, can you please let me know how can I trace the problem.

Thanks for your help.

Traceback (most recent call last):
File "/apps/hgraph2graph/20210428/hgraph2graph/finetune_generator.py", line 124, in
score_func = Chemprop(args.chemprop_model)
File "/apps/hgraph2graph/20210428/hgraph2graph/finetune_generator.py", line 37, in init
scaler, features_scaler = load_scalers(fname)
ValueError: too many values to unpack (expected 2)

muammar · 2022-06-27T14:23:53Z

Traceback (most recent call last):
File "/apps/hgraph2graph/20210428/hgraph2graph/finetune_generator.py", line 124, in
score_func = Chemprop(args.chemprop_model)
File "/apps/hgraph2graph/20210428/hgraph2graph/finetune_generator.py", line 37, in init
scaler, features_scaler = load_scalers(fname)
ValueError: too many values to unpack (expected 2)

Did you solve it? I am trying to figure it out, if I get to solve it, I will push the changes to my own version of this package https://github.com/muammar/hgraph2graph

muammar · 2022-06-27T15:01:03Z

Ok, I solved it... First, your fine-tune set does not need to have any headers. It should look like this:

CC
CCO
CNOO

Then, you need to apply the following patch:

diff --git a/finetune_generator.py b/finetune_generator.py
index d406d38..995cad3 100755
--- a/finetune_generator.py
+++ b/finetune_generator.py
@@ -35,9 +35,9 @@ class Chemprop(object):
             for fname in files:
                 if fname.endswith(".pt"):
                     fname = os.path.join(root, fname)
-                    scaler, features_scaler = load_scalers(fname)
-                    self.scalers.append(scaler)
-                    self.features_scalers.append(features_scaler)
+                    # scaler, features_scaler = load_scalers(fname)
+                    # self.scalers.append(scaler)
+                    # self.features_scalers.append(features_scaler)
                     model = load_checkpoint(fname)
                     self.checkpoints.append(model)
 
@@ -164,10 +164,10 @@ if __name__ == "__main__":
                     [
                         kl_div,
                         loss.item(),
-                        wacc * 100,
-                        iacc * 100,
-                        tacc * 100,
-                        sacc * 100,
+                        wacc.item() * 100,
+                        iacc.item() * 100,
+                        tacc.item() * 100,
+                        sacc.item() * 100,
                     ]
                 )

See: muammar@a714e29

amir-tagh · 2022-07-09T21:15:15Z

Hi muammar,

Thanks for the solution.
I am using the train_translator.py for lead optimization and I am getting the following error. ahve you seen this error before? do you know how to solve it.

Thanks,

Traceback (most recent call last):
File "/apps/hgraph2graph/20210428/hgraph2graph/train_translator.py", line 86, in
loss, kl_div, wacc, iacc, tacc, sacc = model(*batch)
File "/apps/hgraph2graph/20210428/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
return forward_call(*input, **kwargs)
TypeError: forward() missing 2 required positional arguments: 'y_orders' and 'beta'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IndexError: tuple index out of range #38

IndexError: tuple index out of range #38

amir-tagh commented Jun 7, 2022

orubaba commented Jun 14, 2022 •

edited

Loading

amir-tagh commented Jun 15, 2022

orubaba commented Jun 16, 2022

amir-tagh commented Jun 16, 2022

orubaba commented Jun 17, 2022

amir-tagh commented Jun 17, 2022

orubaba commented Jun 18, 2022

orubaba commented Jun 18, 2022

amir-tagh commented Jun 20, 2022

muammar commented Jun 27, 2022

muammar commented Jun 27, 2022

amir-tagh commented Jul 9, 2022

IndexError: tuple index out of range #38

IndexError: tuple index out of range #38

Comments

amir-tagh commented Jun 7, 2022

orubaba commented Jun 14, 2022 • edited Loading

amir-tagh commented Jun 15, 2022

orubaba commented Jun 16, 2022

amir-tagh commented Jun 16, 2022

Name Version Build Channel

orubaba commented Jun 17, 2022

amir-tagh commented Jun 17, 2022

orubaba commented Jun 18, 2022

orubaba commented Jun 18, 2022

amir-tagh commented Jun 20, 2022

muammar commented Jun 27, 2022

muammar commented Jun 27, 2022

amir-tagh commented Jul 9, 2022

orubaba commented Jun 14, 2022 •

edited

Loading