About usage of text description #3

EricGuo5513 · 2022-01-25T06:46:14Z

Hi, thanks for your great work on ICCV. I have a question about the usage of text description. As we know, there could be multiple description for each motion animation. However, the proposed method is one-to-one mapping deterministic method. Do we also use mutiple descriptions for each motion during training, or we just keep using one specific description?

anindita127 · 2022-01-25T10:23:51Z

Hi, thanks for your interest in our work. The method is trained with multiple text descriptions for a motion. You'll find the dataset has multiple annotations for the same kind of action so it's more of a many -to-one mapping. We also used a pose discriminator to train the model so it's not deterministic.

EricGuo5513 · 2022-01-25T17:16:31Z

Thanks for your reply. I have two further questions if you don't mind. During training, how do you deal with the variable length of training motions? During inference, does the proposed method use the ground truth initial pose, and motion length (Number of frames)?

anindita127 · 2022-01-26T12:15:19Z

Hi, we downsample the training data to a fixed length. Yes, during inference the method uses the ground truth initial pose and we can mention the number of frames we want to generate (we have trained with 32 frames).

Mathux · 2022-03-04T11:24:18Z

Hi,

When you are evaluating your method against the ground truth motion, which text sequence do you use? (if there are several available for a particular sequence)

As your output has a fixed size, how do you compare with previous work for the evaluation?

Thanks for your help

anindita127 · 2022-03-04T15:49:23Z

Hi, in the data preprocessing, we create a one-to-one mapping with the text description and pose sequence and use that as our ground truth data. Previous work like language2pose also has fixed output size. We calculate the mean position and variance metrics so evaluation can be done for a fixed sequence length for previous methods as well.

Mathux · 2022-03-04T16:37:48Z

Ok thanks, to be sure I get it, given a motion M1 with the annotations A1/B1/C1, M2 with the annotations A2/B2, you will create (M1/A1), (M1/B1), (M1/C1), (M2/A2) and (M2/B2)?

About the metric computation, I still did not understand. For example with the APE:

Let say hypothetically that:

the ground truth P^hat has 40 frames, so t varies between 0 -> 39.
your generation P^ghosh have 32 frames, so t varies between 0 -> 31.
language2pose P^jl2p have 35 frames, so t varies between 0 -> 34.

How do you compute "P^ghosh_t - P^hat_t"? What will we the "t"?

anindita127 · 2022-03-04T21:15:32Z

Ok thanks, to be sure I get it, given a motion M1 with the annotations A1/B1/C1, M2 with the annotations A2/B2, you will create (M1/A1), (M1/B1), (M1/C1), (M2/A2) and (M2/B2)?

Yes, we create such pairs during data preprocessing.

How do you compute "P^ghosh_t - P^hat_t"? What will we the "t"?

During evaluation you can take fixed number of timesteps (for us t=32). We have downsampled the dataset and taken first 32 frames to train and evaluate. We have done the same with jl2p for comparison.

Mathux · 2022-03-04T22:07:45Z

Thanks it is much clear now.

I have another question about the number of data for the test set.
When I sample the sequences with the command:

python sample_wordConditioned.py -load save/ghosh/exp_124_cpk_2stream_h_model_nocurriculum_time_32_weights.p

I noticed that in the folder save/ghosh/exp_124_cpk_2stream_h_model_nocurriculum_time_32/test/ there is only 520 sequences. Whereas, in language2pose, I got 587 sequences for the test set. The 520 sequences are a subset of the 587.

Are you removing some sequences at some point?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About usage of text description #3

About usage of text description #3

EricGuo5513 commented Jan 25, 2022

anindita127 commented Jan 25, 2022

EricGuo5513 commented Jan 25, 2022 •

edited

Loading

anindita127 commented Jan 26, 2022

Mathux commented Mar 4, 2022

anindita127 commented Mar 4, 2022

Mathux commented Mar 4, 2022

anindita127 commented Mar 4, 2022

Mathux commented Mar 4, 2022

About usage of text description #3

About usage of text description #3

Comments

EricGuo5513 commented Jan 25, 2022

anindita127 commented Jan 25, 2022

EricGuo5513 commented Jan 25, 2022 • edited Loading

anindita127 commented Jan 26, 2022

Mathux commented Mar 4, 2022

anindita127 commented Mar 4, 2022

Mathux commented Mar 4, 2022

anindita127 commented Mar 4, 2022

Mathux commented Mar 4, 2022

EricGuo5513 commented Jan 25, 2022 •

edited

Loading