Allow for transforms.ToTensor to return a Tensor of type torch.uint8 #1595

r-zenine · 2019-11-19T12:35:41Z

Hi Everyone,

This is my first issue in the project. Please forgive me if I overstep.

I would like to have the ability when I transform PIL images using Torchvision to get back a uint8 Tensor.

I am reimplementing a flavor of DQN with experience replay where I have to store frames of an atari game in a big buffer. Storing these tensors as uint8 as opposed to floats allows me to save some memory and have bigger buffers.

I don't know if it is the feature requested is even relevant for you and whether it makes sense to update ToTensor to do it.

I would like to contribute to PyTorch more in the future because I am a big fan of this piece of technology and would be willing to do the work as my first modest contribution if you think this can be useful.

Thank you all

pmeier · 2019-11-19T13:00:35Z

Hi @r-zenine

as of now images in torchvision are always torch.FloatTensor and it is assumed that the values lie in the interval between 0 and 1. Thus, I think, having it as a parameter in transforms.ToTensor is not useful at the moment. However, there is a movement to also support torch.ByteTensor (uint8) images in the future. (I think I read that proposal somewhere, but I don't find the issue / PR. @fmassa can you help me out here?)

I would like to contribute to PyTorch more in the future because I am a big fan of this piece of technology and would be willing to do the work as my first modest contribution if you think this can be useful.

That is great. You could look at issues tagged with help wanted. You can do so by adding label:"help wanted" in the issue search bar or click on the label of another issue tagged with it. In the context of torchvision.transforms #1375 could be a good starting point. Some reasonable simple transformations as center_crop are still open.

From your comment, I'm not sure if you know how to implement a custom transformation that does what you want. If you need help, feel free to ask about it in another comment.

r-zenine · 2019-11-19T14:03:07Z

Hi @pmeier,

I wanted to be as respectful as possible in my comment as not to overstep. Since I am fairly new to Torchvision, I assumed I might not be aware of some design decisions taken by the project and I did not want to be rude!

What is the rationale behind wanting only torch.FloatTensor in Torchvision ? Of course, there is the fact that convnets expect that an input. But am I missing something else?

@pmeier Thanks for suggesting #1375, I will take a look this week-end and try to put something together and maybe ask questions here if I need to.

fmassa · 2019-11-20T20:26:22Z

Hi @r-zenine ,

Thanks for opening the issue!

As @pmeier pointed out, we currently always return float tensors between 0 and 1. This is good for consistency: all images are float in 0-1.
We could have other conventions, as native uint8 (in 0-255), or float in 0-255 for example.
We use float 0-1 for legacy reasons (Lua Torch used images in float 0-1), but it generally works ok.

The discussion that @pmeier mentioned I believe is #1179
In particular, I discuss the uint8 and storing in 0-1 or 0-255 in there.

Have a look and let me know your thoughts!

r-zenine · 2019-11-21T19:27:51Z

Hi @fmassa,

Thank you very much for your response.

I am sorry, I would've liked to look at it now. Unfortunatly, I am a bit short on time today and tomorrow. If it's okay for you ? I will take a look at everything this week-end and share my thoughts.

Thanks,

vfdev-5 · 2020-10-22T12:04:04Z

@r-zenine there can be also an option to use a custom ToTensor transform like it is done for targets in segmentation task:

vision/references/segmentation/transforms.py

Lines 78 to 82 in 1aef87d

    
           class ToTensor(object): 
        
               def __call__(self, image, target): 
        
                   image = F.to_tensor(image) 
        
                   target = torch.as_tensor(np.asarray(target), dtype=torch.int64) 
        
                   return image, target

Let me close the issue as stale, @r-zenine feel free to reopen if you need more help on that. Thanks !

fmassa added module: transforms needs discussion labels Nov 20, 2019

r-zenine mentioned this issue Nov 24, 2019

Implementing image reading functions #1179

Closed

vfdev-5 closed this as completed Oct 22, 2020

fmassa mentioned this issue Nov 4, 2020

Allow torchvision.io to pass through ToTensor() #2959

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for transforms.ToTensor to return a Tensor of type torch.uint8 #1595

Allow for transforms.ToTensor to return a Tensor of type torch.uint8 #1595

r-zenine commented Nov 19, 2019

pmeier commented Nov 19, 2019

r-zenine commented Nov 19, 2019 •

edited

Loading

fmassa commented Nov 20, 2019

r-zenine commented Nov 21, 2019

vfdev-5 commented Oct 22, 2020

Allow for transforms.ToTensor to return a Tensor of type torch.uint8 #1595

Allow for transforms.ToTensor to return a Tensor of type torch.uint8 #1595

Comments

r-zenine commented Nov 19, 2019

pmeier commented Nov 19, 2019

r-zenine commented Nov 19, 2019 • edited Loading

fmassa commented Nov 20, 2019

r-zenine commented Nov 21, 2019

vfdev-5 commented Oct 22, 2020

r-zenine commented Nov 19, 2019 •

edited

Loading