Feature-extraction pipeline to return Tensor #10016

ierezell · 2021-02-04T22:12:05Z

🚀 Feature request

Actually, to code of the feature-extraction pipeline
transformers.pipelines.feature-extraction.FeatureExtractionPipeline l.82 return a super().__call__(*args, **kwargs).tolist()

Which gives a list[float] (or list[list[float]] if list[str] in input)

I guess it's to be framework agnostic, but we can specify framework='pt' in the pipeline config so I was expecting a torch.tensor.

Could we add some logic to return tensors ?

Motivation

Features will be used as input of other models, so keeping them as tensors (even better on GPU) would be profitable.

Thanks in advance for the reply,

Have a great day.

The text was updated successfully, but these errors were encountered:

LysandreJik · 2021-02-05T06:57:33Z

Hello! Indeed, this is a valid request. Would you like to open a PR and take a stab at it?

ierezell · 2021-02-05T14:17:08Z

@LysandreJik Hi, thanks for the fast reply !

Ok will do that :)
I will comment here when the PR will be ready

ak314 · 2021-03-19T13:48:33Z

Hi @LysandreJik is there any update on this issue? If @ierezell didn't have time, I might be able to give a shot at it in the next days

github-actions · 2021-04-14T15:04:23Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

steysie · 2022-05-27T15:41:08Z

Hi!
Is this issue somewhere in consideration still?
Would be awesome to be able to get tensors from the feature extraction pipeline

LysandreJik · 2022-05-31T08:30:38Z

I think we'd still be open to that; WDYT @Narsil?

Narsil · 2022-05-31T10:17:01Z

Sure !

Would adding an argument return_type= "tensors" be OK ? That way we can enable this feature without breaking backward compatibility ?

ajsanjoaquin · 2022-09-30T13:14:37Z

I'm baffled as to why returning the features as a list is the default behavior in the first place... Isn't one common usage of feature extraction to provide an input to another model, which means it is preferred to keep it as a tensor?

Narsil · 2022-10-17T09:49:50Z

@ajsanjoaquin

Well it depends, not necessarily. Another very common use case is to feed it to some feature database for querying later.
Those database engines are not necessarily expecting the same kind of tensors that you are sending.

But I kind of agree that it should be at least a numpy.array because usually conversions between numpy and PT or TF is basically free, meaning it would be much easier to use that way.

Some pipeline were added a long time ago where the current situation was not as clear as today, and since we are very conservative regarding breaking changes, that can explain why some defaults are the way they are.

If/When v5 is getting prepared there would be a lot of small but breaking changes in that regard.

github-actions bot closed this as completed Apr 23, 2021

ajsanjoaquin mentioned this issue Sep 30, 2022

add return_tensor parameter for feature extraction #19257

Merged

This was referenced Oct 17, 2022

add return_tensor parameter for feature extraction #19682

Closed

add return_tensors parameter for feature_extraction 2 #19707

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature-extraction pipeline to return Tensor #10016

Feature-extraction pipeline to return Tensor #10016

ierezell commented Feb 4, 2021

LysandreJik commented Feb 5, 2021

ierezell commented Feb 5, 2021

ak314 commented Mar 19, 2021

github-actions bot commented Apr 14, 2021

steysie commented May 27, 2022

LysandreJik commented May 31, 2022

Narsil commented May 31, 2022

ajsanjoaquin commented Sep 30, 2022

Narsil commented Oct 17, 2022

Feature-extraction pipeline to return Tensor #10016

Feature-extraction pipeline to return Tensor #10016

Comments

ierezell commented Feb 4, 2021

🚀 Feature request

Motivation

LysandreJik commented Feb 5, 2021

ierezell commented Feb 5, 2021

ak314 commented Mar 19, 2021

github-actions bot commented Apr 14, 2021

steysie commented May 27, 2022

LysandreJik commented May 31, 2022

Narsil commented May 31, 2022

ajsanjoaquin commented Sep 30, 2022

Narsil commented Oct 17, 2022