-
Notifications
You must be signed in to change notification settings - Fork 945
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ROIPooling layer in fast and faster R-CNN #565
Comments
It looks like this is just max-pooling with a pool size dependent on the input, so that the output always has the same size (e.g. 7x7)? That should be fairly simple to implement in pure Theano. And since it's unlikely to be a very time-consuming part of a network, making a faster C implementation probably isn't worth it. |
But the input of ROIPooling layer in a batch is several object proposal sub-windows of the same image. They have different sizes and all max-pooling to the same size (e.g. 7x7) with different pooling sizes. So i think it's not very easy to implement. |
Right, in that case it's going to be tough to avoid scan, or something like that. A custom CUDA kernel might even be worth considering (it's fairly easy to wrap them in Theano using PyCUDA). |
Can't it be emulated via |
The |
Fair point. Somehow I missed that it is called pooling for reason. Anyway I am messing around faster rcnn and I almost finished implementation of ROI 'pooling' via |
Has anything been done for this issue? |
No, but the deepdetect issue linking to ours has a Theano implementation posted: https://github.com/ddtm/theano-roi-pooling |
That seems reasonable. Thanks! |
Feel free to submit a PR to Lasagne/Recipes when you got it working, or send a PR to Theano for that Op and ping us back! |
Hi, I have made a draft here, Theano/Theano#5189 . Could you please have a look and let me know if the Op is implemented correctly? Ramana |
Hi, If anyone is interested, the code for |
Now Fast R-CNN and Faster R-CNN are start-of-the-art image detection methods. The most important component of these method is a ROI pooling layer and the authors implemented it in caffe.
I find it may be difficult to implement the ROI pooling layer using the ops in theano. Is there anyone has any idea? Or we can only implement it with C extension?
The text was updated successfully, but these errors were encountered: