Skip to content

Latest commit

 

History

History
24 lines (15 loc) · 1.52 KB

README.md

File metadata and controls

24 lines (15 loc) · 1.52 KB

Pyramid Pooling implemented in PyTorch

This Module implements Spatial Pyramid Pooling (SPP) and Temporal Pyramid Pooling (TPP) as described in different papers.

SPP-TPP Comparison

Temporal Pyramid Pooling:

Sudholt, Fink: Evaluating Word String Embeddings and LossFunctions for CNN-based Word Spotting

Principle

Given an 2D input Tensor, Temporal Pyramid Pooling divides the input in x stripes which extend through the height of the image and width of roughly (input_width / x). These stripes are then each pooled with max- or avg-pooling to calculate the output.

Animated Principle

TPP Visualization

Spatial Pyramid Pooling:

He, et. al.: Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Principle

Given an 2D input Tensor, Spatial Pyramid Pooling divides the input in rectangles with height of roughly (input_height / x) and width of roughly (input_width / x). These rectangles are then each pooled with max- or avg-pooling to calculate the output.