[1506.02025] Spatial Transformer Networks
arxiv.org › abs › 1506Jun 05, 2015 · Convolutional Neural Networks define an exceptionally powerful class of models, but are still limited by the lack of ability to be spatially invariant to the input data in a computationally and parameter efficient manner. In this work we introduce a new learnable module, the Spatial Transformer, which explicitly allows the spatial manipulation of data within the network. This differentiable ...
Spatial Transformer Networks - NeurIPS
proceedings.neurips.cc › paper › 2015The action of the spatial transformer is conditioned on individual data samples, with the appropriate behaviour learnt during training for the task in question (without extra supervision). Unlike pooling layers, where the re-ceptive fields are fixed and local, the spatial transformer module is a dynamic mechanism that can
[1506.02025] Spatial Transformer Networks
https://arxiv.org/abs/1506.0202505.06.2015 · Download PDF Abstract: Convolutional Neural Networks define an exceptionally powerful class of models, but are still limited by the lack of ability to be spatially invariant to the input data in a computationally and parameter efficient manner. In this work we introduce a new learnable module, the Spatial Transformer, which explicitly allows the spatial manipulation of …
Spatial Transformer Networks
courses.cs.duke.edu › spring19 › compsci5273 Spatial Transformers In this section we describe the formulation of a spatial transformer. This is a differentiable module which applies a spatial transformation to a feature map during a single forward pass, where the transformation is conditioned on the particular input, producing a single output feature map. For
Spatial Transformer Explained | Papers With Code
paperswithcode.com › method › spatial-transformerA Spatial Transformer is an image model block that explicitly allows the spatial manipulation of data within a convolutional neural network. It gives CNNs the ability to actively spatially transform feature maps, conditional on the feature map itself, without any extra training supervision or modification to the optimisation process. Unlike pooling layers, where the receptive fields are fixed ...