Video action classification with Attention and LSTM - vision ...
discuss.pytorch.org › t › video-actionJan 21, 2022 · I’m working on a video action classification problem. The videos are in the form of sequences of images. Basically, features are extracted from the images using ResNet, these features are fed into an additive attention mechanism, the attention context are combined with the image features and fed into an LSTM, and its outputs are fed into a classifier. Code below. My questions are: Is my ...