Effects of padding on LSTMs and CNNs
read the original abstract
Long Short-Term Memory (LSTM) Networks and Convolutional Neural Networks (CNN) have become very common and are used in many fields as they were effective in solving many problems where the general neural networks were inefficient. They were applied to various problems mostly related to images and sequences. Since LSTMs and CNNs take inputs of the same length and dimension, input images and sequences are padded to maximum length while testing and training. This padding can affect the way the networks function and can make a great deal when it comes to performance and accuracies. This paper studies this and suggests the best way to pad an input sequence. This paper uses a simple sentiment analysis task for this purpose. We use the same dataset on both the networks with various padding to show the difference. This paper also discusses some preprocessing techniques done on the data to ensure effective analysis of the data.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Clustering in pure-attention hardmax transformers and its role in sentiment analysis
Hardmax transformers converge to leader-determined clusters, enabling an interpretable model for sentiment analysis.
-
Linear Models, Variable Selection, Artificial Intelligence
An ANN trained on OLS estimates performs variable selection in linear models, with simulation studies showing accuracy across sample sizes and variances and competitive results versus Forward, Backward, AIC, BIC, and ...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.