Hybrid quantum-classical neural network for sentiment analysis

Dimitrios Makris; Filippo Caruso; Giacomo Cappiello; Xing Liang

arxiv: 2607.01943 · v1 · pith:6Q4NJ3GYnew · submitted 2026-07-02 · 💻 cs.LG · quant-ph

Hybrid quantum-classical neural network for sentiment analysis

Giacomo Cappiello , Filippo Caruso , Xing Liang , Dimitrios Makris This is my paper

Pith reviewed 2026-07-03 17:27 UTC · model grok-4.3

classification 💻 cs.LG quant-ph

keywords hybrid quantum-classical neural networkssentiment analysistransfer learningCOVID-19 tweetsTF-IDFquantum machine learningSMS spam classificationnatural language processing

0 comments

The pith

Hybrid quantum-classical neural networks match classical accuracy on COVID-19 tweet sentiment while outperforming by 15 points on transferred SMS spam classification.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper tests hybrid quantum-classical neural networks on sentiment analysis using a COVID-19 tweet dataset. Text is converted to TF-IDF vectors and passed to both classical feedforward networks and hybrid models that add parameterized quantum circuits. The hybrids reach accuracy levels comparable to the classical baseline yet display different validation loss and accuracy trajectories. When the same models are transferred to an SMS spam classification task the hybrids raise accuracy on the spam class from 66 percent to 81 percent. The results indicate that quantum machine learning methods are workable for natural language processing and may deliver better generalization.

Core claim

Hybrid models achieve accuracy comparable to the classical baseline on COVID-19 tweet sentiment analysis while, under transfer learning to SMS spam classification, they outperform the classical counterpart by 15 percentage points on the spam class (66% to 81%). The hybrid architectures incorporating parameterized quantum circuits exhibit distinct learning dynamics suggestive of richer representational capacity.

What carries the argument

Parameterized quantum circuits integrated into feedforward neural networks that receive TF-IDF vectorized text.

If this is right

Hybrid models reach accuracy levels comparable to classical networks on the original COVID-19 sentiment task.
Hybrid models exhibit stronger generalization when the learned weights are applied to a new text-classification problem.
Validation curves of hybrid models differ from classical curves in ways consistent with greater internal representational power.
Quantum circuits can be embedded in standard NLP pipelines that begin with TF-IDF features.
Further hardware improvements could widen the observed generalization advantage.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same hybrid construction could be tested on other text-classification problems that involve domain shift.
Replacing TF-IDF with learned embeddings might alter or amplify the reported transfer-learning benefit.
A controlled ablation that isolates circuit depth or entanglement structure would clarify which quantum feature drives the gain.
Scaling the approach to larger corpora would test whether the 15-point margin persists or grows.

Load-bearing premise

Observed transfer-learning gains arise from the hybrid architecture rather than from unstated differences in model size, optimizer settings, or random seeds.

What would settle it

A re-run of the transfer-learning experiments in which classical and hybrid models are forced to identical parameter counts, optimizer choices, and random seeds, after which the 15-point spam-class gap disappears.

read the original abstract

Quantum machine learning has recently emerged as a promising paradigm that leverages the expressive power of quantum circuits to address complex learning tasks. In this work, we investigate the applicability of hybrid quantum-classical neural networks to sentiment analysis, a central problem in natural language processing. We focus on a dataset of tweets related to COVID-19, where the textual content is vectorized using TF-IDF and fed into both classical feedforward networks and hybrid architectures incorporating parameterized quantum circuits. Our results show that hybrid models can achieve accuracy comparable to the classical baseline, while exhibiting distinct learning dynamics, especially in terms of validation loss and accuracy, that suggest a richer representational capacity. Moreover, when applying transfer learning to an SMS spam classification task, the hybrid models consistently outperform the classical counterpart, achieving an accuracy increase of 15 percentage points (from 66% to 81%) on the spam class, demonstrating enhanced generalization. These findings highlight the feasibility of employing QML for natural language processing and point toward the potential advantages of hybrid models as quantum hardware continues to advance.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

15pp transfer gain on spam class is the concrete claim, but no sign the classical baseline was matched on capacity or training details.

read the letter

The headline observation is that hybrid quantum-classical networks match a classical feedforward baseline on COVID-19 tweet sentiment but then improve by 15 points on the spam class after transfer to SMS spam detection. That number is specific and could be useful if it holds up under proper controls.

The paper applies a standard hybrid architecture to TF-IDF features and runs the transfer experiment. It also notes distinct validation loss and accuracy curves for the hybrid version, which at least documents a difference in training behavior. The transfer result is framed as evidence of better generalization from the quantum component.

The soft spot is the complete absence of any reported matching between the two model families. The abstract gives no information on parameter counts, optimizer settings, learning-rate schedules, or random seeds. If those differed, the 15-point gap cannot be read as evidence for the hybrid architecture. No error bars or statistical tests appear either, so the numerical claim rests on unreported choices.

This paper is for readers who track early empirical tests of quantum machine learning on text tasks. Someone already working on transfer in NLP would want the full methods to judge whether the gain is reproducible. It is not yet ready to cite as support for hybrid advantage, but the claim is narrow enough that a referee could reasonably ask for the missing baseline details and any additional matched runs.

Send it to review; the experimental gap is fixable if the authors have the data.

Referee Report

3 major / 2 minor

Summary. The paper investigates hybrid quantum-classical neural networks for sentiment analysis on a COVID-19 tweet dataset, with TF-IDF vectorization fed into both classical feedforward networks and hybrid models incorporating parameterized quantum circuits. It claims that the hybrid models achieve accuracy comparable to the classical baseline while showing distinct learning dynamics, and that under transfer learning to SMS spam classification the hybrid models outperform the classical counterpart by 15 percentage points on the spam class (66% to 81%), indicating enhanced generalization.

Significance. If the experimental controls for model capacity, training details, and statistical significance are properly documented and the performance deltas hold, the work could provide initial evidence that hybrid quantum-classical architectures offer advantages in transfer learning for NLP tasks beyond what classical networks achieve. The absence of these controls currently prevents assessment of whether the reported gains are attributable to the quantum components.

major comments (3)

Abstract: the central claim of a 15pp transfer-learning gain on the spam class (66% to 81%) is presented without any description of the quantum circuit (qubit count, ansatz, depth), training protocol, hyperparameter matching between classical and hybrid runs, or statistical tests/error bars; these omissions make the attribution to the hybrid architecture impossible to evaluate.
Abstract/Results: the assertion that hybrid models exhibit 'distinct learning dynamics' and 'richer representational capacity' is unsupported by any reported validation curves, quantitative metrics, or ablation studies that isolate the quantum circuit's contribution from other implementation choices.
Abstract: no evidence is supplied that classical and hybrid models were matched in parameter count, optimizer, learning-rate schedule, or random seeds, which is required to rule out the possibility that the transfer gain arises from unmatched capacity or training details rather than the hybrid architecture.

minor comments (2)

Add a figure or section detailing the hybrid network architecture and quantum circuit parameterization.
Include reproducibility details such as random seeds, exact hyperparameter values, and dataset splits.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the detailed and constructive report. The comments correctly identify gaps in documentation and evidentiary support that must be addressed for the claims to be properly evaluable. We respond point-by-point below and will revise the manuscript to incorporate the requested details and supporting analyses.

read point-by-point responses

Referee: Abstract: the central claim of a 15pp transfer-learning gain on the spam class (66% to 81%) is presented without any description of the quantum circuit (qubit count, ansatz, depth), training protocol, hyperparameter matching between classical and hybrid runs, or statistical tests/error bars; these omissions make the attribution to the hybrid architecture impossible to evaluate.

Authors: We agree that the abstract omits essential technical specifications. In the revised version we will add a concise description of the qubit count, ansatz, and circuit depth, note the training protocol, confirm hyperparameter matching, and reference the statistical tests and error bars that appear in the results section. revision: yes
Referee: Abstract/Results: the assertion that hybrid models exhibit 'distinct learning dynamics' and 'richer representational capacity' is unsupported by any reported validation curves, quantitative metrics, or ablation studies that isolate the quantum circuit's contribution from other implementation choices.

Authors: The manuscript currently reports differences in validation loss and accuracy but does not supply curves, quantitative metrics, or ablations. We will add these elements in the revision, including validation curves and ablation experiments that isolate the parameterized quantum circuit. revision: yes
Referee: Abstract: no evidence is supplied that classical and hybrid models were matched in parameter count, optimizer, learning-rate schedule, or random seeds, which is required to rule out the possibility that the transfer gain arises from unmatched capacity or training details rather than the hybrid architecture.

Authors: Matching of parameter count, optimizer, learning-rate schedule, and random seeds was performed in the experiments but not explicitly documented. We will revise the methods and results sections to provide this documentation together with the associated statistical analysis. revision: yes

Circularity Check

0 steps flagged

No circularity in empirical model comparisons

full rationale

The paper reports experimental accuracies from training and evaluating hybrid quantum-classical networks versus classical feedforward networks on held-out COVID-19 tweet and SMS spam datasets. No derivation chain, first-principles predictions, or equations exist that could reduce to inputs by construction. Results are obtained via standard supervised training and transfer learning on external benchmarks, with no self-citation load-bearing any uniqueness claim or fitted parameter renamed as a prediction. The work is self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No mathematical derivations, free parameters, or new entities appear in the abstract. The central claim rests entirely on the validity of an unreported experimental pipeline.

pith-pipeline@v0.9.1-grok · 5709 in / 932 out tokens · 32318 ms · 2026-07-03T17:27:03.727333+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

40 extracted references · 6 canonical work pages · 3 internal anchors

[1]

Foundations and Trends in Information Retrieval2(1–2), 1–135 (2008)

Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval2(1–2), 1–135 (2008)

2008
[2]

Synthesis Lectures on Human Language Technologies

Liu, B.: Sentiment Analysis and Opinion Mining. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers, San Rafael, CA (2012)

2012
[3]

Journal of the American Society for Information Science and Technology62(2), 406–418 (2011)

Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment in twitter events. Journal of the American Society for Information Science and Technology62(2), 406–418 (2011)

2011
[4]

Ain Shams Engineering Journal5(4), 1093–1113 (2014)

Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal5(4), 1093–1113 (2014)

2014
[5]

In: Interna- tional Semantic Web Conference, pp

Saif, H., He, Y., Alani, H.: Semantic sentiment analysis of twitter. In: Interna- tional Semantic Web Conference, pp. 508–524. Springer, ??? (2013)

2013
[6]

In: Proceedings of LREC (2010)

Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of LREC (2010)

2010
[7]

In: Proceedings of EMNLP, pp

Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of EMNLP, pp. 79–86 (2002)

2002
[8]

In: EMNLP, pp

Kim, Y.: Convolutional neural networks for sentence classification. In: EMNLP, pp. 1746–1751 (2014)

2014
[9]

In: EMNLP, pp

Tang, D., Qin, B., Liu, T.: Document modeling with gated recurrent neural network for sentiment classification. In: EMNLP, pp. 1422–1432 (2015)

2015
[10]

In: Advances in Neural Information Processing Systems, pp

Vaswani, A.,et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)

2017
[11]

Nature Reviews Physics3(9), 625–644 (2021)

Cerezo, M., Arrasmith, A., Babbush, R., Benjamin, S.C., Endo, S., Fujii, K., McClean, J.R., Mitarai, K., Yuan, X., Cincio, L., Coles, P.J.: Variational quantum 17 algorithms. Nature Reviews Physics3(9), 625–644 (2021)

2021
[12]

Physical Review Letters122(4), 040504 (2019)

Schuld, M., Killoran, N.: Quantum machine learning in feature hilbert spaces. Physical Review Letters122(4), 040504 (2019)

2019
[13]

Nature Computational Science1(6), 403–409 (2021)

Abbas, A., Sutter, D., Zoufal, C., Lucchi, A., Slater, A., Wiebe, N., Gacon, J., Woerner, S.: The power of quantum neural networks. Nature Computational Science1(6), 403–409 (2021)

2021
[14]

Nature567(7747), 209–212 (2019) https://doi.org/10.1038/ s41586-019-0980-2

Havl´ ıˇ cek, V., C´ orcoles, A.D., Temme, K., Harrow, A.W., Kandala, A., Chow, J.M., Gambetta, J.M.: Supervised learning with quantum-enhanced feature spaces. Nature567(7747), 209–212 (2019) https://doi.org/10.1038/ s41586-019-0980-2

2019
[15]

Nature Physics15(12), 1273–1278 (2019)

Cong, I., Choi, S., Lukin, M.D.: Quantum convolutional neural networks. Nature Physics15(12), 1273–1278 (2019)

2019
[16]

Physical Review X8(2), 021050 (2018)

Amin, M.,et al.: Quantum boltzmann machine. Physical Review X8(2), 021050 (2018)

2018
[17]

arXiv preprint arXiv:2102.15115 (2021)

Wang, H., Radin, A., Zeng, W.J.: Quantum natural language processing. arXiv preprint arXiv:2102.15115 (2021)

work page arXiv 2021
[18]

Pro- ceedings of the first instructional conference on machine learning242, 133–142 (2003)

Ramos, J.: Using tf-idf to determine word relevance in document queries. Pro- ceedings of the first instructional conference on machine learning242, 133–142 (2003)

2003
[19]

In: European Conference on Machine Learning, pp

Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: European Conference on Machine Learning, pp. 137– 142 (1998). Springer

1998
[20]

Efficient Estimation of Word Representations in Vector Space

Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

work page internal anchor Pith review Pith/arXiv arXiv 2013
[21]

Information Science and Statistics, vol

Bishop, C.M., Nasrabadi, N.M.: Pattern Recognition and Machine Learning. Information Science and Statistics, vol. 4. Springer, New York, NY (2006)

2006
[22]

Adaptive Computation and Machine Learning

Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. Adaptive Computation and Machine Learning. MIT Press, Cambridge, MA (2016)

2016
[23]

Advances in neural information processing systems27 (2014)

Livni, R., Shalev-Shwartz, S., Shamir, O.: On the computational efficiency of training neural networks. Advances in neural information processing systems27 (2014)

2014
[24]

The journal of machine learning research15(1), 1929–1958 (2014) 18

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research15(1), 1929–1958 (2014) 18

1929
[25]

In: Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp

Nair, V., Hinton, G.E.: Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp. 807–814 (2010)

2010
[26]

arXiv preprint arXiv:2002.04060 (2020)

Asadi, B., Jiang, H.: On approximation capabilities of relu activation and softmax output layer in neural networks. arXiv preprint arXiv:2002.04060 (2020)

work page arXiv 2002
[27]

Physical Review A98(3), 032309 (2018)

Mitarai, K., Negoro, M., Kitagawa, M., Fujii, K.: Quantum circuit learning. Physical Review A98(3), 032309 (2018)

2018
[28]

Physical Review A99(3), 032331 (2019)

Schuld, M., Bergholm, V., Gogolin, C., Izaac, J., Killoran, N.: Evaluating analytic gradients on quantum hardware. Physical Review A99(3), 032331 (2019)

2019
[29]

Cambridge University Press, 9781107002173 (2010)

Nielsen, M.A., Chuang, I.L.: Quantum Computation and Quantum Information: 10th Anniversary Edition. Cambridge University Press, 9781107002173 (2010)

2010
[30]

nature323(6088), 533–536 (1986)

Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back- propagating errors. nature323(6088), 533–536 (1986)

1986
[31]

PennyLane: Automatic differentiation of hybrid quantum-classical computations

Bergholm, V., Izaac, J., Schuld, M., Gogolin, C., Blank, C., McKiernan, K., Killoran, N.: Pennylane: Automatic differentiation of hybrid quantum-classical computations. arXiv preprint arXiv:1811.04968 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023
[32]

In: International Conference on Machine Learning, pp

Mao, A., Mohri, M., Zhong, Y.: Cross-entropy loss functions: Theoretical analysis and applications. In: International Conference on Machine Learning, pp. 23803– 23828 (2023). pmlr

2023
[33]

Adam: A Method for Stochastic Optimization

Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

work page internal anchor Pith review Pith/arXiv arXiv 2014
[34]

IEEE Transactions on knowledge and data engineering22(10), 1345–1359 (2009)

Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Transactions on knowledge and data engineering22(10), 1345–1359 (2009)

2009
[35]

arXiv e-prints (2018)

Du, Y., Hsieh, M., Liu, T., Tao, D.: The expressive power of parameterized quantum circuits. arXiv e-prints (2018)

2018
[36]

Plos one15(10), 0235885 (2020)

Johnson, J.E., Laparra, V., P´ erez-Suay, A., Mahecha, M.D., Camps-Valls, G.: Kernel methods and their derivatives: Concept and perspectives for the earth system sciences. Plos one15(10), 0235885 (2020)

2020
[37]

In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp

Chen, S.Y.-C., Yoo, S., Fang, Y.-L.L.: Quantum long short-term memory. In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8622–8626 (2022). IEEE

2022
[38]

Journal of Physics Communications8(8), 085004 (2024)

Ceschini, A., Rosato, A., Panella, M.: A variational approach to quantum gated recurrent units. Journal of Physics Communications8(8), 085004 (2024)

2024
[39]

arXiv 19 preprint arXiv:2011.07319 (2020)

Sakuma, T.: Application of deep quantum neural networks to finance. arXiv 19 preprint arXiv:2011.07319 (2020)

work page arXiv 2011
[40]

Electronics14(9), 1827 (2025) 20

Eze, L., Chaudhry, U.B., Jahankhani, H.: Quantum-enhanced machine learn- ing for cybersecurity: Evaluating malicious url detection. Electronics14(9), 1827 (2025) 20

2025

[1] [1]

Foundations and Trends in Information Retrieval2(1–2), 1–135 (2008)

Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval2(1–2), 1–135 (2008)

2008

[2] [2]

Synthesis Lectures on Human Language Technologies

Liu, B.: Sentiment Analysis and Opinion Mining. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers, San Rafael, CA (2012)

2012

[3] [3]

Journal of the American Society for Information Science and Technology62(2), 406–418 (2011)

Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment in twitter events. Journal of the American Society for Information Science and Technology62(2), 406–418 (2011)

2011

[4] [4]

Ain Shams Engineering Journal5(4), 1093–1113 (2014)

Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal5(4), 1093–1113 (2014)

2014

[5] [5]

In: Interna- tional Semantic Web Conference, pp

Saif, H., He, Y., Alani, H.: Semantic sentiment analysis of twitter. In: Interna- tional Semantic Web Conference, pp. 508–524. Springer, ??? (2013)

2013

[6] [6]

In: Proceedings of LREC (2010)

Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of LREC (2010)

2010

[7] [7]

In: Proceedings of EMNLP, pp

Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of EMNLP, pp. 79–86 (2002)

2002

[8] [8]

In: EMNLP, pp

Kim, Y.: Convolutional neural networks for sentence classification. In: EMNLP, pp. 1746–1751 (2014)

2014

[9] [9]

In: EMNLP, pp

Tang, D., Qin, B., Liu, T.: Document modeling with gated recurrent neural network for sentiment classification. In: EMNLP, pp. 1422–1432 (2015)

2015

[10] [10]

In: Advances in Neural Information Processing Systems, pp

Vaswani, A.,et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)

2017

[11] [11]

Nature Reviews Physics3(9), 625–644 (2021)

Cerezo, M., Arrasmith, A., Babbush, R., Benjamin, S.C., Endo, S., Fujii, K., McClean, J.R., Mitarai, K., Yuan, X., Cincio, L., Coles, P.J.: Variational quantum 17 algorithms. Nature Reviews Physics3(9), 625–644 (2021)

2021

[12] [12]

Physical Review Letters122(4), 040504 (2019)

Schuld, M., Killoran, N.: Quantum machine learning in feature hilbert spaces. Physical Review Letters122(4), 040504 (2019)

2019

[13] [13]

Nature Computational Science1(6), 403–409 (2021)

Abbas, A., Sutter, D., Zoufal, C., Lucchi, A., Slater, A., Wiebe, N., Gacon, J., Woerner, S.: The power of quantum neural networks. Nature Computational Science1(6), 403–409 (2021)

2021

[14] [14]

Nature567(7747), 209–212 (2019) https://doi.org/10.1038/ s41586-019-0980-2

Havl´ ıˇ cek, V., C´ orcoles, A.D., Temme, K., Harrow, A.W., Kandala, A., Chow, J.M., Gambetta, J.M.: Supervised learning with quantum-enhanced feature spaces. Nature567(7747), 209–212 (2019) https://doi.org/10.1038/ s41586-019-0980-2

2019

[15] [15]

Nature Physics15(12), 1273–1278 (2019)

Cong, I., Choi, S., Lukin, M.D.: Quantum convolutional neural networks. Nature Physics15(12), 1273–1278 (2019)

2019

[16] [16]

Physical Review X8(2), 021050 (2018)

Amin, M.,et al.: Quantum boltzmann machine. Physical Review X8(2), 021050 (2018)

2018

[17] [17]

arXiv preprint arXiv:2102.15115 (2021)

Wang, H., Radin, A., Zeng, W.J.: Quantum natural language processing. arXiv preprint arXiv:2102.15115 (2021)

work page arXiv 2021

[18] [18]

Pro- ceedings of the first instructional conference on machine learning242, 133–142 (2003)

Ramos, J.: Using tf-idf to determine word relevance in document queries. Pro- ceedings of the first instructional conference on machine learning242, 133–142 (2003)

2003

[19] [19]

In: European Conference on Machine Learning, pp

Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: European Conference on Machine Learning, pp. 137– 142 (1998). Springer

1998

[20] [20]

Efficient Estimation of Word Representations in Vector Space

Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

work page internal anchor Pith review Pith/arXiv arXiv 2013

[21] [21]

Information Science and Statistics, vol

Bishop, C.M., Nasrabadi, N.M.: Pattern Recognition and Machine Learning. Information Science and Statistics, vol. 4. Springer, New York, NY (2006)

2006

[22] [22]

Adaptive Computation and Machine Learning

Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. Adaptive Computation and Machine Learning. MIT Press, Cambridge, MA (2016)

2016

[23] [23]

Advances in neural information processing systems27 (2014)

Livni, R., Shalev-Shwartz, S., Shamir, O.: On the computational efficiency of training neural networks. Advances in neural information processing systems27 (2014)

2014

[24] [24]

The journal of machine learning research15(1), 1929–1958 (2014) 18

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research15(1), 1929–1958 (2014) 18

1929

[25] [25]

In: Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp

Nair, V., Hinton, G.E.: Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp. 807–814 (2010)

2010

[26] [26]

arXiv preprint arXiv:2002.04060 (2020)

Asadi, B., Jiang, H.: On approximation capabilities of relu activation and softmax output layer in neural networks. arXiv preprint arXiv:2002.04060 (2020)

work page arXiv 2002

[27] [27]

Physical Review A98(3), 032309 (2018)

Mitarai, K., Negoro, M., Kitagawa, M., Fujii, K.: Quantum circuit learning. Physical Review A98(3), 032309 (2018)

2018

[28] [28]

Physical Review A99(3), 032331 (2019)

Schuld, M., Bergholm, V., Gogolin, C., Izaac, J., Killoran, N.: Evaluating analytic gradients on quantum hardware. Physical Review A99(3), 032331 (2019)

2019

[29] [29]

Cambridge University Press, 9781107002173 (2010)

Nielsen, M.A., Chuang, I.L.: Quantum Computation and Quantum Information: 10th Anniversary Edition. Cambridge University Press, 9781107002173 (2010)

2010

[30] [30]

nature323(6088), 533–536 (1986)

Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back- propagating errors. nature323(6088), 533–536 (1986)

1986

[31] [31]

PennyLane: Automatic differentiation of hybrid quantum-classical computations

Bergholm, V., Izaac, J., Schuld, M., Gogolin, C., Blank, C., McKiernan, K., Killoran, N.: Pennylane: Automatic differentiation of hybrid quantum-classical computations. arXiv preprint arXiv:1811.04968 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023

[32] [32]

In: International Conference on Machine Learning, pp

Mao, A., Mohri, M., Zhong, Y.: Cross-entropy loss functions: Theoretical analysis and applications. In: International Conference on Machine Learning, pp. 23803– 23828 (2023). pmlr

2023

[33] [33]

Adam: A Method for Stochastic Optimization

Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

work page internal anchor Pith review Pith/arXiv arXiv 2014

[34] [34]

IEEE Transactions on knowledge and data engineering22(10), 1345–1359 (2009)

Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Transactions on knowledge and data engineering22(10), 1345–1359 (2009)

2009

[35] [35]

arXiv e-prints (2018)

Du, Y., Hsieh, M., Liu, T., Tao, D.: The expressive power of parameterized quantum circuits. arXiv e-prints (2018)

2018

[36] [36]

Plos one15(10), 0235885 (2020)

Johnson, J.E., Laparra, V., P´ erez-Suay, A., Mahecha, M.D., Camps-Valls, G.: Kernel methods and their derivatives: Concept and perspectives for the earth system sciences. Plos one15(10), 0235885 (2020)

2020

[37] [37]

In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp

Chen, S.Y.-C., Yoo, S., Fang, Y.-L.L.: Quantum long short-term memory. In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8622–8626 (2022). IEEE

2022

[38] [38]

Journal of Physics Communications8(8), 085004 (2024)

Ceschini, A., Rosato, A., Panella, M.: A variational approach to quantum gated recurrent units. Journal of Physics Communications8(8), 085004 (2024)

2024

[39] [39]

arXiv 19 preprint arXiv:2011.07319 (2020)

Sakuma, T.: Application of deep quantum neural networks to finance. arXiv 19 preprint arXiv:2011.07319 (2020)

work page arXiv 2011

[40] [40]

Electronics14(9), 1827 (2025) 20

Eze, L., Chaudhry, U.B., Jahankhani, H.: Quantum-enhanced machine learn- ing for cybersecurity: Evaluating malicious url detection. Electronics14(9), 1827 (2025) 20

2025