pith. sign in

arxiv: 2506.00239 · v5 · submitted 2025-05-30 · 💻 cs.AI

SmellNet: A Large-scale Dataset for Real-world Smell Recognition

Pith reviewed 2026-05-19 11:52 UTC · model grok-4.3

classification 💻 cs.AI
keywords machine olfactionsmell recognitionchemical sensor datasettime-series classificationtransformer architecturemixture predictiongas sensor signalsreal-world odor data
0
0 comments X

The pith

SmellNet supplies 828,000 time-series readings from 50 substances and 43 mixtures so sensor-based AI can classify smells and predict their compositions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper creates SmellNet, a large collection of time-series signals gathered by small gas and chemical sensors from everyday items such as nuts, spices, fruits, and their fixed-ratio mixtures. It then presents ScentFormer, a Transformer model that uses temporal differencing and sliding windows to process these signals. With GC-MS supervision the model reaches 63.3 percent top-1 accuracy on substance classification and 50.2 percent top-1 at 0.1 on mixture distribution tasks. A sympathetic reader would care because reliable sensor smell recognition could enable automated allergen checks, continuous manufacturing monitoring, and detection of physiological signals without visual or chemical lab equipment.

Core claim

SmellNet digitizes real-world smells into roughly 828,000 time-series points across 50 substances and 43 mixtures collected over 68 hours; ScentFormer, a Transformer architecture that combines temporal differencing with sliding-window augmentation, then achieves 63.3 percent top-1 accuracy on SmellNet-Base classification when given GC-MS supervision and 50.2 percent top-1@0.1 on the test-seen split for SmellNet-Mixture distribution prediction.

What carries the argument

ScentFormer, a Transformer model that applies temporal differencing and sliding-window augmentation to capture transient chemical dynamics in sensor time-series data.

If this is right

  • Sensor systems could automatically flag allergens such as peanuts or gluten in food samples.
  • Manufacturing lines could monitor ingredient composition in real time without stopping production.
  • Portable devices might detect stress-related or disease-linked volatile compounds through continuous smell sampling.
  • Environmental sensors could track pollutant mixtures by their characteristic odor profiles.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same temporal modeling pattern might transfer to other chemical sensor streams such as air-quality or breath analysis.
  • Larger unlabeled collections of sensor data could support self-supervised pre-training to reduce the need for GC-MS labels.
  • Combining smell data with simultaneous visual or acoustic recordings could produce multimodal systems that cross-verify substance identity.

Load-bearing premise

The small gas and chemical sensors produce time-series signals that reliably encode the chemical identity and composition of the tested substances under the collection conditions used.

What would settle it

Collect new sensor readings of the same substances under changed temperature or humidity and test whether ScentFormer accuracy falls below 30 percent top-1; if it does, the claim that the signals support generalizable smell recognition collapses.

Figures

Figures reproduced from arXiv: 2506.00239 by Alistair Pernigo, Carol Li, Dewei Feng, Paul Pu Liang, Wei Dai, Yunge Wen.

Figure 1
Figure 1. Figure 1: Overview of our smell sensing data collection and modeling pipeline. (a) Sensor hardware setup and data capture. (b) Raw sensor readings over time. (c) AI model predictions on the substance. 1 Introduction Advancements in AI have revolutionized how machines perceive and interact with the world. However, almost all current progress has been limited to the text, visual, and auditory modalities [29, 30]. The … view at source ↗
Figure 2
Figure 2. Figure 2: Overview of the SMELLNET dataset and sensing setup. (a) Our constructed portable smell sensor detects concentrations of various gases and atmospheric factors through 7 multi-channel gas sensors. (b) SMELLNET includes smell sensor readings of 50 substances spanning nuts, spices, herbs, fruits, and vegetables. Of course, smell is a new data modality for AI, with little progress compared to computer vision an… view at source ↗
Figure 3
Figure 3. Figure 3: Data collection pipeline: Each ingredient undergoes six 10-minute sensing sessions across different days, using a controlled environment to minimize external noise. During each session, 12-channel gas sensor data is recorded at 1 Hz and labeled with the ingredient identity, collection time, and associated metadata. We further pair each ingredient with high-resolution GC-MS data to enable multimodal learnin… view at source ↗
Figure 4
Figure 4. Figure 4: PCA projections of raw sensor data. (a) PCA on all substances shows clear clustering by category, with nuts and fruits occupying distinct regions in latent space, suggesting that sensor responses capture broad category-level differences. (b) PCA on only fruits reveals well-separated clusters between bananas, kiwis, lemons, and pears, indicating that the sensor also encodes fine-grained, ingredient-specific… view at source ↗
Figure 5
Figure 5. Figure 5: NO2 sensor readings for angelica and mint, respectively. The two substances exhibit different signal dynamics: angelica shows a rapid increase in the beginning and then a slower increase in later time steps, while mint displays a slower, more linear increase. These patterns highlight the importance of modeling temporal evolution, as discriminative features may lie in both the magnitude and rate of change. … view at source ↗
Figure 6
Figure 6. Figure 6: Overview of the models used in this study. (a) Raw multi-channel time series data is collected using a [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Per-ingredient Top-1 Accuracy: evaluated using Contrastive Learning model with first order temporal difference (p = 25 samples) grouped by category. Each bar represents an ingredient, color-coded by its category (Nuts, Spices, Herbs, Fruits, Vegetables). Ingredients are sorted from highest to lowest accuracy. with only 6.67% for nuts classification and 25% for spices classification. Perhaps surprisingly, o… view at source ↗
Figure 8
Figure 8. Figure 8: Sensor hardware architecture diagram. The device includes seven gas sensors covering VOCs, [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: PCA projections of ingredient-level sensor responses for each major category. Each point represents [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Correlation matrix of all 12 sensor channels, computed using Pearson correlation. Strong correlations [PITH_FULL_IMAGE:figures/full_fig_p016_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: PCA of GC-MS elemental composition across ingredient categories. PC1 accounts for 99.5% of the [PITH_FULL_IMAGE:figures/full_fig_p017_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: GC-MS correlation heatmap of elemental counts. Strong positive correlations are observed between [PITH_FULL_IMAGE:figures/full_fig_p017_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Hierarchical organization of the SmellNet dataset. Each ingredient folder contains multiple CSV files [PITH_FULL_IMAGE:figures/full_fig_p018_13.png] view at source ↗
read the original abstract

The ability of AI to sense and identify various substances based on their smell alone can have profound impacts on allergen detection (e.g. smelling gluten or peanuts in a cake), monitoring the manufacturing process, and sensing hormones that indicate emotional states, stress levels, and diseases. Despite these broad impacts, there are few standardized datasets, and therefore little progress, for training and evaluating AI systems' ability to `smell' in the real-world. In this paper, we use small gas and chemical sensors to create SmellNet, a comparatively large dataset for sensor-based machine olfaction that digitizes a diverse range of smells in the natural world. SmellNet contains about 828,000 time-series data points across 50 substances, spanning nuts, spices, herbs, fruits, and vegetables, and 43 mixtures among them with fixed ingredient volumetric ratios, with 68 hours of data collected. Using SmellNet, we developed ScentFormer, a Transformer-based architecture combining temporal differencing and sliding-window augmentation for smell data. For the SmellNet-Base classification tasks, ScentFormer achieves 63.3% Top-1 accuracy with GC-MS supervision, and for the SmellNet-Mixture distribution prediction tasks, ScentFormer achieves 50.2% Top-1@0.1 on the test-seen split. ScentFormer's ability to generalize across conditions and capture transient chemical dynamics demonstrates the promise of temporal modeling in sensor-based olfactory AI. SmellNet and ScentFormer lay the groundwork for sensor-based olfactory applications across healthcare, food and beverage, environmental monitoring, manufacturing, and entertainment.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper introduces SmellNet, a dataset of ~828,000 time-series data points collected over 68 hours from small gas and chemical sensors across 50 substances (nuts, spices, herbs, fruits, vegetables) and 43 mixtures with fixed volumetric ratios. It proposes ScentFormer, a Transformer architecture using temporal differencing and sliding-window augmentation, reporting 63.3% Top-1 accuracy on SmellNet-Base classification with GC-MS supervision and 50.2% Top-1@0.1 on mixture distribution prediction for the test-seen split.

Significance. If the sensor signals are shown to reliably capture chemical information, this dataset and model would establish a useful benchmark for sensor-based machine olfaction. The scale, real-world substance diversity, and mixture tasks address a gap in standardized resources, with potential impact on allergen detection, manufacturing, environmental monitoring, and health applications. The temporal modeling focus is a constructive direction for handling transient dynamics.

major comments (2)
  1. [Data Collection] The central claims rest on the assumption that the raw time-series from the small gas and chemical sensors encode chemical identity and composition. However, the manuscript supplies no details on sensor calibration, temperature/humidity logging, airflow standardization, or drift correction during the 68-hour collection (Data Collection section). If environmental confounders dominate the signals, the reported accuracies become uninterpretable.
  2. [Results] The abstract and results state 63.3% Top-1 accuracy and 50.2% Top-1@0.1 without baseline comparisons, details on data splits, or statistical significance testing. This omission prevents assessment of whether ScentFormer meaningfully advances beyond simple heuristics or prior sensor models (Results section).
minor comments (1)
  1. [Abstract] The abstract contains backticks around the word 'smell'; this appears to be a formatting artifact and should be removed for clarity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thoughtful and constructive review of our manuscript. We have carefully addressed each major comment below and revised the paper to incorporate additional details and comparisons as suggested.

read point-by-point responses
  1. Referee: [Data Collection] The central claims rest on the assumption that the raw time-series from the small gas and chemical sensors encode chemical identity and composition. However, the manuscript supplies no details on sensor calibration, temperature/humidity logging, airflow standardization, or drift correction during the 68-hour collection (Data Collection section). If environmental confounders dominate the signals, the reported accuracies become uninterpretable.

    Authors: We agree that the original Data Collection section lacked sufficient detail on these critical aspects. In the revised manuscript, we have expanded this section with a new subsection describing the sensor calibration protocol (using certified reference gases at known concentrations), continuous logging of temperature and humidity via auxiliary sensors, airflow standardization through calibrated mass flow controllers set to a fixed rate, and drift correction via periodic zero-air baselines with exponential smoothing. These additions confirm that environmental variables were monitored and mitigated throughout the 68-hour collection period. revision: yes

  2. Referee: [Results] The abstract and results state 63.3% Top-1 accuracy and 50.2% Top-1@0.1 without baseline comparisons, details on data splits, or statistical significance testing. This omission prevents assessment of whether ScentFormer meaningfully advances beyond simple heuristics or prior sensor models (Results section).

    Authors: We acknowledge this gap in the original submission. The revised Results section now includes comparisons to three baselines: a random forest on statistical features, an LSTM model, and a prior gas-sensor CNN from the literature. We have added explicit details on the data splits (70/15/15 train/validation/test with substance stratification for the test-seen split and held-out substances for test-unseen). Statistical significance is reported via McNemar's test and paired t-tests over five random seeds, showing p < 0.01 improvements for ScentFormer over the strongest baseline. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical dataset and model accuracies are direct measurements

full rationale

The paper presents SmellNet as a collected sensor time-series dataset across substances and mixtures, then reports ScentFormer accuracies (63.3% Top-1, 50.2% Top-1@0.1) as empirical outcomes of training and evaluation. No equations, parameter fits, or self-citations are described that would make these metrics reduce to the input data by construction; the results remain independent empirical measurements on held-out splits under the stated collection conditions.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The work rests on the unstated premise that the chosen sensors and collection protocol produce chemically informative signals; no free parameters, axioms, or invented entities are explicitly introduced in the abstract.

pith-pipeline@v0.9.0 · 5837 in / 1031 out tokens · 25002 ms · 2026-05-19T11:52:25.674541+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. AromaGen: Interactive Generation of Rich Olfactory Experiences with Multimodal Language Models

    cs.HC 2026-04 unverdicted novelty 7.0

    AromaGen generates real-time custom aromas from free-form text or visual inputs via multimodal LLM mapping to 12 odorants, matching or exceeding human mixtures after iterative refinement in a 26-person study.

  2. Chasing Ghosts: A Simulation-to-Real Olfactory Navigation Stack with Optional Vision Augmentation

    cs.RO 2026-02 unverdicted novelty 6.0

    A simulation-to-real navigation policy enables a quadrotor to locate an odor source using only basic olfaction sensors and optional vision, validated in indoor real-world flights.

  3. Artificial Intelligence for Food Innovation

    cs.CE 2025-09 unverdicted novelty 3.0

    A review paper that surveys AI uses across the food innovation pipeline for sustainable proteins and identifies four strategic priorities for the emerging field.

Reference graph

Works this paper leans on

58 extracted references · 58 canonical work pages · cited by 3 Pith papers

  1. [1]

    Application of artificial intelligence to decode the relationships between smell, olfactory receptors and small molecules

    Rayane Achebouche, Anne Tromelin, Karine Audouze, and Olivier Taboureau. Application of artificial intelligence to decode the relationships between smell, olfactory receptors and small molecules. Scientific reports, 12(1):18817, 2022

  2. [2]

    Deepseek chat model

    DeepSeek AI. Deepseek chat model. https://chat.deepseek.com, 2024

  3. [3]

    An electronic nose for identifying multiple combustible/harmful gases and their concentration levels via artificial intelligence

    Omneya Attallah and Iman Morsi. An electronic nose for identifying multiple combustible/harmful gases and their concentration levels via artificial intelligence. Measurement, 199:111458, 2022

  4. [4]

    Electrochemical sensors

    Eric Bakker and Martin Telting-Diaz. Electrochemical sensors. Analytical chemistry, 74(12):2781–2800, 2002

  5. [5]

    Bme680: Low power gas, pressure, temperature & humidity sen- sor [datasheet]

    Bosch Sensortec. Bme680: Low power gas, pressure, temperature & humidity sen- sor [datasheet]. https://www.bosch-sensortec.com/media/boschsensortec/downloads/ datasheets/bst-bme680-ds001.pdf, 2023

  6. [6]

    Odour detection methods: Olfactometry and chemical sensors

    Magda Brattoli, Gianluigi De Gennaro, Valentina De Pinto, Annamaria Demarinis Loiotile, Sara Lovascio, and Michele Penza. Odour detection methods: Olfactometry and chemical sensors. Sensors, 11:5290–5322, 2011

  7. [7]

    Humans can discriminate more than 1 trillion olfactory stimuli

    Caroline Bushdid, Marcelo O Magnasco, Leslie B V osshall, and Andreas Keller. Humans can discriminate more than 1 trillion olfactory stimuli. Science, 343(6177):1370–1372, 2014

  8. [8]

    A feature extraction method for chemical sensors in electronic noses

    Liran Carmel, Shahar Levy, Doron Lancet, and David Harel. A feature extraction method for chemical sensors in electronic noses. Sensors and Actuators B: Chemical, 93:67–76, 2003

  9. [9]

    Chemberta: Large-scale self- supervised pretraining for molecular property prediction, 2020

    Shankari Chithrananda, Gabriel Grand, and Bharath Ramsundar. Chemberta: Large-scale self-supervised pretraining for molecular property prediction. arXiv preprint arXiv:2010.09885, 2020. URL https: //arxiv.org/abs/2010.09885

  10. [10]

    Ap- plication of electronic nose for industrial odors and gaseous emissions measurement and monitoring–an overview

    Sharvari Deshmukh, Rajib Bandyopadhyay, Nabarun Bhattacharyya, RA Pandey, and Arun Jana. Ap- plication of electronic nose for industrial odors and gaseous emissions measurement and monitoring–an overview. Talanta, 144:329–340, 2015

  11. [11]

    Cosimo Distante, Marco Leo, Bruno Siciliano, and Krishna C. Persaud. On the study of feature extraction methods for an electronic nose. Sensors and Actuators B: Chemical, 87(2):274–288, 2002

  12. [12]

    A review of supervised machine learning for smart manufacturing: Techniques, challenges, and applications

    Mahmoud Elhoushi, Ahmed El-Sayed, and Abdulmotaleb El Saddik. A review of supervised machine learning for smart manufacturing: Techniques, challenges, and applications. Journal of Big Data, 9(1): 1–41, 2022. doi: 10.1186/s40537-022-00596-3

  13. [13]

    Recent progress of organic artificial synapses in biomimetic sensory neural system

    Lan Fang, Cuizhu Mao, Haiting Wang, Qian Ding, Wenyao Jiao, Bingshuo Li, Yibo Zhang, and Dunwei Gong. Recent progress of organic artificial synapses in biomimetic sensory neural system. Journal of Materials Chemistry C, 2024

  14. [14]

    How the olfactory system makes sense of scents

    Stuart Firestein. How the olfactory system makes sense of scents. Nature, 413(6852):211–218, 2001

  15. [15]

    Foodb: The food database, 2024

    FooDB Contributors. Foodb: The food database, 2024. URL https://foodb.ca. Accessed: 2025-05-11

  16. [16]

    Smelling, seeing, tasting old senses for new sensing

    Luca Guerrini, Eduardo Garcia-Rico, Nicolas Pazos-Perez, and Ramon A Alvarez-Puebla. Smelling, seeing, tasting old senses for new sensing. ACS nano, 11(6):5217–5222, 2017

  17. [17]

    Technical data mq-3 gas sensor [datasheet]

    Hanwei Electronics Co., Ltd. Technical data mq-3 gas sensor [datasheet]. https://cdn.sparkfun. com/assets/6/a/1/7/b/MQ-3.pdf, n.d

  18. [18]

    Technical data mq-5 gas sensor [datasheet].https://files.seeedstudio

    Hanwei Electronics Co., Ltd. Technical data mq-5 gas sensor [datasheet].https://files.seeedstudio. com/wiki/Grove-Gas_Sensor-MQ5/res/MQ-5.pdf , n.d

  19. [19]

    Mq-9 semiconductor sensor for co/combustible gas [datasheet]

    Hanwei Electronics Co., Ltd. Mq-9 semiconductor sensor for co/combustible gas [datasheet]. https: //www.haoyuelectronics.com/Attachment/MQ-9/MQ9.pdf, n.d

  20. [20]

    Long short-term memory

    Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural computation, 9(8):1735–1780, 1997

  21. [21]

    Optical gas sensing: a review

    Jane Hodgkinson and Ralph P Tatam. Optical gas sensing: a review. Measurement science and technology, 24(1):012004, 2012. 10

  22. [22]

    Mol2vec: Unsupervised machine learning approach with chemical intuition

    Sabrina Jaeger, Simone Fulle, and Samo Turk. Mol2vec: Unsupervised machine learning approach with chemical intuition. Journal of Chemical Information and Modeling , 58(1):27–35, 2018. doi: 10.1021/acs.jcim.7b00616

  23. [23]

    Predicting human olfactory perception from chemical features of odor molecules

    Andreas Keller, Richard C Gerkin, Yuanfang Guan, Amit Dhurandhar, Gabor Turu, Bence Szalai, Joel D Mainland, Yusuke Ihara, Chung Wen Yu, Russ Wolfinger, et al. Predicting human olfactory perception from chemical features of odor molecules. Science, 355(6327):820–826, 2017

  24. [24]

    Gerkin, et al

    Andreas Keller, Richard C. Gerkin, et al. Pyrfume: A large-scale database of olfactory psychophysics data. bioRxiv, 2021. URL https://www.biorxiv.org/content/10.1101/2021.08.02.454727v1

  25. [25]

    Gas chromatography and mass spectrometry: a practical guide

    Fulton G Kitson, Barbara S Larsen, and Charles N McEwen. Gas chromatography and mass spectrometry: a practical guide. Academic Press, 1996

  26. [26]

    Clinical studies of detecting covid-19 from exhaled breath with electronic nose

    Andrzej Kwiatkowski, Sebastian Borys, Katarzyna Sikorska, Katarzyna Drozdowska, and Janusz M Smulko. Clinical studies of detecting covid-19 from exhaled breath with electronic nose. Scientific reports, 12(1):15990, 2022

  27. [27]

    A principal odor map unifies diverse tasks in olfactory perception

    Brian K Lee, Emily J Mayhew, Benjamin Sanchez-Lengeling, Jennifer N Wei, Wesley W Qian, Kelsie A Little, Matthew Andres, Britney B Nguyen, Theresa Moloy, Jacob Yasonik, et al. A principal odor map unifies diverse tasks in olfactory perception. Science, 381(6661):999–1006, 2023

  28. [28]

    Mimicking the human smell sensing mechanism with an artificial nose platform

    Sang Hun Lee, Oh Seok Kwon, Hyun Seok Song, Seon Joo Park, Jong Hwan Sung, Jyongsik Jang, and Tai Hyun Park. Mimicking the human smell sensing mechanism with an artificial nose platform. Biomaterials, 33(6):1722–1729, 2012

  29. [29]

    Hemm: Holistic evaluation of multimodal foundation models

    Paul Pu Liang, Akshay Goindani, Talha Chafekar, Leena Mathur, Haofei Yu, Russ Salakhutdinov, and Louis-Philippe Morency. Hemm: Holistic evaluation of multimodal foundation models. In The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2024

  30. [30]

    Foundations & trends in multimodal machine learning: Principles, challenges, and open questions

    Paul Pu Liang, Amir Zadeh, and Louis-Philippe Morency. Foundations & trends in multimodal machine learning: Principles, challenges, and open questions. ACM Computing Surveys, 56(10):1–42, 2024

  31. [31]

    Qualitative and quantitative analysis of volatile organic compounds using transient and steady-state responses of a thick-film tin oxide gas sensor array

    Eduard Llobet, Jordi Brezmes, Xavier Vilanova, Joan Esteve Sueiras, and Xavier Correig. Qualitative and quantitative analysis of volatile organic compounds using transient and steady-state responses of a thick-film tin oxide gas sensor array. Sensors and Actuators B: Chemical, 41:13–21, 1997

  32. [32]

    Combinatorial receptor codes for odors

    Bettina Malnic, Junzo Hirono, Takaaki Sato, and Linda B Buck. Combinatorial receptor codes for odors. Cell, 96(5):713–723, 1999

  33. [33]

    Feature extraction of chemical sensors in phase space

    Eugenio Martinelli, Christian Falconi, Arnaldo D’Amico, and Corrado Di Natale. Feature extraction of chemical sensors in phase space. Sensors and Actuators B: Chemical, 95(1-3):132–139, 2003

  34. [34]

    Conducting polymer gas sensors.Journal of the Chemical Society, Faraday Transactions 1: Physical Chemistry in Condensed Phases, 82(4):1117–1126, 1986

    Jan J Miasik, Alan Hooper, and Bruce C Tofield. Conducting polymer gas sensors.Journal of the Chemical Society, Faraday Transactions 1: Physical Chemistry in Condensed Phases, 82(4):1117–1126, 1986

  35. [35]

    Semiconductor gas sensors: Materials, technology, design, and application

    Maria Vesna Nikolic, Vladimir Milovanovic, Zorka Z Vasiljevic, and Zoran Stamenkovic. Semiconductor gas sensors: Materials, technology, design, and application. Sensors, 20(22):6694, 2020

  36. [36]

    Chatgpt-4o (may 2025 version), 2025

    OpenAI. Chatgpt-4o (may 2025 version), 2025. Accessed via https://chat.openai.com

  37. [37]

    Deep learning framework for sensor array precision and accuracy enhancement

    Julie Payette, Fabrice Vaussenat, and Sylvain Cloutier. Deep learning framework for sensor array precision and accuracy enhancement. Scientific Reports, 13(1):11237, 2023

  38. [38]

    Found in translation: Learning robust joint representations by cyclic translations between modalities

    Hai Pham, Paul Pu Liang, Thomas Manzini, Louis-Philippe Morency, and Barnabás Póczos. Found in translation: Learning robust joint representations by cyclic translations between modalities. In AAAI, 2019

  39. [39]

    Learning transferable visual models from natural language supervision

    Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, and Sandhini Agarwal. Learning transferable visual models from natural language supervision. In ICML, pages 8748–8763. PMLR, 2021

  40. [40]

    Learning representations by back- propagating errors

    David E Rumelhart, Geoffrey E Hinton, and Ronald J Williams. Learning representations by back- propagating errors. Nature, 323(6088):533–536, 1986

  41. [41]

    Machine learning for scent: Learning generalizable perceptual representations of small molecules

    Benjamin Sanchez-Lengeling, Jennifer N Wei, Brian K Lee, Richard C Gerkin, Alán Aspuru-Guzik, and Alexander B Wiltschko. Machine learning for scent: Learning generalizable perceptual representations of small molecules. arXiv preprint arXiv:1910.10685, 2019. 11

  42. [42]

    Sense of smell: structural, functional, mechanistic advancements and challenges in human olfactory research

    Anju Sharma, Rajnish Kumar, Imlimaong Aier, Rahul Semwal, Pankaj Tyagi, and Pritish Varadwaj. Sense of smell: structural, functional, mechanistic advancements and challenges in human olfactory research. Current neuropharmacology, 17(9):891–911, 2019

  43. [43]

    Reasoning with neural tensor networks for knowledge base completion

    Richard Socher, Danqi Chen, Christopher D Manning, and Andrew Ng. Reasoning with neural tensor networks for knowledge base completion. In Advances in neural information processing systems, pages 926–934, 2013

  44. [44]

    An initial evaluation of the functions of human olfaction

    Richard J Stevenson. An initial evaluation of the functions of human olfaction. Chemical senses, 35(1): 3–20, 2010

  45. [45]

    Data-centric artificial olfactory system based on the eigengraph

    Seung-Hyun Sung, Jun Min Suh, Yun Ji Hwang, Ho Won Jang, Jeon Gue Park, and Seong Chan Jun. Data-centric artificial olfactory system based on the eigengraph. Nature communications, 15(1):1211, 2024

  46. [46]

    Data-centric artificial olfactory system based on the eigengraph

    Seung-Hyun Sung, Jun Min Suh, Yun Ji Hwang, Ho Won Jang, Jeon Gue Park, and Seong Chan Jun. Data-centric artificial olfactory system based on the eigengraph. Nature Communications, 15(1):1211, 2024

  47. [47]

    V okenization: Improving language understanding with contextualized, visual-grounded supervision

    Hao Tan and Mohit Bansal. V okenization: Improving language understanding with contextualized, visual-grounded supervision. In EMNLP, pages 2066–2080, 2020

  48. [48]

    Emotionally responsive wearable technology and stress detection for affective disorders

    Jenny Tillotson. Emotionally responsive wearable technology and stress detection for affective disorders. Psychiatria Danubina, 29(suppl. 3):604–606, 2017

  49. [49]

    Deepnose: Using artificial neural networks to represent the space of odorants

    Ngoc Tran, Daniel Kepple, Sergey Shuvaev, and Alexei Koulakov. Deepnose: Using artificial neural networks to represent the space of odorants. In International Conference on Machine Learning, pages 6305–6314. PMLR, 2019

  50. [50]

    Attention is all you need

    Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in neural information processing systems, pages 5998–6008, 2017

  51. [51]

    Beverage and food fragrance biotechnology, novel applications, sensory and sensor techniques: An overview

    Alice Vilela, Eunice Bacelar, Teresa Pinto, Rosário Anjos, Elisete Correia, Berta Gonçalves, and Fernanda Cosme. Beverage and food fragrance biotechnology, novel applications, sensory and sensor techniques: An overview. Foods, 8(12):643, 2019

  52. [52]

    Evolving the olfactory system with machine learning

    Peter Y Wang, Yi Sun, Richard Axel, LF Abbott, and Guangyu Robert Yang. Evolving the olfactory system with machine learning. Neuron, 109(23):3879–3892, 2021

  53. [53]

    Hmdb 4.0: the human metabolome database for

    David S Wishart, Yannick D Feunang, Ana Marcu, et al. Hmdb 4.0: the human metabolome database for

  54. [54]

    Nucleic acids research, 46(D1):D608–D617, 2018

  55. [56]

    Electronic nose feature extraction methods: A review

    Jia Yan, Xiuzhen Guo, Shukai Duan, Pengfei Jia, Lidan Wang, Chao Peng, and Songlin Zhang. Electronic nose feature extraction methods: A review. Sensors, 15(11):27804–27831, 2015

  56. [57]

    Identification of suitable biomarkers for stress and emotion detection for future personal affective wearable sensors

    Abdulaziz Zamkah, Terence Hui, Simon Andrews, Nilanjan Dey, Fuqian Shi, and R Simon Sherratt. Identification of suitable biomarkers for stress and emotion detection for future personal affective wearable sensors. Biosensors, 10(4):40, 2020

  57. [58]

    Mp503 air-quality gas sensor [datasheet]

    Zhengzhou Winsen Electronics Technology Co., Ltd. Mp503 air-quality gas sensor [datasheet]. https://files.seeedstudio.com/wiki/Grove_Air_Quality_Sensor_v1.3/res/Mp503% 20English.pdf, n.d

  58. [59]

    Wsp2110 voc gas sensor [datasheet]

    Zhengzhou Winsen Electronics Technology Co., Ltd. Wsp2110 voc gas sensor [datasheet]. https: //files.seeedstudio.com/wiki/Grove-HCHO_Sensor/res/Wsp2110-1-.pdf , n.d.. 12 A Background on the Smell Sensors A.1 Sensor hardware specifications Figure 8: Sensor hardware architecture diagram. The device includes seven gas sensors covering VOCs, alcohol, carbon m...