pith. machine review for the scientific record. sign in

arxiv: 1511.08458 · v2 · submitted 2015-11-26 · 💻 cs.NE · cs.CV· cs.LG

Recognition: unknown

An Introduction to Convolutional Neural Networks

Authors on Pith no claims yet
classification 💻 cs.NE cs.CVcs.LG
keywords introductionlearningmachineneuralannsarchitectureartificialcnns
0
0 comments X
read the original abstract

The field of machine learning has taken a dramatic twist in recent times, with the rise of the Artificial Neural Network (ANN). These biologically inspired computational models are able to far exceed the performance of previous forms of artificial intelligence in common machine learning tasks. One of the most impressive forms of ANN architecture is that of the Convolutional Neural Network (CNN). CNNs are primarily used to solve difficult image-driven pattern recognition tasks and with their precise yet simple architecture, offers a simplified method of getting started with ANNs. This document provides a brief introduction to CNNs, discussing recently published papers and newly formed techniques in developing these brilliantly fantastic image recognition models. This introduction assumes you are familiar with the fundamentals of ANNs and machine learning.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 9 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. OVS-DINO: Open-Vocabulary Segmentation via Structure-Aligned SAM-DINO with Language Guidance

    cs.CV 2026-04 unverdicted novelty 7.0

    OVS-DINO structurally aligns DINO with SAM to revitalize attenuated boundary features, achieving SOTA gains of 2.1% average and 6.3% on Cityscapes in weakly-supervised open-vocabulary segmentation.

  2. QMC-Net: Data-Aware Quantum Representations for Remote Sensing Image Classification

    quant-ph 2026-04 unverdicted novelty 6.0

    QMC-Net maps per-band statistics to customized quantum circuit hyperparameters and achieves 93.80% and 99.34% accuracy on EuroSAT and SAT-6, outperforming classical and monolithic quantum baselines.

  3. Automated Attention Pattern Discovery at Scale in Large Language Models

    cs.LG 2026-04 unverdicted novelty 6.0

    AP-MAE reconstructs masked attention patterns in LLMs with high accuracy, generalizes across models, predicts generation correctness at 55-70%, and enables 13.6% accuracy gains via targeted interventions.

  4. TemPose-TF-ASF: Two-Stage Bidirectional Stroke Context Fusion for Badminton Stroke Classification

    cs.CV 2026-05 unverdicted novelty 4.0

    TemPose-TF-ASF fuses bidirectional stroke context in two stages to raise accuracy and Macro-F1 for badminton stroke classification over baselines.

  5. SAGE-GAN: Towards Realistic and Robust Segmentation of Spatially Ordered Nanoparticles via Attention-Guided GANs

    cs.CV 2026-04 unverdicted novelty 4.0

    SAGE-GAN integrates a self-attention U-Net into a CycleGAN framework to generate realistic synthetic electron microscopy image-mask pairs that augment training data for nanoparticle segmentation without human labeling.

  6. Revisiting Human-in-the-Loop Object Retrieval with Pre-Trained Vision Transformers

    cs.CV 2026-04 unverdicted novelty 4.0

    Pre-trained ViT representations combined with active learning and targeted design choices for annotations and selection improve object class retrieval in multi-object scenes.

  7. Learning-Based Spectrum Cartography in Low Earth Orbit Satellite Networks: An Overview

    cs.NI 2026-05 unverdicted novelty 3.0

    The paper overviews attention-based learning methods for spectrum cartography in LEO satellite networks to enable adaptive fusion of heterogeneous measurements for inference and resource allocation.

  8. Survey on Disaster Management Datasets for Remote Sensing Based Emergency Applications

    cs.CV 2026-05 unverdicted novelty 3.0

    A survey providing an overview of publicly available image-based datasets for ML/DL-based disaster management pipelines covering pre-disaster, during, and post-disaster phases.

  9. Comparison of window shapes and lengths in short-time feature extraction for classification of heart sound signals

    cs.SD 2026-04 unverdicted novelty 2.0

    A 75 ms Gaussian window for segmenting phonocardiography signals yields the highest biLSTM classification accuracy among tested shapes and lengths, outperforming rectangular windows and a baseline method.