Robust speech recognition via large-scale weak supervision

Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever · 2023

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

The DeepSpeak Dataset

cs.CV · 2024-08-09 · unverdicted · novelty 7.0

DeepSpeak provides over 100 hours of consented, identity-matched real and modern deepfake audiovisual content focused on talking heads, with evaluations showing existing detectors fail to generalize without retraining.

Non-Intrusive Automatic Speech Recognition Refinement: A Survey

eess.AS · 2025-08-10 · accept · novelty 4.0

A survey that classifies non-intrusive ASR refinement methods into five categories, reviews domain adaptation and evaluation datasets, proposes standardized metrics, and identifies future research directions.

ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning

cs.RO · 2024-06-28 · unverdicted · novelty 4.0

ROS-LLM integrates LLMs with ROS to let non-experts specify robot tasks in natural language, supporting sequence, behavior tree, and state machine modes plus imitation learning and reflection on feedback.

Chat Modeling: Interaction-Enhanced Agent Framework for Visualizing Literature-Grounded Biological Structures

cs.HC · 2024-04-01 · unverdicted · novelty 4.0

Chat Modeling is a multi-agent LLM framework with modeling memory and dynamic chat widgets that translates text inputs into interactive 3D modeling operations for literature-grounded biological structures.

citing papers explorer

Showing 4 of 4 citing papers.

The DeepSpeak Dataset cs.CV · 2024-08-09 · unverdicted · none · ref 41
DeepSpeak provides over 100 hours of consented, identity-matched real and modern deepfake audiovisual content focused on talking heads, with evaluations showing existing detectors fail to generalize without retraining.
Non-Intrusive Automatic Speech Recognition Refinement: A Survey eess.AS · 2025-08-10 · accept · none · ref 5
A survey that classifies non-intrusive ASR refinement methods into five categories, reviews domain adaptation and evaluation datasets, proposes standardized metrics, and identifies future research directions.
ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning cs.RO · 2024-06-28 · unverdicted · none · ref 42
ROS-LLM integrates LLMs with ROS to let non-experts specify robot tasks in natural language, supporting sequence, behavior tree, and state machine modes plus imitation learning and reflection on feedback.
Chat Modeling: Interaction-Enhanced Agent Framework for Visualizing Literature-Grounded Biological Structures cs.HC · 2024-04-01 · unverdicted · none · ref 39
Chat Modeling is a multi-agent LLM framework with modeling memory and dynamic chat widgets that translates text inputs into interactive 3D modeling operations for literature-grounded biological structures.

Robust speech recognition via large-scale weak supervision

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer