The dataset consists of three types of modalities: language (commands), audio (utterances), vision (images)

Dataset In this section we introduce the KITE dataset, a multi-modal dataset for UA V control

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Kite: Automatic speech recognition for unmanned aerial vehicles

cs.SD · 2019-07-02 · unverdicted · novelty 5.0

Introduces a multimodal UAV command dataset and shows image-augmented RNN language models outperform text-only versions despite imperfect training associations.

citing papers explorer

Showing 1 of 1 citing paper.

Kite: Automatic speech recognition for unmanned aerial vehicles cs.SD · 2019-07-02 · unverdicted · none · ref 4
Introduces a multimodal UAV command dataset and shows image-augmented RNN language models outperform text-only versions despite imperfect training associations.

The dataset consists of three types of modalities: language (commands), audio (utterances), vision (images)

fields

years

verdicts

representative citing papers

citing papers explorer