For DINO, we directly use the projection head for [CLS] token and generate a 65536-d probability distribution for each patch token

For BEiT, the DALL-E encoder generates a discrete number for each patch token · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

iBOT: Image BERT Pre-Training with Online Tokenizer

cs.CV · 2021-11-15 · unverdicted · novelty 7.0

iBOT achieves 82.3% linear probing accuracy and 87.8% fine-tuning accuracy on ImageNet-1K using masked image modeling with a jointly trained online tokenizer.

citing papers explorer

Showing 1 of 1 citing paper.

iBOT: Image BERT Pre-Training with Online Tokenizer cs.CV · 2021-11-15 · unverdicted · none · ref 21
iBOT achieves 82.3% linear probing accuracy and 87.8% fine-tuning accuracy on ImageNet-1K using masked image modeling with a jointly trained online tokenizer.

For DINO, we directly use the projection head for [CLS] token and generate a 65536-d probability distribution for each patch token

fields

years

verdicts

representative citing papers

citing papers explorer