Our model uses 128 gradient scales, one for each of its resblocks

Use per-resblock gradient scaling (Figure 4) instead of standard loss scaling

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Zero-Shot Text-to-Image Generation

cs.CV · 2021-02-24 · conditional · novelty 7.0

A transformer autoregressively models text and image tokens as one stream and produces competitive zero-shot text-to-image results at sufficient scale.

citing papers explorer

Showing 1 of 1 citing paper.

Zero-Shot Text-to-Image Generation cs.CV · 2021-02-24 · conditional · none · ref 4
A transformer autoregressively models text and image tokens as one stream and produces competitive zero-shot text-to-image results at sufficient scale.

Our model uses 128 gradient scales, one for each of its resblocks

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer