← back to paper
arxiv: 2103.00020 · 2 revisions
Learning Transferable Visual Models From Natural Language Supervision