pith. sign in

Marko Karbevski

Identifiers

  • name variant Marko Karbevski 0.60 · backfill

Papers (3)

  1. Can an MLP Absorb Its Own Skip Connection? cs.LG · 2026 · author #2
  2. Beyond Linearity in Attention Projections: The Case for Nonlinear Queries cs.LG · 2026 · author #1
  3. Key and Value Weights Are Probably All You Need: On the Necessity of the Query, Key, Value weight Triplet in Self-Attention Transformers cs.LG · 2025 · author #1

Mentions

  • 2603.13381 #1 · arxiv_oai · confidence 0.70 Marko Karbevski

Frequent Coauthors