Marko Karbevski
Identifiers
- name variant Marko Karbevski 0.60 · backfill
Papers (3)
- Can an MLP Absorb Its Own Skip Connection? cs.LG · 2026 · author #2
- Beyond Linearity in Attention Projections: The Case for Nonlinear Queries cs.LG · 2026 · author #1
- Key and Value Weights Are Probably All You Need: On the Necessity of the Query, Key, Value weight Triplet in Self-Attention Transformers cs.LG · 2025 · author #1
Mentions
- 2603.13381 #1 · arxiv_oai · confidence 0.70 Marko Karbevski
Frequent Coauthors
- Antonij Mijoski 2 shared papers