Bifrost achieves significant latency reductions in privacy-preserving transformer inference through a hybrid CPU TEE and accelerator FHE design, with Bifrost+ further optimizing via prefill/decode split.
Rubix: Reducing the overhead of secure rowhammer mitigations via randomized line- to-row mapping
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3representative citing papers
PrISM uses a Sampled History Queue to correlate row samples across windows, solving the non-selection problem in probabilistic RowHammer mitigation and cutting slowdown from 10.7% to 1.5% at threshold 250 versus prior methods.
CloakLM mitigates model exfiltration by obfuscating GPU memory layouts with PCIe shaping, weight shuffling, and HBM remapping while keeping near-native performance.
citing papers explorer
-
Bifrost: Hybrid TEE-FHE Inference for Privacy-Preserving Transformer and LLM Serving
Bifrost achieves significant latency reductions in privacy-preserving transformer inference through a hybrid CPU TEE and accelerator FHE design, with Bifrost+ further optimizing via prefill/decode split.
-
Loaded Dice: Solving the Non-Selection Problem for Scalable Probabilistic RowHammer Defense
PrISM uses a Sampled History Queue to correlate row samples across windows, solving the non-selection problem in probabilistic RowHammer mitigation and cutting slowdown from 10.7% to 1.5% at threshold 250 versus prior methods.
-
CloakLM: Obfuscating GPU Memory Layout to Mitigate Model Ex-filtration for Serving
CloakLM mitigates model exfiltration by obfuscating GPU memory layouts with PCIe shaping, weight shuffling, and HBM remapping while keeping near-native performance.