pith. sign in

Shiva Kumar Pentyala

Identifiers

  • name variant Shiva Kumar Pentyala 0.60 · backfill

Papers (2)

  1. UNA: A Unified Supervised Framework for Efficient LLM Alignment Across Feedback Types cs.LG · 2024 · author #4
  2. Reinforcement Learning for LLM Post-Training: A Survey cs.CL · 2024 · author #4

Mentions

  • 2407.16216 #4 · arxiv_oai · confidence 0.70 Shiva Kumar Pentyala

Frequent Coauthors