How many objects are left? •Original Answer: 3 Input Image <think> Okay, let's see

Simplify the Negative Sign (Optional): Since , the equation can also be written as: Final Answer: y=a/uni22C5sin(−5x) ( /uni03C0 10,5) a x=/uni03C0 10 y=5 5=a/uni22C5sin(−5/uni22C5

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

cs.CL · 2025-04-10 · unverdicted · novelty 6.0

SFT induces pseudo-reasoning paths that undermine RL in LVLMs, while RL with GRPO and mixed perception-cognition rewards on the new VLAA-Thinking dataset produces more genuine reasoning and top leaderboard performance.

citing papers explorer

Showing 1 of 1 citing paper.

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models cs.CL · 2025-04-10 · unverdicted · none · ref 30
SFT induces pseudo-reasoning paths that undermine RL in LVLMs, while RL with GRPO and mixed perception-cognition rewards on the new VLAA-Thinking dataset produces more genuine reasoning and top leaderboard performance.

How many objects are left? •Original Answer: 3 Input Image <think> Okay, let's see

fields

years

verdicts

representative citing papers

citing papers explorer