Published Papers
None
No published papers available at the moment.Preprints
AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint
Leheng Sheng*, Changshuo Shen*, Weixiang Zhao, Junfeng Fang, Xiaohao Liu, Zhenkai Liang, Xiang Wang, An Zhang†, Tat-Seng ChuaOn Reasoning Strength Planning in Large Reasoning Models
Leheng Sheng, An Zhang†, Zijian Wu, Weixiang Zhao, Changshuo Shen, Yi Zhang, Xiang Wang, Tat-Seng Chua
Notes: * Equal contribution; † Corresponding author