Publications
A collection of my research work.

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning
Yiju Guo, Wenkai Yang, Zexu Sun, Ning Ding, Zhiyuan Liu, Yankai Lin
NeurIPS 2025 Conference 2025
A framework to improve LLM reasoning by removing distracting tokens via causal attention distillation and gradient-guided pruning.

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
Wenkai Yang, Weijie Liu, Ruobing Xie, Yiju Guo, Lulu Wu, Saiyong Yang, Yankai Lin
arXiv preprint arXiv:2510.14943 2025
An efficient RL method that unifies reasoning and verification by utilizing the last-token probability as a self-rewarding signal.

Uncertainty and influence aware reward model refinement for reinforcement learning from human feedback
Zexu Sun, Yiju Guo, Yankai Lin, Xu Chen, Qi Qi, Xing Tang, Ji-Rong Wen
ICLR 2025 Conference 2025
An uncertainty-aware data augmentation method to refine reward models in RLHF without expensive human annotation.

Controllable preference optimization: Toward controllable multi-objective alignment
Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Zexu Sun, Bowen Sun, Huimin Chen, Ruobing Xie, Jie Zhou, Yankai Lin, others
EMNLP 2024 main conference 2024
A multi-objective alignment method that explicitly controls preference scores to balance helpfulness, honesty, and harmlessness.
Amvae: Asymmetric multimodal variational autoencoder for multi-view representation
Wen Youpeng, Lin Hongxiang, Guo Yiju, Zhao Liang
International Conference on Artificial Neural Networks 2021
A variational autoencoder framework for learning representations from asymmetric multimodal data.