Publications
2026
- Preprint
Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged AgentsArxiv:2602.07796, 2026 - PreprintTowards Reducible Uncertainty Modeling for Reliable Large Language Model AgentsArxiv:2602.05073, 2026
- WWW ’26Are LLMs Stable Formal Logic Translators in Logical Reasoning Across Linguistically Diversified Texts?In Proceedings of the ACM Web Conference 2026, 2026
- IEEE TLTTowards Fair and Efficient Intelligent Learning: A Generative Cognitive Diagnosis ApproachIEEE Transactions on Learning Technologies, 2026
- AAAI ’26From Diagnosis to Generalization: A Cognitive Approach to Data Selection for Educational LLMsIn Proceedings of the AAAI Conference on Artificial Intelligence, 2026
- ICLR ’26Scaling Reasoning Hop Exposes Weaknesses: Demystifying and Improving Hop Generalization in Large Language ModelsarXiv preprint arXiv:2601.21214, 2026
- ICLR ’26Fewer Battles, More Gain: An Information-Efficient Framework for Arena-based LLM EvaluationIn The Fourteenth International Conference on Learning Representations, 2026
2025
- TPAMIEfficient Benchmarking via Bias-Bounded Subset SelectionIEEE Trans. Pattern Anal. Mach. Intell., 2025
- EMNLP ’25 FindingSummarize-Exemplify-Reflect: Data-driven Insight Distillation Empowers LLMs for Few-shot Tabular ClassificationIn Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
- ICML ’25
am-ELO: A Stable Framework for Arena-based LLM EvaluationIn Forty-second International Conference on Machine Learning, 2025 - NeurIPS ’25
- Preprint
- preprintLogic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical ExpressionArxiv:2505.13527, 2025
2024
- Preprint
Leveraging LLMs for Hypothetical Deduction in Logical Inference: A Neuro-Symbolic ApproachArxiv:2410.21779, 2024 - NeurIPS ’24
Computerized Adaptive Testing via Collaborative RankingIn Advances in Neural Information Processing Systems, 2024 - KDD ’24AdaRD: An Adaptive Response Denoising Framework for Robust Learner ModelingIn Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
- Preprint
- PreprintA Survey of Models for Cognitive Diagnosis: New Developments and Future DirectionsArxiv:2407.05458, 2024
- DASFAA ’24Reformulating Sequential Recommendation: Learning Dynamic User Interest with Content-enriched Language ModelingArxiv:2309.10435, 2024
2023
- Preprint