Publications

(2025). FinSage: A Multi-aspect RAG System for Financial Filings Question Answering. Proceedings of the 34th ACM International Conference on Information and Knowledge Management (CIKM 2025).
(2025). CARE: Improving Context Fidelity via Native Retrieval-Augmented Reasoning. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025).
(2025). BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks. The Thirteenth International Conference on Learning Representations (ICLR 2025).
(2025). AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding. CoRR.
(2025). Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems. arXiv preprint arXiv: 2504.01990.
(2024). Resonance RoPE: Improving Context Length Generalization of Large Language Models. Findings of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024.
(2024). LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models. CoRR.
(2024). FACT: Examining the Effectiveness of Iterative Context Rewriting for Multi-fact Retrieval. Findings of The 2025 Annual Conference of the Nations of the Americas Chapter of the ACL (NAACL 2025 Findings).
(2023). Efficient Classification of Long Documents via State-Space Models. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023.