Rabiul Awal, Mahsa Massoud, Aarash Feizi, Zichao Li, Suyuchen Wang, Christopher Pal, Aishwarya Agrawal, David Vázquez, Siva Reddy, Juan A. Rodriguez, Perouz Taslakian, Spandana Gella, Sai Rajeswar(2025).
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025).
Suyuchen Wang*, Tianyu Zhang*, Lu Li, Ge Zhang, Perouz Taslakian, Sai Rajeswar, Jie Fu, Bang Liu, Yoshua Bengio(2025).
VCR: Pixel-Level Complex Reasoning by Restoring Occluded Text.
The Thirteenth International Conference on Learning Representations (ICLR 2025).
Tianyu Zhang, Xinyu Wang, Lu Li, Zhenghan Tai, Jijun Chi, Jingrui Tian, Hailin He, Suyuchen Wang(2025).
STRICT: Stress-Test of Rendering Image Containing Text.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025).