Qiyu Wu
Hi there! This is Qiyu Wu. I’m living in Tokyo now.
I’m a Ph.D. student at The University of Tokyo. I’m fortunate to advised by Yoshimasa Tsuruoka.
I’m supported by JSPS DC2 Fellowship and JST SPRING Fellowship during my Ph.D. program.
Before moving to Tokyo, I obtained my M.Sc. degree from Peking University and B.Eng. degree from Sichuan University. Here is my CV.
Reach out to me by wuqiyu576 [AT] gmail [DOT] com, or LinkedIn.
Research Interests
My research focus lies in semantic representation learning for NLP, mainly encompassing better representing word/phrase semantics, sentence semantics, in both monolingual and multilingual contexts. Recently I also broaden my scope to visual-language representation and entity representation.
In addition to NLP, I also have a keen interest in data-centric techniques like data mining, integration and augmentation, etc.
Publications
- Zhongtao Miao†, Qiyu Wu, Kaiyan Zhao, Zilong Wu, Yoshimasa Tsuruoka. “Enhancing Cross-lingual Sentence Embedding for Low-resource Languages with Word Alignment.”, NAACL 2024, findings.
- Kaiyan Zhao*†, Qiyu Wu*, Xin-Qiang Cai, Yoshimasa Tsuruoka. “Leveraging Multi-lingual Positive Instances in Contrastive Learning to Improve Sentence Embedding.”, EACL 2024, main conference long paper. [Paper]
- Shiwen Wu, Qiyu Wu, Honghua Dong, Wen Hua, Xiaofang Zhou. “Blocker and Matcher Can Mutually Benefit: A Co-Learning Framework for Low-Resource Entity Resolution.”, to appear at VLDB 2024, research track paper.
- Qiyu Wu, Masaaki Nagata, Yoshimasa Tsuruoka. “WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction.”, at ACL 2023, main conference long paper. [Paper] [Code]
- Qiyu Wu, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng and Daxin Jiang. “PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings.” EMNLP 2022, main conference long paper. [Paper] [Code]
- Qiyu Wu, Chen Xing, Yatao Li, Guolin Ke, Di He and Tie-Yan Liu. “Taking Notes on the Fly Helps Language Pre-Training.” ICLR 2021. [Paper]
- More
News
- 2024-03, a paper on low-resource language sentence embedding accepted to NAACL 2024 findings.
- 2024-01, a long paper on multilingual sentence embedding accepted to EACL 2024 main conference.
- 2023-11, just visited and gave talks at Osaka University and Kyoto University, “Mitigating Reporting Bias in Visual- Language Datasets w/ Large Generative Models”. [Slides]
- 2023-09, I’m fortunate to be selected as Research Fellowships for Young Scientists (DC2), JSPS!
- 2023-08, a paper collaborated with HKUST about entity resolution is accepted (with artifacts availability) to VLDB 2024.
- 2023-07, attending ACL 2023 in Toronto, will present WSPAlign in-person.
- 2023-06, give a talk at Sony, “Leveraging Unlabeled Text: Data-centric Approaches to Improve NLP Training”. [Slides]
- 2023-06, start my internship at Creative AI Lab, Sony, Tokyo, I will work on NLP for multi-modal training.
- 2023-05, a long paper with NTT CS lab about word alignment is accepted to ACL 2023 main conference.
- 2023-01, start visiting San Diego as a visiting student at UCSD.
- 2022-12, attend EMNLP 2022 at Abu Dhabi, will present a paper about diverse augmentations for sentence embeddings.
- 2022-10, a long paper with Microsoft accepted to EMNLP 2022 conference.
- 2021-10, I join Tsuruoka Lab at University of Tokyo as PhD student!
- 2021-10, I am glad to be selected to be supported by JST Support for Pioneering Research Initiated by the Next Generation (SPRING) Program from 2021 to 2024!
- 2021-04, invited talk at TechBeat, “Light language pre-training”. [Slides]
- 2021-01, a paper with Microsoft Research Asia about language pre-training is accepted to ICLR 2021.
- 2020-12, a paper with Baidu Research is accepted to AAAI 2021.
- 2020-04, start internship at Machine Learning Group, Microsoft Research Asia, Beijing.