Publications

  1. Cong Ma, Yaping Zhang, Yang Zhao, Yu Zhou, Chengqing Zong. Vector Quantization Knowledge Transfer for End-to-End Text Image Machine Translation. The 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024.

  2. Cong Ma, Xu Han, Linghui Wu, Yaping Zhang, Yang Zhao, Yu Zhou, and Chengqing Zong. Modal Contrastive Learning based End-to-End Text Image Machine Translation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. IEEEXplore Early Access.

  3. Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, Chengqing Zong. CCIM: Cross-Modal Cross-Lingual Interactive Image Translation. In the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023. ACL_Anthology_version.

  4. Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, and Chengqing Zong. E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation. In The 17th Document Analysis and Recognition - ICDAR 2023, pages 70–88, Cham. Springer Nature Switzerland. arXiv_version, Springer_Link

  5. Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, and Chengqing Zong. Multi-Teacher Knowledge Distillation for End-to-End Text Image Machine Translation. In The 17th Document Analysis and Recognition - ICDAR 2023, pages 484–501, Cham. Springer Nature Switzerland. arXiv_version, Springer_Link

  6. Cong Ma, Yaping Zhang, Mei Tu, Xu Han, Linghui Wu, Yang Zhao, Yu Zhou. Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task. In Proceedings of the 26th International Conference on Pattern Recognition (ICPR 2022), Virtually, Montréal Québec, Canada. August 21-25, 2022. pp.1664-1670. arXiv_version, ieeexplore_version, GitHub.

  7. Qian Wang, Yuchen Liu, Cong Ma, Yu Lu, Yining Wang, Long Zhou, Yang Zhao, Jiajun Zhang, Chengqing Zong. CASIA’s System for IWSLT 2020 Open Domain Translation. In Proceedings of the 17th International Conference on Spoken Language Translation(IWSLT), pages 130-139. July 9-10, 2020.

  8. Yang Zhao, Long Zhou, Qian Wang, Cong Ma, Yuchen Liu, Yining Wang, Lu Xiang, Jiajun Zhang, Yu Zhou, Chengqing Zong. Research on Low-Resource Ethnic-to-Chinese Neural Machine Translation. In The 15th China Conference on Machine Translation, CCMT 2019.

  9. ZHAO Yang, ZHOU Long, WANG Qian, MA Cong, LIU Yuchen, WANG Yining, XIANG Lu, ZHANG Jiajun, ZHOU Yu, ZONG Chengqing. The Study on Ethnic-to-Chinese Scare-Resource Neural Machine Translation. In the Journal of Jiangxi Normal University (Natural Science), 2019, vol. 43, no. 6, pp. 630-637.

    赵阳, 周龙, 王迁, 马聪, 刘宇宸, 王亦宁, 向露, 张家俊, 周玉, 宗成庆. 民汉稀缺资源神经机器翻译技术研究 [J]. 江西师范大学学报(自然科学版), 2019, 043(006):630-637.

  10. H. Li, J. Zhu, C. Ma, J. Zhang and C. Zong, “Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video” in IEEE Transactions on Knowledge and Data Engineering, vol. 31, no. 5, pp. 996-1009, 1 May 2019. This_Paper_in_IEEE_Xplore.

  11. Haoran Li, Junnan Zhu, Cong Ma, Jiajun Zhang and Chengqing Zong. Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP-17), Copenhagen, Denmark. September 9-11, 2017, pp. 1103–1113.