I am a Ph.D student in Computer Science.
My research interests lie in the areas of Natural Language Processing (NLP), Information Retrieval (IR) and Multimodal ML.
Recently, I focus on the augmentation and application of large language models (LLM).
计算机科学与技术专业博士生。
研究兴趣主要包括自然语言处理(NLP)、机器学习(ML)和多模态。
-
2023.02 - ,
,
.
,
Wenjie Li
.
-
2020.09 - 2022.10,
,
.
.
-
2016.09 - 2020.06,
,
.
-
2023.07 - 2023.08,
,
,
.
-
2022.11 - 2023.10,
,
,
.
-
2021.09 - 2022.04,
,
,
.
accelerate,
AllenNLP,
datasets,
MTEB
.
* .
-
Language Models are Universal Embedders.
Xin Zhang, Zehan Li, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang.
arXiv preprint, 2023.
[Github]
-
Towards General Text Embeddings with Multi-stage Contrastive Learning.
Zehan Li, Xin Zhang, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang.
arXiv preprint, 2023.
[GTE-v1.5 series 🤗]
-
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval.
Xin Zhang, Yanzhao Zhang, Dingkun Long, Wen Xie, Ziqi Dai, Jialong Tang, Huan Lin, Baosong Yang, Pengjun Xie, Fei Huang, Meishan Zhang, Wenjie Li, Min Zhang.
EMNLP Industry Track, 2024.
[mGTE & mGTE-reranker 🤗]
[Code for GLUE & XTREME-R]
-
Finetuning Language Models for Multimodal Question Answering.
Xin Zhang*, Wen Xie*, Ziqi Dai*, Jun Rao, Haokun Wen, Xuan Luo, Meishan Zhang, Min Zhang.
ACM MM, 2023 (Grand Challenge).
Ranked 1st in both Chinese and English tracks of the VTQA 2023.
-
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection.
Biyang Guo*, Xin Zhang*, Ziyuan Wang*, Minqi Jiang*, Jinran Nie*, Yuxuan Ding, Jianwei Yue, Yupeng Wu.
Symposium on Large Language Models, IJCAI, 2023 (LLM@IJCAI'23).
[Github]
-
Extending Phrase Grounding with Pronouns in Visual Dialogues.
Panzhong Lu, Xin Zhang, Meishan Zhang, Min Zhang.
EMNLP, 2022 (Long paper).
[Code & Data]
-
Identifying Chinese Opinion Expressions with Extremely-Noisy Crowdsourcing Annotations.
Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang, Xiaobin Wang, Min Zhang.
ACL, 2022 (Long paper).
[Code & Data]
[Poster]
[Slides]
-
Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition.
Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang, Pengjun Xie.
ACL, 2021 (Long paper).
[Code]
[Poster]
[Slides]
- AAAI 2023-2024, ACL 2023, COLING 2022/2024, EMNLP 2022-2023, WSDM 2023 (external).
- ACL ARR.
- 2022, .
- 2021, .
- 2020, .
- 2019, (top 30 of 32000+).
- 2019, .