Wenqing Wu
2 papers ยท Latest:
Natural Language Processing
Impact of large language models on peer review opinions from a fine-grained perspective: Evidence from top conference proceedings in AI
This study reveals that LLMs make peer reviews longer and more fluent but reduce focus on deep evaluative aspects like originality.
2604.19578
Natural Language ProcessingNovBench: Evaluating Large Language Models on Academic Paper Novelty Assessment
NovBench is introduced as the first benchmark to evaluate large language models' ability to assess research paper novelty, revealing current LLMs' limitations.
2604.11543
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.