Conference Proceedings

A Comparative Analysis of Linguistic and Retrieval Diversity in LLM-Generated Search Queries

O Zendel, SFD Al Lawati, L Rashidi, F Scholer, M Sanderson

Cikm 2025 Proceedings of the 34th ACM International Conference on Information and Knowledge Management | ACM | Published : 2025

Abstract

Large Language Models (LLMs) are increasingly used to generate search queries for various Information Retrieval (IR) tasks. However, it remains unclear how these machine-generated queries compare to human-written ones, particularly in terms of diversity and alignment with real user behavior. This paper presents an empirical comparison of LLM- and human-generated queries across multiple dimensions, including lexical diversity, linguistic variation, and retrieval effectiveness. We analyze queries produced by several LLMs and compare them with human queries from two datasets collected five years apart. Our findings show that while LLMs can generate diverse queries, their patterns differ from th..

View full abstract

University of Melbourne Researchers