An Introduction to Data Cleaning Using Internet Search Data
Matthew Greenwood-Nimmo, Kalvinder Shields
The Australian Economic Review | John Wiley & Sons | Published : 2017
This article considers the issue of data cleaning. We use state‐level data on internet search activity in the United States to illustrate several common data cleaning tasks, including frequency conversion and data scaling as well as methods for handling sampling uncertainty and accommodating structural breaks and outliers. We emphasise that data cleaning relies on informed judgement and so it is important to maintain transparency through careful documentation of data cleaning procedures.