A recent post by Jessica Hullman (here) served as my most recent periodic reminder about the enduring need for folks (myself included) to take a few, basic, simple steps to become more familiar with the underlying structure of and relationships among critical data points. Tukey's elegant tables and figures notwithstanding, what is perhaps more important is what follows from Tukey's general impulse: Sometimes simple data plots can unlock key analytic insights. And even if such insights aren't necessary for the researcher, readers' (and journal editors') interests should not be taken for granted.
In trying to explain why researchers might ignore--or bypass--simple initial data visualization/familiarization steps, Hullman points to "latent beliefs that plotting data is somehow inferior to anything that feels more like math. And because making plots for the sake of checking what the data look like or discovering things you didn’t expect seems disconnected and even distracting from the ultimate goal of fitting some model." That such beliefs may be entirely understandable, however, does not detract from their potential harm. It remains awfully difficult to overestimate the value of simple, Tukey-inspired data familiarization/visualization efforts.
Comments