« Guest Blogger: Kevin McGuire | Main | Where's the Theory? »

11 September 2006

Comments

Mark A. Graber

May I suggest that we not choose between lumpers and splitters, that we recognize that all forms of social science research, done with reasonable care, are likely to cast some light on judicial phenomenon and that no method in isolation is likely to solve all the serious questions we ask about courts and law. The decision to be a lumper or a splitter, I suspect, is likely to be influenced as much by predispositions (I was always more interested in history than math) and graduate training as by neutral analysis on which method explains the most (the attitudinal model of scholarly dispositions will work very well here). As long as we have lots of lumpers and splitters doing good work, I suspect our knowledge of law and courts will progressively improve.

Sean Wilson

Kevin, you might want to look at my SSRN paper (if you haven't already).

http://papers.ssrn.com/sol3/papers.cfm?abstract_id=922183

The problem with lumping is that the game of inference becomes too hazardous. For example, you can change the dependant-variable data significantly in an ideology model, yet still receive statistically-significant and seemingly robust results. You can make the values for liberalism polarized or squished, and the "lumped" model still seems to "work." Yet, subtract the 10 most extreme of the lumped values, and you lose statistical significance completely. Lumping, therefore, seems to introduce instability into the models and, I think, causes researchers and their audience to become fooled into thinking that they have better results than they really do.

All of this makes me very suspicious of regression with grouped aggregates. I don't know of good methodologists outside of political science who enjoy grouped-aggregate models so much, especially when the data is readily available for analysis at the level at which it is observed. We all know the pitfalls and risks of ecological inference. It makes the job of inference become tricky.

Also, remember that aggregated models only analyze the index of aggregates that is being created by the modeler, not the things that comprise each aggregate. This gets to be tricky when interpreting fit. Once again, see my paper. (If you have already read it or will read it, feel free to email me about your thoughts. I do not think my assertions are in error.)

Regards ... and nice post!

The comments to this entry are closed.

Conferences

May 2025

Sun Mon Tue Wed Thu Fri Sat
        1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31

Site Meter


Creative Commons License


  • Creative Commons License
Blog powered by Typepad