Faculty of Language: Falsifiability

Tuesday, July 2, 2013

Falsifiability

Minimalism induces falsifiability anxiety in otherwise unflappable people. I noticed this while auditing a class here at the LSA summer institute (one of the undeniable perks of being on the faculty here): how could one show that Minimalism was false? Is there anything that would show that it is incorrect? I’m not sure there is, but then I’m not sure that any interesting scientific proposal can be shown to be false. Let me explain.

Scientific theories are very complex objects. Even as regards more mature sciences like physics and chemistry, philosophers of science have long recognized that Popper’s simple version of falsificationism is an inadequate methodological credo. What makes theories hard to refute? Well, mainly the fact that there is quite a distance between the central concepts that animate a theory, the particular models that incarnate them and the “facts” that test them. Lakatos talked about central belts versus auxilliary hypotheses (Cartwright, a favorite of mine on these topics has good descriptions of how elaborate the testing process in physics can be and how wide is the gap between theory and experiment see here), but now pretty much any account of how the rubber of theory hits the road of experiment highlights the subtle complexities that allow them to make contact. This said, scientists can and do find evidence against particular models (specific combos of theory, auxiliary hypotheses, and experimental set-ups), but how this bears on the higher level theory is a tricky affair precisely because it is never the theory of interest alone that confronts the empirical jury. In other words, when something goes wrong (i.e. when an experiment delivers up evidence that is contrary to the deductions of the model) it is almost always possible to save the day by tinkering with the auxiliary hypotheses (or the details of the experimental set-up or the right description of the “facts”) and leave the basic theory intact.

The recent discovery of the Higg’s particle offers a fair illustration of this logic. There was some discussion before the fact of where (i.e. which energy range) to look to find the Higg’s. The one discovered was one in the lower possible energy ranges. Say the Cernistas had not found anything where they first looked. Would this have falsified the Standard Theory? Nope, there were a whole bunch of other candidates to explore (some at energies that the facility would have strained to achieve). And say that even these proved to be duds, what would have been the rational strategy? Dump the Standard Theory and assume that it was completely off track? Maybe for the young and the daring hoping to make their mark in the world, but my hunch is that this would have been chalked up as a puzzle to be explored and explained away until something better came along, rather than as an indication that the whole edifice is rotten and must be thrown out wholesale. Why? Because the reason that people adopt theories is that they do work, and the work that the Standard theory did, even had the Higg’s been left undiscovered, would still remain. Yes, there would be problems, and yes it would be nice to explain these “anomalies” away, but the theory would not have been dumped. There would likely have been ad hoc patches proposed to salve the disappointment and allow work to continue apace. As Hilary Putnam once observed, ‘ad hoc’ does mean ‘to the point’ and a nice clean local fix would have served quite nicely, I am sure.

So does this mean that theories are not regulated by “the facts”? No. Modulo all the caveats about how facts need massaging to be relevant targets of explanation, empirical success of course plays a role, but the role is not that of falsification. Rather, the facts serve a useful function when they are recruited to distinguish otherwise viable alternatives. And this is where falsificationism really misleads. I don’t know about you, but I find it hard to take most of my concocted explanations seriously because they are so obviously inadequate from the get-go. In other words, a candidate theory’s main problem initially concerns not falsification but verification. The pressing and relevant question is not whether there is counter-evidence but whether there is any interesting evidence in its favor! Most theories plop stillborn from the mind. Only a few are worth taking seriously. What makes them worth taking seriously? There are interesting facts they would explain were they true (note the ‘interesting’ here: some facts really are more interesting than others, but this is for another time). The first part of any sane research strategy is to find places where the account works, for when one starts there are all too many indications of failure all too quickly and so one needs reasons for taking the hypothesis seriously and the most immediate concern is not whether there are problems with one’s account (of course there are) but what it would buy you were the theory (roughly) on the right track. So, the very first thing one does (again, if one is sane) is to find factual life support for one’s tender creation, nurturing it via verification, and looking for evidence in its favor. In other words, unless one is an enthusiastic masochist, the last thing one does in the initial stages of theory development is look for reasons to discard your newborn proposal.

Does this mean that looking for contrary evidence is unimportant? No. It is important, but mainly in service of verification. Here’s what I mean. The best kind of evidence in favor of a proposal is the verification of a counter intuitive prediction (especially one that is problematic given current assumptions). So, for example, a very strong argument in favor of Copernicus’ account of a heliocentric solar system is that it predicts the possibility of retrograde planetary motion (viz. that planets rather than moving smoothly forward around the night sky would look like they reversed gear for a while before reversing gear again and moving forward). If Copernicus was right (as we now think he was) then were you to calculate planetary motion using Earth as the center then what you expect to find is the appearance of retrograde motion. Moreover, this motion would “disappear” once one did the calculations using the Sun as center. So rather than being a problem as it was for the Ptolemaic conceptions, apparent (when viewed from Earth) retrograde planetary motion was predicted. This served to corral an otherwise rather unpleasant anomaly and so was strong evidence in favor of Copernicus’.

Other examples abound: bending light rays, perihelion of mercury, shrinking rulers and slowing clocks, quantum tunneling, “spooky” action at a distance (aka entanglement), the tides, colors in white light, backwards control (hehe!!) a.o. So, yes looking for empirical trouble is part and parcel of the good theorists armamentarium but mainly in service of finding strong verification. The strongest evidence in favor of an account lies with the surprising (counterintuitive) predictions that it makes, that turn out to hold. That’s the main reason to go falsificationist and chase potential heartbreak! It’s strategic: new theories need to gain a hearing and the best way to do this is to find a wild unexpected prediction that pans out. So, the smart theorist looks for ways to falsify her account in order to find those that pan out. In other words, the big game here is not the false results, but the predictions that work. If any do, then the theory has earned the right to be taken seriously and then the next stage of serious work begins.

With this as background, let’s return to minimalist syntax.

As with all other theory, minimalism has leading ideas and executions of such. Chomsky likes to talk of the Minimalist Program. The way I see this is a series of basic conceptions (merge, Probe-Goal, phase locality, minimality, Extension, etc.) that can be packaged in different ways to produce varying minimalist theories or models (a personal favorite e.g.: one can think of control as a Probe-Goal effect with PRO the goal of a higher functional probe or one can think of PRO as the trace like residue of internal merge). These theories are then explored by finding how well they fit the “established” facts (e.g. re control: do they derive the distribution and interpretation of control sentences) and what novel predictions (the more surprising the better) they make (e.g. do they allow for the possibility of backwards control). Models will accrete successes and failures and will be judged over a certain period winning fans and detractors. The success of the program will be a function of the theoretical and empirical suasive powers of the particular theories. None of this is novel to linguistics, nor should it be.

How then does a theory fail, linguistic or otherwise? Actually, mainly by running out of steam. Boredom is more deadly than a couple of false data points. Theories can run out of explanatory steam or, worse, never really develop any. Such theories and their attendant programs are abandoned. So there is something worse than being wrong, at least if you are a theory, and that’s being BORING! If right, this has a useful practical consequence: if correct, then the Minimalist Program has a very long and bright future, for boring it ain’t!

40 comments:

UnknownJuly 2, 2013 at 11:02 AM
Truly fascinating, I consider myself privileged to witness the long awaited fifth stage in philosophy of science. After Popper's falsificationism, Kuhn's paradigm shifting, Lakatos' 'research programming', and Feyerabend's "Against Method" we now have Hornstein's "Against Boredom". I shall return to this great achievement momentarily.

First a couple of dull observations:

1. Scientific theories are complex beasts and much of what is predicted to exist is not that easily confirmed [if it were we would not need theories we'd just see the stuff like Higgs particles or planetary orbits]. So its okay if a theory predicts a lot for which we have no empirical confirmation yet ,and if there are phenomena that seem to contradict the theory - just be patient and listen to the wise theorizer.

2. Don't look for falsification but for confirmation. This is an important advance from Chomsky's criterion for progress: “Suppose that counterevidence is discovered as we should expect and as we should in fact hope, since precisely this eventuality will offer the possibility of a deeper understanding of the real principles involved” (Chomsky, 1982, 76). Glad to learn we have moved on from this...

Now lets return to the science trail-blazing boredom criterion. I know some excellent linguists who find it quite boring to talk about 'biological foundations of language' and think work on such topics should be left to professional biologists or psychologists. At least some [maybe all] biolinguists disagree. They do not think work on [or even better theorizing about] the biological language faculty are boring. And, at least some of them seem to consider boring "the whole mass of data that interests the linguist who wants to work on a particular language" (Chomsky, 2012, 84) and just want to abstract away from it.

Now, someone as unimaginative as myself wonders: who gets to decide what is and is not boring? Or is it up to every linguist to study what s/he considers not boring, regardless of what 'the rest of the field' does? And if 'false data points' do not matter much, why would one get all excited when some people make claims about recursion one finds disagreeable? Even write papers entitled 'Recursive misrepresentations'? The tone sounded quite furious to me but I missed entirely, that Levinson [2013] was accused of being BORING....

In fact he was accused of such trivial things like as not getting the facts right: “Far be it from us to condemn speculation in linguistics. … We do believe, however, that a speculation … if advanced on the basis of misrepresentations, mischaracterizations and confusion about basic issues, is not off to a good start” (Legate et al., 2013, 12)

So: is there more besides "Against Boredom" that you have not been telling us?
ReplyDelete
Replies
ChrisJuly 2, 2013 at 11:14 PM
I wonder, is the problem with falsifiability really that "scientific theories are complex objects" but more a conflict between Bayesian epistemology (i.e., that one can observe evidence, but not truth) and Popper's (to me, rather incoherent) stance that scientific data is somehow equivalent to truth? If you buy this (and I'd love to believe you are a died-in-the-wool Bayesian!), then the idea that scientists should be concerned with "confirming" rather than "falsifying" theories is a somewhat strange one to try to make. From the Bayesian perspective, evidence always is "confirming" one set of theories and "falsifying" others since evidence can only be interpreted in light of theories. In fact, your invokation of "surprisal" (a crypto-Bayesian term for likelihood if I've ever heard one;)) seems to suggest that you yourself are interested in "falsification"-- of competing theories (to be clear: my reading of "surprisal" is "observing an event that should have extremely low probability *according to some theory*).
ReplyDelete
Replies
NorbertJuly 3, 2013 at 1:42 AM
I have no problem with the basics of Bayes (I started life in the Columbia University Philo dept and was influenced on these topics by Isaac Levi (c.f. his Gambling With Truth)). What I find off about the Bayesian approach is the idealization. Here's what I mean. Phenomenologically, one is not pitting theories in a well defined space of options against one another. Rather the space itself is very patchy and one is using data to , as it were, construct the space of alternatives. The hard problem is knowing what to compare, not having options and using the data to sift them apart. There is not method for this process of constructing the space of relevant options, not even a Bayesian one (which reduces this very inchoate process to a far too mechanical procedure). Bayes works well when the alternatives are demarcated. It strikes me as missing the crux of the epistemological problems as it starts by assuming away the hard part: what's worth taking seriously? What do the real alternatives look like? This is the hard problem, and this is where falsificationism misleads. So, in the ideal circumstance, Bayes is fine (at least for me), however, we are almost always far away from the ideal when one is on the frontiers of research and so the Bayesian dicta only apply very very loosely. This said, sure, surprisal reflects the obvious pre-bayesian idea that evidence is strongest when unexpected.
ReplyDelete
Replies
Bill IdsardiJuly 3, 2013 at 7:25 AM
Just a tiny clarification. Lakatos's contrast was between the "hard core" of the research program versus the "protective belt" of the auxiliary hypotheses. The idea being (as you say) that in response to recalcitrant data you could (almost) always change your auxiliary hypothesis rather than give up a core tenet.
ReplyDelete
Replies
Tim HunterJuly 5, 2013 at 10:49 AM
This is a somewhat-vague hunch, so I'm very willing to be told that I'm wrong, but it seems to me that the distinction between the "hard core" of a research program and the "protective belt" of auxiliary hypotheses might have something to do with the frequent disagreement about the role of formalisation. I usually find myself somewhere in between the two positions that are staked out in that debate (in the instantiation on this blog a few weeks ago, roughly Alex C. on one side and Norbert and David P. on the other).

I don't want to put words into anyone's mouth, but my understanding is that what the pro-formalisation side often means to encourage is the practice of formalising particular combinations of core-idea-plus-auxiliary-hypotheses, and I wonder if perhaps those who disagree see this as eliminating an important distinction. In particular, one might worry that the auxiliary hypotheses will come to be seen as part of the core, and that some counter-example will be perceived as a strike against a core idea (e.g. that natural languages involve movement) when actually it really only counts against a particular auxiliary hypothesis (e.g. about exactly what the target position of wh-movement is, or whatever). I notice, for example, that Norbert uses the term "theory" to mean the core and uses the term "model" for core-plus-auxiliary-hypotheses, and on this usage perhaps a pro-formalisation person who asks for "formalised theories" rather than "formalised models" could give the impression that everything in the formalisation is to be considered part of the core, so that everything in the formalisation should stand or fall together as one monolithic object. But I don't think that is what the pro-formalisation side really means to suggest.

It's true that it's usually not explicitly "written into" a formal system which pieces of mathematical machinery comprise the core and which parts comprise the auxiliary hypotheses, and this does sometimes seem to worry those who respond to the pro-formalisation argument. But it's still generally possible to work out, when an incorrect prediction emerges, whether it's a core idea or an auxiliary hypothesis which is "at fault". In other words, when you put together a new formalised system of core-idea-plus-auxiliary-hypotheses in order to try to accommodate new facts, it's usually easy to tell whether you've made a change to the core idea or merely to some of the auxiliary hypotheses. So in a sense it is true that formalisation can eliminate the distinction between core idea and auxiliary hypotheses, in that a particular set of equations or whatever doesn't itself draw the dividing line, but this doesn't prevent the scientist from maintaining that distinction, and responding in the appropriately subtle not-naive-falsificationist ways when the formalised system reveals an incorrect prediction. The ideas we formalise needn't be only those core ones to which we are relatively strongly committed.

Also, there's obviously a difference between
(a) thinking that a theory should be abandoned as soon as it is falsified (by a single observation), and
(b) thinking that a theory is more valuable (all else being equal) if it meets the criterion of falsifiability.
The "pro-formalisation" argument has nothing at all to do with (a), as far as I can tell.
It does require that we accept (b), I think (or at least, the argument is stronger if we accept (b)). But (b) does seem to be generally accepted among "Chomskyan linguists" (e.g. I think that's what Chomsky is getting at when he says that we should hope to find counter-examples).

To end speculatively and provocatively (and optimistically perhaps): is there any hope that some aspects of the formalisation debate might be resolved by clarifications of how the two sides treat the distinction between core ideas and auxiliary assumptions?
ReplyDelete
Replies
NorbertJuly 6, 2013 at 7:51 PM
Maybe. I think however the bigger problem/difference is that some of us don't think that it is very hard to find evidence against proposals even given the low levels of formalization. Alex made this point and I agree. As I said sometime earlier, I think that people mistake the value of formalization. It lies not in getting theories to be more falsifiable, but in better understanding how basic concepts interrelate. This is a BIG plus when it is doable. Formalization per se does not advance falsifiability as it is already all too easy.

Yes, if a theory makes no in principle falsifiable claims, it's not good. But I know of almost no theories of this kind within the linguistics that I follow. They are not only falsifiable but have been falsified, if what we mean by this are either incomplete or in apparent contradiction with well accepted data. It's for this reason that I don't find the idea all that useful. It fails to engage what people actually do.
ReplyDelete
Replies
UnknownJuly 9, 2013 at 1:54 AM
The problem with this entire discussion is that, again, it is void of any specific examples that illustrate what is asserted. This technique [invented by Chomsky long time ago and perfected over the years] makes it possible to evade any criticism of one's views/proposals. David can claim "that's not what Norbert said' but when asked where i went wrong he refused to answer. I have asked repeatedly for examples of frivolous requests for formalization but none is forthcoming. So let me provide a specific example to illustrate what non-minimalists are concerned about. I know Norbert will not like this example - sorry about that, but then by now he had ample of time to provide his own:

Proposal 1 [P1]

FLN, “is the abstract linguistic computational system alone, independent of the other systems with which it interacts and interfaces...The CORE property of FLN is recursion... it takes a finite set of elements and yields a potentially infinite array of discrete expressions” (Hauser, Chomsky, & Fitch, 2002, p. 1571, my emphasis).

Proposal 2 [P2}
Fitch, Hauser and Chomsky (2005) argue, “the putative absence of obvious recursion in one of [the human] languages ... does not affect the argument that recursion is part of the human language faculty [because] ...our language faculty provides us with a toolkit for building languages, but not all languages use all the tools” (pp. 203-204), and they suggest that “the contents of FLN ... could possibly be empty, if empirical findings showed that none of the mechanisms involved are uniquely human or unique to language, and that only the way they are integrated is specific to human language” (Ibid., p. 181).

P1 is a scientific proposal that can be falsified by empirical data. So when Everett [2005] came along claiming Piraha has no recursion defenders of P1 had two options:
[i] accept Everett's empirical findings and abandon P1 given that it's CORE property claim had been falsified [and come up with P3 to account for new data].
[ii] showing that Everett had made a mistake and that Piraha in fact has recursion

We all know there was a good deal of [ii] going on and if Minimalists had left it at that we would be still doing science. But they did not and proposed P2. A core property was demoted to be 'one tool among others' and it was asserted that FLN can be empty. So P2 is no longer falsifiable by ANY empirical data anyone possibly could find. It asserts that even core properties do not have to be present in language L, and that FLN exists AND that it possibly can be empty. So no matter what anyone finds - P2 is unfalsifiable.

Of course I COULD BE wrong. P2 could be falsifiable. But if so it would be of great help to show here and now HOW P2 could be falsified [formalized or not]
ReplyDelete
Replies
David PesetskyJuly 9, 2013 at 2:46 PM
This comment has been removed by the author.
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Tuesday, July 2, 2013

Falsifiability

40 comments:

Contributors