"A little thought reveals a fact widely understood among statisticians: The null hypothesis, taken literally" (and that’s the only way you can take it in formal hypothesis testing), is always false in the real world. [...] If it is false, even to a tiny degree, it must be the case that a large enough sample will produce a significant result and lead to its rejection. So if the null hypothesis is always false, what’s the big deal about rejecting it?" (Jacob Cohen,"Things I Have Learned" (So Far)", American Psychologist, 1990)
"Statistics is, or should be, about scientific investigation and how to do it better, but many statisticians believe it is a branch of mathematics." (George E P Box, Commentary, Technometrics 32, 1990)
"Probability does pervade the universe, and in this sense, the old chestnut about baseball imitating life really has validity. The statistics of streaks and slumps, properly understood, do teach an important lesson about epistemology, and life in general. The history of a species, or any natural phenomenon, that requires unbroken continuity in a world of trouble, works like a batting streak. All are games of a gambler playing with a limited stake against a house with infinite resources. The gambler must eventually go bust. His aim can only be to stick around as long as possible, to have some fun while he's at it, and, if he happens to be a moral agent as well, to worry about staying the course with honor!" (Stephen J Gould, 1991)
"Sometimes the proprietors of a bad model claim that parts of it are facts, not just beliefs. Evaluation then amounts to determining if facts support the claims, and disciplines like statistics have tools for this task. The difficulty of using statistical tools will vary depending on the problem." (James S Hodges, "Six (or So) Things You Can Do with a Bad Model", 1991)
"Statistical models for data are never true. The question whether a model is true is irrelevant. A more appropriate question is whether we obtain the correct scientific conclusion if we pretend that the process under study behaves according to a particular statistical model." (Scott Zeger, "Statistical reasoning in epidemiology", American Journal of Epidemiology, 1991)
"Statistics is a very powerful and persuasive mathematical tool. People put a lot of faith in printed numbers. It seems when a situation is described by assigning it a numerical value, the validity of the report increases in the mind of the viewer. It is the statistician's obligation to be aware that data in the eyes of the uninformed or poor data in the eyes of the naive viewer can be as deceptive as any falsehoods." (Theoni Pappas, "More Joy of Mathematics: Exploring mathematical insights & concepts", 1991)
"[...] the mean information of a message is defined as the amount of chance" (or randomness) present in a set of possible messages. To see that this is a natural definition, note that by choosing a message, one destroys the randomness present in the variety of possible messages. Information theory is thus concerned, as is statistical mechanics, with measuring amounts of randomness. The two theories are therefore closely related." (David Ruelle, "Chance and Chaos", 1991)
"The rise of statistical reasoning was a key step in the birth of many empirical sciences, especially epidemiology. The ability to focus on the aggregate behavior amidst apparently chaotic variation across autonomous individuals has dramatically increased our understanding of disease processes that affect the health of the public. Simple statistical models based upon the laws of probability provide the language for this population perspective." (Scott Zeger, "Statistical reasoning in epidemiology", American Journal of Epidemiology, 1991)
"When looking at the end result of any statistical analysis, one must be very cautious not to over interpret the data. Care must be taken to know the size of the sample, and to be certain the method forg athering information is consistent with other samples gathered. […] No one should ever base conclusions without knowing the size of the sample and how random a sample it was. But all too often such data is not mentioned when the statistics are given - perhaps it is overlooked or even intentionally omitted." (Theoni Pappas, "More Joy of Mathematics: Exploring mathematical insights & concepts", 1991)
"Chaos is a parody of any metaphysics of destiny. It is not even an avatar of such a metaphysics. The poetry of initial conditions fascinates us today, now that we no longer possess a vision of final conditions, and Chaos stands in for us as a negative destiny. [...] Destiny is the ecstatic figure of necessity. Chaos is merely the metastatic figure of Chance. Chaotic processes are random and statistical in nature and, even if they culminate in the hidden order of strange attractors, that still has nothing to do with the fulgurating notion of destiny, the absence of which is cruelly felt." (Jean Baudrillard, "The Illusion of the End", 1992)
"Gambling was the place where statistics and profound human consequences met most nakedly, after all, and cards, even more than dice or the numbers on a roulette wheel, seemed able to define and perhaps even dictate a player's... luck." (Tim Powers, "Last Call", 1992)
"[…] an honest exploratory study should indicate how many comparisons were made […] most experts agree that large numbers of comparisons will produce apparently statistically significant findings that are actually due to chance. The data torturer will act as if every positive result confirmed a major hypothesis. The honest investigator will limit the study to focused questions, all of which make biologic sense. The cautious reader should look at the number of ‘significant’ results in the context of how many comparisons were made." (James L Mills, "Data torturing", New England Journal of Medicine, 1993)
"Flip a coin 100 times. Assume that 99 heads are obtained. If you ask a statistician, the response is likely to be: 'It is a biased coin'. But if you ask a probabilist, he may say: 'Wooow, what a rare event'." (Chamont Wang, "Sense and Nonsense of Statistical Inference", 1993)
"Statistics as a science is to quantify uncertainty, not unknown." (Chamont Wang,"Sense and Nonsense of Statistical Inference: Controversy, Misuse, and Subtlety", 1993)
"Statistics at its best provides methodology for dealing empirically with complicated and uncertain information, in a way that is both useful and scientifically valid" (John M Chambers, 1993)
"The Monte Carlo method is a numerical method of solving mathematical problems by the simulation of random variables. [...] One advantageous feature of the Monte Carlo method is the simple structure of the computation algorithm. As a rule, a program is written to carry out one random trial [...]. This trial is repeated N times, each trial being independent of the rest, and then the results of all trials are averaged. Therefore, the Monte Carlo method is sometimes called the method of statistical trials." (Ilya M Sobol, "A Primer for the Monte Carlo Method", 1994)
"The statistics on insanity are that one out of every four people is suffering from some form of mental illness. Think of your three best friends. If they're okay, then it's got to be you." (Rita Mae Brown) [in: Susan Musgrave,"Musgrave Landing: Musings on the Writing Life", 1994)
"When looking at the end result of any statistical analysis, one must be very cautious not to over interpret the data. Care must be taken to know the size of the sample, and to be certain the method for gathering information is consistent with other samples gathered. […] No one should ever base conclusions without knowing the size of the sample and how random a sample it was. But all too often such data is not mentioned when the statistics are given - perhaps it is overlooked or even intentionally omitted." (Theoni Pappas, "More Joy of Mathematics: Exploring mathematical insights & concepts", 1994)
"When [the profession of] statistics becomes clearly embedded in people’s minds as being concerned with investigation rather than simply calculation (or worse still, mere cataloguing of data), there can be no room for doubts about the relevance of the subject." (Christopher J Wild, "Embracing the ‘Wider view’ of Statistics", The American Statistician 48, 1994)
"Chaos and catastrophe theories are among the most interesting recent developments in nonlinear modeling, and both have captured the interests of scientists in many disciplines. It is only natural that social scientists should be concerned with these theories. Linear statistical models have proven very useful in a great deal of social scientific empirical analyses, as is evidenced by how widely these models have been used for a number of decades. However, there is no apparent reason, intuitive or otherwise, as to why human behavior should be more linear than the behavior of other things, living and nonliving. Thus an intellectual movement toward nonlinear models is an appropriate evolutionary movement in social scientific thinking, if for no other reason than to expand our paradigmatic boundaries by encouraging greater flexibility in our algebraic specifications of all aspects of human life." (Courtney Brown, "Chaos and Catastrophe Theories", 1995)
"[…] it does not seem helpful just to say that all models are wrong. The very word model implies simplification and idealization. The idea that complex physical, biological or sociological systems can be exactly described by a few formulae is patently absurd. The construction of idealized representations that capture important stable aspects of such systems is, however, a vital part of general scientific analysis and statistical models, especially substantive ones, do not seem essentially different from other kinds of model." (David Cox, "Comment on ‘Model uncertainty, data mining and statistical inference’", Journal of the Royal Statistical Society, Series A 158, 1995)
"Most statistical models assume error free measurement, at least of independent (predictor) variables. However, as we all know, measurements are seldom if ever perfect. Particularly when dealing with noisy data such as questionnaire responses or processes which are difficult to measure precisely, we need to pay close attention to the effects of measurement errors. Two characteristics of measurement which are particularly important in psychological measurement are reliability and validity." (Clay Helberg, "Pitfalls of Data Analysis (or How to Avoid Lies and Damned Lies)", 1995)
"System dynamics models are not derived statistically from time-series data. Instead, they are statements about system structure and the policies that guide decisions. Models contain the assumptions being made about a system. A model is only as good as the expertise which lies behind its formulation. A good computer model is distinguished from a poor one by the degree to which it captures the essence of a system that it represents. Many other kinds of mathematical models are limited because they will not accept the multiple-feedback-loop and nonlinear nature of real systems." (Jay W Forrester, "Counterintuitive Behavior of Social Systems", 1995)
"The science of statistics may be described as exploring, analyzing and summarizing data; designing or choosing appropriate ways of collecting data and extracting information from them; and communicating that information. Statistics also involves constructing and testing models for describing chance phenomena. These models can be used as a basis for making inferences and drawing conclusions and, finally, perhaps for making decisions." (Fergus Daly et al,"Elements of Statistics", 1995)
"We can consider three broad classes of statistical pitfalls. The first involves sources of bias. These are conditions or circumstances which affect the external validity of statistical results. The second category is errors in methodology, which can lead to inaccurate or invalid results. The third class of problems concerns interpretation of results, or how statistical results are applied (or misapplied) to real world issues." (Clay Helberg, "Pitfalls of Data Analysis (or How to Avoid Lies and Damned Lies)", 1995)
"When we think of π, let’s not always think of circles. It is related to all the odd whole numbers. It also is connected to all the whole numbers that are not divisible by the square of a prime. And it is part of an important formula in statistics. These are just a few of the many places where it appears, as if by magic. It is through such astonishing connections that mathematics reveals its unique and beguiling charm." (Sherman K Stein, "Strength in Numbers", 1996)
"We believe that numeracy is about making meaning in mathematics and being critical about maths. This view of numeracy is very different from numeracy just being about numbers, and it is a big step from numeracy or everyday maths that meant doing some functional maths. It is about using mathematics in all its guises - space and shape, measurement, data and statistics, algebra, and of course, number - to make sense of the real world, and using maths critically and being critical of maths itself. It acknowledges that numeracy is a social activity. That is why we can say that numeracy is not less than maths but more. It is why we don’t need to call it critical numeracy being numerate is being critical." (Dave Tout & Beth Marr,"Changing practice: Adult numeracy professional development", 1997)
"'Garbage in, garbage out' is a sound warning for those in the computer field; it is every bit as sound in the use of statistics. Even if the 'garbage' which comes out leads to a correct conclusion, this conclusion is still tainted, as it cannot be supported by logical reasoning. Therefore, it is a misuse of statistics. But obtaining a correct conclusion from faulty data is the exception, not the rule. Bad basic data (the 'garbage in') almost always leads to incorrect conclusions (the 'garbage out'). Unfortunately, incorrect conclusions can lead to bad policy or harmful actions." (Herbert F Spirer et al, "Misused Statistics" 2nd Ed, 1998)
"It is a misuse of statistics to use whichever set of statistics suits the purpose at hand and ignore the conflicting sets and the implications of the conflicts." (Herbert F Spirer et al, "Misused Statistics" 2nd Ed, 1998)
"Statistical thinking is a general, fundamental, and independent mode of reasoning about data, variation, and chance." (David S Moore, 1998)
"Statistics is a general intellectual method that applies wherever data, variation, and chance appear. It is a fundamental method because data, variation and chance are omnipresent in modern life. It is an independent discipline with its own core ideas rather than, for example, a branch of mathematics. […] Statistics offers general, fundamental, and independent ways of thinking." (David S Moore,"Statistics among the Liberal Arts", Journal of the American Statistical Association, 1998)
"The practical definitions of randomness - a sequence is random by virtue of how many and which statistical tests it satisfies and a sequence is random by virtue of the length of the algorithm necessary to describe it [...]." (Deborah J. Bennett,"Randomness", 1998)
"To some extent the beauty of number theory seems to be related to the contradiction between the simplicity of the integers and the complicated structure of the primes, their building blocks. This has always attracted people." (Andreas Knauf, "Number Theory, Dynamical Systems and Statistical Mechanics", 1998)
"We use mathematics and statistics to describe the diverse realms of randomness. From these descriptions, we attempt to glean insights into the workings of chance and to search for hidden causes. With such tools in hand, we seek patterns and relationships and propose predictions that help us make sense of the world." (Ivars Peterson, "The Jungles of Randomness: A Mathematical Safari", 1998)
"[…] good statistics requires a conversation between scientists and mathematical statisticians." (Stephen M Stigler, Statistics on the Table: the history of statistical concepts and methods, 1999)
No comments:
Post a Comment