— Ch. 1 · Defining Nonparametric Statistics —
Nonparametric statistics.
~5 min read · Ch. 1 of 5
In 1999, Stuart A. Ord and J.K. Arnold published a sixth edition of Kendall's Advanced Theory of Statistics that clarified how statisticians label their work. They described two distinct meanings for the term nonparametric statistics, yet modern literature often blurs these lines into a single category. The first meaning involves techniques that do not rely on data belonging to any particular parametric family of probability distributions. These methods are distribution-free because they make no assumptions about whether data comes from a given parametric family. An example is order statistics, which relies solely on the ordinal ranking of observations rather than specific numerical values. The second meaning describes techniques where the model structure does not assume a fixed form. Instead, the model grows in size to accommodate the complexity of the data. Individual variables may still belong to parametric distributions even if the overall association remains flexible. This distinction matters because hypothesis (d) in the text states that two unspecified continuous distributions might be identical without specifying the underlying form. Such hypotheses are termed distribution-free, yet the statistical community commonly applies the label non-parametric to them anyway.
Applications And Purpose
Researchers frequently use non-parametric methods when studying populations with a ranked order like movie reviews receiving one to five stars. Data often have a ranking but lack clear numerical interpretation, such as when assessing human preferences or subjective opinions. Non-parametric methods result in ordinal data and apply much more generally than corresponding parametric methods. They become necessary situations where less is known about the application in question before analysis begins. Their reliance on fewer assumptions makes these methods more robust against violations of standard conditions. Some consider non-parametric methods simpler to use and more robust than parametric methods even when parametric assumptions hold true. This general nature makes them less susceptible to misuse and misunderstanding by practitioners. A conservative choice ensures they work even when their specific assumptions are not fully met. Parametric methods can produce misleading results when their assumptions are violated, whereas non-parametric tests remain stable. The wider applicability and increased robustness come at a cost though. In cases where a parametric test's assumptions are met, non-parametric tests have less statistical power. A larger sample size can be required to draw conclusions with the same degree of confidence.