Transformations

Author(s)

David Scott and David M. Lane

Prerequisites

Measures of Central Tendency, Measures of Variability, Values of the Pearson Correlation
  1. Log
  2. Tukey's Ladder of Powers
  3. Box-Cox Transformatons
  4. Exercises (not available yet)

The focus of statistics courses is the exposition of appropriate methodology to analyze data to answer the question at hand. Sometimes the data are given to you, while other times the data are collected as part of a carefully design experiment. Often the time devoted to statistical analysis is less than 10% of the time devoted to data collection and preparation. If aspects of the data preparation fail, then the success of the analysis is in jeopardy. Sometimes errors are introduced into the recording of data. Sometimes biases are inadvertently introduced in the selection of subjects or the mis-calibration of monitoring equipment. In this note, we focus on the fact that many statistical procedures work best if individual variables have certain properties. The measurement scale of a variable should be part of the data preparation effort. For example, the correlation coefficient does not require the variables have a normal shape, but often relationships can be made clearer by re-expressing the variables. An economist may choose to analyze the logarithm of prices if the relative price if of interest. A chemist may choose to perform a statistical analysis using the inverse temperature as a variable rather than the temperature itself. But note that the inverse of a temperature will differ depending if it is measured in °F, °C, or °K.

In this chapter we begin by covering linear transformations. These transformations normallly do not change statistics such as Pearson's r although they do affect the mean and standard deviation. The second section is on log transormations which are useful to reduce skew. The third section is on Tukey's ladder of powers. You will see that log transformations are a special case of the ladder of powers. Finally, we cover the relatively advanced topic of the Box-Cox transformation.

Please answer the questions:
correct feedback