Interpolated Median in R

In survey data, actual distributions frequently are non-continuous and non-normal. The mean may thus be inappropriate to summarize the central tendency, and the median too rough because it is constrained to the actual categories in the data. On a five-point scale, the median can only fall on one of these categories, and thus does not reflect smaller changes in the distribution. The interpolated median adjusts the median position to do just that.

The concept of interpolated medians is nicely described at  http://aec.umich.edu/median.php (at the bottom), or at http://www.weekscomputing.com/webhelp/hs520.htm (with a nice graph).

In R, the interpolated median can easily be calculated with the package psych, using the interp.median function.

To illustrate the difference, here a simple plot using data from the SOM project. Here I use the positional variable (all actors in Austria combined) by year. It is immediately apparent that the three ways to express central tendency are different. The median position (in blue) is clearly constrained by the categories available (in this case -1, -0.5, 0, 0.5, 1). With its corrections, the interpolated median offers much more details of changes over time.
timedian

OK, not a very pretty graph, but it does illustrate the point.

One Reply to “Interpolated Median in R”

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.

%d bloggers like this: