Taking the aggravation out of data aggregation: A conceptual guide to dealing with statistical issues related to the pooling of individual-level observational data

Pollet, Thomas, Stulp, Gert, Henzi, S. Peter and Barrett, Louise (2015) Taking the aggravation out of data aggregation: A conceptual guide to dealing with statistical issues related to the pooling of individual-level observational data. American Journal of Primatology, 77 (7). pp. 727-740. ISSN 0275-2565

Full text not available from this repository. (Request a copy)
Official URL: https://doi.org/10.1002/ajp.22405

Abstract

Field data often include multiple observations taken from the same individual. In order to avoid pseudoreplication, it is commonplace to aggregate data, generating a mean score per individual, and then using these aggregated data in subsequent analyses. Aggregation, however, can generate problems of its own. Not only does it lead to a loss of information, it can also leave analyses vulnerable to the “ecological fallacy”: the drawing of false inferences about individual behavior on the basis of population level (“ecological”) data. It can also result in Simpson's paradox, where relationships seen at the individual level can be completely reversed when analyzed at the aggregate level. These phenomena have been documented widely in the medical and social sciences but tend to go unremarked in primatological studies that rely on observational data from the field. Here, we provide a conceptual guide that explains how and why aggregate data are vulnerable to the ecological fallacy and Simpson's paradox, illustrating these points using data on baboons. We then discuss one particular analytical approach, namely multi-level modeling, that can potentially eliminate these problems. By highlighting the issue of the ecological fallacy, and increasing awareness of how datasets are often organized into a number of different levels, we also highlight the manner in which researchers can more positively exploit the structure of their datasets, without any information loss. These analytical approaches may thus provide greater insight into behavior by permitting more thorough investigation of interactions and cross-level effects.

Item Type: Article
Uncontrolled Keywords: ecological fallacy; Simpson's paradox; aggregation; linear mixed modeling; observation methods; pseudoreplication
Subjects: C800 Psychology
Department: Faculties > Health and Life Sciences > Psychology
Depositing User: Becky Skoyles
Date Deposited: 25 Sep 2017 11:25
Last Modified: 12 Oct 2019 17:27
URI: http://nrl.northumbria.ac.uk/id/eprint/31972

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics