Looking back at the first chapter, Introduction to distance sampling, we saw some simulations of animal populations. In these examples animals were distributed pretty uniformly across the survey area. In reality, the locatio of animals in space is driven by a variety of factors: climate, prey availability, ease of movement and more play a role.

Taking another look at the pantropical spotted dolphins in the Gulf of Mexico, the data were collected with spatial information (location of observations and bathymetry at those points). We can investigate the relationship between these covariates and the observered counts by plotting the data in two ways, first simply looking at histograms of counts with respect to covariate values:

## Warning: Ignoring unknown aesthetics: weight

## Warning: Ignoring unknown aesthetics: weight

## Warning: Ignoring unknown aesthetics: weight
<img src=‘dsm-why_files/figure-html/pantropical-geo-eda-hist-1.png’ width=‘960’height=’‘width.px=’960’height.px=’288’ style=‘display: block’>
Figure 1: Histograms showing the number of dolphins observed at different Eastings, Northings and depths in the Gulf of Mexico.

We can see from the histograms that there are distinct peaks at particular values and avoidance of other values. For example, from this crude exploratory analysis we see that the dolphins tend to be observed near the centre of the survey area and appear to avoid shallow waters.

The one-dimensional slices offered by the histograms are useful, but don’t tell the full story about what’s happening in these data. So, we plot the observations over a map of the depth values:

<img src=‘dsm-why_files/figure-html/pantropical-geo-eda-plot-1.png’ width=‘960’height=’‘width.px=’960’height.px=’480’ style=‘display: block’>
Figure 2: Bathymetry of the Gulf of Mexico study area with observations of pantropical spotted dolphins (green points) and transect lines (red lines) overlaid.

Also plotted (in red) are the transect lines that the Oregon II travelled. This shows that the survey had relatively good coverage of the survey area in space, but less good coverage in terms of the depth covariate.

From the above plots we can see that there is definitely some correlation between the pantropical spotted dolphins and the covariates we’ve collected on location and depth.

The aim of this section is to talk about how to model this relationship. Before we go into the details of the models we’ll use, let’s first think about why one might want to do such an analysis.

Why go through all the fuss?

There are a number of reasons to model the distributions of biological populations explicitly in space:

Often the first and last reasons above dominate most peoples’ motivation for building spatially explicit models: they need to know where animals are and they want this information to be accessible to others.

Model-based analysis

The major contrast between the approach detailed in this chapter and that of the previous chapters is that we now consider model-based inference about the biological populations in question rather than design-based inference. This has some advantages and some disadvantages. A spatially-explicit model can explain the between-transect variation (which is often a large component of the variance in design-based estimates) and so using a model-based approach can lead to smaller variance in estimates of abundance than design-based estimates.


This chapter looked breifly at the Gulf of Mexico data again, showing that there are spatial elements to the data. If abundance is non-uniform with respect to spatial or environmental covariates we should model this variation to ensure the most precise estimates of abundance. The next few chapters will explain how this is possible using the R package dsm.

Further reading


Elith, J. and Leathwick, J.R. (2009) Species Distribution Models: Ecological Explanation and Prediction Across Space and Time. Annual Review of Ecology, Evolution, and Systematics, 40, 677–697.

Gelman, A. (2011) Why Tables Are Really Much Better Than Graphs. Journal of Computational and Graphical Statistics, 20, 3–7.

McInerny, G.J. and Etienne, R.S. (2013) ‘Niche’ or ‘distribution’ modelling? A response to Warren. Trends in Ecology & Evolution, 28, 191–192.

Sillero, N. (2011) What does ecological modelling model? A proposed classification of ecological niche models based on their underlying methods. Ecological Modelling, 222, 1343–1346.

Warren, D.L. (2012) In defense of ‘niche modeling’. Trends in Ecology & Evolution, 27, 497–500.

  1. For more (light-hearted) discussion on graphs vs. tables see Gelman (2011).