1 April 2026

Goals for today

  • Identify features of data that drive analyses
  • Think critically about what the data could tell you
  • Start thinking about the distributional forms for data

General approach

Question \(\rightarrow\) Data \(\rightarrow\) Model \(\rightarrow\) Inference \(\rightarrow\) Prediction

General approach

Question \(\rightarrow\) Data \(\rightarrow\) Model \(\rightarrow\) Inference \(\rightarrow\) Prediction

Common questions in ecology

At the individual level

Ideas from the audience?

Common questions in ecology

At the individual level

Sex?

Fecundity?

Growth?

Survival?

Movement?

Common questions in ecology

At the population level

Thoughts from you all?

Common questions in ecology

At the population level

Abundance?

Survival?

Spatial distribution?

Movement/migration?

General approach

Question \(\rightarrow\) Data \(\rightarrow\) Model \(\rightarrow\) Inference \(\rightarrow\) Prediction

Ecological data

At the individual level

Ideas, anyone?

Ecological data

At the individual level

1 Detection \(\rightarrow\) presence/absence

2+ Detections \(\rightarrow\) survival, movement

Ecological data

At the individual level

1 Detection \(\rightarrow\) presence/absence

2+ Detections \(\rightarrow\) survival, movement


1 Measurement \(\rightarrow\) fecundity, age, size

2+ Measurements \(\rightarrow\) growth

Ecological data

At the population level

Detections \(\rightarrow\) presence/absence


Counts \(\rightarrow\) density or survival/movement

Data collection methods

Can you see where this is going?

Data collection methods

Nonexhaustive counts

 

Data collection methods

Exhaustive counts

 

Data collection methods

(Non)exhaustive surveys

Depletions

 

Data collection methods

(Non)exhaustive surveys

Depletions

Capture/Tag/Recapture

 

Data types

Discrete values

Anyone have thoughts here?

Data types

Discrete values

Sex

Age

Fecundity

Counts/Census

Individual survival

Data types

Continuous

How about this?

Data types

Continuous

Size (length, mass)

Biomass

Survival (population)

A note on continuous variables

Approximating rational numbers with real numbers

Survival (0.78)


Composition (~55% age-4)


Density (0.14 per ha)

A note on continuous variables

Approximating rational numbers with real numbers

Survival (7 of 9 survived \(\approx\) 0.78)


Composition (4 age-3, 18 age-4, 11 age-5 \(\rightarrow\) ~55% age-4)


Density (3 animals in 21 ha plot \(\approx\) 0.14 per ha)

A note on continuous variables

Approximating rational numbers with real numbers

Which of these give you more confidence?

A) 3 / 9 \(\approx\) 0.33

B) 300 / 900 \(\approx\) 0.33

The importance of raw data
cannot be overstated

Distributions of data

Discrete distributions

Discrete distributions

Binary (0,1) \(\rightarrow\) Bernoulli


Classic “coin toss”

set.seed(514)
## flip a coin 1000 times
x <- rbinom(n = 1000, size = 1, prob = 0.5)
## count tails (0) and heads (1)
table(x)
## x
##   0   1 
## 511 489

Discrete distributions

Discrete distributions

Count (non-negative integers) \(\rightarrow\) Poisson or Negative-Binomial


Number of times birds were resighted

set.seed(514)
## 37 banded birds; number of resightings
x <- rpois(n = 37, lambda = 1)
## count up resightings
table(x)
## x
##  0  1  2  3  4  5 
##  7 18  5  5  1  1

Discrete distributions

Discrete distributions

Composition (D categories) \(\rightarrow\) Binomial (D = 2) or Multinomial (D > 2)


Surviving offspring

set.seed(514)
## 1357 tagged smolts; probability of survival is 0.05
x <- rbinom(n = 1357, size = 1, prob = 0.05)
## number of mortalities (0) and survivors (1)
table(x)  # p ~= 0.0593
## x
##    0    1 
## 1281   76

Continuous distributions

Real numbers (-\(\infty\),\(\infty\)) \(\rightarrow\) Normal

Continuous distributions

Real numbers (-\(\infty\),\(\infty\)) \(\rightarrow\) Normal


Common “bell-shaped curve”

## normal distribution (mean = 0, sd = 1)
curve(dnorm(x, mean = 0, sd = sqrt(1)), from = -4, to = 4,
      main = "", xlab = "", ylab = "Density", bty = "n",
      cex.lab = 1.5, cex.axis = 1.2, lwd = 2, col = "#32006e")

Continuous distributions

Continuous distributions

Continuous distributions

Density (non-negative real values) \(\rightarrow\) log-Normal or Gamma


## log-normal distribution (mean = 2, sd = 0.7)
curve(dlnorm(x, meanlog = 2, sdlog = 0.7), from = 0, to = 20,
      main = "", xlab = "", ylab = "Density", bty = "n",
      cex.lab = 1.5, cex.axis = 1.2, lwd = 2,  col = "#32006e")

Continuous distributions

Continuous distributions

Proportion (simplex\(^D\)) \(\rightarrow\) Beta (D = 2) or Dirichlet (D > 2)


## beta distribution (mean = 2/3)
curve(dbeta(x, shape1 = 6, shape2 = 3), from = 0, to = 1,
      main = "", xlab = "", ylab = "Density", bty = "n",
      cex.lab = 1.5, cex.axis = 1.2, lwd = 2,  col = "#32006e")

Continuous distributions