Introduction
In GLMs, we often encounter scenarios where we need to account for exposure or adjust for certain factors. Both offset and weights play crucial roles in achieving this. Let’s break down their differences and understand when to use each one. Let represent the response variable, are predictor variables, and represents the exposure time.
1. Offset
An offset is a covariate included in a model with a fixed coefficient of (which is not estimated). It acts as a scaling factor for the response variable. Typically, offsets are used with Poisson models to represent exposure. For instance, if you’re modeling count data (e.g., number of events), an offset can account for varying exposure times. The formula for incorporating an offset in a Poisson GLM with is:
This makes totally sense, the exposure just multiplies compared to a Poisson regression model without different exposure and is the correct way to incorporate exposure into a Poisson regression.
The log likelihood is therefore given by
2. Using as response variable
We still assume that . Thus assuming to be a Poisson distribution as well is incorrect since we are modelling rates now. To see this we take a look at the log likelihood which differs from the offset approach
3. Weights
Weights, on the other hand, are quite different. They adjust the variance of the response variable. When using weights, the scale parameter (related to the variance) is divided by the weight values for each observation. Records with weight values less than or equal to 0 or missing are excluded from the analysis. Weights are commonly employed in GLMs to handle heteroscedasticity or unequal variances. Weight are reflected in the log likelihood by
Conclusion:
To incorporate exposure in an Poisson GLM Regression using an offset is the method of choice. However, a weighted Poisson regression when modelling will give the same results, since: