Understanding Nonlinear Regression in Biostatistics and Life Sciences

Fundamentally, regression analysis examines the relationship between one or more independent variables (predictors) and a dependent variable (outcome). The dependent variable in biostatistics typically refers to a biological measurement or health outcome that we intend to understand or forecast, whereas the independent variables could be risk factors, treatments, or demographic traits.

Examples of Regression

Types of Regression in Biostatistics

Depending on the type of data and the research question, several regression methods can be applied:

Linear Regression

The simplest form of regression assumes a linear relationship between variables. It’s used when the dependent variable is continuous (e.g., blood pressure, weight, cholesterol levels).

Formula:

Y = β₀ + β₁X₁ + β₂X₂ + ... + ε

Where:

Logistic Regression

When dealing with binary outcomes (like disease presence/absence, Yes/No), logistic regression becomes essential. Rather than predicting the value directly, it models the probability of an outcome using the logit function:

log(p/(1-p)) = β₀ + β₁X₁ + ... + βₙXₙ

Example: The output is a probability, which is then converted to a yes/no prediction

Applications include:

Multivariate Regression

Applied when there are multiple dependent variables being predicted at once.

Poisson and Negative Binomial Regression

Used for count data, like the number of hospital visits or number of mutations in a gene.

Cox Proportional Hazards Regression

For time-to-event data (survival analysis), Cox regression models the hazard function:

h(t) = h₀(t) × exp (β₁X₁ + ... + βₙXₙ)

This is crucial for:

Applications of Regression

Regression isn't just about fitting lines to data. In biostatistics, it plays a crucial role in:

Non-Linear Regression

When there isn't a straight line that can explain the link between the independent and dependent variables, non-linear regression is used. Unlike linear regression, which fits data to a line, non-linear regression fits data to a curve.

These curves could be:

In non-linear regression:

Unlike linear regression, there’s no closed-form solution. That’s why non-linear models need good initial estimates to converge on the best-fit solution.

Types of Nonlinear Regression Models

  • Exponential Regression
  • Logistic Regression
  • Polynomial Regression
  • Power Regression
  • Sigmoidal (Logistic Growth) Regression
  • Michaelis-Menten
  • Gompertz Model
  • Hill Equation Regression

Non-linear regressions Examples

1. Dose-Response Relationship

Increasing a drug's dosage in pharmacology doesn't always increase its effect linearly. At a certain point, the effect plateaus. This relationship is often modelled using a sigmoidal (logistic) curve, such as the four-parameter logistic (4PL) or five-parameter logistic (5PL) models.

2. Enzyme Kinetics

The Michaelis-Menten equation is a classic example of non-linear regression used to model enzyme-substrate interactions:

V = (Vmax × [S]) / (Km + [S])

Where:

Why Use Non-Linear Regression in Biostatistics?