Statistical Models

Hierarchical Bayesian models for AI evaluation data

Models in hibayes are NumPyro probabilistic models that define the joint probability distribution over random variables and observed data. There are a number of built-in models you can select from.

What makes up a model?

Models take processed Features and define a probabilistic program using NumPyro primitives (numpyro.sample, numpyro.deterministic, etc.). They return nothing but instead define the generative process for your data.

from hibayes.model import Model, model
from hibayes.model.models import check_features

import numpyro
import numpyro.distributions as dist
import jax.numpy as jnp
import jax


@model
def my_custom_model(
    prior_mu_loc: float = 0.0,
    prior_mu_scale: float = 1.0,
) -> Model:
    """
    A simple hierarchical model for grouped binomial data.
    """
    def model(features):
        check_features(features, ["obs", "num_group", "group_index", "n_total"])

        overall_mean = numpyro.sample(
            "overall_mean",
            dist.Normal(prior_mu_loc, prior_mu_scale),
        )

        sigma_group = numpyro.sample(
            "sigma_group",
            dist.Exponential(rate=1.0),
        )

        z_group = numpyro.sample(
            "z_group", dist.Normal(0, 1).expand([features["num_group"]])
        )
        group_effects = overall_mean + sigma_group * z_group
        numpyro.deterministic("group_effects", group_effects)

        logit_p = group_effects[features["group_index"]]

        numpyro.sample(
            "obs",
            dist.Binomial(
                total_count=features["n_total"],
                probs=jax.nn.sigmoid(logit_p),
            ),
            obs=features["obs"],
        )

    return model

1: Any prior hyperparameters the user can configure through the config file.
2: The inner function receives processed features from your data pipeline.
3: Use check_features to validate that required features are present, providing helpful error messages.
4: Use numpyro.deterministic to track derived quantities for posterior analysis.
5: The observed data must be named "obs" - this is enforced by the @model decorator.

Built-in models

Hierarchical binomial models

These models are designed for aggregated binomial data (success counts out of totals) with hierarchical grouping structures.

simplified_group_binomial_exponential - A simple two-level hierarchical model with exponential prior on group variance. Good starting point for basic grouped data.

two_level_group_binomial - Two-level hierarchical model with half-normal priors on variance components. Includes reparameterisation for better sampling.

three_level_group_binomial - Three-level hierarchical model (group → subgroup → subsubgroup) with half-normal priors. Use for deeply nested data structures.

three_level_group_binomial_exponential - Three-level hierarchical model with exponential priors on variance components. Alternative prior specification.

Linear models

linear_group_binomial - Non-hierarchical binomial regression with configurable main effects and interactions. Treats categorical variables as predictors without nesting structure.

ordered_logistic_model - Ordered logistic regression for ordinal outcomes. Supports main effects, interactions, and configurable cutpoint priors.

Configuring models

Models are configured in your hibayes.yaml under the model section:

model:
  models:
    - name: two_level_group_binomial
      config:
        fit:
          samples: 2000
          warmup: 1000
          chains: 4
        prior_mu_overall_loc: 0.0
        prior_mu_overall_scale: 1.0

Or specify multiple models to compare:

model:
  models:
    - name: simplified_group_binomial_exponential
      config:
        tag: "exponential_prior"
    - name: two_level_group_binomial
      config:
        tag: "halfnormal_prior"

Custom models

To use a custom model, create a Python file with your model definition and reference it in your config:

model:
  path: path/to/my_models.py
  models:
    - name: my_custom_model
      config:
        prior_mu_loc: -1.0

The @model decorator registers your function so it can be accessed by name in the config.

Built-in Models Reference

Hierarchical Binomial Models

For aggregated success/failure data with grouped structure.

Model	Description	Variance Priors
`simplified_group_binomial_exponential`	Two-level hierarchy (overall → group). Simple starting point for grouped binomial data.	Exponential on group variance
`two_level_group_binomial`	Two-level hierarchy with reparameterisation for better sampling. Default model for basic grouped data.	HalfNormal on variance components
`three_level_group_binomial`	Three-level hierarchy (overall → group → subgroup → subsubgroup). For deeply nested data like students within schools within districts.	HalfNormal on all variance components
`three_level_group_binomial_exponential`	Three-level hierarchy with simpler exponential priors. Alternative when HalfNormal priors are too informative.	Exponential on all variance components

Regression Models

For modelling effects of categorical/continuous predictors.

Model	Description	Key Parameters
`linear_group_binomial`	Non-hierarchical binomial regression. Treats groups as fixed effects rather than random. Supports main effects and interactions with optional effect coding.	`main_effects`, `interactions`, `effect_coding_for_main_effects`
`ordered_logistic_model`	Ordered logistic regression for ordinal outcomes (e.g., 0-10 ratings). Models cumulative probabilities via ordered cutpoints.	`main_effects`, `interactions`, `num_classes`, `effect_coding_for_main_effects`