Add documentation (partial)

penelopeysm · penelopeysm · commit 44d75fe64987 · 2025-04-17T10:18:17.000+01:00
diff --git a/docs/make.jl b/docs/make.jl
@@ -24,7 +24,9 @@ makedocs(;
     format=Documenter.HTML(; size_threshold=2^10 * 400),
     modules=[DynamicPPL, Base.get_extension(DynamicPPL, :DynamicPPLMCMCChainsExt)],
     pages=[
-        "Home" => "index.md", "API" => "api.md", "Internals" => ["internals/varinfo.md"]
+        "Home" => "index.md",
+        "API" => "api.md",
+        "Internals" => ["internals/varinfo.md", "internals/submodel_condition.md"],
     ],
     checkdocs=:exports,
     doctest=false,
diff --git a/docs/src/internals/submodel_condition.md b/docs/src/internals/submodel_condition.md
@@ -0,0 +1,231 @@
+# How `PrefixContext` and `ConditionContext` interact
+
+```@meta
+ShareDefaultModule = true
+```
+
+## PrefixContext
+
+`PrefixContext` is a context that, as the name suggests, prefixes all variables inside a model with a given symbol.
+Thus, for example:
+
+```@example
+using DynamicPPL, Distributions
+
+@model function f()
+    x ~ Normal()
+    return y ~ Normal()
+end
+
+@model function g()
+    return a ~ to_submodel(f())
+end
+```
+
+inside the submodel `f`, the variables `x` and `y` become `a.x` and `a.y` respectively.
+This is easiest to observe by running the model:
+
+```@example
+vi = VarInfo(g())
+keys(vi)
+```
+
+!!! note
+    
+    In this case, where `to_submodel` is called without any other arguments, the prefix to be used is automatically inferred from the name of the variable on the left-hand side of the tilde.
+    We will return to the 'manual prefixing' case later.
+
+What does it really mean to 'become' a different variable?
+We can see this from [the definition of `tilde_assume`, for example](https://github.com/TuringLang/DynamicPPL.jl/blob/60ee68e2ce28a15c6062c243019e6208d16802a5/src/context_implementations.jl#L87-L89):
+
+```
+function tilde_assume(context::PrefixContext, right, vn, vi)
+    return tilde_assume(context.context, right, prefix(context, vn), vi)
+end
+```
+
+Functionally, this means that even though the _initial_ entry to the tilde-pipeline has `vn` as `x` and `y`, once the `PrefixContext` has been applied, the later functions will see `a.x` and `a.y` instead.
+
+## ConditionContext
+
+`ConditionContext` is a context which stores values of variables that are to be conditioned on.
+These values may be stored as a `Dict` which maps `VarName`s to values, or alternatively as a `NamedTuple`.
+The latter only works correctly if all `VarName`s are 'basic', in that they have an identity optic (i.e., something like `a.x` or `a[1]` is forbidden).
+Because of this limitation, we will only use `Dict` in this example.
+
+!!! note
+    
+    If a `ConditionContext` with a `NamedTuple` encounters anything to do with a prefix, its internal `NamedTuple` is converted to a `Dict` anyway, so it is quite reasonable to ignore the `NamedTuple` case in this exposition.
+
+One can inspect the conditioning values with, for example:
+
+```@example
+@model function d()
+    x ~ Normal()
+    return y ~ Normal()
+end
+
+cond_model = d() | (@varname(x) => 1.0)
+cond_ctx = cond_model.context
+```
+
+There are several internal functions that are used to determine whether a variable is conditioned, and if so, what its value is.
+
+```@example
+DynamicPPL.hasconditioned_nested(cond_ctx, @varname(x))
+```
+
+```@example
+DynamicPPL.getconditioned_nested(cond_ctx, @varname(x))
+```
+
+These functions are in turn used by the function `DynamicPPL.contextual_isassumption`, which is largely the same as `hasconditioned_nested`, but also checks whether the value is `missing` (in which case it isn't really conditioned).
+
+```@example
+DynamicPPL.contextual_isassumption(cond_ctx, @varname(x))
+```
+
+!!! note
+    
+    Notice that (neglecting `missing` values) the return value of `contextual_isassumption` is the _opposite_ of `hasconditioned_nested`, i.e. for a variable that _is_ conditioned on, `contextual_isassumption` returns `false`.
+
+If a variable `x` is conditioned on, then the effect of this is to set the value of `x` to the given value (while still including its contribution to the log probability density).
+Since `x` is no longer a random variable, if we were to evaluate the model, we would find only one key in the `VarInfo`:
+
+```@example
+keys(VarInfo(cond_model))
+```
+
+## Joint behaviour: desiderata at the model level
+
+When paired together, these two contexts have the potential to cause substantial confusion: `PrefixContext` modifies the variable names that are seen, which may cause them to be out of sync with the values contained inside the `ConditionContext`.
+
+We begin by mentioning some high-level desiderata for their joint behaviour.
+Take these models, for example:
+
+```@example
+# We define a helper function to unwrap a layer of SamplingContext, to
+# avoid cluttering the print statements.
+unwrap_sampling_context(ctx::DynamicPPL.SamplingContext) = ctx.context
+unwrap_sampling_context(ctx::DynamicPPL.AbstractContext) = ctx
+@model function inner()
+    println("inner context: $(unwrap_sampling_context(__context__))")
+    x ~ Normal()
+    return y ~ Normal()
+end
+
+@model function outer()
+    println("outer context: $(unwrap_sampling_context(__context__))")
+    return a ~ to_submodel(inner())
+end
+
+# 'Outer conditioning'
+with_outer_cond = outer() | (@varname(a.x) => 1.0)
+
+# 'Inner conditioning'
+inner_cond = inner() | (@varname(x) => 1.0)
+@model function outer2()
+    println("outer context: $(unwrap_sampling_context(__context__))")
+    return a ~ to_submodel(inner_cond)
+end
+with_inner_cond = outer2()
+```
+
+We want that:
+
+ 1. `keys(VarInfo(outer()))` should return `[a.x, a.y]`;
+ 2. `keys(VarInfo(with_outer_cond))` should return `[a.y]`;
+ 3. `keys(VarInfo(with_inner_cond))` should return `[a.y]`,
+
+**In other words, we can condition submodels either from the outside (point (2)) or from the inside (point (3)), and the variable name we use to specify the conditioning should match the level at which we perform the conditioning.**
+
+This is an incredibly salient point because it means that submodels can be treated as individual, opaque objects, and we can condition them without needing to know what it will be prefixed with, or the context in which that submodel is being used.
+For example, this means we can reuse `inner_cond` in another model with a different prefix, and it will _still_ have its inner `x` value be conditioned, despite the prefix differing.
+
+!!! info
+    
+    In the current version of DynamicPPL, these criteria are all fulfilled. However, this was not the case in the past: in particular, point (3) was not fulfilled, and users had to condition the internal submodel with the prefixes that were used outside. (See [this GitHub issue](https://github.com/TuringLang/DynamicPPL.jl/issues/857) for more information; this issue was the direct motivation for this documentation page.)
+
+## Desiderata at the context level
+
+The above section describes how we expect conditioning and prefixing to behave from a user's perpective.
+We now turn to the question of how we implement this in terms of DynamicPPL contexts.
+We do not specify the implementation details here, but we will sketch out something resembling an API that will allow us to achieve the target behaviour.
+
+**Point (1)** does not involve any conditioning, only prefixing; it is therefore already satisfied by virtue of the `tilde_assume` method shown above.
+
+**Points (2) and (3)** are more tricky.
+As the reader may surmise, the difference between them is the order in which the contexts are stacked.
+
+For the _outer_ conditioning case (point (2)), the `ConditionContext` will contain a `VarName` that is already prefixed.
+When we enter the inner submodel, this `ConditionContext` has to be passed down and somehow combined with the `PrefixContext` that is created when we enter the submodel.
+We make the claim here that the best way to do this is to nest the `PrefixContext` _inside_ the `ConditionContext`.
+This is indeed what happens, as can be demonstrated by running the model.
+
+```@example
+with_outer_cond();
+nothing;
+```
+
+!!! info
+    
+    The `; nothing` at the end is purely to circumvent a Documenter.jl quirk where stdout is only shown if the return value of the final statement is `nothing`.
+    If these documentation pages are moved to Quarto, it will be possible to remove this.
+
+For the _inner_ conditioning case (point (3)), the outer model is not run with any special context.
+The inner model will itself contain a `ConditionContext` will contain a `VarName` that is not prefixed.
+When we run the model, this `ConditionContext` should be then nested _inside_ a `PrefixContext` to form the final evaluation context.
+Again, we can run the model to see this in action:
+
+```@example
+with_inner_cond();
+nothing;
+```
+
+Putting all of the information so far together, what it means is that if we have these two inner contexts (taken from above):
+
+```@example
+using DynamicPPL: PrefixContext, ConditionContext, DefaultContext
+
+inner_ctx_with_outer_cond = ConditionContext(
+    Dict(@varname(a.x) => 1.0), PrefixContext{:a}(DefaultContext())
+)
+inner_ctx_with_inner_cond = PrefixContext{:a}(
+    ConditionContext(Dict(@varname(x) => 1.0), DefaultContext())
+)
+```
+
+then we want both of these to be `true` (and thankfully, they are!):
+
+```@example
+DynamicPPL.hasconditioned_nested(inner_ctx_with_outer_cond, @varname(a.x))
+```
+
+```@example
+DynamicPPL.hasconditioned_nested(inner_ctx_with_inner_cond, @varname(a.x))
+```
+
+Essentially, our job is threefold:
+
+  - Firstly, given the correct arguments, we need to make sure that `hasconditioned_nested` and `getconditioned_nested` behave correctly.
+
+  - Secondly, we need to make sure that both the correct arguments are supplied. In order to do so:
+    
+      + We need to make sure that when evaluating a submodel, the context stack is arranged such that prefixes are applied _inside_ the parent model's context, but _outside_ the submodel's own context.
+      + We also need to make sure that the `VarName` passed to it is prefixed correctly. This is, in fact, _not_ handled by `tilde_assume`, because `contextual_isassumption` is much higher in the call stack than `tilde_assume` is. So, we need to explicitly prefix it.
+
+## How do we do it?
+
+`hasconditioned_nested` accomplishes this by doing the following:
+
+  - If the outermost layer is a `ConditionContext`, it checks whether the variable is contained in its values.
+  - If the outermost layer is a `PrefixContext`, it goes through the `PrefixContext`'s child context and prefixes any inner conditioned variables, before checking whether the variable is contained.
+
+We ensure that the context stack is correctly arranged by appropriately setting leaf contexts before evaluating a submodel; see the implementation of `tilde_assume!!` for more details.
+
+And finally, we ensure that the `VarName` is correctly prefixed by modifying the `@model` macro (or, technically, its subsidiary `isassumption`) to explicitly prefix the variable before passing it to `contextual_isassumption`.
+
+## FixedContext
+
+Finally, note that all of the above also applies to the interaction between `PrefixContext` and `FixedContext`, except that the functions have different names.
+(`FixedContext` behaves the same way as `ConditionContext`, except that unlike conditioned variables, fixed variables do not contribute to the log probability density.)