Searching with template expressions
Template expressions are a powerful feature in SymbolicRegression.jl that allow you to impose structure on the symbolic regression search. Rather than searching for a completely free-form expression, you can specify a template that combines multiple sub-expressions in a prescribed way.
This is particularly useful when:
- You have domain knowledge about the functional form of your solution
- You want to learn vector-valued expressions (e.g., force fields, velocity fields)
- You need to enforce constraints on which variables can appear in different parts of the expression
- You want to share sub-expressions between multiple components
For example, you might know that your system follows a pattern like: sin(f(x1, x2)) + g(x3)^2
where f
and g
are unknown functions you want to learn. With template expressions, you can encode this structure while still letting the symbolic regression search discover the optimal form of the sub-expressions.
In this tutorial, we'll walk through a complete example of using template expressions to learn the components of a particle's motion under magnetic and drag forces. We'll see how to:
- Define the structure of our template
- Specify constraints on which variables each sub-expression can access
- Set up the symbolic regression search
- Interpret and use the results
Let's get started!
using SymbolicRegression, Random
using MLJBase: machine, fit!, predict, report
The Physical Problem
We'll study a charged particle moving through a magnetic field with temperature-dependent drag. The total force on the particle will have two components:
\[\mathbf{F} = \mathbf{F}_\text{drag} + \mathbf{F}_\text{magnetic} = -\eta(T)\mathbf{v} + q \mathbf{v} \times \mathbf{B}(t)\]
where we will take $q = 1$ for simplicity.
From physics, we know:
- The magnetic force comes from a cross product with the field: $\mathbf{F}_\text{magnetic} = \mathbf{v} \times \mathbf{B}$
- The drag force opposes motion, and we'll define a simple model for it: $\mathbf{F}_\text{drag} = -\eta(T)\mathbf{v}$
Now, the parts of this model we don't know:
- The magnetic field $\mathbf{B}(t)$ varies in time throughout the experiment, but this pattern repeats for each experiment. We want to learn the components of this field, symbolically!
- The drag coefficient $\eta(T)$ depends only on temperature. We also want to figure out what this is!
We'll generate synthetic data from a known model and then try to rediscover these relationships, only knowing the total force on a particle for a given experiment, as well as the input variables: time, velocity, and temperature. We will do this with template expressions to encode the physical structure of the problem.
Let's say we run this experiment 1000 times:
n = 1000
rng = Random.MersenneTwister(0);
Each time we run the experiment, the temperature is a bit different:
T = 298.15 .+ 0.5 .* rand(rng, n)
T[1:3]
3-element Vector{Float64}:
298.3718654224738
298.1561708577222
298.1894634079026
We run the experiment, and record the velocity at a random time between 0 and 10 seconds.
t = 10 .* rand(rng, n)
t[1:3]
3-element Vector{Float64}:
4.638978857758314
5.0741312407383194
6.2470995980837385
We introduce a particle at a random velocity between -1 and 1
v = [ntuple(_ -> 2 * rand(rng) - 1, 3) for _ in 1:n]
v[1:3]
3-element Vector{Tuple{Float64, Float64, Float64}}:
(0.04597744561694972, 0.3617761091511644, -0.7622618201519469)
(0.13929410233783868, 0.6246359182930323, -0.8802795769225171)
(0.024384606418089483, -0.2764887177675295, 0.5304044608519174)
Now, let's create the true unknown model.
Let's assume the magnetic field is sinusoidal with frequency 1 Hz along axes x and y, and decays exponentially along the z-axis:
\[\mathbf{B}(t) = \begin{pmatrix} \sin(\omega t) \\ \cos(\omega t) \\ e^{-t/10} \end{pmatrix} \quad \text{where} \quad \omega = 2\pi\]
This gives us a rotating magnetic field in the x-y plane that weakens along z:
omega = 2ฯ
B = [(sin(omega * ti), cos(omega * ti), exp(-ti / 10)) for ti in t]
B[1:3]
3-element Vector{Tuple{Float64, Float64, Float64}}:
(-0.7664076828044546, -0.642354468917518, 0.6288277634479177)
(0.4491201340287294, 0.8934713790659531, 0.6020509973388409)
(0.9998339518319687, 0.01822275401986526, 0.5354166983624471)
We assume the drag force is linear in the velocity and depends on the temperature with a power law:
\[\mathbf{F}_\text{drag} = -\alpha T^{1/2} \mathbf{v} \quad \text{where} \quad \alpha = 10^{-5}\]
This creates a temperature-dependent damping effect:
F_d = [-1e-5 * Ti^(1//2) .* vi for (Ti, vi) in zip(T, v)]
F_d[1:3]
3-element Vector{Tuple{Float64, Float64, Float64}}:
(-7.941888292012911e-6, -6.249119338936066e-5, 0.00013166886815220216)
(-2.405219023122198e-5, -0.00010785712876486136, 0.00015199962873839863)
(-4.210773653883972e-6, 4.7744523262348025e-5, -9.159110839701518e-5)
Now, let's compute the true magnetic force, in 3D:
cross((a1, a2, a3), (b1, b2, b3)) = (a2 * b3 - a3 * b2, a3 * b1 - a1 * b3, a1 * b2 - a2 * b1)
F_mag = [cross(vi, Bi) for (vi, Bi) in zip(v, B)]
F_mag[1:3]
3-element Vector{Tuple{Float64, Float64, Float64}}:
(-0.262147425073388, 0.5552914209766048, 0.24773417184709556)
(1.162567285138538, -0.47921343480610845, -0.1560812736313824)
(-0.15770210642270086, 0.5172604627036361, 0.2768871620070909)
We then sum these to get the total force:
F = [fd .+ fm for (fd, fm) in zip(F_d, F_mag)]
F[1:3]
3-element Vector{Tuple{Float64, Float64, Float64}}:
(-0.26215536696168, 0.5552289297832154, 0.24786584071524775)
(1.1625432329483067, -0.47932129193487333, -0.155929274002644)
(-0.15770631719635475, 0.5173082072268985, 0.2767955708986939)
This forms our dataset!
data = (; t, v, T, F, B, F_d, F_mag)
keys(data)
(:t, :v, :T, :F, :B, :F_d, :F_mag)
Now, let's format the input variables for input to the regressor:
X = (;
t=data.t,
v_x=[vi[1] for vi in data.v],
v_y=[vi[2] for vi in data.v],
v_z=[vi[3] for vi in data.v],
T=data.T,
)
keys(X)
(:t, :v_x, :v_y, :v_z, :T)
Template expressions allow us to regress directly on a struct, so here we can define a Force
type:
struct Force{T}
x::T
y::T
z::T
end
y = [Force(F...) for F in data.F]
y[1:3]
3-element Vector{Main.Force{Float64}}:
Main.Force{Float64}(-0.26215536696168, 0.5552289297832154, 0.24786584071524775)
Main.Force{Float64}(1.1625432329483067, -0.47932129193487333, -0.155929274002644)
Main.Force{Float64}(-0.15770631719635475, 0.5173082072268985, 0.2767955708986939)
Our variable names are the keys of the struct:
variable_names = ["t", "v_x", "v_y", "v_z", "T"]
5-element Vector{String}:
"t"
"v_x"
"v_y"
"v_z"
"T"
Template expressions require you to define a structure function, which describes how to combine the sub-expressions into a single expression, numerically evaluate them, and print them.
First, let's just make a function that prints the expression:
function combine_strings(e)
# e is a named tuple of strings representing each formula
return " โญ ๐ = [ " * e.B_x * " , " * e.B_y * " , " * e.B_z * " ]\n โฐ ๐
= (" * e.F_d_scale * ") * ๐ฏ"
# (Note that string interpolation will erase the colors, so use `*` instead)
end
combine_strings (generic function with 1 method)
So, this will just print the separate B and F_d expressions we've learned.
Then, let's define an expression that takes the numerical values evaluated in the TemplateExpression, and combines them into the resultant force vector. Inside this function, we can do whatever we want.
function combine_vectors(e, X)
# This time, e is a named tuple of *vectors*, representing the batched
# evaluation of each formula.
# First, extract the 3D velocity vectors from the input matrix:
v = [(X[2, i], X[3, i], X[4, i]) for i in eachindex(axes(X, 2))]
# Use this to compute the full drag force:
F_d = [F_d_scale_i .* vi for (F_d_scale_i, vi) in zip(e.F_d_scale, v)]
# Collect the magnetic field components that we've learned into the vector:
B = [(bx, by, bz) for (bx, by, bz) in zip(e.B_x, e.B_y, e.B_z)]
# Using this, we compute the magnetic force with a cross product:
F_mag = [cross(vi, Bi) for (vi, Bi) in zip(v, B)]
# Finally, we combine the drag and magnetic forces into the total force:
return [Force((fd .+ fm)...) for (fd, fm) in zip(F_d, F_mag)]
end
combine_vectors (generic function with 1 method)
For the functions we wish to learn, we can constraint what variables each of them depends on, explicitly. Let's say B only depends on time, and the drag force scale only depends on temperature (we explicitly multiply the velocity in).
variable_constraints = (; B_x=[1], B_y=[1], B_z=[1], F_d_scale=[5])
(B_x = [1], B_y = [1], B_z = [1], F_d_scale = [5])
Now, we can create our template expression:
structure = TemplateStructure{(:B_x, :B_y, :B_z, :F_d_scale)}(;
combine_strings=combine_strings,
combine_vectors=combine_vectors,
variable_constraints=variable_constraints,
)
(::TemplateStructure{(:B_x, :B_y, :B_z, :F_d_scale), Nothing, typeof(Main.combine_vectors), typeof(Main.combine_strings), @NamedTuple{B_x::Vector{Int64}, B_y::Vector{Int64}, B_z::Vector{Int64}, F_d_scale::Vector{Int64}}}) (generic function with 4 methods)
Let's look at an example of how this would be used in a TemplateExpression, for some guess at the form of the solution:
options = Options(; binary_operators=(+, *, /, -), unary_operators=(sin, cos, sqrt, exp))
# The inner operators are an `DynamicExpressions.OperatorEnum` which is used by `Expression`:
operators = options.operators
t = Expression(Node{Float64}(; feature=1); operators, variable_names)
T = Expression(Node{Float64}(; feature=5); operators, variable_names)
B_x = B_y = B_z = 2.1 * cos(t)
F_d_scale = 1.0 * sqrt(T)
ex = TemplateExpression(
(; B_x, B_y, B_z, F_d_scale);
structure, operators, variable_names
)
โญ ๐ = [ 2.1 * cos(t) , 2.1 * cos(t) , 2.1 * cos(t) ]
โฐ ๐
= (1.0 * sqrt(T)) * ๐ฏ
So we can see that it prints the expression as we've defined it.
Now, we can create a regressor that builds template expressions which follow this structure:
model = SRRegressor(;
binary_operators=(+, -, *, /),
unary_operators=(sin, cos, sqrt, exp),
niterations=500,
maxsize=35,
expression_type=TemplateExpression,
expression_options=(; structure=structure),
# The elementwise needs to operate directly on each row of `y`:
elementwise_loss=(F1, F2) -> (F1.x - F2.x)^2 + (F1.y - F2.y)^2 + (F1.z - F2.z)^2,
batching=true,
batch_size=30,
);
Note how we also have to define the custom elementwise_loss
function. This is because our combine_vectors
function returns a Force
struct, so we need to combine it against the truth!
Next, we can set up our machine and fit:
mach = machine(model, X, y)
At this point, you would run:
fit!(mach)
which should print using your combine_strings
function during the search. The final result is accessible with:
report(mach)
which would return a named tuple of the fitted results, including the .equations
field, which is a vector of TemplateExpression
objects that dominated the Pareto front.
Show raw source code
#=
# Searching with template expressions
Template expressions are a powerful feature in SymbolicRegression.jl that allow you to impose structure
on the symbolic regression search. Rather than searching for a completely free-form expression, you can
specify a template that combines multiple sub-expressions in a prescribed way.
This is particularly useful when:
- You have domain knowledge about the functional form of your solution
- You want to learn vector-valued expressions (e.g., force fields, velocity fields)
- You need to enforce constraints on which variables can appear in different parts of the expression
- You want to share sub-expressions between multiple components
For example, you might know that your system follows a pattern like:
`sin(f(x1, x2)) + g(x3)^2`
where `f` and `g` are unknown functions you want to learn. With template expressions, you can encode
this structure while still letting the symbolic regression search discover the optimal form of the
sub-expressions.
In this tutorial, we'll walk through a complete example of using template expressions to learn
the components of a particle's motion under magnetic and drag forces. We'll see how to:
1. Define the structure of our template
2. Specify constraints on which variables each sub-expression can access
3. Set up the symbolic regression search
4. Interpret and use the results
Let's get started!
=#
using SymbolicRegression, Random
using MLJBase: machine, fit!, predict, report
#=
## The Physical Problem
We'll study a charged particle moving through a magnetic field with temperature-dependent drag.
The total force on the particle will have two components:
\```math
\mathbf{F} = \mathbf{F}_\text{drag} + \mathbf{F}_\text{magnetic} = -\eta(T)\mathbf{v} + q \mathbf{v} \times \mathbf{B}(t)
\```
where we will take ``q = 1`` for simplicity.
From physics, we know:
- The magnetic force comes from a cross product with the field: ``\mathbf{F}_\text{magnetic} = \mathbf{v} \times \mathbf{B}``
- The drag force opposes motion, and we'll define a simple model for it: ``\mathbf{F}_\text{drag} = -\eta(T)\mathbf{v}``
Now, the parts of this model we don't know:
- The magnetic field ``\mathbf{B}(t)`` varies in time throughout the experiment, but this pattern repeats for each experiment. We want to learn the components of this field, symbolically!
- The drag coefficient ``\eta(T)`` depends only on temperature. We also want to figure out what this is!
We'll generate synthetic data from a known model and then try to rediscover these relationships,
**only knowing the total force** on a particle for a given experiment, as well as the input variables:
time, velocity, and temperature.
We will do this with template expressions to encode the physical structure of the problem.
Let's say we run this experiment 1000 times:
=#
n = 1000
rng = Random.MersenneTwister(0);
#=
Each time we run the experiment, the temperature is a bit different:
=#
T = 298.15 .+ 0.5 .* rand(rng, n)
T[1:3]
#=
We run the experiment, and record the velocity at a random time
between 0 and 10 seconds.
=#
t = 10 .* rand(rng, n)
t[1:3]
#=
We introduce a particle at a random velocity between -1 and 1
=#
v = [ntuple(_ -> 2 * rand(rng) - 1, 3) for _ in 1:n]
v[1:3]
#=
**Now, let's create the true unknown model.**
Let's assume the magnetic field is sinusoidal with frequency 1 Hz along axes x and y,
and decays exponentially along the z-axis:
\```math
\mathbf{B}(t) = \begin{pmatrix}
\sin(\omega t) \\
\cos(\omega t) \\
e^{-t/10}
\end{pmatrix}
\quad \text{where} \quad \omega = 2\pi
\```
This gives us a rotating magnetic field in the x-y plane that weakens along z:
=#
omega = 2ฯ
B = [(sin(omega * ti), cos(omega * ti), exp(-ti / 10)) for ti in t]
B[1:3]
#=
We assume the drag force is linear in the velocity and
depends on the temperature with a power law:
\```math
\mathbf{F}_\text{drag} = -\alpha T^{1/2} \mathbf{v}
\quad \text{where} \quad \alpha = 10^{-5}
\```
This creates a temperature-dependent damping effect:
=#
F_d = [-1e-5 * Ti^(1//2) .* vi for (Ti, vi) in zip(T, v)]
F_d[1:3]
#=
Now, let's compute the true magnetic force, in 3D:
=#
cross((a1, a2, a3), (b1, b2, b3)) = (a2 * b3 - a3 * b2, a3 * b1 - a1 * b3, a1 * b2 - a2 * b1)
F_mag = [cross(vi, Bi) for (vi, Bi) in zip(v, B)]
F_mag[1:3]
#=
We then sum these to get the total force:
=#
F = [fd .+ fm for (fd, fm) in zip(F_d, F_mag)]
F[1:3]
#=
This forms our dataset!
=#
data = (; t, v, T, F, B, F_d, F_mag)
keys(data)
#=
Now, let's format the input variables for input to the regressor:
=#
X = (;
t=data.t,
v_x=[vi[1] for vi in data.v],
v_y=[vi[2] for vi in data.v],
v_z=[vi[3] for vi in data.v],
T=data.T,
)
keys(X)
#=
Template expressions allow us to regress directly on a struct,
so here we can define a `Force` type:
=#
struct Force{T}
x::T
y::T
z::T
end
y = [Force(F...) for F in data.F]
y[1:3]
#=
Our variable names are the keys of the struct:
=#
variable_names = ["t", "v_x", "v_y", "v_z", "T"]
#=
Template expressions require you to define a _structure_ function,
which describes how to combine the sub-expressions into a single
expression, numerically evaluate them, and print them.
First, let's just make a function that prints the expression:
=#
function combine_strings(e)
## e is a named tuple of strings representing each formula
return " โญ ๐ = [ " * e.B_x * " , " * e.B_y * " , " * e.B_z * " ]\n โฐ ๐
= (" * e.F_d_scale * ") * ๐ฏ"
## (Note that string interpolation will erase the colors, so use `*` instead)
end
#=
So, this will just print the separate B and F_d expressions we've learned.
Then, let's define an expression that takes the numerical values
evaluated in the TemplateExpression, and combines them into the resultant
force vector. Inside this function, we can do whatever we want.
=#
function combine_vectors(e, X)
## This time, e is a named tuple of *vectors*, representing the batched
## evaluation of each formula.
## First, extract the 3D velocity vectors from the input matrix:
v = [(X[2, i], X[3, i], X[4, i]) for i in eachindex(axes(X, 2))]
## Use this to compute the full drag force:
F_d = [F_d_scale_i .* vi for (F_d_scale_i, vi) in zip(e.F_d_scale, v)]
## Collect the magnetic field components that we've learned into the vector:
B = [(bx, by, bz) for (bx, by, bz) in zip(e.B_x, e.B_y, e.B_z)]
## Using this, we compute the magnetic force with a cross product:
F_mag = [cross(vi, Bi) for (vi, Bi) in zip(v, B)]
## Finally, we combine the drag and magnetic forces into the total force:
return [Force((fd .+ fm)...) for (fd, fm) in zip(F_d, F_mag)]
end
#=
For the functions we wish to learn, we can constraint what variables
each of them depends on, explicitly. Let's say B only depends on time,
and the drag force scale only depends on temperature (we explicitly
multiply the velocity in).
=#
variable_constraints = (; B_x=[1], B_y=[1], B_z=[1], F_d_scale=[5])
#=
Now, we can create our template expression:
=#
structure = TemplateStructure{(:B_x, :B_y, :B_z, :F_d_scale)}(;
combine_strings=combine_strings,
combine_vectors=combine_vectors,
variable_constraints=variable_constraints,
)
#=
Let's look at an example of how this would be used
in a TemplateExpression, for some guess at the form of
the solution:
=#
options = Options(; binary_operators=(+, *, /, -), unary_operators=(sin, cos, sqrt, exp))
## The inner operators are an `DynamicExpressions.OperatorEnum` which is used by `Expression`:
operators = options.operators
t = Expression(Node{Float64}(; feature=1); operators, variable_names)
T = Expression(Node{Float64}(; feature=5); operators, variable_names)
B_x = B_y = B_z = 2.1 * cos(t)
F_d_scale = 1.0 * sqrt(T)
ex = TemplateExpression(
(; B_x, B_y, B_z, F_d_scale);
structure, operators, variable_names
)
#=
So we can see that it prints the expression as we've defined it.
Now, we can create a regressor that builds template expressions
which follow this structure:
=#
model = SRRegressor(;
binary_operators=(+, -, *, /),
unary_operators=(sin, cos, sqrt, exp),
niterations=500,
maxsize=35,
expression_type=TemplateExpression,
expression_options=(; structure=structure),
## The elementwise needs to operate directly on each row of `y`:
elementwise_loss=(F1, F2) -> (F1.x - F2.x)^2 + (F1.y - F2.y)^2 + (F1.z - F2.z)^2,
batching=true,
batch_size=30,
);
#=
Note how we also have to define the custom `elementwise_loss`
function. This is because our `combine_vectors` function
returns a `Force` struct, so we need to combine it against the truth!
Next, we can set up our machine and fit:
=#
mach = machine(model, X, y)
#=
At this point, you would run:
\```julia
fit!(mach)
\```
which should print using your `combine_strings` function
during the search. The final result is accessible with:
\```julia
report(mach)
\```
which would return a named tuple of the fitted results,
including the `.equations` field, which is a vector
of `TemplateExpression` objects that dominated the Pareto front.
=#
which uses Literate.jl to generate this page.