Kernels#

Classes and associated functionality to use kernel functions.

A kernel is a non-negative, real-valued integrable function that can take two inputs, x and y, and returns a value that decreases as x and y move further away in space from each other. Note that further here may account for cyclic behaviour in the data, for example.

In this library, we often use kernels as a smoothing tool: given a dataset of distinct points, we can reconstruct the underlying data generating distribution through smoothing of the data with kernels.

Some kernels are parameterizable and may represent other well known kernels when given appropriate parameter values. For example, the SquaredExponentialKernel,

\[k(x,y) = \text{output_scale} * \exp (-||x-y||^2/2 * \text{length_scale}^2),\]

which if parameterized with an output_scale of \(\frac{1}{\sqrt{2\pi} \,*\, \text{length_scale}}\), yields the well known Gaussian kernel.

A Kernel must implement a compute_elementwise() method, which returns the floating point value after evaluating the kernel on two floats, x and y. Additional methods, such as Kernel.grad_x_elementwise(), can optionally be overridden to improve performance. The canonical example is when a suitable closed-form representation of a higher-order gradient can be used to avoid the expense of performing automatic differentiation.

Such an example can be seen in SteinKernel, where the analytic forms of Kernel.divergence_x_grad_y() are significantly cheaper to compute that the automatic differentiated default.

coreax.kernel.median_heuristic(x)[source]#

Compute the median heuristic for setting kernel bandwidth.

Analysis of the performance of the median heuristic can be found in [garreau2018median].

Parameters:: x (ArrayLike) – Input array of vectors
Return type:: Array
Returns:: Bandwidth parameter, computed from the median heuristic, as a zero-dimensional array

class coreax.kernel.Kernel[source]#

Abstract base class for kernels.

compute(x, y)[source]#

Evaluate the kernel on input data x and y.

The ‘data’ can be any of:

floating numbers (so a single data-point in 1-dimension)
zero-dimensional arrays (so a single data-point in 1-dimension)
a vector (a single-point in multiple dimensions)
array (multiple vectors).

Evaluation is always vectorised.

Parameters:

x (ArrayLike) – An \(n \times d\) dataset (array) or a single value (point)
y (ArrayLike) – An \(m \times d\) dataset (array) or a single value (point)

Return type: