Theorem

  • Let be a nonempty set, a positive-definite real-valued kernel on with corresponding reproducing kernel Hilbert space , and let be a differentiable regularization function. Then given a training sample and an arbitrary error function , a minimizer of the regularized empirical risk admits a representation of the form Why it’s cool
  • Representer theorems are useful from a practical standpoint because they dramatically simplify the regularized ERM problem.
    • In most interesting applications, the search domain for the minimization will be an infinite-dimensional subspace of and therefore the search (as written) does not admit implementation on finite-memory and finite-precision computers.
    • In contrast, the representation of afforded by a representer theorem reduces the original (infinite-dimensional) minimization problem to a search for the optimal -dimensional vector of coefficients ; it can then be obtained by applying any standard function minimization algorithm.