next up previous contents
Next: Linear Support Vector Machine Up: The Formulation of Support Previous: Structural Risk Minimisation

VC dimension and VC confidence

 

VC confidence is defined as the second term on the right hand side of Eqn. gif. tex2html_wrap_inline1654 is chosen such that tex2html_wrap_inline1656 , and the bound in Eqn. gif will hold with probability tex2html_wrap_inline1658 . h is the VC dimension and is a measure of the notion of capacity of a learning machine.

  equation100

VC dimension can be defined for various classes of learning machines. It is defined as the maximum number of points that can be ``shatter'' by the machine. Considering the two-class problem as defined in the beginning of this chapter, a set of l points have tex2html_wrap_inline1664 possible label assignments. If a learning machine with the mapping tex2html_wrap_inline1666 can correctly assign all possible labels, then that set of points is shattered by this learning machine. It can be shown that a machine with oriented hyper-planes in tex2html_wrap_inline1668 as its mapping function, has a VC dimension of N+1.

Figure gif gif shows that using these bound on the expected risk. It is clear that a learning machine with large capacity (hence large VC dimension) will give low empirical risk but the VC confidence interval is also large for that learning machine, i.e the machine does not generalise well. By measuring this bound, one can select a learning machine that give the lowest expected risk. The selected machine will give better generalisation.

   figure108
Figure: The bound on actual risk of a classifier or learning machine

In [Vapnik, 1995], it is stressed that the VC dimension and hence the capacity does not depend on the number of free parameters directly, i.e. a learning machine with a larger number of free parameter need not have larger capacity. And the reverse is also true. The machine with mapping function tex2html_wrap_inline1672 , where tex2html_wrap_inline1674 has an infinite VC dimension.


next up previous contents
Next: Linear Support Vector Machine Up: The Formulation of Support Previous: Structural Risk Minimisation

K.K. Chin
Thu Sep 10 11:05:30 BST 1998