From (4) we can write:
And solving the recurrent equation:
Because of the linearly separable assumption,
then we can define:
Multiplying both sides of (7) by
:
![]() |
(8) |
And applying (8):
| (9) |
Applying Cauchy-Schwarz:
| (10) |
From (6): ||w_k+1||^2 = ||w_k||^2 + ||x_k||^2 + 2w_k^tx_k for k=1,,n and x_n C_1
||w_k+1||^2 ||w_k||^2 + ||x_k||^2
||w_k+1||^2 - ||x_k||^2 ||w_k||^2 for k=1,,n
Defining
as:
= _x_k C_1 ||x_k||^2
||w_k+1||^2 _k=1^n ||x_k||^2 n
So the squared norm of the weight vector grows at most linearly with the number of iterations
, which for large
conflicts with (12), so to satisfy both (12) and (17):
(n_max)^2||w_o||^2 = n_max
n_max = ||w_o||^2^2
The adaptation of the perceptron thus, shall terminate after
steps.
A variation consists in presenting the same pattern to then network, until
has the correct sign. The choice of
just decreases or increases the number of steps required to converge.
Pedro Larroy 2005-04-29