The nearest neighbor algorithm is similar to the Euclidean distance algorithm in that the line of discrimination is computed using the following steps: - Determine the points in the current space that are equal in distance
from the nearest point of each data set.
- The points that are equal in distance from the nearest point of
each data set determine the line of discrimination that
separates the them.
- The distance between any two points in the current space is determined
by the following distance formula:
`d`= (^{2}`x2`-`x1`)^{2}+ (`y2`-`y1`)^{2}
- Here is an example of how the nearest neigbour scheme works:
First select the**Two Gaussian**data sets from the**Patterns**menu. Following that, select the**Nearest Neigbor**option under the**Algorithms**menu. Initialize this algorithm by selecting**Initialize**from the**Go**menu. In order to compute the line of discrimination select the**Next**option under the**Go**menu. This will display the first step of the process, the data sets in both the input plot (top left) and the output plot (bottom left) of the applet. Also, the process description box indicates which step you are currently on and the algorithm that is currently being used to compute the line of discrimination.
**Note**that the nearest neighbor algorithm is computationally expensive so please be patient if the process takes longer.
- The second step of the process computes the mean of the each
data set. The mean of each data sets is displayed on the output plot
as black dots near the corresponding data sets. The value of the mean
for each data set, which corresponds to the current scale, is
displayed on the process description box.
- The third step of the process displays the line of
discrimination of the given data sets as determined by the nearest
neighbor algorithm. Also, the classification error for each
data set along with the total classification error is displayed
on the process description box.
Click here to go back to the main tutorial page. |