December / Monthly / Tutorials / Software / Home

The nearest neighbor algorithm is similar to the Euclidean distance algorithm in that the line of discrimination is computed using the following steps:

Determine the points in the current space that are equal in distance from the nearest point of each data set.
The points that are equal in distance from the nearest point of each data set determine the line of discrimination that separates the them.
The distance between any two points in the current space is determined by the following distance formula:

d² = (x2 - x1)² + (y2 - y1)²
Here is an example of how the nearest neigbour scheme works:

First select the Two Gaussian data sets from the Patterns menu. Following that, select the Nearest Neigbor option under the Algorithms menu. Initialize this algorithm by selecting Initialize from the Go menu. In order to compute the line of discrimination select the Next option under the Go menu. This will display the first step of the process, the data sets in both the input plot (top left) and the output plot (bottom left) of the applet. Also, the process description box indicates which step you are currently on and the algorithm that is currently being used to compute the line of discrimination.

Note that the nearest neighbor algorithm is computationally expensive so please be patient if the process takes longer.
The second step of the process computes the mean of the each data set. The mean of each data sets is displayed on the output plot as black dots near the corresponding data sets. The value of the mean for each data set, which corresponds to the current scale, is displayed on the process description box.
The third step of the process displays the line of discrimination of the given data sets as determined by the nearest neighbor algorithm. Also, the classification error for each data set along with the total classification error is displayed on the process description box.

Click here to go back to the main tutorial page.