K-Nearest Neighbors

Classifying based on what's closest to you.

Simple Yet Powerful

K-Nearest Neighbors (KNN) is one of the simplest ML algorithms. It doesn't learn an explicit model — instead, it stores the entire training dataset and makes predictions by looking at the K closest data points. It's instance-based learning: the algorithm remembers everything and classifies new points by majority vote of their neighbors.

How KNN Works


  K=3: Look at 3 nearest neighbors

        ? ← New point to classify

  ○ ○ ● ◆ ● ○ ○
  ○ ● ● ◆ ● ● ○
  ● ● ● ◆ ● ● ●

  Nearest 3 neighbors of ◆:
  ● ● ●

  Majority: ● → New point classified as ●

  ──────────────────────────────

  K=5: Look at 5 nearest neighbors

  Nearest 5 neighbors of ◆:
  ● ● ● ○ ○

  Majority: ● → Still classified as ●

Choosing K

Small K (e.g., K=1) — Very sensitive to noise. A single outlier can change the prediction. Low bias, high variance.
Large K (e.g., K=20) — Smoother decision boundaries, less sensitive to noise. But may miss local patterns. High bias, low variance.
Rule of thumb — Start with K = √n (square root of the number of training samples) and tune from there.

Distance Metrics

KNN relies on a distance metric to find "nearest" neighbors:

Euclidean Distance — Straight-line distance (most common)
Manhattan Distance — Sum of absolute differences (city blocks)
Minkowski Distance — Generalization of both

Important: Features must be normalized/scaled before using KNN, or features with larger ranges will dominate the distance calculation.

Pros and Cons

Pros: Simple to understand and implement, no training phase (lazy learning), naturally handles multi-class problems, adapts to new data easily.

Cons: Slow at prediction time (must compute distances to all training points), memory-intensive (stores entire dataset), struggles with high-dimensional data, sensitive to irrelevant features and unscaled data.

When to Use KNN

KNN works well for small to medium-sized datasets with clear local structure. It's commonly used for recommendation systems, pattern recognition, and as a baseline for more complex models. For large datasets, consider approximate nearest neighbor algorithms (like KD-Trees or Ball Trees) to speed things up.

🧪 Quick Quiz

How does KNN classify a new data point?

← Previous Decision Trees & Random Forests

Next → Support Vector Machines