$$d(\mathbf{A}, \mathbf{B}) = \sqrt{\sum_{i=1}^{n} (A_i - B_i)^2}$$
$$d\big((x_1, x_2), (y_1, y_2)\big) = \sqrt{(x_1 - y_1)^2 + (x_2 - y_2)^2}$$
$$\mathcal{N}(\mathbf{x})=\left\{\mathbf{x}_i \;:\;\|\mathbf{x}_i-\mathbf{x}\|\le h\right\}$$
$$\mathbf{x}^{(t+1)} = \frac{1}{\left|\mathcal{N}\!\left(\mathbf{x}^{(t)}\right)\right|} \sum_{\mathbf{x}_i\in\mathcal{N}\!\left(\mathbf{x}^{(t)}\right)} \mathbf{x}_i$$
Với bandwidth = 3.5
Dự đoán thuộc cluster (cụm) nào?
$$x_1 = 2, x_2 = 4$$
$$x_1 = 5, x_2 = 7$$
| Index | x1 | x2 |
|---|---|---|
| 0 | 2 | 10 |
| 1 | 2 | 5 |
| 2 | 8 | 4 |
| 3 | 5 | 8 |
| 4 | 7 | 5 |
| 5 | 6 | 4 |
| 6 | 1 | 2 |
| 7 | 4 | 9 |