- #1
getUsername
- 3
- 0
Homework Statement
A dissimilarity measure d(x, y) for two data points x and y typically satisfy the following three properties:
1. [itex]d(x, y) ≥ 0[/itex] and [itex]d(x, y) = 0[/itex] if and only if [itex]x = y[/itex]
2. [itex]d(x, y) = d(y, x)[/itex]
3. [itex]d(x,z) ≤ d(x, y) + d(y,z)[/itex]The following method has been proposed for measuring the dissimilarity[read: distance] between two sets of data points A = {xa1, xa2, . . . , xam} and B = {xb1, xb2, . . . , xbn}Average Linkage(UPGMA):
[tex]d(A,B) = {\frac{Ʃ_{x∈A} Ʃ_{y∈B} d(x, y)}{|A||B|}} [/tex]Show that the linkage method satisfies each of the properties or provide a counter example (a visual/diagram representation of any counter example is sufficient if appropriate).
2. Specific Problem
I am having difficulty proving that the Average Linkage method does/does not satisfy the third property of metric space, the triangle inequality.
The Euclidean distance formula, the Absolute distance formula(Manhattan) or any other distance formula may be used to show this.