> Hi,
>
> I have two questions regarding the distance printed in the labels file.
>
> a. Is the distance in the label file correspond to the Euclidean
> distance between vestors?
It is the Euclidean distance between that vector and its cluster center.
> b. Is the distance calculated between the centroid of the cluster and
> other vectors in the cluster?
Yes.
> I believe in k-means the centroid need not map to an actual point in the
> data set. Some clusters did not have a zero distance value so I guessed
> the distance would have been calculated with respect to the centroid
> chosen by k-means.
That's right.
- greg