Question d’entretien chez Microsoft

How to measure distance between data point?

Réponses aux questions d'entretien

Utilisateur anonyme

29 mars 2016

It depends on whether the data point contains numeric, categorical or mixed data, and also on the purpose of the distance function. For numeric, Euclidian is simple, but cosine is robust to scaling and dtw is good for time series. For categorical, a basic one will be Jaccard similarity, which local sensitive hashing might apply.

5

Utilisateur anonyme

12 juin 2015

Euclidean distance, manhattan distance

1