Published April 23, 2008 | Version Published
Journal Article Open

The Metric Nearness Problem

Abstract

Metric nearness refers to the problem of optimally restoring metric properties to distance measurements that happen to be nonmetric due to measurement errors or otherwise. Metric data can be important in various settings, for example, in clustering, classification, metric-based indexing, query processing, and graph theoretic approximation algorithms. This paper formulates and solves the metric nearness problem: Given a set of pairwise dissimilarities, find a "nearest" set of distances that satisfy the properties of a metric—principally the triangle inequality. For solving this problem, the paper develops efficient triangle fixing algorithms that are based on an iterative projection method. An intriguing aspect of the metric nearness problem is that a special case turns out to be equivalent to the all pairs shortest paths problem. The paper exploits this equivalence and develops a new algorithm for the latter problem using a primal-dual method. Applications to graph clustering are provided as an illustration. We include experiments that demonstrate the computational superiority of triangle fixing over general purpose convex programming software. Finally, we conclude by suggesting various useful extensions and generalizations to metric nearness.

Additional Information

Received by the editors March 2, 2006; accepted for publication (in revised form) by R. Nabben September 11, 2007; published electronically April 23, 2008. This research was supported by NSF grant CCF-0431257, NSF Career Award ACI-0093404, and NSF-ITR award IIS-0325116. A preliminary version of this work appeared at NIPS 2004, Vancouver, Canada.

Attached Files

Published - 4FC5Cd01.pdf

Files

4FC5Cd01.pdf

Files (314.2 kB)

Name Size Download all
md5:fc5a78584610425c817a846e889ad63b
314.2 kB Preview Download

Additional details

Identifiers

Eprint ID
23672
Resolver ID
CaltechAUTHORS:20110513-152152857

Funding

NSF
CCF-0431257
NSF
ACI-0093404
NSF
IIS-0325116

Dates

Created
2011-05-13
Created from EPrint's datestamp field
Updated
2021-11-09
Created from EPrint's last_modified field