A hierarchical semantic-based distance for nominal histogram comparison

Abstract : We propose a new distance called Hierarchical Semantic-Based Distance (HSBD), devoted to the comparison of nominal histograms equipped with a dissimilarity matrix providing the semantic correlations between the bins. The computation of this distance is based on a hierarchical strategy, progressively merging the considered instances (and their bins) according to their semantic proximity. For each level of this hierarchy, a standard bin-to-bin distance is computed between the corresponding pair of histograms. In order to obtain the proposed distance, these bin-to-bin distances are then fused by taking into account the semantic coherency of their associated level. From this modus operandi, the proposed distance can handle histograms which are generally compared thanks to cross-bin distances. It preserves the advantages of such cross-bin distances (namely robustness to histogram translation and histogram bin size issues), while inheriting the low computational cost of bin-to-bin distances. Validations in the context of geographical data classification emphasize the relevance and usefulness of the proposed distance.
Document type :
Journal articles
Complete list of metadatas

Cited literature [30 references]  Display  Hide  Download

https://hal.univ-reims.fr/hal-01719116
Contributor : Nicolas Passat <>
Submitted on : Wednesday, February 28, 2018 - 1:29:16 AM
Last modification on : Tuesday, August 20, 2019 - 3:44:03 PM
Long-term archiving on : Monday, May 28, 2018 - 2:02:23 PM

File

A_hierarchical_semantic-based_...
Files produced by the author(s)

Identifiers

Citation

Camille Kurtz, Pierre Gançarski, Nicolas Passat, Anne Puissant. A hierarchical semantic-based distance for nominal histogram comparison. Data and Knowledge Engineering, Elsevier, 2013, 87, pp.206-225. ⟨10.1016/j.datak.2013.06.002⟩. ⟨hal-01719116⟩

Share

Metrics

Record views

132

Files downloads

152