Data Matching

Deciding whether two data elements are the "same" (a.k.a. a match) or not

Data matching generally refers to the process of deciding whether two data elements are the “same” (a.k.a. a match) or not, where each data element could be of different classes such as string, tuple, column, and so on. Data matching is a key concept in data integration and data preparation that includes a wide spectrum of tasks. In this paper, we consider seven common data matching tasks, namely entity matching, entity linking, entity alignment, string matching, column type annotation, schema matching, and ontology matching.

References

2023

  1. Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration
    Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, and 4 more authors
    Proc. ACM Manag. Data, 2023

2022

  1. DADER: Hands-Off Entity Resolution with Domain Adaptation
    Jianhong Tu, Xiaoyue Han, Ju Fan, Nan Tang, and 3 more authors
    Proc. VLDB Endow., 2022
  2. Domain Adaptation for Deep Entity Resolution
    Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, and 4 more authors
    In SIGMOD ’22: International Conference on Management of Data, Philadelphia, PA, USA, June 12 - 17, 2022, 2022

2021

  1. Deep Learning for Blocking in Entity Matching: A Design Space Exploration
    Saravanan Thirumuruganathan, Han Li, Nan Tang, Mourad Ouzzani, and 4 more authors
    Proc. VLDB Endow., 2021

2018

  1. Distributed Representations of Tuples for Entity Resolution
    Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq R. Joty, Mourad Ouzzani, and 1 more author
    Proc. VLDB Endow., 2018

2017

  1. Synthesizing Entity Matching Rules by Examples
    Rohit Singh, Venkata Vamsikrishna Meduri, Ahmed K. Elmagarmid, Samuel Madden, and 4 more authors
    Proc. VLDB Endow., 2017
  2. Generating Concise Entity Matching Rules
    Rohit Singh, Venkata Vamsikrishna Meduri, Ahmed K. Elmagarmid, Samuel Madden, and 4 more authors
    In Proceedings of the 2017 ACM International Conference on Management of Data, SIGMOD Conference 2017, Chicago, IL, USA, May 14-19, 2017, 2017