Incremental Clustering on Linked Data

« (…) In this paper, we propose and evaluate new scalable approaches for incremental entity clustering that support the continuous addition of new entities and data sources. The implementation is based on the distributed processing framework Apache Flink. A detailed performance evaluation with real and synthetically customized datasets shows the effectiveness and scalability of the incremental clustering approaches. (…) »

Source >, Nentwig, Markus; Rahm, Erhard, Accepted for publication: IEEE International Conference on Data Mining Workshop, ICDMW 2018, Singapore 2018-11