Live show

Παρουσίαση της Μεταπτυχιακής Εργασίας του μεταπτυχιακού φοιτητή του Τμήματος Επιστήμης Υπολογιστών κ. Καρδουλάκη Νικόλαου

Παρουσίαση της Μεταπτυχιακής Εργασίας του μεταπτυχιακού φοιτητή του Τμήματος Επιστήμης Υπολογιστών κ. Καρδουλάκη Νικόλαου με θέμα: "HInT: Hybrid and  Incremental Type Discovery for Large RDF Data Sources ".

29 Οκτωβρίου 2020, 10:00-12:00

Περιγραφή:  The rapid explosion of linked data has resulted into many weakly structured and incomplete data sources, where type declarations are completely or partially missing. On the other hand, type information is essential for a number of tasks such as query answering, integration, summarization and partitioning. Existing approaches for type discovery, either completely ignore type declarations available in the dataset (implicit type discovery approaches), or have to rely on partial availability of those types, in order to complement them (explicit type enrichment approaches). Implicit type discovery approaches are based on instance grouping, which requires an exhaustive comparison between the instances. This process is expensive and not incremental. Explicit type enrichment approaches on the other hand, can not process data sources that have little or no schema information.

In this thesis, we present HInT, the first incremental and hybrid type discovery system for RDF datasets. It enables type discovery in datasets where type declarations are either partially available or completely missing. To achieve this goal, we incrementally identify the patterns of the various instances, we index and then group them to identify the types. During the processing of an instance, our approach exploits its type information, if available, to improve the quality of the discovered types by guiding the classification of the new instance in the correct group and by refining the groups already built. We analytically and experimentally show that our approach dominates in terms of effectiveness and most importantly efficiency, competitors from both worlds, implicit type discovery and explicit type enrichment.

Eπιβλέπων :  Καθηγητής, Δ. Πλεξουσάκης
Not enabled

Κάλυψη

Έναρξη:
29-10-2020 10:00


Λήξη:
29-10-2020 12:00

Συνδέσεις

Μέγιστες:
4