Managing the Evolution and Preservation of the Data Web

Debattista J,Fernández JD,Vidal ME,Umbrich J

Publication

Managing the Evolution and Preservation of the Data Web

The 4th edition of this workshop has targeted one of the emerging and fundamental problems in the Semantic Web, specifically the preservation of evolving linked datasets. This topic is of particular relevance to the Semantic Web community since it raises awareness of the many research challenges for preserving and managing dynamic linked datasets. Fostering active usage of such evolving datasets requires further research advances on topics such as storage, synchronisation, change representation and querying over evolving graphs. This year, we accepted three papers, we invited a keynote speaker, and we discussed on future steps of the community, which we describe in brief.

In this year’s contributions we see a focus on the management of data versioning and the preservation of evolving knowledge. Singh et al. [4] present DELTA-LD, a change detection mechanisms for linked datasets. DELTA-LD focuses on detecting changes at both resource level (creation, removal, update, movement, or renewal of a resource) and triple level (deleting or adding a triple). To do so, the approach considers (i) the extraction of features from the linked datasets in order to detect changes and identify similar representations in different versions (i.e. moved resources), and (ii) a classification of the changes and a representation of the change model using a provided ontology. Pandit et al. [3] investigate on how to represent changes in consents and activities regarding the novel General Data Protection Regulation (GDPR).

In their position paper, they first discuss the use of PROV to represent the provenance of activities and ODRL to represent the consent, and identify the influence of consent changes. Then, they discuss on detecting and representing change in activities and how to link and use the changes to demonstrate the compliance w.r.t DDPR obligations. Laajimi et al. [2] focus on evaluating the performance of archiving engines. In particular, they propose and evaluate the use of the SPARK distributed system to archive RDF data. Thus, authors represent RDF data and changes in SPARK dataframes, while archiving queries are resolved via SPARK SQL. Then, the performance of different versioning approaches (e.g. fully materialized version versus representing only the changing triples in each version) are evaluated, with particular attention to measuring the different performance of starand chain queries

J. Debattista, J. D. Fernández, M. Vidal, J. Umbrich, Managing the Evolution and Preservation of the Data Web, J. Web Semant. 54 (2019) 1-3

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Analytics" category.
cookielawinfo-checkbox-functional	1 year	The GDPR Cookie Consent plugin sets the cookie to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Necessary" category.
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
PHPSESSID	session	This cookie is native to PHP applications. The cookie stores and identifies a user's unique session ID to manage user sessions on the website. The cookie is a session cookie and will be deleted when all the browser windows are closed.
viewed_cookie_policy	1 year	The GDPR Cookie Consent plugin sets the cookie to store whether or not the user has consented to use cookies. It does not store any personal data.

Cookie	Duration	Description
mec_cart	1 month	Provides functionality for our ticket shop
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_ga	1 year	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year	Google Analytics sets this cookie to store and count page views.
_gat_gtag_UA_*	1 min	Google Analytics sets this cookie to store a unique user ID.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.

Managing the Evolution and Preservation of the Data Web

CSH Newsletter