Jump to content

GeoSPARQL: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
No edit summary
Line 69: Line 69:
The former can reveal whether a triplestore gives a timely answer to a GeoSPARQL query and may or may not check the answer for correctness. The latter checks whether a triplestore gives compliant answers with respect to the definitions of the GeoSPARQL 1.0 standard irrespective of the time the query takes for execution.
The former can reveal whether a triplestore gives a timely answer to a GeoSPARQL query and may or may not check the answer for correctness. The latter checks whether a triplestore gives compliant answers with respect to the definitions of the GeoSPARQL 1.0 standard irrespective of the time the query takes for execution.


Well-known geospatial performance benchmarks include the Geographica<ref>Garbis G., Kyzirakos K., Koubarakis M. (2013) Geographica: A Benchmark for Geospatial RDF Stores (Long Version). In: Alani H. et al. (eds) The Semantic Web – ISWC 2013. ISWC 2013. Lecture Notes in Computer Science, vol 8219. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41338-4_22</ref> and Geographica 2 <ref>Ioannidis, T., Garbis, G., Kyzirakos, K. et al. Evaluating Geospatial RDF Stores Using the Benchmark Geographica 2. J Data Semant (2021). https://doi.org/10.1007/s13740-021-00118-x</ref> benchmarks which track the performance of predefined sets of queries on synthetic and real-world datasets. They each test a subset of GeoSPARQL query functions for performance.
Well-known geospatial performance benchmarks include the Geographica<ref>https://doi.org/10.1007/978-3-642-41338-4_22</ref> and Geographica 2<ref>https://doi.org/10.1007/s13740-021-00118-x</ref> benchmarks which track the performance of predefined sets of queries on synthetic and real-world datasets. They each test a subset of GeoSPARQL query functions for performance.
Another performance benchmark by Huang et.al. <ref>Huang, Weiming, Syed A. Raza, Oleg Mirzov, and Lars Harrie. 2019. "Assessment and Benchmarking of Spatially Enabled RDF Stores for the Next Generation of Spatial Data Infrastructure" ISPRS International Journal of Geo-Information 8, no. 7: 310. https://doi.org/10.3390/ijgi8070310</ref> assessed the performance of GeoSPARQL-enabled triple stores as part of a spatial data infrastructure.
Another performance benchmark by Huang et.al. <ref>https://doi.org/10.3390/ijgi8070310</ref> assessed the performance of GeoSPARQL-enabled triple stores as part of a spatial data infrastructure.


Compliance benchmarking of OGC standards is usually conducted as part of the [https://cite.opengeospatial.org/teamengine/ OGC Team Engine Test Suite] which allows companies to get certified for implementing certain OGC specifications correctly.
Compliance benchmarking of OGC standards is usually conducted as part of the [https://cite.opengeospatial.org/teamengine/ OGC Team Engine Test Suite] which allows companies to get certified for implementing certain OGC specifications correctly.
As of 2021, however, the OGC Team Engine does not provide a set of compliance tests to test GeoSPARQL compliance.
As of 2021, however, the OGC Team Engine does not provide a set of compliance tests to test GeoSPARQL compliance.
Nevertheless, in 2021, Jovanovik et.al. <ref>Jovanovik, Milos, Timo Homburg, and Mirko Spasić. 2021. "A GeoSPARQL Compliance Benchmark" ISPRS International Journal of Geo-Information 10, no. 7: 487. https://doi.org/10.3390/ijgi10070487</ref> developed the first comprehensive, reproducible GeoSPARQL Compliance benchmark in which nine different triple stores were initially tested.
Nevertheless, in 2021, Jovanovik et.al. <ref>https://doi.org/10.3390/ijgi10070487</ref> developed the first comprehensive, reproducible GeoSPARQL Compliance benchmark in which nine different triple stores were initially tested.
The results of these first compliance tests along with the software <ref>Software for the GeoSPARQL compliance benchmark. Jovanovik, M.; Homburg, T.; and Spasić, M. Software Impacts, 8: 100071. March 2021 https://www.sciencedirect.com/science/article/pii/S2665963821000191</ref> are available on [https://github.com/OpenLinkSoftware/GeoSPARQLBenchmark Github].
The results of these first compliance tests along with the software <ref>https://doi.org/10.1016/j.simpa.2021.100071</ref> are available on [https://github.com/OpenLinkSoftware/GeoSPARQLBenchmark Github].


==Submission==
==Submission==

Revision as of 16:49, 20 July 2021

GeoSPARQL is a standard for representation and querying of geospatial linked data for the Semantic Web from the Open Geospatial Consortium (OGC).[1] The definition of a small ontology based on well-understood OGC standards is intended to provide a standardized exchange basis for geospatial RDF data which can support both qualitative and quantitative spatial reasoning and querying with the SPARQL database query language.[2]

The Ordnance Survey Linked Data Platform uses OWL mappings for GeoSPARQL equivalent properties in its vocabulary.[3][4] The LinkedGeoData data set is a work of the Agile Knowledge Engineering and Semantic Web (AKSW) research group at the University of Leipzig,[5] a group mostly known for DBpedia, that uses the GeoSPARQL vocabulary to represent OpenStreetMap data.

In particular, GeoSPARQL provides for:

Example

The following example SPARQL query could help model the question "What is within the bounding box defined by 38°54′49″N 77°05′20″W / 38.913574°N 77.089005°W / 38.913574; -77.089005 and 38°53′11″N 77°01′48″W / 38.886321°N 77.029953°W / 38.886321; -77.029953?"[6]

PREFIX geo: <http://www.opengis.net/ont/geosparql#>
PREFIX geof: <http://www.opengis.net/def/function/geosparql/>

SELECT ?what
WHERE {
  ?what geo:hasGeometry ?geometry .

  FILTER(geof:sfWithin(?geometry,
     "POLYGON((-77.089005 38.913574,-77.029953 38.913574,-77.029953 38.886321,-77.089005 38.886321,-77.089005 38.913574))"^^geo:wktLiteral))
}

RCC8 use in GeoSPARQL

RCC8 has been implemented in GeoSPARQL as described below:

A graphical representation of Region Connection Calculus (RCC: Randell, Cui and Cohn, 1992) and the links to the equivalent naming by the Open Geospatial Consortium (OGC) with their equivalent URIs.
A graphical representation of Region Connection Calculus (RCC: Randell, Cui and Cohn, 1992) and the links to the equivalent naming by the Open Geospatial Consortium (OGC) with their equivalent URIs.

Implementations

There are (almost) no complete implementations of GeoSPARQL, there are, however partial or vendor implementations of GeoSPARQL. Currently there are the following implementations:

Apache Marmotta
GeoSPARQL was implemented in the context of the Google Summer of Code 2015.[7] on Apache Marmotta; it uses PostGIS, and it is available just for PostgreSQL.
Apache Jena
Since version 2.11 Apache Jena has a GeoSPARQL extension.[8]
Parliament
Parliament has an almost complete implementation of GeoSPARQL by using JENA and a modified ARQ query processor.[9]
Eclipse RDF4J
Eclipse RDF4J is an open-source Java framework for scalable RDF processing, storage, reasoning and SPARQL querying. It offers support for a large subset of GeoSPARQL functionality.[10]
Strabon
Strabon is an open-source semantic spatiotemporal RDF store that supports two popular extensions of SPARQL: stSPARQL and GeoSPARQL. Strabon is built by extending the well-known RDF store Sesame and extends Sesame's components to manage thematic, spatial and temporal data that is stored in the backend RDBMS. It has been fully tested with PostgreSQL (with PostGIS and PostgreSQL-Temporal extensions[11]) and MonetDB (with geom[12] module).
OpenSahara uSeekM IndexingSail Sesame Sail plugin
uSeekM IndexingSail uses a PostGIS installation to deliver GeoSPARQL. They deliver partial implementation of GeoSPARQL along with some vendor prefixes.[13][14]
Oracle Spatial and Graph
GraphDB
GraphDB is an enterprise ready Semantic Graph Database, compliant with W3C Standards. Semantic graph databases (also called RDF triplestores) provide the core infrastructure for solutions where modelling agility, data integration, relationship exploration and cross-enterprise data publishing and consumption are important.
Stardog
Stardog is an enterprise data unification platform built on smart graph technology: query, search, inference, and data virtualization.
Virtuoso Universal Server
Virtuoso Universal Server is a middleware and database engine hybrid that combines the functionality of a traditional Relational database management system (RDBMS), Object-relational database (ORDBMS), virtual database, RDF, XML, free-text, web application server and file server functionality in a single system.[15]

Performance and Compliance Benchmarking

Benchmarking GeoSPARQL 1.0 and geospatial-enabled triplestores, in general, has been conducted using several approaches. One can distinguish between performance and compliance benchmarks. The former can reveal whether a triplestore gives a timely answer to a GeoSPARQL query and may or may not check the answer for correctness. The latter checks whether a triplestore gives compliant answers with respect to the definitions of the GeoSPARQL 1.0 standard irrespective of the time the query takes for execution.

Well-known geospatial performance benchmarks include the Geographica[16] and Geographica 2[17] benchmarks which track the performance of predefined sets of queries on synthetic and real-world datasets. They each test a subset of GeoSPARQL query functions for performance. Another performance benchmark by Huang et.al. [18] assessed the performance of GeoSPARQL-enabled triple stores as part of a spatial data infrastructure.

Compliance benchmarking of OGC standards is usually conducted as part of the OGC Team Engine Test Suite which allows companies to get certified for implementing certain OGC specifications correctly. As of 2021, however, the OGC Team Engine does not provide a set of compliance tests to test GeoSPARQL compliance. Nevertheless, in 2021, Jovanovik et.al. [19] developed the first comprehensive, reproducible GeoSPARQL Compliance benchmark in which nine different triple stores were initially tested. The results of these first compliance tests along with the software [20] are available on Github.

Submission

The GeoSPARQL standard was submitted to the OGC by:

Future development

With regards to future work, the GeoSPARQL standard states:

Obvious extensions are to define new conformance classes for other standard serializations of geometry data (e.g. KML, GeoJSON). In addition, significant work remains in developing vocabularies for spatial data, and expanding the GeoSPARQL vocabularies with OWL axioms to aid in logical spatial reasoning would be a valuable contribution. There are also large amounts of existing feature data represented in either a GML file (or similar serialization) or in a datastore supporting the general feature model. It would be beneficial to develop standard processes for converting (or virtually converting and exposing) this data to RDF.

In 2019, the OGC's GeoSemantics Domain Working Group set out to assess the current usage of GeoSPARQL in different domains in the White Paper "OGC Benefits of Representing Spatial Data Using Semantic and Graph Technologies"[21] and collected initial feature requests to extend GeoSPARQL.

This led to the re-establishment of the GeoSPARQL Standards Working Group with a newly formed working group charter, in September 2020. The group is working towards a new release of the GeoSPARQL standard, with non-breaking changes - GeoSPARQL 1.1 - in the summer of 2021, the development of which can be followed on Github.

At the GeoLD workshop 2021, held as part of the Extended Semantic Web Conference 2021, an outline of the additions which are likely to be present in GeoSPARQL 1.1 have been presented [22].

See also

References

  1. ^ Battle & Kolas 2012, p. 355.
  2. ^ Battle & Kolas 2012, p. 358.
  3. ^ Goodwin, John (26 April 2013). "GeoSPARQL and Ordnance Survey Linked Data". John’s Weblog. {{cite web}}: External link in |website= (help)
  4. ^ Gemma (3 June 2013). "New Linked Data service launches". Ordnance Survey Blog. {{cite web}}: External link in |website= (help)
  5. ^ "Imprint". AKSW. 2012-05-18.
  6. ^ Battle & Kolas 2012, p. 363.
  7. ^ https://wiki.apache.org/marmotta/GSoC/2015/MARMOTTA-584
  8. ^ [https://jena.apache.org/documentation/query/spatial-query.html
  9. ^ http://parliament.semwebcentral.org/
  10. ^ http://docs.rdf4j.org/programming/#_geosparql
  11. ^ https://github.com/jeff-davis/PostgreSQL-Temporal
  12. ^ https://www.monetdb.org/Documentation/Extensions/GIS
  13. ^ "Archived copy". Archived from the original on 2014-04-15. Retrieved 2012-12-16.{{cite web}}: CS1 maint: archived copy as title (link)
  14. ^ "Archived copy". Archived from the original on 2014-04-15. Retrieved 2014-04-14.{{cite web}}: CS1 maint: archived copy as title (link)
  15. ^ Williams, Hugh (October 29, 2018). "Virtuoso GeoSPARQL Demo Server". OpenLink Software Community Forum. Retrieved 2021-02-02.
  16. ^ https://doi.org/10.1007/978-3-642-41338-4_22
  17. ^ https://doi.org/10.1007/s13740-021-00118-x
  18. ^ https://doi.org/10.3390/ijgi8070310
  19. ^ https://doi.org/10.3390/ijgi10070487
  20. ^ https://doi.org/10.1016/j.simpa.2021.100071
  21. ^ OGC Benefits of Representing Spatial Data Using Semantic and Graph Technologies. Abhayaratna, J.; van den Brink, L.; Car, N.; Atkinson, R.; Homburg, T.; Knibbe, F.; McGlinn, K.; Wagner, A.; Bonduel, M.; Holten Rasmussen, M.; and Thiery, F., OGC White Paper, http://docs.ogc.org/wp/19-078r1/19-078r1.html, October 2020.
  22. ^ GeoSPARQL 1.1: an almost decadal update to the most important geospatial LOD standard. Car, N. J.; and Homburg, T. GeoLD Workshop at ESWC 2021, https://github.com/surroundaustralia/geosparql11-geold-paper/blob/master/manuscript.pdf, May 2021