Abstract
Nowadays, RDF data becomes more and more popular on the Web due to the advances of the Semantic Web and the Linked Open Data initiatives. Several works are focused on transforming relational databases to RDF by storing related data in N-Triple serialization format. However, these approaches do not take into account the existing normalization of their databases since N-Triple format allows data redundancy and does not control any normalization by itself. Moreover, the mostly used and recommended serialization formats, such as RDF/XML, Turtle, and HDT, have either high human-readability but waste storage capacity, or focus further on storage capacities while providing low human-readability. To overcome these limitations, we propose here a new serialization format, called S-RDF. By considering the structure (graph) and values of the RDF data separately, S-RDF reduces the duplicity of values by using unique identifiers. Results show an important improvement over the existing serialization formats in terms of storage (up to 71,66% w.r.t. N-Triples) and human readability.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Document Type Definition (DTD) defines the structure and the legal elements and attributes of an XML document.
- 2.
Centrality identifies the most related nodes within a graph, which have a high number of relations.
- 3.
It is one of the four normalization forms, which consists on a Canonical Decomposition, followed by a Canonical Composition -http://www.unicode.org/reports/tr15/.
- 4.
S-RDF: http://rdf-sequence.sigappfr.org.
- 5.
Jena is a Java framework for building Semantic Web applications. It provides a extensive Java libraries for helping developers develop code that handles RDF, RDFS, RDFa, OWL and SPARQL in line with published W3C recommendations - https://jena.apache.org/about_jena/about.html.
- 6.
Information about persons extracted from the English and Germany Wikipedia, represented by the FOAF vocabulary - http://wiki.dbpedia.org/Downloads2015-10.
- 7.
Geographic coordinates extracted from Wikipedia - https://wiki.dbpedia.org/downloads-2016-10.
- 8.
Easy-Converte: http://www.easyrdf.org/converter.
- 9.
RDF-Translator: https://rdf-translator.appspot.com.
- 10.
The form is available here: https://forms.gle/DNMfsp5LL3nw1hW9A.
References
Microdata to RDF - Second Edition - Transformation from HTML+Microdata to RDF. https://www.w3.org/TR/microdata-rdf/ (2014). Accessed 01 July 2019
Phillips, M.D.A.: Tags for identifying languages. https://tools.ietf.org/html/bcp47. Accessed 01 July 2019
Bornea, M.A., et al.: Building an efficient RDF store over a relational database. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, pp. 121–132. ACM, New York (2013)
Būmans, G., Čerāns, K.: RDB2OWL: A practical approach for transforming RDB data into RDF/OWL. In: Proceedings of the 6th International Conference on Semantic Systems, I-SEMANTICS 2010, pp. 25:1–25:3. ACM, New York (2010)
Chantrapornchai, C., Makpaisit, P.: TripleiD-C: low cost compressed representation for RDF query processing in GPUs. In: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2018, pp. 261–270. ACM, New York (2018)
Cyganiak, R., Wood, D., Lanthaler, M.: RDF 1.1 concepts and abstract syntax. Technical report (2014). Accessed 06 Dec 2016
Duerst, M., Suignard, M.: Internationalized resource identifiers (IRIs). Technical report, Microsoft Corporation (2004)
Fernández, J.D.: Binary RDF for scalable publishing, exchanging and consumption in the web of data. In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012 Companion, pp. 133–138. ACM, New York (2012)
Goasdoué, F., Manolescu, I., Roatiş, A.: Getting more RDF support from relational databases. In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012 Companion, pp. 515–516. ACM, New York (2012)
Hausenblas, M., Ding, L., Peristeras, V.: Linked open government data. IEEE Intell. Syst. 27, 11–15 (2012)
Hernández-Illera, A., Martínez-Prieto, M.A., Fernández, J.D.: Serializing RDF in compressed space. In: 2015 Data Compression Conference, pp. 363–372, April 2015
Huang, J.-Y., Lange, C., Auer, S.: Streaming transformation of XML to RDF using XPath-based mappings. In: Proceedings of the 11th International Conference on Semantic Systems, SEMANTICS 2015, pp. 129–136. ACM, New York (2015)
Konstantinou, N., Kouis, D., Mitrou, N.: Incremental export of relational database contents into RDF graphs. In: Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics (WIMS14), WIMS 2014, pp. 33:1–33:8. ACM, New York (2014)
Lacoste, D., Sawant, K.P., Roy, S.: An efficient XML to OWL converter. In: Proceedings of the 4th India Software Engineering Conference, ISEC 2011, pp. 145–154. ACM, New York (2011)
Lassila, O., Swick, R.R., Wide, W., Consortium, W.: Resource description framework (RDF) model and syntax specification (1998)
Kellogg, G., Lanthaler, M., Lindström, N., Sporny, M., Longley, D.: JSON-LD 1.0, A JSON-based Serialization for Linked Data, W3C Recommendation 16 January 2014 (2014). https://www.w3.org/TR/json-ld/. Accessed 27 Oct 2017
O’Connor, M.J., Das, A.: Acquiring OWL ontologies from XML documents. In: Proceedings of the Sixth International Conference on Knowledge Capture, K-CAP 2011, pp. 17–24. ACM, New York (2011)
Patel-Schneider, P.F., Hayes, P.J.: RDF 1.1 Semantics, W3C Recommendation 25 February 2014 (2014). https://www.w3.org/TR/rdf11-mt/#literals-and-datatypes. Accessed 01 July 2019
Salas, P.E., Marx, E., Mera, A., Viterbo, J.: RDB2RDF plugin: relational databases to RDF plugin for eclipse. In: Proceedings of the 1st Workshop on Developing Tools As Plug-ins, TOPI 2011, pp. 28–31. ACM, New York (2011)
Sandro Hawke, P.A., Herman, I.: W3C semantic web activity (2001). https://www.w3c.org/2001/sw/. Accessed 06 Dec 2018
Sequeda, J.F., Arenas, M., Miranker, D.P.: On directly mapping relational databases to RDF and OWL. In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012, pp. 649–658. ACM, New York (2012)
Stefanova, S., Risch, T.: Scalable reconstruction of RDF-archived relational databases. In: Proceedings of the Fifth Workshop on Semantic Web Information Management, SWIM 2013, pp. 5:1–5:4. ACM, New York (2013)
Thuy, P.T.T., Lee, Y.-K., Lee, S.: DTD2OWL: automatic transforming XML documents into OWL ontology. In: Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human, ICIS 2009, pp. 125–131. ACM, New York (2009)
Thuy, P.T.T., Thuan, N.D., Han, Y., Park, K., Lee, Y.-K.: RDB2RDF: completed transformation from relational database into RDF ontology. In: Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2014, pp. 88:1–88:7. ACM, New York (2014)
Ticona-Herrera, R., Tekli, J., Chbeir, R., Laborie, S., Dongo, I., Guzman, R.: Toward RDF normalization. In: Johannesson, P., Lee, M.L., Liddle, S.W., Opdahl, A.L., López, Ó.P. (eds.) ER 2015. LNCS, vol. 9381, pp. 261–275. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25264-3_19
Vion-Dury, J.-Y.: Using RDFS/OWL to ease semantic integration of structured documents. In: Proceedings of the 2013 ACM Symposium on Document Engineering, DocEng 2013, pp. 189–192. ACM, New York (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Dongo, I., Chbeir, R. (2019). S-RDF: A New RDF Serialization Format for Better Storage Without Losing Human Readability. In: Panetto, H., Debruyne, C., Hepp, M., Lewis, D., Ardagna, C., Meersman, R. (eds) On the Move to Meaningful Internet Systems: OTM 2019 Conferences. OTM 2019. Lecture Notes in Computer Science(), vol 11877. Springer, Cham. https://doi.org/10.1007/978-3-030-33246-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-33246-4_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33245-7
Online ISBN: 978-3-030-33246-4
eBook Packages: Computer ScienceComputer Science (R0)