Abstract
Exploiting the complex structure of relational data enables to build better models by taking into account the additional information provided by the links between objects. We extend this idea to the Semantic Web by introducing our novel SPARQL-ML approach to perform data mining for Semantic Web data. Our approach is based on traditional SPARQL and statistical relational learning methods, such as Relational Probability Trees and Relational Bayesian Classifiers.
We analyze our approach thoroughly conducting three sets of experiments on synthetic as well as real-world data sets. Our analytical results show that our approach can be used for any Semantic Web data set to perform instance-based learning and classification. A comparison to kernel methods used in Support Vector Machines shows that our approach is superior in terms of classification accuracy.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Bloehdorn, S., Sure, Y.: Kernel Methods for Mining Instance Data in Ontologies. In: ISWC, pp. 58–71 (2007)
Dzeroski, S.: Multi-Relational Data Mining: An Introduction. SIGKDD 5(1), 1–16 (2003)
Edwards, P., Grimnes, G.A., Preece, A.: An Empirical Investigation of Learning from the Semantic Web. In: ECML/PKDD Workshop, pp. 71–89 (2002)
Gilardoni, L., Biasuzzi, C., Ferraro, M., Fonti, R., Slavazza, P.: Machine Learning for the Semantic Web: Putting the user into the cycle. In: Dagstuhl Seminar (2005)
Hartmann, J.: A Knowledge Discovery Workbench for the Semantic Web. In: SIGKDD Workshop (2004)
Hess, A., Kushmerick, N.: Machine Learning for Annotating Semantic Web Services. In: AAAI (2004)
Joachims, T.: SVM light—Support Vector Machine (2004), Software, available at http://svmlight.joachims.org/
Kiefer, C., Bernstein, A., Stocker, M.: The Fundamentals of iSPARQL: A Virtual Triple Approach for Similarity-Based Semantic Web Tasks. In: ISWC, pp. 295–309 (2007)
Kochut, K., Janik, M.: SPARQLeR: Extended SPARQL for Semantic Association Discovery. In: ESWC, pp. 145–159 (2007)
Neville, J., Jensen, D., Friedland, L., Hay, M.: Learning Relational Probability Trees. In: KDD, pp. 625–630 (2003)
Neville, J., Jensen, D., Gallagher, B.: Simple Estimators for Relational Bayesian Classifiers. In: ICDM, pp. 609–612 (2003)
Pérez, J., Arenas, M., Gutierrez, C.: The Semantics and Complexity of SPARQL. In: ISWC, pp. 30–43 (2006)
Provost, F., Domingos, P.: Well-Trained PETs: Improving Probability Estimation Trees. CeDER Working Paper #IS-00-04 (2000)
Provost, F.J., Fawcett, T.: Robust Classification for Imprecise Environments. Machine Learning 42(3), 203–231 (2001)
Prud’hommeaux, E., Seaborne, A.: SPARQL Query Language for RDF. Technical report, W3C Recommendation (January 15, 2008)
Valiente, G.: Algorithms on Trees and Graphs. Springer, Berlin (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kiefer, C., Bernstein, A., Locher, A. (2008). Adding Data Mining Support to SPARQL Via Statistical Relational Learning Methods. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds) The Semantic Web: Research and Applications. ESWC 2008. Lecture Notes in Computer Science, vol 5021. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68234-9_36
Download citation
DOI: https://doi.org/10.1007/978-3-540-68234-9_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68233-2
Online ISBN: 978-3-540-68234-9
eBook Packages: Computer ScienceComputer Science (R0)