Abstract
Currently, a good portion of datasets on Internet are accessed through data services, where user’s queries are answered as a composition of multiple data services. Defining the semantics of data services is the first step towards automating their composition. An interesting approach to define the semantics of data services is by describing them as semantic views over a domain ontology. However, defining such semantic views cannot always be done with certainty, especially when the service’s returned data are too complex. In such case, a data service is associated with several possible semantic views. In addition, complex correlations may be present among these possible semantic views, mainly when data services encapsulate the same data sources. In this paper, we propose a probabilistic approach to model the semantic uncertainty of data services. Services along with their possible semantic views are represented in probabilistic service registry. The correlations among service semantics are modeled through a directed probabilistic graphical model (Bayesian network). Based on our modeling, we study the problem of compositing correlated data services to answer a user query, and propose an efficient method to compute the different possible compositions and their probabilities.
Similar content being viewed by others
References
Antova, L., C. K. et al.: 10\(^{10^{6}}\) worlds and beyond: Efficient representation and processing of incomplete information. In: ICDE, pp 606–615 (2007)
Barhamgi, M., Benslimane, D., Medjahed, B.: A query rewriting approach for web service composition. IEEE Trans. Serv. Comput. 3(3), 206–222 (2010)
Carey, M.J., Onose, N., Petropoulos, M.: Data services. Commun. ACM 55(6), 86–97 (2012)
Chiticariu, L., Kolaitis, P.G., Popa, L.: Interactive generation of integrated schemas. In: SIGMOD, pp 833–846 (2008)
Christensen, E., Curbera, F., Meredith, G., Weerawarana, S.: Web services description language (wsdl) 1.1. Technical report, W3C, Ariba, International Business Machines Corporation. Microsoft (2001)
Cowell, R.G., Dawid, A.P., Lauritzen, S.L., Spiegelhalter, D.J., 1st edition: Probabilistic Networks and Expert Systems: Exact Computational Methods for Bayesian Networks. Springer Publishing Company, Incorporated (2007)
Dalvi, N.N., Re, C., Suciu, D.: Queries and materialized views on probabilistic databases. J. Comput. Syst. Sci. 77(3), 473–490 (2011)
Dalvi, N.N., Suciu, D.: Efficient query evaluation on probabilistic databases. In: VLDB, pp 864–875 (2004)
Dechter, R.: Bucket elimination: A unifying framework for probabilistic inference. In: UAI, pp 211–219 (1996)
Dong, X.L., Halevy, A.Y., Yu, C.: Data integration with uncertainty. In: VLDB, pp 687–698 (2007)
Dylla, M., Miliaraki, I., Theobald, M.: Top-k query processing in probabilistic databases with non-materialized views. In: ICDE, pp 122–133 (2013)
Fuhr, N., Rölleke, T.: A probabilistic relational algebra for the integration of information retrieval and database systems. ACM Inf. Syst. 15, 32–66 (1997)
Hadley, M.: Web application description language. Technical report, W3C, Sun Microsystems, Inc., 4150 Network Circle, Santa Clara, California 95054. U.S.A (2009)
Huang, C., Darwiche, A.: Inference in belief networks: A procedural guide. Int. J. Approx. Reason. 15, 225–263 (1996)
Kopecký, J., Vitvar, T., Bournez, C., Farrell, J.: Sawsdl: Semantic annotations for wsdl and xml schema. IEEE Internet Comput. 11(6), 60–67 (2007)
Liang, Q.A., Lam, H.: Web service matching by ontology instance categorization. In: IEEE SCC (1), pp 202–209 (2008)
Malki, A., Benslimane, S.M.: Building semantic mashup. In: ICWIT, pp 40–49 (2012)
Mateescu, R., Dechter, R.: And/or cutset conditioning. IJCAI’05, pp 230–235. Morgan Kaufmann Publishers Inc., USA (2005)
Pottinger, R., Halevy, A.: Minicon: A scalable algorithm for answering queries using views. The VLDB Journal 10(2-3), 182–198. Sept. 2001
Sarma, A.D., Dong, X.L., Halevy, A.Y.: Uncertainty in data integration and dataspace support platforms. In: Schema Matching, pp 75–108 (2011)
Segev, A., Sheng, Q.Z.: Bootstrapping ontologies for web services. IEEE Trans. Serv. Comput. 5(1), 33–44 (2012)
Sen, P., Deshpande, A., Getoor, L.: Prdb: managing and exploiting rich correlations in probabilistic databases. VLDB J. 18(5), 1065–1090 (2009)
Srivastava, U., Munagala, K., Widom, J., Motwani, R.: Query optimization over web services. In: VLDB, pp 355–366 (2006)
Suciu, D., Olteanu, D., Ré, C., Koch, C.: Probabilistic Databases. Synthesis Lectures on Data Management. Morgan & Claypool Publishers (2011)
Vaculín, R., Chen, H., Neruda, R., Sycara, K.P.: Modeling and discovery of data providing services. In: ICWS, pp 54–61 (2008)
Wang, D.Z., Michelakis, E., Garofalakis, M.N., Hellerstein, J.M.: Bayesstore: managing large, uncertain data repositories with probabilistic graphical models. PVLDB 1(1), 340–351 (2008)
Yu, Q., Liu, X., Bouguettaya, A., Medjahed, B.: Deploying and managing web services: issues, solutions, and directions. VLDB J. 17(3), 537–572 (2008)
Zhao, W., Liu, C., Chen, J.: Automatic composition of information-providing web services based on query rewriting. Sci. China Inf. Sci. 55(11), 2428–2444 (2012)
Zhou, L., Chen, H., Zhang, Y., Zhou, C.: A semantic mapping system for bridging the gap between relational database and semantic web. In: AAAI Spring Symposium: Semantic Scientific Knowledge Integration, p 122 (2008)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Malki, A., Benslimane, D., Benslimane, SM. et al. Data Services with uncertain and correlated semantics. World Wide Web 19, 157–175 (2016). https://doi.org/10.1007/s11280-014-0317-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-014-0317-x