Abstract
Metric space searching is an emerging technique to address the problem of efficient similarity searching in many applications, including multimedia databases and other repositories handling complex objects. Although promising, the metric space approach is still immature in several aspects that are well established in traditional databases. In particular, most indexing schemes are not dynamic. From the few dynamic indexes, even fewer work well in secondary memory. That is, most of them need the index in main memory in order to operate efficiently. In this paper we introduce two different secondary-memory versions of the Dynamic Spatial Approximation Tree with Clusters (DSACL-tree from Barroso et al.) which has shown to be competitive in main memory. These two indexes handle well the secondary memory scenario and are competitive with the state of the art. But in particular the innovations proposed by the version DSACL+-tree lead to significant performance improvements.The resulting data structures can be useful in a wide range of database application.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Barroso, M., Reyes, N., Paredes, R.: Enlarging nodes to improve dynamic spatial approximation trees. In: Proceedings of the 3rd International Conference on Similarity Search and Applications (SISAP 2010), pp. 41–48. ACM Press (2010), doi: http://doi.acm.org/10.1145/1862344.1862351
Brin, S.: Near neighbor search in large metric spaces. In: Proc. 21st Conference on Very Large Databases (VLDB 1995), pp. 574–584 (1995)
Burkhard, W., Keller, R.: Some approaches to best-match file searching. Comm. of the ACM 16(4), 230–236 (1973)
Chávez, E., Navarro, G., Baeza-Yates, R., Marroquín, J.: Searching in metric spaces. ACM Comput. Surv. 33(3), 273–321 (2001)
Ciaccia, P., Patella, M., Zezula, P.: M-tree: an efficient access method for similarity search in metric spaces. In: Proc. 23rd Conf. on Very Large Databases (VLDB 1997), pp. 426–435 (1997)
Dehne, F., Noltemeier, H.: Voronoi trees and clustering problems. Information Systems 12(2), 171–175 (1987)
Dohnal, V.: An Access Structure for Similarity Search in Metric Spaces. In: Lindner, W., Fischer, F., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 133–143. Springer, Heidelberg (2004)
Hetland, M.L.: The Basic Principles of Metric Indexing. In: Coello, C.A.C., Dehuri, S., Ghosh, S. (eds.) Swarm Intelligence for Multi-objective Problems in Data Mining. SCI, vol. 242, pp. 199–232. Springer, Heidelberg (2009)
Hjaltason, G., Samet, H.: Incremental similarity search in multimedia databases. Tech. Rep. CS-TR-4199, University of Maryland, Computer Science Dept. (2000)
Hjaltason, G., Samet, H.: Index-driven similarity search in metric spaces. ACM Trans. on Database Systems 28(4), 517–580 (2003)
Knuth, D.E.: The Art of Computer Programming, Volume I: Fundamental Algorithms, 2nd edn. Addison-Wesley (1973)
Lokoc, J., Skopal, T.: On reinsertions in m-tree. In: SISAP 2008: Proceedings of the First International Workshop on Similarity Search and Applications (SISAP 2008), pp. 121–128. IEEE Computer Society, Washington, DC (2008)
Navarro, G.: Searching in metric spaces by spatial approximation. The Very Large Databases Journal 11(1), 28–46 (2002)
Navarro, G., Reyes, N.: Dynamic spatial approximation trees. ACM Journal of Experimental Algorithmics 12, article 1.5, 68 pages (2008)
Navarro, G., Reyes, N.: Dynamic spatial approximation trees for massive data. In: Proc. 2nd International Workshop on Similarity Search and Applications (SISAP), pp. 81–88. IEEE CS Press (2009)
Navarro, G., Uribe, R.: Fully dynamic metric access methods based on hyperplane partitioning. Information Systems 36(4), 734–747 (2011)
Samet, H.: Foundations of Multidimensional and Metric Data Structures (The Morgan Kaufmann Series in Computer Graphics and Geometric Modeling). Morgan Kaufmann Publishers Inc., San Francisco (2005)
Skopal, T., Pokorný, J., Snásel, V.: PM-tree: Pivoting metric tree for similarity search in multimedia databases. In: ADBIS (Local Proceedings) (2004)
Uhlmann, J.: Satisfying general proximity/similarity queries with metric trees. Information Processing Letters 40, 175–179 (1991)
Yianilos, P.: Data structures and algorithms for nearest neighbor search in general metric spaces. In: Proc. 4th ACM-SIAM Symposium on Discrete Algorithms (SODA 1993), pp. 311–321 (1993)
Yianilos, P.: Excluded middle vantage point forests for nearest neighbor search. In: DIMACS Implementation Challenge, ALENEX 1999, Baltimore, MD (1999)
Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Metric Space Approach. Advances in Database Systems, vol. 32. Springer (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Britos, L., Printista, A.M., Reyes, N. (2012). DSACL+-tree: A Dynamic Data Structure for Similarity Search in Secondary Memory. In: Navarro, G., Pestov, V. (eds) Similarity Search and Applications. SISAP 2012. Lecture Notes in Computer Science, vol 7404. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32153-5_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-32153-5_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32152-8
Online ISBN: 978-3-642-32153-5
eBook Packages: Computer ScienceComputer Science (R0)