Abstract
Classification is one of the basic tasks of data mining in modern database applications including molecular biology, astronomy, mechanical engineering, medical imaging or meteorology. The underlying models have to consider spatial properties such as shape or extension as well as thematic attributes. We introduce 3D shape histograms as an intuitive and powerful similarity model for 3D objects. Particular flexibility is provided by using quadratic form distance functions in order to account for errors of measurement, sampling, and numerical rounding that all may result in small displacements and rotations of shapes. For query processing, a general filter-refinement architecture is employed that efficiently supports similarity search based on quadratic forms. An experimental evaluation in the context of molecular biology demonstrates both, the high classification accuracy of more than 90% and the good performance of the approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ankerst M., Braunmüller B., Kriegel H.-P., Seidl T.: Improving Adaptable Similarity Query Processing by Using Approximatios. Proc. 24th Int. Conf. on Very Large Databases (VLDB’98), New York, USA. Morgan Kaufmann (1998) 206–217
Ankerst M., Kriegel H.-P., Seidl T.: A Multi-Step Approach for Shape Similarity Search in Image Databases.IEEE Transactions on Knowledge and Data Engineering, Vol. 10, No. 6 (1998) 996–1004
Berchtold S., Böhm C., Braunmüller B., Keim D., Kriegel H.-P.: Fast Parallel Similarity Search in Multimedia Databases. Proc. ACM SIGMOD Int. Conf. on Management of Data, Tucson, AZ. ACM Press (1997) 1–12, Best Paper Award
Berchtold S., Böhm C., Keim D., Kriegel H.-P.: A Cost Model for Nearest Neighbor Search in High-Dimensional Data Spaces. Proc. 16th ACM SIGACT-SIGMODSIGART Symp. on Principles of Database Systems (PODS), Tucson, AZ (1997) 78–86
Berchtold S.: Geometry Based Search of Similar Mechanical Parts. Ph.D. Thesis, Institute for Computer Science, University of Munich.Shaker Verlag, Aachen (1997) in German
Bernstein F.C., Koetzle T.F., Williams G.J., Meyer E.F., Brice M.D., Rodgers J.R., Kennard O., Shimanovichi T., Tasumi M.: The Protein Data Bank: a Computer-based Archival File for Macromolecular Structures. Journal of Molecular Biology, Vol. 112 (1977) 535–542
Berchtold S., Keim D., Kriegel H.-P.: The X-tree: An Index Structure for High-Dimensional Data.Proc. 22nd Int. Conf. on Very Large Data Bases (VLDB‘96), Mumbai, India. Morgan Kaufmann (1996) 28–39
Berchtold S., Kriegel H.-P.: S3: Similarity Search in CAD Database Systems. Proc. ACM SIGMOD Int. Conf. on Management of Data. ACM Press (1997) 564–567
Berchtold S., Keim D.A., Kriegel H.-P.: Using Extended Feature Objects for Partial Similarity Retrieval. VLDB Journal, Vol. 6, No. 4. Springer Verlag, Berlin Heidelberg New York (1997) 333–348
Berchtold S., Keim D.A., Kriegel H.-P.: Section Coding: A Method for Similarity Search in CAD Databases. Proc. German Conf. on Databases for Office Automation, Technology, and Science (BTW). Series Informatik Aktuell. Springer Verlag, Berlin Heidelberg New York (1997) 152–171; in German
Beckmann N., Kriegel H.-P., Schneider R., Seeger B.: The R*-tree: An Efficient and Robust Access Method for Points and Rectangles.Proc. ACM SIGMOD Int. Conf. on Management of Data, Atlantic City, NJ. ACM Press (1990) 322–331
Chen M.-S., Han J. and Yu P.S.: Data Mining: An Overview from a Database Perspective.IEEE Transactions on Knowledge and Data Engineering, Vol. 8, No. 6 (1996) 866–883
Faloutsos C., Barber R., Flickner M., Hafner J., Niblack W., Petkovic D., Equitz W.: Efficient and Effective Querying by Image Content.Journal of Intelligent Information Systems, Vol. 3 (1994) 231–262
Gaede V., Günther O.: Multidimensional Access Methods.ACM Computing Surveys, Vol. 30, No. 2 (1998) 170–231
Gary J.E., Mehrotra R.: Similar Shape Retrieval Using a Structural Feature Index. Information Systems, Vol. 18, No. 7 (1993) 525–537
Guttman A.: R-trees: A Dynamic Index Structure for Spatial Searching.Proc. ACM SIGMOD Int. Conf. on Management of Data, Boston, MA. ACM Press (1984) 47–57
Holm L., Sander C.: The FSSP Database of Structurally Aligned Protein Fold Families. Nucleic Acids Research, Vol. 22 (1994) 3600–3609
Hjaltason G.R., Samet H.: Ranking in Spatial Databases. Proc. 4th Int. Symposium on Large Spatial Databases (SSD’95). Lecture Notes in Computer Science, Vol. 951. Springer Verlag, Berlin Heidelberg New York (1995) 83–95
Holm L., Sander C.: Touring Protein Fold Space with Dali/FSSP.Nucleic Acids Research, Vol. 26 (1998) 316–319
Hafner J., Sawhney H.S., Equitz W., Flickner M., Niblack W.: Efficient Color Histogram Indexing for Quadratic Form Distance Functions. IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 17, No. 7. IEEE Press (1995) 729–736
Jagadish H.V.: A Retrieval Technique for Similar Shapes.Proc. ACM SIGMOD Int. Conf. on Management of Data. ACM Press (1991) 208–217
Kastenmüller G.: Shape-oriented Similarity Search in 3D Protein Database Systems. Diploma Thesis, Institute for Computer Science, University of Munich (1998) in German
Kriegel H.-P., Seidl T.: Approximation-Based Similarity Search for 3-D Surface Segments. GeoInformatica Journal, Vol. 2, No. 2. Kluwer Academic Publishers (1998) 113–147
Korn F., Sidiropoulos N., Faloutsos C., Siegel E., Protopapas Z.: Fast Nearest Neighbor Search in Medical Image Databases. Proc. 22nd VLDB Conference, Mumbai, India. Morgan Kaufmann (1996) 215–226
Kriegel H.-P., Schmidt T., Seidl T.: 3D Similarity Search by Shape Approximation. Proc. Fifth Int. Symposium on Large Spatial Databases (SSD’97), Berlin, Germany. Lecture Notes in Computer Science, Vol. 1262. Springer Verlag, Berlin Heidelberg New York (1997) 11–28
Lamdan Y., Wolfson H.J.: Geometric Hashing: A General and Efficient Model-Based Recognition Scheme. Proc. IEEE Int. Conf. on Computer Vision, Tampa, Florida, 1988 238–249
Mitchell T.M.: Machine Learning. McCraw-Hill, (1997)
Michie D., Spiegelhalter D.J., Taylor C.C.: Machine Learning, Neural and Statistical Classification. Ellis Horwood (1994)
Orengo C.A., Michie A.D., Jones S., Jones D.T. Swindells M.B., Thornton, J.M.: CATH-A Hierarchic Classification of Protein Domain Structures. Structure, Vol. 5, No. 8 (1997)1093–1108
Samet H.: The Design and Analysis of Spatial Data Structures. Addison Wesley (1990)
Seidl T.: Adaptable Similarity Search in 3-D Spatial Database Systems. Ph.D. Thesis, Institute for Computer Science, University of Munich (1997). Herbert Utz Verlag, Munich, http://utzverlag.com, ISBN: 3-89675-327-4
Seidl T., Kriegel H.-P.: A 3D Molecular Surface Representation Supporting Neighborhood Queries. Proc. 4th Int. Symposium on Large Spatial Databases (SSD’95), Portland, Maine, USA. Lecture Notes in Computer Science, Vol. 951. Springer Verlag, Berlin Heidelberg New York (1995)240–258
Seidl T., Kriegel H.-P.: Efficient User-Adaptable Similarity Search in Large Multimedia Databases. Proc. 23rd Int. Conf. on Very Large Databases (VLDB’97), Athens, Greece. Morgan Kaufmann (1997) 506–515
Seidl T., Kriegel H.-P.: Optimal Multi-Step k-Nearest Neighbor Search. Proc. ACM SIGMOD Int. Conf. on Management of Data, Seattle, Washington (1998)154–165
Sellis T., Roussopoulos N., Faloutsos C.: The R+-Tree: A Dynamic Index for Multi-Dimensional Objects. Proc. 13th Int. Conf. on Very Large Databases, Brighton, England (1987) 507–518
Taubin G., Cooper D.B.: Recognition and Positioning of Rigid Objects Using Algebraic Moment Invariants. in Geometric Methods in Computer Vision, Vol. 1570, SPIE (1991) 175–186
Weiss S.M., Kulikowski C.A.: Computer Systems that Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning, and Expert Systems. Morgan Kaufmann, San Francisco (1991)
Weber R., Schek H.-J., Blott S.: A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces. Proc. 24th Int. Conf. on Very Large Databases (VLDB’98), New York, USA. Morgan Kaufmann (1998) 194–205
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ankerst, M., Kastenmüller, G., Kriegel, HP., Seidl, T. (1999). 3D Shape Histograms for Similarity Search and Classification in Spatial Databases. In: Güting, R.H., Papadias, D., Lochovsky, F. (eds) Advances in Spatial Databases. SSD 1999. Lecture Notes in Computer Science, vol 1651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48482-5_14
Download citation
DOI: https://doi.org/10.1007/3-540-48482-5_14
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66247-1
Online ISBN: 978-3-540-48482-0
eBook Packages: Springer Book Archive