research-article

Semi-supervised Hashing with Semantic Confidence for Large Scale Visual Search

Authors:

Tao MeiAuthors Info & Claims

SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 53 - 62

https://doi.org/10.1145/2766462.2767725

Published: 09 August 2015 Publication History

Abstract

Similarity search is one of the fundamental problems for large scale multimedia applications. Hashing techniques, as one popular strategy, have been intensively investigated owing to the speed and memory efficiency. Recent research has shown that leveraging supervised information can lead to high quality hashing. However, most existing supervised methods learn hashing function by treating each training example equally while ignoring the different semantic degree related to the label, i.e. semantic confidence, of different examples. In this paper, we propose a novel semi-supervised hashing framework by leveraging semantic confidence. Specifically, a confidence factor is first assigned to each example by neighbor voting and click count in the scenarios with label and click-through data, respectively. Then, the factor is incorporated into the pairwise and triplet relationship learning for hashing. Furthermore, the two learnt relationships are seamlessly encoded into semi-supervised hashing methods with pairwise and listwise supervision respectively, which are formulated as minimizing empirical error on the labeled data while maximizing the variance of hash bits or minimizing quantization loss over both the labeled and unlabeled data. In addition, the kernelized variant of semi-supervised hashing is also presented. We have conducted experiments on both CIFAR-10 (with label) and Clickture (with click data) image benchmarks (up to one million image examples), demonstrating that our approaches outperform the state-of-the-art hashing techniques.

References

[1]

C. F. Cadieu, H. Hong, D. Yamins, N. Pinto, N. J. Majaj, and J. J. DiCarlo. The neural representation benchmark and its evaluation on brain and machine. In ICLR, 2013.

[2]

J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310.1531, 2013.

[3]

A. Gionis, P. Indyk, and R. Motwani. Similarity search in high dimensions via hashing. In VLDB, 1999.

Digital Library

[4]

Y. Gong, S. Lazebnik, A. Gordo, and F. Perronnin. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans. PAMI, 35(12):2916--2929, 2013.

Digital Library

[5]

R. Herbrich, T. Graepel, and K. Obermayer. Large Margin Rank Boundaries for Ordinal Regression. MIT Press, January 2000.

[6]

G. E. Hinton and R. R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504--507, 2006.

[7]

X.-S. Hua, L. Yang, J. Wang, J. Wang, M. Ye, K. Wang, Y. Rui, and J. Li. Clickage: Towards bridging semantic and intent gaps via mining click logs of search engines. In ACM MM, 2013.

Digital Library

[8]

S. Kim and S. Choi. Semi-supervised discriminant hashing. In ICDM, 2011.

Digital Library

[9]

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.

Digital Library

[10]

B. Kulis and T. Darrell. Learning to hash with binary reconstructive embeddings. In NIPS, 2009.

Digital Library

[11]

B. Kulis and K. Grauman. Kernelized locality-sensitive hashing. IEEE Trans. PAMI, 34(6):1092--1104, 2012.

Digital Library

[12]

W. Liu, C. Mu, S. Kumar, and S.-F. Chang. Discrete graph hashing. In NIPS. 2014.

[13]

W. Liu, J. Wang, R. Ji, Y.-G. Jiang, and S.-F. Chang. Supervised hashing with kernels. In CVPR, 2012.

[14]

W. Liu, J. Wang, S. Kumar, and S.-F. Chang. Hashing with graphs. In ICML, 2011.

Digital Library

[15]

Y. Mu, J. Shen, and S. Yan. Weakly-supervised hashing in kernel space. In CVPR, 2010.

[16]

M. Norouzi and D. M. Blei. Minimal loss hashing for compact binary codes. In ICML, 2011.

Digital Library

[17]

M. Norouzi, D. M. Blei, and R. Salakhutdinov. Hamming distance metric learning. In NIPS, 2012.

Digital Library

[18]

A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV, 42(3):145--175, 2001.

Digital Library

[19]

Y. Pan, T. Yao, T. Mei, H. Li, C. W. Ngo, and Y. Rui. Click-through-based cross-view learning for image search. In SIGIR, 2014.

Digital Library

[20]

P. Ram, D. Lee, H. Ouyang, and A. G. Gray. Rank-approximate nearest neighbor search: Retaining meaning and speed in high dimensions. In NIPS, 2009.

[21]

M. Rastegari, J. Choi, S. Fakhraei, D. Hal, and L. Davis. Predictable dual-view hashing. In ICML, 2013.

Digital Library

[22]

J. P. Romano. On the behavior of randomization tests without a group invariance assumption. Journal of the American Statistical Association, 85(411):686--692, 1990.

[23]

R. Salakhutdinov and G. Hinton. Semantic hashing. International Journal of Approximate Reasoning, 50(7):969--978, 2006.

Digital Library

[24]

C. Strecha, A. M. Bronstein, M. M. Bronstein, and P. Fua. Ldahash: Improved matching with smaller descriptors. IEEE Trans. PAMI, 34(1):66--78, 2012.

Digital Library

[25]

A. Torralba, R. Fergus, and W. Freeman. 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Trans. PAMI, 30(11):1958--1970, 2008.

Digital Library

[26]

J. Wang, S. Kumar, and S.-F. Chang. Semi-supervised hashing for large-scale search. IEEE Trans. PAMI, 34(12):2393--2406, 2012.

Digital Library

[27]

J. Wang, W. Liu, A. X. Sun, and Y.-G. Jiang. Learning hash codes with listwise supervision. In ICCV, 2013.

Digital Library

[28]

Q. Wang, L. Si, and D. Zhang. Learning to hash with partial tags: Exploring correlation between tags and hashing bits for large scale image retrieval. In ECCV. 2014.

[29]

Q. Wang, D. Zhang, and L. Si. Semantic hashing using tags and topic modeling. In SIGIR, 2013.

Digital Library

[30]

Y. Weiss, A. Torralba, and R. Fergus. Spectral hashing. In NIPS, 2008.

Digital Library

[31]

H. Xia, P. Wu, S. C. Hoi, and R. Jin. Boosting multi-kernel locality-sensitive hashing for scalable image retrieval. In SIGIR, 2012.

Digital Library

[32]

T. Yao, T. Mei, C.-W. Ngo, and S. Li. Annotation for free: Video tagging by mining user search behavior. In ACM MM, 2013.

Digital Library

[33]

L. Zelnik-Manor and P. Perona. Self-tuning spectral clustering. In NIPS, 2004.

Digital Library

[34]

X. Zhu, W. Nejdl, and M. Georgescu. An adaptive teleportation random walk model for learning social tag relevance. In SIGIR, 2014.

Digital Library

Cited By

Zhang ZWang JZhu LLu G(2022)Discriminative Visual Similarity Search with Semantically Cycle-consistent Hashing NetworksACM Transactions on Multimedia Computing, Communications, and Applications10.1145/353251918:2s(1-21)Online publication date: 20-Apr-2022
https://dl.acm.org/doi/10.1145/3532519
Shi YNie XLiu XYang LYin Y(2022)Zero-shot Hashing via Asymmetric Ratio Similarity MatrixIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3150790(1-1)Online publication date: 2022
https://doi.org/10.1109/TKDE.2022.3150790
Zheng CZhu LZhang ZLi JYu X(2022)Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation RegressionIEEE Transactions on Image Processing10.1109/TIP.2022.320321631(5881-5892)Online publication date: 2022
https://doi.org/10.1109/TIP.2022.3203216
Show More Cited By

Index Terms

Semi-supervised Hashing with Semantic Confidence for Large Scale Visual Search
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Semi-Supervised Hashing for Large-Scale Search

Hashing-based approximate nearest neighbor (ANN) search in huge databases has become popular due to its computational and memory efficiency. The popular hashing methods, e.g., Locality Sensitive Hashing and Spectral Hashing, construct hash functions ...
Semi-Supervised Nonlinear Hashing Using Bootstrap Sequential Projection Learning

In this paper, we study the effective semi-supervised hashing method under the framework of regularized learning-based hashing. A nonlinear hash function is introduced to capture the underlying relationship among data points. Thus, the dimensionality of ...
Active hashing and its application to image and text retrieval

In recent years, hashing-based methods for large-scale similarity search have sparked considerable research interests in the data mining and machine learning communities. While unsupervised hashing-based methods have achieved promising successes for ...

Comments

comments powered by Disqus.

Information & Contributors

Information

Published In

SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

August 2015

1198 pages

ISBN:9781450336215

DOI:10.1145/2766462

General Chair:
Ricardo Baeza-Yates
Yahoo Labs, USA
,
Program Chairs:
Mounia Lalmas
Yahoo Labs, UK
,
Alistair Moffat
University of Melbourne, Australia
,
Berthier Ribeiro-Neto
Google, Brazil, and UFMG, Brazil

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 August 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

The 973 Programme
National Natural Science Foundation of China
The 863 Programme

Conference

SIGIR '15

Sponsor:

SIGIR

SIGIR '15: The 38th International ACM SIGIR conference on research and development in Information Retrieval

August 9 - 13, 2015

Santiago, Chile

Acceptance Rates

SIGIR '15 Paper Acceptance Rate 70 of 351 submissions, 20%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

25
Total Citations
View Citations
551
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)1

Reflects downloads up to 22 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhang ZWang JZhu LLu G(2022)Discriminative Visual Similarity Search with Semantically Cycle-consistent Hashing NetworksACM Transactions on Multimedia Computing, Communications, and Applications10.1145/353251918:2s(1-21)Online publication date: 20-Apr-2022
https://dl.acm.org/doi/10.1145/3532519
Shi YNie XLiu XYang LYin Y(2022)Zero-shot Hashing via Asymmetric Ratio Similarity MatrixIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3150790(1-1)Online publication date: 2022
https://doi.org/10.1109/TKDE.2022.3150790
Zheng CZhu LZhang ZLi JYu X(2022)Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation RegressionIEEE Transactions on Image Processing10.1109/TIP.2022.320321631(5881-5892)Online publication date: 2022
https://doi.org/10.1109/TIP.2022.3203216
Cheng SZhou YZhang WWu DYang CLi BWang W(2022)Uncertainty-Aware and Multigranularity Consistent Constrained Model for Semi-Supervised HashingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2022.317457732:10(6914-6926)Online publication date: Oct-2022
https://doi.org/10.1109/TCSVT.2022.3174577
Dubey S(2022)A Decade Survey of Content Based Image Retrieval Using Deep LearningIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2021.308092032:5(2687-2704)Online publication date: May-2022
https://doi.org/10.1109/TCSVT.2021.3080920
Pan YChen YBao QZhang NYao TLiu JMei T(2021)Smart Director: An Event-Driven Directing System for Live BroadcastingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/344898117:4(1-18)Online publication date: 30-Nov-2021
https://dl.acm.org/doi/10.1145/3448981
Shi YNie XChen MLian LYin Y(2021)Deep Hashing With Weighted Spatial ImportanceIEEE Transactions on Multimedia10.1109/TMM.2020.303109223(3778-3792)Online publication date: 2021
https://doi.org/10.1109/TMM.2020.3031092
Qin QHuang LWei ZXie KZhang W(2021)Unsupervised Deep Multi-Similarity Hashing With Semantic Structure for Image RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2020.303240231:7(2852-2865)Online publication date: Jul-2021
https://doi.org/10.1109/TCSVT.2020.3032402
Li YYao TPan YChao HMei T(2020)Deep Metric Learning With Density AdaptivityIEEE Transactions on Multimedia10.1109/TMM.2019.293971122:5(1285-1297)Online publication date: May-2020
https://doi.org/10.1109/TMM.2019.2939711
He SWang BWang ZYang YShen FHuang ZShen H(2020)Bidirectional Discrete Matrix Factorization Hashing for Image SearchIEEE Transactions on Cybernetics10.1109/TCYB.2019.294128450:9(4157-4168)Online publication date: Sep-2020
https://doi.org/10.1109/TCYB.2019.2941284
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Alternative Proxies:

Alternative Proxy