skip to main content
10.1145/2766462.2767725acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Semi-supervised Hashing with Semantic Confidence for Large Scale Visual Search

Published: 09 August 2015 Publication History

Abstract

Similarity search is one of the fundamental problems for large scale multimedia applications. Hashing techniques, as one popular strategy, have been intensively investigated owing to the speed and memory efficiency. Recent research has shown that leveraging supervised information can lead to high quality hashing. However, most existing supervised methods learn hashing function by treating each training example equally while ignoring the different semantic degree related to the label, i.e. semantic confidence, of different examples. In this paper, we propose a novel semi-supervised hashing framework by leveraging semantic confidence. Specifically, a confidence factor is first assigned to each example by neighbor voting and click count in the scenarios with label and click-through data, respectively. Then, the factor is incorporated into the pairwise and triplet relationship learning for hashing. Furthermore, the two learnt relationships are seamlessly encoded into semi-supervised hashing methods with pairwise and listwise supervision respectively, which are formulated as minimizing empirical error on the labeled data while maximizing the variance of hash bits or minimizing quantization loss over both the labeled and unlabeled data. In addition, the kernelized variant of semi-supervised hashing is also presented. We have conducted experiments on both CIFAR-10 (with label) and Clickture (with click data) image benchmarks (up to one million image examples), demonstrating that our approaches outperform the state-of-the-art hashing techniques.

References

[1]
C. F. Cadieu, H. Hong, D. Yamins, N. Pinto, N. J. Majaj, and J. J. DiCarlo. The neural representation benchmark and its evaluation on brain and machine. In ICLR, 2013.
[2]
J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310.1531, 2013.
[3]
A. Gionis, P. Indyk, and R. Motwani. Similarity search in high dimensions via hashing. In VLDB, 1999.
[4]
Y. Gong, S. Lazebnik, A. Gordo, and F. Perronnin. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans. PAMI, 35(12):2916--2929, 2013.
[5]
R. Herbrich, T. Graepel, and K. Obermayer. Large Margin Rank Boundaries for Ordinal Regression. MIT Press, January 2000.
[6]
G. E. Hinton and R. R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504--507, 2006.
[7]
X.-S. Hua, L. Yang, J. Wang, J. Wang, M. Ye, K. Wang, Y. Rui, and J. Li. Clickage: Towards bridging semantic and intent gaps via mining click logs of search engines. In ACM MM, 2013.
[8]
S. Kim and S. Choi. Semi-supervised discriminant hashing. In ICDM, 2011.
[9]
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
[10]
B. Kulis and T. Darrell. Learning to hash with binary reconstructive embeddings. In NIPS, 2009.
[11]
B. Kulis and K. Grauman. Kernelized locality-sensitive hashing. IEEE Trans. PAMI, 34(6):1092--1104, 2012.
[12]
W. Liu, C. Mu, S. Kumar, and S.-F. Chang. Discrete graph hashing. In NIPS. 2014.
[13]
W. Liu, J. Wang, R. Ji, Y.-G. Jiang, and S.-F. Chang. Supervised hashing with kernels. In CVPR, 2012.
[14]
W. Liu, J. Wang, S. Kumar, and S.-F. Chang. Hashing with graphs. In ICML, 2011.
[15]
Y. Mu, J. Shen, and S. Yan. Weakly-supervised hashing in kernel space. In CVPR, 2010.
[16]
M. Norouzi and D. M. Blei. Minimal loss hashing for compact binary codes. In ICML, 2011.
[17]
M. Norouzi, D. M. Blei, and R. Salakhutdinov. Hamming distance metric learning. In NIPS, 2012.
[18]
A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV, 42(3):145--175, 2001.
[19]
Y. Pan, T. Yao, T. Mei, H. Li, C. W. Ngo, and Y. Rui. Click-through-based cross-view learning for image search. In SIGIR, 2014.
[20]
P. Ram, D. Lee, H. Ouyang, and A. G. Gray. Rank-approximate nearest neighbor search: Retaining meaning and speed in high dimensions. In NIPS, 2009.
[21]
M. Rastegari, J. Choi, S. Fakhraei, D. Hal, and L. Davis. Predictable dual-view hashing. In ICML, 2013.
[22]
J. P. Romano. On the behavior of randomization tests without a group invariance assumption. Journal of the American Statistical Association, 85(411):686--692, 1990.
[23]
R. Salakhutdinov and G. Hinton. Semantic hashing. International Journal of Approximate Reasoning, 50(7):969--978, 2006.
[24]
C. Strecha, A. M. Bronstein, M. M. Bronstein, and P. Fua. Ldahash: Improved matching with smaller descriptors. IEEE Trans. PAMI, 34(1):66--78, 2012.
[25]
A. Torralba, R. Fergus, and W. Freeman. 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Trans. PAMI, 30(11):1958--1970, 2008.
[26]
J. Wang, S. Kumar, and S.-F. Chang. Semi-supervised hashing for large-scale search. IEEE Trans. PAMI, 34(12):2393--2406, 2012.
[27]
J. Wang, W. Liu, A. X. Sun, and Y.-G. Jiang. Learning hash codes with listwise supervision. In ICCV, 2013.
[28]
Q. Wang, L. Si, and D. Zhang. Learning to hash with partial tags: Exploring correlation between tags and hashing bits for large scale image retrieval. In ECCV. 2014.
[29]
Q. Wang, D. Zhang, and L. Si. Semantic hashing using tags and topic modeling. In SIGIR, 2013.
[30]
Y. Weiss, A. Torralba, and R. Fergus. Spectral hashing. In NIPS, 2008.
[31]
H. Xia, P. Wu, S. C. Hoi, and R. Jin. Boosting multi-kernel locality-sensitive hashing for scalable image retrieval. In SIGIR, 2012.
[32]
T. Yao, T. Mei, C.-W. Ngo, and S. Li. Annotation for free: Video tagging by mining user search behavior. In ACM MM, 2013.
[33]
L. Zelnik-Manor and P. Perona. Self-tuning spectral clustering. In NIPS, 2004.
[34]
X. Zhu, W. Nejdl, and M. Georgescu. An adaptive teleportation random walk model for learning social tag relevance. In SIGIR, 2014.

Cited By

View all
  • (2022)Discriminative Visual Similarity Search with Semantically Cycle-consistent Hashing NetworksACM Transactions on Multimedia Computing, Communications, and Applications10.1145/353251918:2s(1-21)Online publication date: 20-Apr-2022
  • (2022)Zero-shot Hashing via Asymmetric Ratio Similarity MatrixIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3150790(1-1)Online publication date: 2022
  • (2022)Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation RegressionIEEE Transactions on Image Processing10.1109/TIP.2022.320321631(5881-5892)Online publication date: 2022
  • Show More Cited By

Index Terms

  1. Semi-supervised Hashing with Semantic Confidence for Large Scale Visual Search

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
    August 2015
    1198 pages
    ISBN:9781450336215
    DOI:10.1145/2766462
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 August 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. click-through data
    2. hashing
    3. neighbor voting
    4. semi-supervised hashing
    5. similarity learning

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SIGIR '15
    Sponsor:

    Acceptance Rates

    SIGIR '15 Paper Acceptance Rate 70 of 351 submissions, 20%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 22 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Discriminative Visual Similarity Search with Semantically Cycle-consistent Hashing NetworksACM Transactions on Multimedia Computing, Communications, and Applications10.1145/353251918:2s(1-21)Online publication date: 20-Apr-2022
    • (2022)Zero-shot Hashing via Asymmetric Ratio Similarity MatrixIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3150790(1-1)Online publication date: 2022
    • (2022)Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation RegressionIEEE Transactions on Image Processing10.1109/TIP.2022.320321631(5881-5892)Online publication date: 2022
    • (2022)Uncertainty-Aware and Multigranularity Consistent Constrained Model for Semi-Supervised HashingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2022.317457732:10(6914-6926)Online publication date: Oct-2022
    • (2022)A Decade Survey of Content Based Image Retrieval Using Deep LearningIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2021.308092032:5(2687-2704)Online publication date: May-2022
    • (2021)Smart Director: An Event-Driven Directing System for Live BroadcastingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/344898117:4(1-18)Online publication date: 30-Nov-2021
    • (2021)Deep Hashing With Weighted Spatial ImportanceIEEE Transactions on Multimedia10.1109/TMM.2020.303109223(3778-3792)Online publication date: 2021
    • (2021)Unsupervised Deep Multi-Similarity Hashing With Semantic Structure for Image RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2020.303240231:7(2852-2865)Online publication date: Jul-2021
    • (2020)Deep Metric Learning With Density AdaptivityIEEE Transactions on Multimedia10.1109/TMM.2019.293971122:5(1285-1297)Online publication date: May-2020
    • (2020)Bidirectional Discrete Matrix Factorization Hashing for Image SearchIEEE Transactions on Cybernetics10.1109/TCYB.2019.294128450:9(4157-4168)Online publication date: Sep-2020
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media

    pFad - Phonifier reborn

    Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

    Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


    Alternative Proxies:

    Alternative Proxy

    pFad Proxy

    pFad v3 Proxy

    pFad v4 Proxy