


default search action
CVPR 2018: Salt Lake City, UT, USA - Workshops
- 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2018, Salt Lake City, UT, USA, June 18-22, 2018. Computer Vision Foundation / IEEE Computer Society 2018
Disguised Faces in the Wild
- Vineet Kushwaha, Maneet Singh, Richa Singh
, Mayank Vatsa
, Nalini K. Ratha
, Rama Chellappa:
Disguised Faces in the Wild. 1-9 - Ankan Bansal, Rajeev Ranjan, Carlos Domingo Castillo, Rama Chellappa:
Deep Features for Recognizing Disguised Faces in the Wild. 10-16 - Naman Kohli, Daksha Yadav, Afzel Noore:
Face Verification With Disguise Variations via Deep Disguise Recognizer. 17-24 - Skand Vishwanath Peri, Abhinav Dhall:
DisguiseNet: A Contrastive Approach for Disguised Face Verification in the Wild. 25-31 - Kaipeng Zhang, Ya-Liang Chang, Winston H. Hsu:
Deep Disguised Faces Recognition. 32-36 - Evgeny Smirnov
, Aleksandr Melnikov, Andrei Oleinik
, Elizaveta Ivanova, Ilya Kalinovskiy, Eugene Luckyanets:
Hard Example Mining With Auxiliary Embeddings. 37-46 - Jun Liu, Ajay Kumar:
Detecting Presentation Attacks From 3D Face Masks Under Multispectral Imaging. 47-52
NVIDIA AI City Challenge
- Milind Naphade, Ming-Ching Chang, Anuj Sharma, David C. Anastasiu
, Vamsi Jagarlamudi, Pranamesh Chakraborty, Tingting Huang, Shuo Wang, Ming-Yu Liu, Rama Chellappa, Jenq-Neng Hwang, Siwei Lyu:
The 2018 NVIDIA AI City Challenge. 53-60 - Ming-Ching Chang, Yi Wei, Nenghui Song, Siwei Lyu:
Video Analytics in Smart Transportation for the AIC'18 Challenge. 61-68 - Weitao Feng, Deyi Ji, Yiru Wang, Shuorong Chang, Hansheng Ren, Weihao Gan:
Challenges on Large Scale Surveillance Video Analysis. 69-76 - Jakub Sochor, Jakub Spanhel
, Roman Juránek
, Petr Dobes
, Adam Herout
:
Graph@FIT Submission to the NVIDIA AI City Challenge 2018. 77-84 - Tingyu Mao, Wei Zhang, Haoyu He, Yanjun Lin, Vinay Kale, Alexander Stein, Zoran Kostic:
AIC2018 Report: Traffic Surveillance Research. 85-92 - Panagiotis Giannakeris, Vagia Kaltsa, Konstantinos Avgerinakis, Alexia Briassouli
, Stefanos Vrochidis
, Ioannis Kompatsiaris:
Speed Estimation and Abnormality Detection From Surveillance Cameras. 93-99 - Minh-Triet Tran
, Tung Dinh Duy, Thanh-Dat Truong, Vinh Ton-That, Thanh-Nhon Do, Quoc-An Luong, Thanh-An Nguyen
, Vinh-Tiep Nguyen, Minh N. Do
:
Traffic Flow Analysis With Multiple Adaptive Vehicle Detectors and Velocity Estimation With Landmark-Based Scanlines. 100-107 - Zheng Tang, Gaoang Wang, Hao Xiao
, Aotian Zheng, Jenq-Neng Hwang:
Single-Camera and Inter-Camera Vehicle Tracking and 3D Speed Estimation Based on Fusion of Visual and Semantic Features. 108-115 - Honghui Shi
, Zhonghao Wang, Yang Zhang, Xinchao Wang
, Thomas S. Huang:
Geometry-Aware Traffic Flow Analysis by Detection and Tracking. 116-120 - Chih-Wei Wu, Chih-Ting Liu, Cheng-En Chiang, Wei-Chih Tu, Shao-Yi Chien:
Vehicle Re-Identification With the Space-Time Prior. 121-128 - Jiayi Wei, Jianfei Zhao, Yanyun Zhao, Zhicheng Zhao:
Unsupervised Anomaly Detection for Traffic Surveillance Based on Background Modeling. 129-136 - Amit Kumar, Pirazh Khorramshahi, Wei-An Lin, Prithviraj Dhar, Jun-Cheng Chen
, Rama Chellappa:
A Semi-Automatic 2D Solution for Vehicle Speed Estimation From Monocular Videos. 137-144 - Yan Xu, Xi Ouyang, Yu Cheng, Shining Yu, Lin Xiong, Choon-Ching Ng, Sugiri Pranata, Shengmei Shen, Junliang Xing:
Dual-Mode Vehicle Motion Pattern Learning for High Performance Road Traffic Anomaly Detection. 145-152 - Shuai Hua, Manika Kapoor, David C. Anastasiu
:
Vehicle Tracking and Speed Estimation From Traffic Videos. 153-160 - Tingting Huang:
Traffic Speed Estimation From Surveillance Video Data. 161-165 - Pedro A. Marín-Reyes, Andrea Palazzi
, Luca Bergamini, Simone Calderara, Javier Lorenzo-Navarro
, Rita Cucchiara:
Unsupervised Vehicle Re-Identification Using Triplet Networks. 166-171
DeepGlobe: A Challenge for Parsing the Earth through Satellite Images
- Ilke Demir
, Krzysztof Koperski, David Lindenbaum, Guan Pang, Jing Huang
, Saikat Basu, Forest Hughes, Devis Tuia, Ramesh Raskar:
DeepGlobe 2018: A Challenge to Parse the Earth Through Satellite Images. 172-181 - Lichen Zhou, Chuang Zhang, Ming Wu:
D-LinkNet: LinkNet With Pretrained Encoder and Dilated Convolution for High Resolution Satellite Imagery Road Extraction. 182-186 - Ryuhei Hamaguchi
, Shuhei Hikosaka:
Building Detection From Satellite Imagery Using Ensemble of Size-Specific Detectors. 187-191 - Chao Tian, Cong Li, Jianping Shi:
Dense Fusion Classmate Network for Land Cover Classification. 192-196 - Shubhra Aich, William van der Kamp, Ian Stavness:
Semantic Binary Segmentation Using Convolutional Networks Without Decoders. 197-201 - Tao Sun, Zehui Chen, Wenxiang Yang, Yin Wang:
Stacked U-Nets With Multi-Output for Road Extraction. 202-206 - Alexander Buslaev, Selim S. Seferbekov, Vladimir Iglovikov, Alexey Shvets:
Fully Convolutional Network for Automatic Road Extraction From Satellite Imagery. 207-210 - Oleksandr Filin, Anton Zapara, Serhii Panchenko:
Road Detection With EOSResUNet and Post Vectorizing Algorithm. 211-215 - Jigar Doshi:
Residual Inception Skip Network for Binary Segmentation. 216-219 - Dragos Costea, Alina Marcu, Emil Slusanschi
, Marius Leordeanu:
Roadmap Generation Using a Multi-Stage Ensemble of Deep Neural Networks With Smoothing-Based Optimization. 220-224 - Matt Dickenson, Lionel Gueguen:
Rotated Rectangles for Symbolized Building Footprint Extraction. 225-228 - Sergey Golovanov, Rauf Kurbanov
, Aleksey Artamonov, Alex Davydow, Sergey I. Nikolenko
:
Building Detection From Satellite Imagery Using a Composite Loss Function. 229-232 - Vladimir Iglovikov, Selim S. Seferbekov, Alexander Buslaev, Alexey Shvets:
TernausNetV2: Fully Convolutional Network for Instance Segmentation. 233-237 - Weijia Li, Conghui He, Jiarui Fang
, Haohuan Fu:
Semantic Segmentation Based Building Extraction Method Using Multi-Source GIS Map Datasets and Satellite Imagery. 238-241 - Rémi Delassus, Romain Giot:
CNNs Fusion for Building Detection in Aerial Images for the Building Detection Challenge. 242-246 - Kang Zhao, Jungwon Kang, Jaewook Jung, Gunho Sohn:
Building Extraction From Satellite Images Using Mask R-CNN With Building Boundary Regularization. 247-251 - Tzu-Sheng Kuo, Keng-Sen Tseng, Jia-Wei Yan, Yen-Cheng Liu, Yu-Chiang Frank Wang:
Deep Aggregation Net for Land Cover Classification. 252-256 - Arthita Ghosh, Max Ehrlich, Sohil Shah, Larry S. Davis, Rama Chellappa:
Stacked U-Nets for Ground Material Segmentation in Remote Sensing Imagery. 257-261 - Alexander Rakhlin, Alex Davydow, Sergey I. Nikolenko
:
Land Cover Classification From Satellite Imagery With U-Net and Lovasz-Softmax Loss. 262-266 - Mohamed Samy, Karim Amer, Kareem Eissa, Mahmoud Shaker, Mohamed ElHelw:
NU-Net: Deep Residual Wide Field of View Convolutional Neural Network for Semantic Segmentation. 267-271 - Selim S. Seferbekov, Vladimir Iglovikov, Alexander Buslaev, Alexey Shvets:
Feature Pyramid Network for Multi-Class Land Segmentation. 272-275 - Guillem Pascual, Santi Seguí
, Jordi Vitrià
:
Uncertainty Gated Network for Land Cover Segmentation. 276-279 - Alex Davydow, Sergey I. Nikolenko
:
Land Cover Classification With Superpixels and Jaccard Index Post-Optimization. 280-284
Visual Understanding of Humans in Crowd Scene and Look Into Person Challenge
- Yu-Jhe Li, Fu-En Yang, Yen-Cheng Liu, Yu-Ying Yeh, Xiaofei Du, Yu-Chiang Frank Wang:
Adaptation and Re-Identification Network: An Unsupervised Deep Transfer Learning Approach to Person Re-Identification. 172-178 - Aske R. Lejbølle, Benjamin Krogh, Kamal Nasrollahi, Thomas B. Moeslund
:
Attention in Multimodal Neural Networks for Person Re-Identification. 179-187 - Girum G. Demisse, Konstantinos Papadopoulos, Djamila Aouada
, Björn E. Ottersten:
Pose Encoding for Robust Skeleton-Based Action Recognition. 188-194 - Diptodip Deb, Jonathan Ventura:
An Aggregated Multicolumn Dilated Convolution Network for Perspective-Free Counting. 195-204 - Mihai Fieraru, Anna Khoreva, Leonid Pishchulin, Bernt Schiele
:
Learning to Refine Human Pose Estimation. 205-214 - Meng Yang, Lida Rashidi, Sutharshan Rajasegarar
, Christopher Leckie
, Aravinda S. Rao
, Marimuthu Palaniswami
:
Crowd Activity Change Point Detection in Videos via Graph Stream Mining. 215-223
Deep Learning for Visual SLAM
- Daniel DeTone, Tomasz Malisiewicz, Andrew Rabinovich:
SuperPoint: Self-Supervised Interest Point Detection and Description. 224-236 - Emilio Parisotto, Devendra Singh Chaplot, Jian Zhang, Ruslan Salakhutdinov:
Global Pose Estimation With an Attention-Based Recurrent Network. 237-246 - Stefan Milz, Georg Arbeiter, Christian Witt, Bassam Abdallah, Senthil Kumar Yogamani:
Visual SLAM for Automated Driving: Exploring the Applications of Deep Learning. 247-257 - Masaya Kaneko, Kazuya Iwami, Toru Ogawa, Toshihiko Yamasaki, Kiyoharu Aizawa:
Mask-SLAM: Robust Feature-Based Monocular SLAM by Masking Using Semantic Segmentation. 258-266 - Ganesh Iyer, J. Krishna Murthy, Gunshi Gupta, K. Madhava Krishna, Liam Paull:
Geometric Consistency for Self-Supervised End-to-End Visual Odometry. 267-275 - Sungil Choi, Seungryong Kim, Kihong Park, Kwanghoon Sohn:
Learning Descriptor, Confidence, and Depth Estimation in Multi-View Stereo. 276-282 - Arun C. S. Kumar, Suchendra M. Bhandarkar, Mukta Prasad:
DepthNet: A Recurrent Neural Network Architecture for Monocular Depth Prediction. 283-291 - Luis Contreras, Walterio W. Mayol-Cuevas:
Towards CNN Map Representation and Compression for Camera Relocalisation. 292-299 - Arun C. S. Kumar, Suchendra M. Bhandarkar, Mukta Prasad:
Monocular Depth Prediction Using Generative Adversarial Networks. 300-308 - Bo Yang
, Zihang Lai, Xiaoxuan Lu
, Shuyu Lin, Hongkai Wen, Andrew Markham, Niki Trigoni
:
Learning 3D Scene Semantics and Structure From a Single Depth Image. 309-312 - Lachlan Nicholson, Michael Milford
, Niko Sünderhauf
:
QuadricSLAM: Dual Quadrics As SLAM Landmarks. 313-314
Diff-CVML: Differential Geometry in Computer Vision and Machine Learning
- Hang Shao, Abhishek Kumar, P. Thomas Fletcher:
The Riemannian Geometry of Deep Generative Models. 315-323 - Kyungmin Ahn, J. Derek Tucker, Wei Wu
, Anuj Srivastava
:
Elastic Handling of Predictor Phase in Functional Regression Models. 324-331 - Maxime Louis, Benjamin Charlier, Stanley Durrleman:
Geodesic Discriminant Analysis for Manifold-Valued Data. 332-340 - Rudrasis Chakraborty, Chun-Hao Yang
, Baba C. Vemuri:
A Mixture Model for Aggregation of Multiple Pre-Trained Weak Classifiers. 341-348 - Hongjun Choi
, Qiao Wang, Meynard John Toledo, Pavan K. Turaga
, Matthew P. Buman
, Anuj Srivastava
:
Temporal Alignment Improves Feature Quality: An Experiment on Activity Recognition With Accelerometer Data. 349-357 - Justin D. Strait
, Sebastian Kurtek, Steven N. MacEachern:
Locally-Weighted Elastic Comparison of Planar Shapes. 358-366 - Dinesh Acharya, Zhiwu Huang, Danda Pani Paudel
, Luc Van Gool:
Covariance Pooling for Facial Expression Recognition. 367-374 - Mehran Javanmardi, Ricardo Bigolin Lanfredi, Müjdat Çetin, Tolga Tasdizen:
Image Segmentation by Deep Learning of Disjunctive Normal Shape Model Shape Representation. 375-382 - Suhas Lohit, Ankan Bansal, Nitesh Shroff, Jaishanker K. Pillai, Pavan K. Turaga
, Rama Chellappa:
Predicting Dynamical Evolution of Human Activities From a Single Image. 383-392 - Ioana Ilea
, Lionel Bombrun
, Salem Said, Yannick Berthoumieu:
Covariance Matrices Encoding Based on the Log-Euclidean and Affine Invariant Riemannian Metrics. 393-402 - Somenath Das, Suchendra M. Bhandarkar:
Principal Curvature Guided Surface Geometry Aware Global Shape Representation. 403-412
Biometrics
- Yoanna Martínez-Díaz, Heydi Méndez-Vázquez
, Leyanis López-Avila
, Leonardo Chang
, Luis Enrique Sucar, Massimo Tistarelli:
Toward More Realistic Face Recognition Evaluation Protocols for the YouTube Faces Database. 413-421 - Yefei Chen, Jianbo Su
:
Dict Layer: A Structured Dictionary Layer. 422-431 - Elias N. Zois
, Marianna Papagiannopoulou, Dimitrios Tsourounis
, George Economou
:
Hierarchical Dictionary Learning and Sparse Coding for Static Signature Verification. 432-442 - Mohsen Jenadeleh
, Marius Pedersen, Dietmar Saupe:
Realtime Quality Assessment of Iris Biometrics Under Visible Light. 443-452 - Narsi Reddy, Dewan Fahim Noor, Zhu Li, Reza Derakhshani:
Multi-Frame Super Resolution for Ocular Biometrics. 453-461 - Arun Kumar Jindal, Srinivas Chalamala, Santosh Kumar Jami:
Face Template Protection Using Deep Convolutional Neural Network. 462-470 - Ruben Tolosana
, Rubén Vera-Rodríguez, Julian Fiérrez, Javier Ortega-Garcia
:
Incorporating Touch Biometrics to Mobile One-Time Passwords: Exploration of Digits. 471-478 - Maneet Singh, Shruti Nagpal, Mayank Vatsa
, Richa Singh
, Angshul Majumdar:
Identity Aware Synthesis for Cross Resolution Face Recognition. 479-488 - Sivaram Prasad Mudunuri, Soubhik Sanyal, Soma Biswas:
GenLR-Net: Deep Framework for Very Low Resolution Face and Object Recognition With Generalization to Unseen Categories. 489-498 - Hadi Kazemi, Sobhan Soleymani, Ali Dabouei, Seyed Mehdi Iranmanesh, Nasser M. Nasrabadi:
Attribute-Centered Loss for Soft-Biometrics Guided Face Sketch-Photo Recognition. 499-507 - Jude Ezeobiejesi, Bir Bhanu
:
Latent Fingerprint Image Quality Assessment Using Deep Learning. 508-516 - Shaan Chopra, Aakarsh Malhotra
, Mayank Vatsa
, Richa Singh
:
Unconstrained Fingerphoto Database. 517-525 - Mustafa Berkay Yilmaz
, Kagan Öztürk:
Hybrid User-Independent and User-Dependent Offline Signature Verification With a Two-Channel CNN. 526-534 - Siqi Yang, Arnold Wiliem, Brian C. Lovell
:
It Takes Two to Tango: Cascading Off-the-Shelf Face Detectors. 535-543 - Javier Hernandez-Ortega, Julian Fiérrez, Aythami Morales, Pedro Tome:
Time Analysis of Pulse-Based Face Anti-Spoofing in Visible and NIR. 544-552 - Fariborz Taherkhani
, Nasser M. Nasrabadi, Jeremy M. Dawson:
A Deep Face Identification Network Enhanced by Facial Attributes Prediction. 553-560 - Yasushi Makihara, Daisuke Adachi, Chi Xu
, Yasushi Yagi:
Gait Recognition by Deformable Registration. 561-571 - Daksha Yadav, Naman Kohli, Akshay Agarwal
, Mayank Vatsa
, Richa Singh
, Afzel Noore:
Fusion of Handcrafted and Deep Learning Features for Large-Scale Multiple Iris Presentation Attack Detection. 572-579 - Gee-Sern Jison Hsu, Wen-Fong Huang, Jiunn-Horng Kang:
Hierarchical Network for Facial Palsy Detection. 580-586
Embedded Vision
- Mennatullah Siam, Mostafa Gamal, Moemen Abdel-Razek, Senthil Kumar Yogamani, Martin Jägersand, Hong Zhang:
A Comparative Study of Real-Time Semantic Segmentation for Autonomous Driving. 587-597 - Nikitha Vallurupalli, Sriharsha Annamaneni, Girish Varma, C. V. Jawahar
, Manu Mathew, Soyeb Nagori:
Efficient Semantic Segmentation Using Gradual Grouping. 598-606 - Hongxing Gao, Wei Tao, Dongchao Wen
, Tse-Wei Chen, Kinya Osa, Masami Kato:
IFQ-Net: Integrated Fixed-Point Quantization Networks for Embedded Vision. 607-615 - Takayuki Ujiie, Masayuki Hiromoto, Takashi Sato
:
Interpolation-Based Object Detection Using Motion Vectors for Embedded Real-Time Tracking Systems. 616-624 - Cevahir Çigla, Rohan Thakker, Larry H. Matthies:
Onboard Stereo Vision for Drone Pursuit or Sense and Avoid. 625-633 - Andre Ivan, Williem
, In Kyu Park:
Light Field Depth Estimation on Off-the-Shelf Mobile GPU. 634-643 - Nicholas F. Y. Chen:
Pseudo-Labels for Supervised Learning on Dynamic Vision Sensor Data, Applied to Object Detection Under Ego-Motion. 644-653 - Cevahir Çigla, Kemal E. Sahin, Fikret Alim:
GPU Based Video Object Tracking on PTZ Cameras. 654-662 - Alexandre Briot, Prashanth Viswanath, Senthil Kumar Yogamani:
Analysis of Efficient CNN Design Techniques for Semantic Segmentation. 663-672 - Pankaj Bhowmik, Md Jubaer Hossain Pantho, Marjan Asadinia, Christophe Bobda:
Design of a Reconfigurable 3D Pixel-Parallel Neuromorphic Architecture for Smart Image Sensor. 673-681 - Paolo Di Febbo, Carlo Dal Mutto, Kinh Tieu, Stefano Mattoccia
:
KCNN: Extremely-Efficient Hardware Keypoint Detection With a Compact Convolutional Neural Network. 682-690
New Trends in Image Restoration and Enhancement
- Andrey Ignatov, Nikolay Kobyshev, Radu Timofte
, Kenneth Vanhoey, Luc Van Gool:
WESPE: Weakly Supervised Photo Enhancer for Digital Cameras. 691-700 - Yuan Yuan, Siyuan Liu, Jiawei Zhang, Yongbing Zhang, Chao Dong, Liang Lin:
Unsupervised Image Super-Resolution Using Cycle-in-Cycle Generative Adversarial Networks. 701-710 - Honggang Chen, Xiaohai He, Linbo Qing, Shuhua Xiong, Truong Q. Nguyen:
DPW-SDNet: Dual Pixel-Wavelet Domain Deep CNNs for Soft Decoding of JPEG-Compressed Images. 711-720 - Cheng-Han Lee, Kaipeng Zhang, Hu-Cheng Lee, Chia-Wen Cheng, Winston H. Hsu:
Attribute Augmented Convolutional Neural Network for Face Hallucination. 721-729 - Yixin Du, Xin Li
:
Recursive Deep Residual Learning for Single Image Dehazing. 730-737 - S. Alireza Golestaneh, Lina J. Karam
:
Synthesized Texture Quality Assessment via Multi-Scale Spatial and Statistical Texture Attributes of Image and Gradient Magnitude Coefficients. 738-744 - Meiguang Jin, Michael Hirsch, Paolo Favaro:
Learning Face Deblurring Fast and Wide. 745-753 - Codruta O. Ancuti, Cosmin Ancuti, Radu Timofte
, Christophe De Vleeschouwer:
O-HAZE: A Dehazing Benchmark With Real Hazy and Haze-Free Outdoor Images. 754-762 - George Seif, Dimitrios Androutsos:
Large Receptive Field Networks for High-Scale Image Super-Resolution. 763-772 - Pengju Liu, Hongzhi Zhang, Kai Zhang
, Liang Lin, Wangmeng Zuo:
Multi-Level Wavelet-CNN for Image Restoration. 773-782 - Asha Anoosheh, Eirikur Agustsson, Radu Timofte
, Luc Van Gool:
ComboGAN: Unrestrained Scalability for Image Domain Translation. 783-790 - Namhyuk Ahn, Byungkon Kang, Kyung-Ah Sohn:
Image Super-Resolution via Progressive Cascading Residual Network. 791-799 - Jun-Hyuk Kim
, Jong-Seok Lee:
Deep Residual Network With Enhanced Upscaling Module for Super-Resolution. 800-808 - Rong Chen
, Yanyun Qu, Kun Zeng, Jinkang Guo, Cuihua Li, Yuan Xie:
Persistent Memory Residual Network for Single Image Super Resolution. 809-816 - Sehwan Ki, Hyeonjun Sim, Jae-Seok Choi, Saehun Kim, Munchurl Kim:
Fully End-to-End Learning Based Conditional Boundary Equilibrium GAN With Receptive Field Sizes Enlarged for Single Ultra-High Resolution Image Dehazing. 817-824 - Deniz Engin, Anil Genç, Hazim Kemal Ekenel
:
Cycle-Dehaze: Enhanced CycleGAN for Single Image Dehazing. 825-833 - Manoj Sharma, Rudrabha Mukhopadhyay, Avinash Upadhyay
, Sriharsha Koundinya, Ankit Shukla, Santanu Chaudhury:
IRGUN: Improved Residue Based Gradual Up-Scaling Network for Single Image Super Resolution. 834-843 - Sriharsha Koundinya, Himanshu Sharma, Manoj Sharma, Avinash Upadhyay
, Raunak Manekar, Rudrabha Mukhopadhyay, Abhijit Karmakar
, Santanu Chaudhury:
2D-3D CNN Based Architectures for Spectral Reconstruction From RGB Images. 844-851 - Radu Timofte
, Shuhang Gu, Jiqing Wu, Luc Van Gool:
NTIRE 2018 Challenge on Single Image Super-Resolution: Methods and Results. 852-863 - Yifan Wang, Federico Perazzi, Brian McWilliams, Alexander Sorkine-Hornung, Olga Sorkine-Hornung, Christopher Schroers:
A Fully Progressive Approach to Single-Image Super-Resolution. 864-873 - Yijie Bei, Alexandru Damian, Shijia Hu, Sachit Menon, Nikhil Ravi, Cynthia Rudin:
New Techniques for Preserving Global Structure and Denoising With Low Information Loss in Single-Image Super-Resolution. 874-881 - Dongwon Park, Kwan-Young Kim, Se Young Chun:
Efficient Module Based Single Image Super Resolution for Multiple Problems. 882-890 - Cosmin Ancuti, Codruta O. Ancuti, Radu Timofte
:
NTIRE 2018 Challenge on Image Dehazing: Methods and Results. 891-901 - He Zhang, Vishwanath Sindagi, Vishal M. Patel:
Multi-Scale Single Image Dehazing Using Perceptual Pyramid Deep Network. 902-911 - Hyeonjun Sim, Sehwan Ki, Jae-Seok Choi, Soomin Seo, Saehun Kim, Munchurl Kim:
High-Resolution Image Dehazing With Respect to Training Losses and Receptive Field Sizes. 912-919 - Ranjan Mondal, Sanchayan Santra
, Bhabatosh Chanda:
Image Dehazing by Joint Estimation of Transmittance and Airlight Using Bi-Directional Consistency Loss Minimized FCN. 920-928 - Boaz Arad, Ohad Ben-Shahar
, Radu Timofte
:
NTIRE 2018 Challenge on Spectral Reconstruction From RGB Images. 929-938 - Zhan Shi, Chang Chen, Zhiwei Xiong, Dong Liu, Feng Wu:
HSCNN+: Advanced CNN-Based Hyperspectral Recovery From RGB Images. 939-947 - Tarek Stiebel, Simon Koppers, Philipp Seltsam, Dorit Merhof:
Reconstructing Spectral Images From RGB-Images Using a Convolutional Neural Network. 948-953
Autonomous Driving
- Xinyu Huang, Xinjing Cheng, Qichuan Geng, Binbin Cao, Dingfu Zhou, Peng Wang, Yuanqing Lin, Ruigang Yang
:
The ApolloScape Dataset for Autonomous Driving. 954-960 - JeongYeol Baek, Ioana Veronica Chelu, Livia Iordache, Vlad Paunescu
, HyunJoo Ryu, Alexandru Ghiuta, Andrei Petreanu, YunSung Soh, Andrei Leica, ByeongMoon Jeon:
Scene Understanding Networks for Autonomous Driving Based on Around View Monitoring System. 961-968 - Jonathan Tremblay, Aayush Prakash, David Acuna, Mark Brophy, Varun Jampani, Cem Anil, Thang To, Eric Cameracci, Shaad Boochoon, Stan Birchfield:
Training Deep Networks With Synthetic Data: Bridging the Reality Gap by Domain Randomization. 969-977 - Arantxa Casanova, Guillem Cucurull, Michal Drozdzal, Adriana Romero, Yoshua Bengio:
On the Iterative Refinement of Densely Connected Representation Levels for Semantic Segmentation. 978-987 - Satoshi Tsutsui, Tommi Kerola, Shunta Saito, David J. Crandall:
Minimizing Supervision for Free-Space Segmentation. 988-997 - Yu-Hui Huang
, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars
, Luc Van Gool:
Error Correction for Dense Semantic Image Labeling. 998-1006 - Nikolai Smolyanskiy, Alexey Kamenev, Stan Birchfield:
On the Importance of Stereo for Accurate Depth Estimation: An Efficient Semi-Supervised Deep Neural Network Approach. 1007-1015 - Shaohui Sun, Ramesh Sarukkai, Jack Kwok, Vinay D. Shet:
Accurate Deep Direct Geo-Localization From Ground Imagery and Phone-Grade GPS. 1016-1023 - Ernest Cheung, Aniket Bera, Dinesh Manocha:
Efficient and Safe Vehicle Navigation Based on Driver Behavior Classification. 1024-1031 - Bhakti Baheti, Suhas S. Gajre
, Sanjay N. Talbar
:
Detection of Distracted Driver Using Convolutional Neural Network. 1032-1038 - Aniket Bera, Tanmay Randhavane, Austin Wang, Dinesh Manocha, Emily Kubin, Kurt Gray
:
Classifying Group Emotions for Socially-Aware Autonomous Vehicle Navigation. 1039-1047 - Andrew Best, Sahil Narang, Lucas Pasqualin, Daniel Barber, Dinesh Manocha:
AutonoVi-Sim: Autonomous Vehicle Simulation Platform With Weather, Sensing, and Traffic Control. 1048-1056 - Arun C. S. Kumar, Suchendra M. Bhandarkar, Mukta Prasad:
Learning Hierarchical Models for Class-Specific Reconstruction From Natural Data. 1057-1065 - Pratik Prabhanjan Brahma, Adrienne Othon:
Subset Replay Based Continual Learning for Scalable Improvement of Autonomous Systems. 1066-1074
Human Pose, Motion, Activities and Shape in 3D
- Endri Dibra, Silvan Melchior, Ali Balkis, Thomas Wolf, Cengiz Öztireli, Markus H. Gross:
Monocular RGB Hand Pose Inference From Unsupervised Refinable Nets. 1075-1085 - Maren Awiszus
, Stella Grasshof
, Felix Kuhnke, Jörn Ostermann:
Unsupervised Features for Facial Expression Intensity Estimation Over Time. 1086-1094 - Nolan Lunscher, John S. Zelek:
Deep Learning Whole Body Point Cloud Scans From a Single Depth Map. 1095-1102 - Akshay Rangesh, Mohan M. Trivedi:
HandyNet: A One-Stop Solution to Detect, Segment, Localize & Analyze Driver Hands. 1103-1110
Brave New Ideas for Video Understanding
- Debidatta Dwibedi, Pierre Sermanet, Jonathan Tompson:
Temporal Reasoning in Videos Using Convolutional Gated Recurrent Units. 1111-1116 - Ali Diba, Mohsen Fayyaz, Vivek Sharma, Amir Hossein Karami, Mohammad Mahdi Arzani, Rahman Yousefzadeh, Luc Van Gool:
Temporal 3D ConvNets Using Temporal Transition Layer. 1117-1121 - Wonmin Byeon, Qin Wang, Rupesh Kumar Srivastava, Petros Koumoutsakos:
ContextVP: Fully Context-Aware Video Prediction. 1122-1126 - Michael Wray, Davide Moltisanti, Dima Damen:
Towards an Unequivocal Representation of Actions. 1127-1131 - Suman Saha, Rajitha Navarathna, Leonhard Helminger, Romann M. Weber:
Unsupervised Deep Representations for Learning Audience Facial Behaviors. 1132-1137 - Shweta Bhardwaj, Mitesh M. Khapra:
I Have Seen Enough: A Teacher Student Network for Video Classification Using Fewer Frames. 1138-1142
Perception Beyond the Visible Spectrum
- Amanda Berg, Jörgen Ahlberg, Michael Felsberg
:
Generating Visible Spectrum Images From Thermal Infrared. 1143-1152 - Shuo Liu, Vijay John, Erik Blasch, Zheng Liu, Ying Huang:
IR2VI: Enhanced Night Environmental Perception by Unsupervised Thermal Image Translation. 1153-1160 - Timothy Doster, Tegan Emerson, Colin C. Olson:
Path Orthogonal Matching Pursuit for Sparse Reconstruction and Denoising of SWIR Maritime Imagery. 1161-1168 - Patricia L. Suarez, Angel Domingo Sappa, Boris Xavier Vintimilla, Riad I. Hammoud:
Deep Learning Based Single Image Dehazing. 1169-1176 - Kin Gwn Lore
, Kishore K. Reddy, Michael Giering, Edgar A. Bernal:
Generative Adversarial Networks for Depth Map Estimation From RGB Video. 1177-1185 - Michael Loveday, Toby P. Breckon
:
On the Impact of Parallax Free Colour and Infrared Image Co-Registration to Fused Illumination Invariant Adaptive Background Modelling. 1186-1195 - Anthony Ortiz, Alonso Granados, Olac Fuentes, Christopher Kiekintveld, Dalton S. Rosario, Zachary Bell:
Integrated Learning and Feature Selection for Deep Neural Networks in Multispectral Images. 1196-1205 - Jiahang Che, Yuxiang Xing, Li Zhang:
A Comprehensive Solution for Deep-Learning Based Cargo Inspection to Discriminate Goods in Containers. 1206-1213 - Jin-Fu Lin, Yen-Liang Lin, Erh-Kan King, Hung-Ting Su, Winston H. Hsu:
Cross-Domain Hallucination Network for Fine-Grained Object Recognition. 1214-1221 - Brian Millikan, Hassan Foroosh, Qiyu Sun:
Deep Convolutional Neural Networks With Integrated Quadratic Correlation Filters for Automatic Target Recognition. 1222-1229 - Xingyu Wan, Jinjun Wang, Sanping Zhou:
An Online and Flexible Multi-Object Tracking Framework Using Long Short-Term Memory. 1230-1238 - Mark W. Koch, R. Derek West, Robert Riley, Tu-Thach Quach:
Polarimetric Synthetic-Aperture-Radar Change-Type Classification With a Hyperparameter-Free Open-Set Classifier. 1239-1246 - Marcel Sheeny
, Andrew M. Wallace
, Mehryar Emambakhsh, Sen Wang
, Barry Connor:
POL-LWIR Vehicle Detection: Convolutional Neural Networks Meet Polarised Infrared Sensors. 1247-1253
Computer Vision for Physiological Measurement
- Christian S. Pilz, Sebastian Zaunseder, Jarek Krajewski, Vladimir Blazek:
Local Group Invariance for Heart Rate Estimation From Face Videos in the Wild. 1254-1262 - Genki Okada, Kenta Masui, Norimichi Tsumura:
Advertisement Effectiveness Estimation Based on Crowdsourced Multimodal Affective Responses. 1263-1271 - Ewa Magdalena Nowara, Tim K. Marks, Hassan Mansour, Ashok Veeraraghavan:
SparsePPG: Towards Driver Monitoring Using Camera-Based Vital Signs Estimation in Near-Infrared. 1272-1281 - Gregory F. Lewis, Maria I. Davila, Stephen W. Porges:
Novel Algorithms to Monitor Continuous Cardiac Activity With a Video Camera. 1282-1290 - Emmett Kerr, Sonya A. Coleman
, T. Martin McGinnity, Andrea Shepherd:
Measurement of Capillary Refill Time (CRT) in Healthy Subjects Using a Robotic Hand. 1291-1298 - Changchen Zhao, Chun-Liang Lin, Weihai Chen, Zhengguo Li:
A Novel Framework for Remote Photoplethysmography Pulse Extraction on Compressed Videos. 1299-1308 - Chuanxiang Tang, Jiwu Lu, Jie Liu:
Non-Contact Heart Rate Monitoring by Combining Convolutional Neural Network Skin Detection and Remote Photoplethysmography via a Low-Cost Camera. 1309-1315 - Puneet Gupta
, Brojeshwar Bhowmick, Arpan Pal
:
Exploring the Feasibility of Face Video Based Instantaneous Heart-Rate for Micro-Expression Spotting. 1316-1323 - Munenori Fukunishi, Kouki Kurita, Shoji Yamamoto, Norimichi Tsumura:
Video Based Measurement of Heart Rate and Heart Rate Variability Spectrogram From Estimated Hemoglobin Information. 1324-1331 - Richard Macwan, Serge Bobbia, Yannick Benezeth, Julien Dubois, Alamin Mansouri:
Periodic Variance Maximization Using Generalized Eigenvalue Decomposition Applied to Remote Photoplethysmography Estimation. 1332-1340 - Serge Bobbia, Duncan Luguern, Yannick Benezeth, Keisuke Nakamura, Randy Gomez, Julien Dubois:
Real-Time Temporal Superpixels for Unsupervised Remote Photoplethysmography. 1341-1348 - Tom Vogels, Mark van Gastel, Wenjin Wang, Gerard de Haan:
Fully-Automatic Camera-Based Pulse-Oximetry During Sleep. 1349-1357 - Andreia Vieira Moco, Sander Stuijk
, Mark van Gastel, Gerard de Haan:
Impairing Factors in Remote-PPG Pulse Transit Time Measurements on the Face. 1358-1366 - Daniel McDuff:
Deep Super Resolution for Recovering Physiological Information From Videos. 1367-1374 - Jaehee Park, Ashutosh Sabharwal, Ashok Veeraraghavan:
Direct-Global Separation for Improved Imaging Photoplethysmography. 1375-1384
Automated Analysis of Marine Video for Environmental Monitoring
- Deborah Levy, Yuval Belfer, Elad Osherov, Eyal Bigal
, Aviad P. Scheinin, Hagai Nativ, Dan Tchernov
, Tali Treibitz
:
Automated Analysis of Marine Video With Limited Data. 1385-1393 - Andrew King, Suchendra M. Bhandarkar, Brian M. Hopkinson:
A Comparison of Deep Learning Methods for Semantic Segmentation of Coral Reef Survey Images. 1394-1402 - Yi-Min Chou, Chien-Hung Chen, Keng-Hao Liu, Chu-Song Chen:
Stingray Detection of Aerial Images Using Augmented Training Images Generated by a Conditional Generative Model. 1403-1409 - Malte Pedersen
, Stefan Bengtson
, Rikke Gade, Niels Madsen, Thomas B. Moeslund
:
Camera Calibration for Underwater 3D Reconstruction Based on Ray Tracing Using Snell's Law. 1410-1417
Joint Detection, Tracking, and Prediction in the Wild
- Emad Barsoum, John R. Kender, Zicheng Liu:
HP-GAN: Probabilistic 3D Human Motion Prediction via GAN. 1418-1427 - Roberto Henschel, Laura Leal-Taixé, Daniel Cremers
, Bodo Rosenhahn:
Fusion of Head and Full-Body Detectors for Multi-Object Tracking. 1428-1437 - Neeti Narayan, Nishant Sankaran, Srirangaraj Setlur
, Venu Govindaraju:
Re-Identification for Online Person Tracking by Modeling Space-Time Continuum. 1438-1447 - Mehran Khodabandeh, Hamid Reza Vaezi Joze, Ilya Zharkov, Vivek Pradeep:
DIY Human Action Dataset Generation. 1448-1458 - Hilke Kieritz, Wolfgang Hübner, Michael Arens:
Joint Detection and Online Multi-Object Tracking. 1459-1467 - Nachiket Deo, Mohan M. Trivedi:
Convolutional Social Pooling for Vehicle Trajectory Prediction. 1468-1476
Visual Odometry and Computer Vision Applications Based on Location Clues
- Chun-Wei Chen, Yin-Hsi Kuo, Tang Lee, Cheng-Han Lee, Winston H. Hsu:
Drone-View Building Identification by Cross-View Visual Learning and Relative Spatial Estimation. 1477-1485 - Silvio Giancola
, Jens Schneider
, Peter Wonka, Bernard Ghanem
:
Integration of Absolute Orientation Measurements in the KinectFusion Reconstruction Pipeline. 1486-1495 - Xue Iuan Wong, Taewook Lee, Puneet Singla, Manoranjan Majji:
Optimal Linear Attitude Estimator for Alignment of Point Clouds. 1496-1504 - Yi Xu, Yuzhang Wu, Hui Zhou:
Multi-Scale Voxel Hashing and Efficient 3D Representation for Mobile Augmented Reality. 1505-1512 - Ahmed Nassar, Karim Amer, Reda ElHakim
, Mohamed ElHelw:
A Deep CNN-Based Framework for Enhanced Aerial Imagery Registration With Applications to UAV Geolocalization. 1513-1523 - Qiong Wu, Ambrose Li:
Automated Virtual Navigation and Monocular Localization of Indoor Spaces From Videos. 1524-1532 - Tristan Swedish, Ramesh Raskar:
Deep Visual Teach and Repeat on Path Networks. 1533-1542 - Liang Yang, Bing Li, Wei Li, Biao Jiang, Jizhong Xiao:
Semantic Metric 3D Reconstruction for Concrete Inspection. 1543-1551
Bright and Dark Sides of Computer Vision: Challenges and Opportunities for Privacy and Security
- Aniket Roy, Diangarti Bhalang Tariang
, Rajat Subhra Chakraborty, Ruchira Naskar
:
Discrete Cosine Transform Residual Feature Based Filtering Forgery and Splicing Detection in JPEG Images. 1552-1560 - Noa Privman-Horesh, Azmi Haider, Hagit Hel-Or:
Forgery Detection in 3D-Sensor Images. 1561-1569 - Jiawei Chen, Janusz Konrad
, Prakash Ishwar
:
VGAN-Based Image Representation Learning for Privacy-Preserving Facial Expression Recognition. 1570-1579 - Jinyuan Zhao, Natalia Frumkin, Janusz Konrad
, Prakash Ishwar
:
Privacy-Preserving Indoor Localization via Active Scene Illumination. 1580-1589 - Yifang Li
, Wyatt Troutman, Bart P. Knijnenburg, Kelly Caine:
Human Perceptions of Sensitive Content in Photos. 1590-1596 - Jamie Hayes:
On Visible Adversarial Perturbations & Digital Watermarking. 1597-1604 - Mahmood Sharif, Lujo Bauer
, Michael K. Reiter:
On the Suitability of Lp-Norms for Creating and Preventing Adversarial Examples. 1605-1613 - Hossein Hosseini, Radha Poovendran
:
Semantic Adversarial Examples. 1614-1619 - Steven Hoffman, Renu Sharma, Arun Ross:
Convolutional Neural Networks for Iris Presentation Attack Detection: Toward Cross-Dataset and Cross-Sensor Generalization. 1620-1628
Efficient Deep Learning for Computer Vision
- Amarjot Singh, Devendra Patil, S. N. Omkar:
Eye in the Sky: Real-Time Drone Surveillance System (DSS) for Violent Individuals Identification Using ScatterNet Hybrid Deep Learning Network. 1629-1637 - Amir Gholami, Kiseok Kwon, Bichen Wu, Zizheng Tai, Xiangyu Yue, Peter H. Jin, Sicheng Zhao, Kurt Keutzer:
SqueezeNext: Hardware-Aware Neural Network Design. 1638-1647 - Lane McIntosh, Niru Maheswaranathan, David Sussillo, Jonathon Shlens:
Recurrent Segmentation for Variable Computational Budgets. 1648-1657 - Oyebade K. Oyedotun
, Abd El Rahman Shabayek
, Djamila Aouada
, Björn E. Ottersten:
Highway Network Block With Gates Constraints for Training Very Deep Networks. 1658-1667 - Dae Ha Kim, Seung Hyun Lee, Byung Cheol Song:
MUNet: Macro Unit-Based Convolutional Neural Network for Mobile Devices. 1668-1676 - Baohua Sun, Lin Yang, Patrick Dong, Wenhan Zhang, Jason Dong, Charles Young:
Ultra Power-Efficient CNN Domain Specific Accelerator With 9.3TOPS/Watt for Mobile and Embedded Applications. 1677-1685 - Yi-Min Chou, Yi-Ming Chan, Jia-Hong Lee
, Chih-Yi Chiu
, Chu-Song Chen:
Merging Deep Neural Networks for Mobile Devices. 1686-1694 - Qing Zhang, Mengru Zhang, Mengdi Wang
, Wanchen Sui, Chen Meng, Jun Yang, Weidan Kong, Xiaoyuan Cui, Wei Lin:
Efficient Deep Learning Inference Based on Model Compression. 1695-1702 - Michael T. Chan, Daniel Scarafoni, Ronald Duarte, Jason Thornton, Luke J. Skelly:
Learning Network Architectures of Deep CNNs Under Resource Constraints. 1703-1710
Computer Vision in Sports
- Silvio Giancola
, Mohieddine Amine, Tarek Dghaily, Bernard Ghanem
:
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos. 1711-1721 - Tharindu Fernando
, Sridha Sridharan, Clinton Fookes, Simon Denman:
Deep Decision Trees for Discriminative Dictionary Learning With Adversarial Multi-Agent Trajectories. 1722-1731 - Arda Senocak, Tae-Hyun Oh, Junsik Kim, In So Kweon:
Part-Based Player Identification Using Deep Convolutional Representation and Multi-Scale Pooling. 1732-1739 - A. J. Piergiovanni, Michael S. Ryoo:
Fine-Grained Activity Recognition in Baseball Videos. 1740-1748 - Rajkumar Theagarajan, Federico Pala, Xiu Zhang, Bir Bhanu
:
Soccer: Who Has the Ball? Generating Visual Analytics and Player Statistics. 1749-1757 - Vito Renò
, Nicola Mosca, Roberto Marani
, Massimiliano Nitti, Tiziana D'Orazio
, Ettore Stella
:
Convolutional Neural Networks Based Ball Detection in Tennis Games. 1758-1764 - Anthony Cioppa, Adrien Deliège, Marc Van Droogenbroeck:
A Bottom-Up Approach Based on Semantics for the Interpretation of the Main Camera Stream in Soccer Games. 1765-1774 - Kosuke Takahashi, Dan Mikami
, Mariko Isogawa, Hideaki Kimata:
Human Pose As Calibration Pattern; 3D Human Pose Estimation With Multiple Unsynchronized and Uncalibrated Cameras. 1775-1782 - Gen Li, Shikun Xu, Xiang Liu, Lei Li, Changhu Wang:
Jersey Number Recognition With Semi-Supervised Spatial Transformer Network. 1783-1790 - Dan Zecha, Moritz Einfalt, Christian Eggert, Rainer Lienhart
:
Kinematic Pose Rectification for Performance Analysis and Retrieval in Sports. 1791-1799 - Pushkar Shukla, Hemant Sadana, Apaar Bansal, Deepak Verma
, Carlos E. L. Elmadjian, Balasubramanian Raman, Matthew Turk:
Automatic Cricket Highlight Generation Using Event-Driven and Excitement-Based Features. 1800-1808 - Tomoya Kaichi, Shohei Mori
, Hideo Saito, Kosuke Takahashi, Dan Mikami, Mariko Isogawa, Hideaki Kimata:
Estimation of Center of Mass for Sports Scene Using Weighted Visual Hull. 1809-1815 - Mohib Ullah, Faouzi Alaya Cheikh:
A Directed Sparse Graphical Model for Multi-Target Tracking. 1816-1823 - Noor ul Huda, Kasper H. Jensen, Rikke Gade, Thomas B. Moeslund
:
Estimating the Number of Soccer Players Using Simulation-Based Occlusion Handling. 1824-1833
Computational Cameras and Displays
- Hajime Nagahara, Toshiki Sonoda, Dengyu Liu, Jinwei Gu:
Space-Time-Brightness Sampling Using an Adaptive Pixel-Wise Coded Exposure. 1834-1842 - Avinash Kumar, Manjula Gururaj, Kalpana Seshadrinathan, Ramkumar Narayanswamy:
Multi-Capture Dynamic Calibration of Multi-Camera Systems. 1843-1851 - Nianyi Li, Scott McCloskey, Jingyi Yu:
Jittered Exposures for Image Super-Resolution. 1852-1859
Women in Computer Vision
- Ilke Demir
, Dena Bazazian, Adriana Romero, Viktoriia Sharmanska
, Lyne P. Tchapmi:
WiCV 2018: The Fourth Women in Computer Vision Workshop. 1860-1862 - Kumar Rohit Malhotra, Anis Davoudi
, Scott Siegel, Azra Bihorac
, Parisa Rashidi
:
Autonomous Detection of Disruptions in the Intensive Care Unit Using Deep Mask R-CNN. 1863-1865 - Avantika Singh, Aditya Nigam
:
Encapsulating the Impact of Transfer Learning, Domain Knowledge and Training Strategies in Deep-Learning Based Architecture: A Biometric Based Case Study. 1866-1868 - Bojana Gajic, Ramón Baldrich:
Cross-Domain Fashion Image Retrieval. 1869-1871 - Dena Bazazian, Dimosthenis Karatzas
, Andrew D. Bagdanov:
Word Spotting in Scene Images Based on Character Recognition. 1872-1874 - Ilke Demir
:
A Holistic Framework for Addressing the World Using Machine Learning. 1875-1877 - Ivona Tautkute, Tomasz Trzcinski
, Adam Bielski:
I Know How You Feel: Emotion Recognition With Facial Landmarks. 1878-1880 - Jyoti Islam, Yanqing Zhang:
Early Diagnosis of Alzheimer's Disease: A Neuroimaging Study With Deep Learning Architectures. 1881-1883 - Kanami Yamagishi, Shintaro Yamamoto, Takuya Kato, Shigeo Morishima
:
Cosmetic Features Extraction by a Single Image Makeup Decomposition. 1884-1886 - Ksenia Bittner, Marco Körner:
Automatic Large-Scale 3D Building Shape Refinement Using Conditional Generative Adversarial Networks. 1887-1889 - Marcella Cornia
, Lorenzo Baraldi
, Giuseppe Serra, Rita Cucchiara:
SAM: Pushing the Limits of Saliency Prediction Models. 1890-1892 - Meng Zheng, Srikrishna Karanam, Richard J. Radke
:
RPIfield: A New Dataset for Temporally Evaluating Person Re-Identification. 1893-1895 - Mikayla Timm, Subhransu Maji, Todd Fuller:
Large-Scale Ecological Analyses of Animals in the Wild Using Computer Vision. 1896-1898 - Murium Iqbal, Adair Kovac, Kamelia Aryafar:
Discovering Style Trends Through Deep Visually Aware Latent Item Embeddings. 1899-1901 - Nezihe Merve Gürel:
Towards More Accurate Radio Telescope Images. 1902-1904 - Sima Behpour:
ARC: Adversarial Robust Cuts for Semi-Supervised and Multi-Label Classification. 1905-1907
Mutual Benefits of Cognitive and Computer Vision: How Can We Use One to Understand the Other?
- Vandit Gajjar, Yash Khandhediya, Ayesha Gurnani, Viraj Mavani, Mehul S. Raval
:
ViS-HuD: Using Visual Saliency to Improve Human Detection With Convolutional Neural Networks. 1908-1916 - Masaki Nakada, Honglin Chen, Demetri Terzopoulos:
Learning Biomimetic Perception for Human Sensorimotor Control. 1917-1922 - Hossein Hosseini, Baicen Xiao, Mayoore Jaiswal, Radha Poovendran
:
Assessing Shape Bias Property of Convolutional Neural Networks. 1923-1931 - Hossein Adeli, Gregory J. Zelinsky:
Deep-BCN: Deep Networks Meet Biased Competition to Create a Brain-Inspired Model of Attention Control. 1932-1942 - Mahmoud Khademi, Oliver Schulte:
Image Caption Generation With Hierarchical Contextual Visual Spatial Attention. 1943-1951 - Ravi Kant Kumar, Jogendra Garain, Dakshina Ranjan Kisku
, Goutam Sanyal:
Estimating Attention of Faces Due to Its Growing Level of Emotions. 1952-1960 - Amir Rosenfeld, Markus D. Solbach, John K. Tsotsos
:
Totally Looks Like - How Humans Compare, Compared to Machines. 1961-1964 - Lin Qi, Ying Xu, Xiaowei Shang, Junyu Dong:
Fusing Visual Saliency for Material Recognition. 1965-1968 - Mikhail Startsev, Michael Dorr
:
Increasing Video Saliency Model Generalizability by Training for Smooth Pursuit Prediction. 1969-1972 - Katerina Malakhova:
Representation of Categories in Filters of Deep Neural Networks. 1973-1975 - Tian Xu, Oliver G. B. Garrod, H. Steven Scholte, Robin A. A. Ince, Philippe G. Schyns
:
Using Psychophysical Methods to Understand Mechanisms of Face Identification in a Deep Neural Network. 1976-1984 - Tao Tu, Jonathan Koss, Paul Sajda:
Relating Deep Neural Network Representations to EEG-fMRI Spatiotemporal Dynamics in a Perceptual Decision-Making Task. 1985-1991 - Akram Bayat, Do Hyong Koh
, Anubhaw Kumar Nand, Marta Pereira, Marc Pomplun:
Scene Grammar in Human and Machine Recognition of Objects and Scenes. 1992-1999 - Petros Koutras
, Georgia Panagiotaropoulou
, Antigoni Tsiami
, Petros Maragos:
Audio-Visual Temporal Saliency Modeling Validated by fMRI Data. 2000-2010 - Amir Rosenfeld, Mahdi Biparva, John K. Tsotsos
:
Priming Neural Networks. 2011-2020
Real World Challenges and New Benchmarks for Deep Learning in Robotic Vision
- Xingchao Peng, Ben Usman, Neela Kaushik, Dequan Wang, Judy Hoffman
, Kate Saenko
:
VisDA: A Synthetic-to-Real Benchmark for Visual Domain Adaptation. 2021-2026 - Xavier Roynard, Jean-Emmanuel Deschaud, François Goulette:
Paris-Lille-3D: A Point Cloud Dataset for Urban Scene Segmentation and Classification. 2027-2030 - Tyler L. Hayes, Ronald Kemker, Nathan D. Cahill
, Christopher Kanan:
New Metrics and Experimental Paradigms for Continual Learning. 2031-2034 - Alan Wu, A. J. Piergiovanni, Michael S. Ryoo:
Action-Conditioned Convolutional Future Regression Models for Robot Imitation Learning. 2035-2037 - Jonathan Tremblay, Thang To, Stan Birchfield:
Falling Things: A Synthetic Dataset for 3D Object Detection and Pose Estimation. 2038-2041 - Deepak Pathak, Yide Shentu, Dian Chen, Pulkit Agrawal, Trevor Darrell, Sergey Levine, Jitendra Malik:
Learning Instance Segmentation by Interaction. 2042-2045 - Phil Ammirato, Alexander C. Berg, Jana Kosecka:
Active Vision Dataset Benchmark. 2046-2049 - Deepak Pathak, Parsa Mahmoudieh, Guanghao Luo, Pulkit Agrawal, Dian Chen, Yide Shentu, Evan Shelhamer, Jitendra Malik, Alexei A. Efros
, Trevor Darrell:
Zero-Shot Visual Imitation. 2050-2053 - Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra:
Embodied Question Answering. 2054-2063
Analysis and Modeling of Faces and Gestures
- Yuancheng Ye, Yingli Tian
, Matt Huenerfauth, Jingya Liu:
Recognizing American Sign Language Gestures From Within Continuous Videos. 2064-2073 - Nataniel Ruiz, Eunji Chong, James M. Rehg
:
Fine-Grained Head Pose Estimation Without Keypoints. 2074-2083 - Sveinn Palsson, Eirikur Agustsson, Radu Timofte
, Luc Van Gool:
Generative Adversarial Style Transfer Networks for Face Aging. 2084-2092 - Adam Kortylewski, Bernhard Egger, Andreas Schneider, Thomas Gerig, Andreas Morel-Forster, Thomas Vetter:
Empirically Analyzing the Effect of Dataset Biases on Deep Face Recognition Systems. 2093-2102 - Okan Köpüklü, Neslihan Kose
, Gerhard Rigoll:
Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition. 2103-2111 - Jia Xue, Zibo Meng, Karthik Katipally, Haibo Wang, Kees van Zon:
Clothing Change Aware Person Identification. 2112-2120 - Chieh-Ming Kuo, Shang-Hong Lai
, Michel Sarkis:
A Compact Deep Learning Model for Robust Facial Expression Recognition. 2121-2129 - Itir Önal Ertugrul, László A. Jeni, Jeffrey F. Cohn:
FACSCaps: Pose-Independent Facial Action Coding With Capsules. 2130-2139 - Daksha Yadav, Naman Kohli, Ekampreet Kalsi, Mayank Vatsa
, Richa Singh
, Afzel Noore:
Unraveling Human Perception of Facial Aging Using Eye Gaze. 2140-2147 - Dário Augusto Borges Oliveira
, Andréa Britto Mattos, Edmilson Da Silva Morais:
Improving Viseme Recognition Using GAN-Based Frontal View Mapping. 2148-2155 - Rajeev Ranjan, Shalini De Mello, Jan Kautz:
Light-Weight Head Pose Invariant Gaze Tracking. 2156-2164 - Esube Bekele, Wallace E. Lawson, Zachary Horne
, Sangeet Khemlani:
Implementing a Robust Explanatory Bias in a Person Re-Identification Network. 2165-2172 - Puspita Majumdar, Saheb Chhabra, Richa Singh
, Mayank Vatsa
:
On Detecting Domestic Abuse via Faces. 2173-2179
Vision With Biased or Scarce Data
- Maren Awiszus
, Bodo Rosenhahn:
Markov Chain Neural Networks. 2180-2187 - Ashish Mishra, M. Shiva Krishna Reddy, Anurag Mittal, Hema A. Murthy:
A Generative Model for Zero Shot Learning Using Conditional Variational Autoencoders. 2188-2196 - Liang Qiu, Hongliang Ren:
Endoscope Navigation and 3D Reconstruction of Oral Cavity by Visual SLAM With Mitigated Data Scarcity. 2197-2204
Computer Vision for Microscopy Image Analysis
- Yuki Hiramatsu, Kazuhiro Hotta, Ayako Imanishi, Michiyuki Matsuda, Kenta Terai
:
Cell Image Segmentation by Integrating Multiple CNNs. 2205-2211 - Dongnan Liu
, Donghao Zhang
, Yang Song
, Chaoyi Zhang, Heng Huang, Mei Chen, Weidong Cai
:
Large Kernel Refine Fusion Net for Neuron Membrane Segmentation. 2212-2220 - Chichen Fu, Soonam Lee
, David Joon Ho
, Shuo Han, Paul Salama
, Kenneth W. Dunn, Edward J. Delp
:
Three Dimensional Fluorescence Microscopy Image Synthesis and Segmentation. 2221-2229 - Abdul Aziz, Harshit Pande, Bharath Cheluvaraju, Tathagato Rai Dastidar:
Improved Extraction of Objects From Urine Microscopy Images With Unsupervised Thresholding and Supervised U-Net Techniques. 2230-2238 - Mina Khoshdeli, Garrett Winkelmaier, Bahram Parvin:
Multilayer Encoder-Decoder Network for 3D Nuclear Segmentation in Spheroid Models of Human Mammary Epithelial Cell Lines. 2239-2245 - Cheng Yang, Haowen Ma, Xu Cao, Xia Hua, Xiaofeng Bu, Limin Zhang, Tao Yue, Feng Yan:
Resolution-Enhanced Lensless Color Shadow Imaging Microscopy Based on Large Field-of-View Submicron-Pixel Imaging Sensors. 2246-2253 - Vibha Gupta, Arnav Bhavsar
:
Sequential Modeling of Deep Features for Breast Cancer Histopathological Image Classification. 2254-2261 - Romain Mormont, Pierre Geurts, Raphaël Marée:
Comparison of Deep Transfer Learning Strategies for Digital Pathology. 2262-2271 - Alexandr A. Kalinin
, Ari Allyn-Feuer, Alexander S. Ade, Gordon-Victor Fon, Walter Meixner, David Dilworth, Jeffrey R. de Wet, Gerald A. Higgins
, Gen Zheng, Amy Creekmore, John W. Wiley, James E. Verdone, Robert W. Veltri, Kenneth J. Pienta, Donald S. Coffey, Brian D. Athey, Ivo D. Dinov
:
3D Cell Nuclear Morphology: Microscopy Imaging Dataset and Voxel-Based Morphometry Classification Results. 2272-2280 - Sreetama Basu, Elton Rexhepaj, Nathalie Spassky, Auguste Genovesio, Rasmus Reinhold Paulsen, A. S. M. Shihavuddin
:
FastSME: Faster and Smoother Manifold Extraction From 3D Stack. 2281-2289 - Shahira Abousamra, Shai Adar, Natalie Elia, Roy Shilkrot:
Localization and Tracking in 4D Fluorescence Microscopy Imagery. 2290-2298 - Karan Dewan, Tathagato Rai Dastidar, Maroof Ahmad:
Estimation of Sperm Concentration and Total Motility From Microscopic Videos of Human Semen Samples. 2299-2306
Computational Models for Learning Systems and Educational Assessment
- Vijay Rowtula, Varun Bhargavan, Mohan Kumar, C. V. Jawahar:
Scaling Handwritten Student Assessments With a Document Image Workflow System. 2307-2314 - Ömer Sümer, Patricia Goldberg, Kathleen Stürmer, Tina Seidel, Peter Gerjets, Ulrich Trautwein, Enkelejda Kasneci:
Teachers' Perception in the Classroom. 2315-2324
Visual Understanding of Subjective Attributes of Data
- Bo Pang, Kaiwen Zha, Cewu Lu:
Human Action Adverb Recognition: ADHA Dataset and a Three-Stream Hybrid Model. 2325-2334 - Adam Bielski, Tomasz Trzcinski
:
Pay Attention to Virality: Understanding Popularity of Social Media Videos With the Attention Mechanism. 2335-2337 - Eli Alshan, Sharon Alpert, Assaf Neuberger, Nathaniel Bubis, Eduard Oks:
Learning Fashion by Simulated Human Supervision. 2338-2344 - Amir Sadovnik
, Wassim Gharbi, Thanh Vu, Andrew C. Gallagher:
Finding Your Lookalike: Measuring Face Similarity Rather Than Face Identity. 2345-2353 - Dario Dotti, Mirela Popa, Stylianos Asteriadis
:
Behavior and Personality Analysis in a Nonsocial Context Dataset. 2354-2362 - Gülcan Can, Yassir Benkhedda, Daniel Gatica-Perez
:
Ambiance in Social Media Venues: Visual Cue Interpretation by Machines and Crowds. 2363-2372 - Albert Clapés
, Ozan Bilici, Dariia Temirova, Egils Avots, Gholamreza Anbarjafari, Sergio Escalera
:
From Apparent to Real Age: Gender, Age, Ethnic, Makeup, and Expression Bias Analysis in Real Age Estimation. 2373-2382
Sight and Sound
- Ruohan Gao, Rogério Schmidt Feris, Kristen Grauman:
Learning to Separate Object Sounds by Watching Unlabeled Video. 2496-2499 - Yipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara L. Berg:
Visual to Sound: Generating Natural Sound for Videos in the Wild. 2500-2503 - Vinicius Signori Furlan, Ruzena Bajcsy, Erickson R. Nascimento:
Fast Forwarding Egocentric Videos by Listening and Watching. 2504-2507 - Arda Senocak, Tae-Hyun Oh, Junsik Kim, Ming-Hsuan Yang, In So Kweon:
On Learning Association of Sound Source and Visual Scenes. 2508-2509 - Yue Qiu, Hirokatsu Kataoka:
Image Generation Associated With Music Data. 2510-2513 - Herman Kamper, Gregory Shakhnarovich, Karen Livescu:
Semantic Speech Retrieval With a Visually Grounded Model of Untranscribed Speech. 2514-2517 - Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events. 2518-2519 - Michele Merler, Dhiraj Joshi, Khoi-Nguyen C. Mac, Quoc-Bao Nguyen, Stephen Hammer, John Kent, Jinjun Xiong, Minh N. Do, John R. Smith, Rogério Schmidt Feris:
The Excitement of Sports: Automatic Highlights Using Audio/Visual Cues. 2520-2523 - Tawfiq Salem, Menghua Zhai, Scott Workman, Nathan Jacobs:
A Multimodal Approach to Mapping Soundscapes. 2524-2527 - Chiori Hori, Takaaki Hori, Gordon Wichern, Jue Wang, Teng-Yok Lee, Anoop Cherian, Tim K. Marks:
Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description. 2528-2531 - Abe Davis, Maneesh Agrawala:
Visual Rhythm and Beat. 2532-2535 - Zhoutong Zhang, Jiajun Wu, Qiujia Li, Zhengjia Huang, Joshua B. Tenenbaum, William T. Freeman:
Inverting Audio-Visual Simulation for Shape and Material Perception. 2536-2538
Workshop and Challenge on Learned Image Compression
- David Alexandre, Chih-Peng Chang, Wen-Hsiao Peng, Hsueh-Ming Hang:
An Autoencoder-based Learned Image Compressor: Description of Challenge Proposal by NCTU. 2539-2542 - Ming Li, Jianhua Hu, Changsheng Xia, Yundong Zhang:
An Implementation of Picture Compression with A CNN-based Auto-encoder. 2543-2546 - Alekh Karkada Ashok, Nagaraju Palani:
Autoencoders with Variable Sized Latent Vector for Image Compression. 2547-2550 - Çaglar Aytekin, Xingyang Ni, Francesco Cricri, Jani Lainema, Emre Aksu, Miska M. Hannuksela:
Block-optimized Variable Bit Rate Neural Image Compression. 2551-2554 - Danial Maleki, Soheila Nadalian, Mohammad Mahdi Derakhshani, Mohammad Amin Sadeghi:
BlockCNN: A Deep Network for Artifact Removal and Image Compression. 2555-2558 - Zhenzhong Chen, Yiming Li, Feiyang Liu, Zizheng Liu, Xiang Pan, Wanjie Sun, Yingbin Wang, Yan Zhou, Han Zhu, Shan Liu:
CNN-Optimized Image Compression with Uncertainty based Resource Allocation. 2559-2562 - Jianhua Hu, Ming Li, Changsheng Xia, Yundong Zhang:
Combine Traditional Compression Method With Convolutional Neural Networks. 2563-2566 - Zhimin Tang, Linkai Luo:
Compression artifact removal using multi-scale reshuffling convolutional network. 2567-2570 - Kai Cui, Eckehard G. Steinbach:
Decoder Side Image Quality Enhancement exploiting Inter-channel Correlation in a 3-stage CNN: Submission to CLIC 2018. 2571-2574 - Haojie Liu, Tong Chen, Qiu Shen, Tao Yue, Zhan Ma:
Deep Image Compression via End-to-End Learning. 2575-2578 - Dang-Khoa Le Tan, Huu Le, Tuan Hoang, Thanh-Toan Do, Ngai-Man Cheung:
DeepVQ: A Deep Network Architecture for Vector Quantization. 2579-2582 - Tamar Rott Shaham, Tomer Michaeli:
Deformation Aware Image Compression. 2583-2586 - Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte, Luc Van Gool:
Extreme Learned Image Compression with GANs. 2587-2590 - Aupendu Kar, Sri Phani Krishna Karri, Nirmalya Ghosh, Ramanathan Sethuraman, Debdoot Sheet:
Fully Convolutional Model for Variable Bit Length and Lossy High Density Compression of Mammograms. 2591-2594 - Jonatan Samuelsson, Per Hermansson:
Image compression with xvc. 2595-2597 - Mario González, Javier Preciozzi, Pablo Musé, Andrés Almansa:
Joint denoising and decompression using CNN regularization. 2598-2601 - Ogun Kirmemis, Gonca Bakar, A. Murat Tekalp:
Learned Compression Artifact Removal by Deep Residual Networks. 2602-2605 - Yu-Chuan Su, Kristen Grauman:
Learning Compressible 360deg Video Isomers. 2606-2609 - Eli Ben-David, Sharon Carmel, Boris Filippov, Dror Gill, Alexey Martemyanov, Tamar Shoham, Nikolay Terterov, Pavel Tiktov, Tom Vaughan, Alexander Zheludkov:
Perceptually optimized low bit-rate image encoding. 2610-2612 - Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto:
Performance Comparison of Convolutional AutoEncoders, Generative Adversarial Networks and Super-Resolution for Image Compression. 2613-2616 - Lei Zhou, Chunlei Cai, Yue Gao, Sanbao Su, Junmin Wu:
Variational Autoencoder for Low Bit-rate Image Compression. 2617-2620 - Yuchen Fan, Jiahui Yu, Thomas S. Huang:
Wide-activated Deep Residual Networks based Restoration for BPG-compressed Images. 2621-2624 - Dong Wei, Mei Yang:
YASO. 2625-2628

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.