| Scale-Aware Fast R-CNN for Pedestrian Detection |
99 |
| Arbitrary-Oriented Scene Text Detection via Rotation Proposals |
90 |
| A Fast Uyghur Text Detector for Complex Background Images |
63 |
| Cross-Modality Bridging and Knowledge Transferring for Image Understanding |
57 |
| Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length |
50 |
| Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification |
49 |
| Deep Learning for Single Image Super-Resolution: A Brief Review |
46 |
| Hybrid Deep-Learning-Based Anomaly Detection Scheme for Suspicious Flow Detection in SDN: A Social Multimedia Perspective |
42 |
| Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching |
40 |
| CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network |
33 |
| Blind Quality Assessment Based on Pseudo-Reference Image |
33 |
| Content Popularity Prediction Towards Location-Aware Mobile Edge Caching |
32 |
| PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance |
31 |
| Edge Computing Framework for Cooperative Video Processing in Multimedia IoT Systems |
30 |
| Group-Sensitive Triplet Embedding for Vehicle Reidentification |
28 |
| Cache Less for More: Exploiting Cooperative Video Caching and Delivery in D2D Communications |
25 |
| QoE-Driven Mobile Edge Caching Placement for Adaptive Video Streaming |
25 |
| Exploiting Recurrent Neural Networks and Leap Motion Controller for the Recognition of Sign Language and Semaphoric Hand Gestures |
21 |
| Spatio-Temporal Saliency Networks for Dynamic Saliency Prediction |
21 |
| A Two-Stage Clustering Based 3D Visual Saliency Model for Dynamic Scenarios |
21 |
| AENet: Learning Deep Audio Features for Video Analysis |
21 |
| Single Image Dehazing Using Ranking Convolutional Neural Network |
20 |
| Parallax-Tolerant Image Stitching Based on Robust Elastic Warping |
20 |
| Unified Spatio-Temporal Attention Networks for Action Recognition in Videos |
20 |
| Fusing Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks |
20 |
| Joint Optimization of Radio and Virtual Machine Resources With Uncertain User Demands in Mobile Cloud Computing |
20 |
| Separable and Reversible Data Hiding in Encrypted Images Using Parametric Binary Tree Labeling |
20 |
| Depth Pooling Based Large-Scale 3-D Action Recognition With Convolutional Neural Networks |
19 |
| Optimizing Multistage Discriminative Dictionaries for Blind Image Quality Assessment |
19 |
| Multisensor Image Fusion and Enhancement in Spectral Total Variation Domain |
19 |
| GLA: Global-Local Attention for Image Description |
19 |
| Robust Coverless Image Steganography Based on DCT and LDA Topic Classification |
19 |
| Deep Alignment Network Based Multi-Person Tracking With Occlusion and Motion Reasoning |
18 |
| Distribution-Oriented Aesthetics Assessment With Semantic-Aware Hybrid Network |
18 |
| Quality Evaluation of Image Dehazing Methods Using Synthetic Hazy Images |
18 |
| Visual Sentiment Prediction Based on Automatic Discovery of Affective Regions |
18 |
| Facial Expression Recognition Using Hierarchical Features With Deep Comprehensive Multipatches Aggregation Convolutional Neural Networks |
18 |
| Sparse Coding Guided Spatiotemporal Feature Learning for Abnormal Event Detection in Large Videos |
18 |
| EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition |
17 |
| Predicting Visual Features From Text for Image and Video Caption Retrieval |
17 |
| GLAD: Global-Local-Alignment Descriptor for Scalable Person Re-Identification |
17 |
| Quality Assessment of DIBR-Synthesized Images by Measuring Local Geometric Distortions and Global Sharpness |
17 |
| Bilevel Feature Learning for Video Saliency Detection |
16 |
| Reduced-Reference Image Quality Assessment in Free-Energy Principle and Sparse Representation |
15 |
| SeaShips: A Large-Scale Precisely Annotated Dataset for Ship Detection |
15 |
| Learning Multi-View Representation With LSTM for 3-D Shape Recognition and Retrieval |
15 |
| Stylized Aesthetic QR Code |
15 |
| HSCS: Hierarchical Sparsity Based Co-saliency Detection for RGBD Images |
14 |
| Learning a Joint Affinity Graph for Multiview Subspace Clustering |
14 |
| Improving Video Saliency Detection via Localized Estimation and Spatiotemporal Refinement |
14 |