ECCV2020将于2020年8月23-28日在线上举行,今年共接受了1361篇论文,本文是接收论列表的第二部分,第一部见链接
Paper ID | Paper Title | Category |
---|---|---|
2515 | Temporal Aggregate Representations for Long Term Video Understanding | Poster |
2527 | Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition | Poster |
2530 | General 3D Room Layout from a Single View by Render-and-Compare | Poster |
2532 | Neural Dense Non-Rigid Structure from Motion with Latent Space Constraints | Poster |
2535 | Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability | Poster |
2538 | Yet Another Intermediate-Level Attack | Poster |
2540 | Topology-Change-Aware Volumetric Fusion for Dynamic Scene Reconstruction | Poster |
2544 | Early Exit Or Not: Resource-Efficient Blind Quality Enhancement for Compressed Images | Poster |
2547 | PatchNets: Patch-based Generalizable Deep Implicit 3D Shape Representations | Poster |
2548 | How does Lipschitz Regularization Influence GAN Training? | Poster |
2550 | Infrastructure-based Multi-Camera Calibration using Radial Projections | Poster |
2553 | MotionSqueeze: Neural Motion Feature Learning for Video Understanding | Poster |
2559 | Polarized optical-flow gyroscope | Poster |
2561 | Online Meta-Learning for Multi-Source and Semi-Supervised Domain Adaptation | Poster |
2562 | An Ensemble of Epoch-wise Empirical Bayes for Few-shot Learning | Poster |
2568 | On the Effectiveness of Image Rotation for Open Set Domain Adapation | Poster |
2569 | Combining Task Predictors via Enhancing Joint Predictability | Poster |
2581 | Multi-Scale Positive Sample Refinement for Few-Shot Object Detection | Poster |
2583 | Single-Image Depth Prediction Makes Feature Matching Easier | Poster |
2586 | Deep Reinforced Attention Learning for Quality-Aware Visual Recognition | Poster |
2588 | CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization | Poster |
2590 | Learning Joint Spatial-Temporal Transformations for Video Inpainting | Poster |
2593 | Single Path One-Shot Neural Architecture Search with Uniform Sampling | Poster |
2595 | Learning to Generate Novel Domains for Domain Generalization | Poster |
2599 | Continuous Adaptation for Interactive Object Segmentation by Learning from Corrections | Poster |
2601 | Impact of base dataset design on few-shot image classification | Poster |
2605 | Invertible Zero-Shot Recognition Flows | Poster |
2606 | GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes | Poster |
2607 | Location Sensitive Image Retrieval and Tagging | Poster |
2608 | Joint 3D Layout and Depth Prediction from a Single Indoor Panorama Image | Poster |
2612 | Guessing State Tracking for Visual Dialogue | Poster |
2614 | Memory-Efficient Incremental Learning Through Feature Adaptation | Poster |
2619 | Neural Voice Puppetry: Audio-Driven Facial Reenactment | Poster |
2621 | One-Shot Unsupervised Cross-Domain Detection | Poster |
2629 | Stochastic Frequency Masking to Improve Super-Resolution and Denoising Networks | Poster |
2630 | Probabilistic Future Prediction for Video Scene Understanding | Poster |
2633 | Suppressing Mislabeled Data via Grouping and Self-Attention | Poster |
2638 | Class-wise Dynamic Graph Convolution for Semantic Segmentation | Poster |
2639 | Character-Preserving Coherent Story Visualization | Poster |
2640 | GINet: Graph Interaction Network for Scene Parsing | Poster |
2662 | Tensor Low-Rank Reconstruction for Semantic Segmentation | Poster |
2668 | Attentive Normalization | Poster |
2678 | Count- and Similarity-aware Pedestrian Detection | Poster |
2682 | TRADI: Tracking deep neural network weight distributions | Poster |
2686 | Spatiotemporal Attacks for Embodied Agents | Poster |
2697 | Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation | Poster |
2701 | Unselfie: Translating Selfies to Neutral-pose Portraits in the Wild | Poster |
2709 | Design and Interpretation of Universal Adversarial Patches in Face Detection | Poster |
2712 | Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild | Poster |
2715 | Weakly Supervised 3D Hand Pose Estimation via Biomechanical Constraints | Poster |
2716 | Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification | Poster |
2718 | Contextual Heterogeneous Graph for Human-Object Interaction Detection | Poster |
2721 | Zero-Shot Image Super-Resolution with Depth Guided Internal Degradation Learning | Poster |
2724 | A Closest Point Proposal for MCMC-based Probabilistic Surface Registration | Poster |
2729 | Interactive Video Object Segmentation Using Global and Local Transfer Modules | Poster |
2749 | End-to-end interpretable learning of non-blind image deblurring | Poster |
2756 | Employing Multi-Estimations for Weakly-Supervised Semantic Segmentation | Poster |
2760 | Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection | Poster |
2761 | Rethinking Image Deraining via Rain Streaks and Vapors | Poster |
2775 | Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes | Poster |
2781 | Is Sharing of Egocentric Video Giving Away Your Biometric Signature? | Poster |
2783 | Captioning Images for a Real Use Case | Poster |
2800 | Improving Semantic Segmentation via Decoupled Body and Edge Supervision | Poster |
2805 | Conditional Entropy Coding for Efficient Video Compression | Poster |
2810 | Differentiable Feature Aggregation Search for Knowledge Distillation | Poster |
2813 | Attention Guided Anomaly Localization in Images | Poster |
2819 | Self-supervised Video Representation Learning by Pace Reasoning | Poster |
2820 | Full-Body Awareness from Partial Observations | Poster |
2822 | Reinforced Axial Refinement Network for Monocular 3D Object Detection | Poster |
2830 | Self-Supervised Procedure Learning from Instructional Videos using DNNs | Poster |
2838 | Multi-view multi-object 6D pose estimation via robust scene consistency optimization | Poster |
2839 | In-Domain GAN Inversion for Real Image Editing | Poster |
2841 | Key Frame Proposal Network for Efficient Pose Estimation in Videos | Poster |
2844 | Exchangeable Deep Neural Networks for Set-to-Set Matching and Learning | Poster |
2861 | Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with INNs | Poster |
2864 | Cross-Modal Weighting Network for RGB-D Salient Object Detection | Poster |
2865 | Open-set Adversarial Defense | Poster |
2866 | Deep Image Compression using Decoder Side Information | Poster |
2874 | Bridging the Sim-to-Real Gap: Unsupervised Learning of Scene Structure for Synthetic Data Generation | Poster |
2883 | L2 Norm: A Generic Visualization Approach for Convolutional Neural Networks | Poster |
2888 | Interactive Annotation of 3D Object Geometry using 2D Scribbles | Poster |
2889 | Hierarchical Kinematic Human Mesh Recovery | Poster |
2890 | Multi-Loss Rebalancing Algorithm for Monocular Depth Estimation | Poster |
2897 | 3D Bird Reconstruction: a Dataset, Model, and Shape Recovery from a Single View | Poster |
2903 | We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos | Poster |
2908 | Joint Optimization for Multi-Person Shape Models from Markerless 3D-Scans | Poster |
2916 | Accurate RGB-D Salient Object Detection via Collaborative Learning | Poster |
2919 | Finding Your (3D) Center: 3D Object Detection Using a Learned Loss | Poster |
2920 | Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection | Poster |
2924 | Two-Stream Active Query Suggestion for Large-Scale Object Detection in Connectomics | Poster |
2941 | Pix2Surf: Learning Parametric 3D Surface Models of Objects from Images | Poster |
2942 | Continuous Multimodal 6D Camera Relocalization | Poster |
2943 | Modeling Artistic Workflows for Image Generation and Editing | Poster |
2945 | A Large-scale Annotated Mechanical Components Benchmark for Classification and Retrieval Tasks with Deep Neural Networks | Poster |
2946 | Hidden Footprints: Learning Contextual Walkability from 3D Human Trails | Poster |
2957 | Self-supervised learning of audio-visual objects from video | Poster |
2959 | GAN-based Garment Generation Using Sewing Pattern Images | Poster |
2962 | Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach | Poster |
2966 | An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds | Poster |
2970 | Montonicity Prior for Cloud Tomography | Poster |
2971 | Learning Trailer Moments in Full-Length Movies with Co-Contrastive Attention | Poster |
2976 | Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval | Poster |
2979 | Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline | Poster |
2981 | Learning to Generate Grounded Visual Captions without Localization Supervision | Poster |
2985 | Neural Hair Rendering | Poster |
2989 | JNR: Joint-based Neural Rig Representation for Compact 3D Face Modeling | Poster |
3004 | On Disentangling Spoof Traces for Generic Face Anti-Spoofing | Poster |
3005 | Streaming Object Detection for 3-D Point Clouds | Poster |
3006 | NAS-DIP: Learning Deep Image Prior with Neural Architecture Search | Poster |
3007 | Learning to Learn in a Semi-Supervised Fashion | Poster |
3009 | FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning | Poster |
3017 | Exploiting Radar for Robust Perception of Dynamic Objects | Poster |
3023 | Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation | Poster |
3024 | Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes | Poster |
3037 | Towards Causal Benchmarking of Algorithm Bias with Counterfactual Synthesis | Poster |
3039 | Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation | Poster |
3056 | Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions | Poster |
3066 | Transformation Consistency Regularization - A Semi-Supervised Paradigm for Image-to-Image Translation | Poster |
3072 | LIRA: Lifelong Image Restoration from Unknown Blended Distortions | Poster |
3074 | HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization | Poster |
3082 | SOLO: Segmenting Objects by Locations | Poster |
3093 | Learning to See in the Dark with Events | Poster |
3094 | Trajectron++: Dynamically-Feasible Trajectory Forecasting With Heterogeneous Data | Poster |
3098 | Context-Gated Convolution | Poster |
3100 | Polynomial Regression Network for Variable-Number Lane Detection | Poster |
3108 | Structural Deep Metric Learning for Room Layout Estimation | Poster |
3122 | Adaptive Task Sampling for Meta-Learning | Poster |
3124 | Deep Complementary Joint Model for Complex Scene Registration and Few-shot Segmentation on Medical Images | Poster |
3128 | Improving Multispectral Pedestrian Detection by Addressing Modality Imbalance Problems | Poster |
3135 | High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling | Poster |
3136 | Online Ensemble Model Compression using Knowledge Distillation | Poster |
3137 | Deep Learning-based Pupil Center Detection for Fast and Accurate Eye Tracking System | Poster |
3149 | Efficient Residue Number System Based Winograd Convolution | Poster |
3150 | Robust Tracking against Adversarial Attacks | Poster |
3151 | Single-Shot Neural Relighting and SVBRDF Estimation | Poster |
3152 | Unsupervised Human 3D Pose Representation with Viewpoint and Pose Disentanglement | Poster |
3155 | Angle-based Search Space Shrinking for Neural Architecture Search | Poster |
3160 | RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition | Poster |
3162 | Towards Fast, Accurate and Stable 3D Dense Face Alignment | Poster |
3170 | Iterative Feature Transformation for Fast and Versatile Universal Style Transfer | Poster |
3177 | CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search | Poster |
3182 | Toward Faster and Simpler Matrix Normalization via Rank-1 Update | Poster |
3186 | Accurate polarimetric BRDF for real polarization scene rendering | Poster |
3188 | Lensless Imaging with Focusing Sparse URA Masks in Long-Wave Infrared and its Application for Human Detection | Poster |
3190 | Topology-Preserving Class-Incremental Learning | Poster |
3199 | Inter-Image Communication for Weakly Supervised Localization | Poster |
3205 | UFO$^2$: A Unified Framework Towards Omni-supervised Object Detection | Poster |
3215 | iCaps: An Interpretable Classifier via Disentangled Capsule Networks | Poster |
3220 | Detecting natural disasters, damage, and incidents in the wild | Poster |
3223 | Dynamic ReLU | Poster |
3224 | Acquiring Dynamic Light Fields through Coded Aperture Camera | Poster |
3238 | Gait Recognition from a Single Image using a Phase-Aware Gait Cycle Reconstruction Network | Poster |
3240 | Informative Sample Mining Network for Multi-Domain Image-to-Image Translation | Poster |
3242 | Spherical Feature Transform for Deep Metric Learning | Poster |
3245 | Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering | Poster |
3254 | Unsupervised Multi-View CNN for Salient View Selection of 3D Objects and Scenes | Poster |
3266 | FDTS: Fast Diverse-Transformation Search for Object Detection and Beyond | Poster |
3268 | Peeking into occluded joints: A novel framework for crowd pose estimation | Poster |
3271 | RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition | Poster |
3281 | Deep Hashing with Active Pairwise Supervision | Poster |
3293 | Graph Edit Distance Reward: Learning to Edit Scene Graph | Poster |
3295 | Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing | Poster |
3301 | Feature-metric Loss for Self-supervised Learning of Depth and Egomotion | Poster |
3304 | Propagating Over Phrase Relations for One-Stage Visual Grounding | Poster |
3307 | Adversarial Semantic Data Augmentation for Human Pose Estimation | Poster |
3314 | Deep Novel View Synthesis from Unstructured Input | Poster |
3315 | Face Anti-Spoofing via disentangled representation learning | Poster |
3317 | Prime-Aware Adaptive Distillation | Poster |
3318 | Meta-Learning with Network Pruning | Poster |
3323 | Spiral Generative Network for Image Extrapolation | Poster |
3324 | Scene Sketcher: Fine-grained Image Retrieval with Scene Sketch | Poster |
3337 | Few-shot Compositional Font Generation with Dual Memory | Poster |
3338 | PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling | Poster |
3339 | Content-aware Video Summarization | Poster |
3348 | Handcrafted Outlier Detection Revisited | Poster |
3359 | The Average Mixing Kernel Signature | Poster |
3361 | BCNet: Learning Body and Cloth Shape from A Single Image | Poster |
3372 | Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos | Poster |
3375 | Interactive Multi-Dimension Modulation with Dynamic Controllable Residual Learning for Image Restoration | Poster |
3382 | Polysemy Deciphering Network for Human-Object Interaction Detection | Poster |
3384 | Small-Task Incremental Learning | Poster |
3386 | Learning Graph-Convolutional Representations for Point Cloud Denoising | Poster |
3397 | Semantic Line Detection Using Mirror Attention and Comparative Ranking and Matching | Poster |
3398 | A Differentiable Recurrent Surface for Asynchronous Event-Based Data | Poster |
3399 | Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches | Poster |
3400 | LiteFlowNet3: Resolving Correspondence Ambiguity for More Accurate Optical Flow Estimation | Poster |
3405 | Microscopy Image Restoration with Deep Wiener-Kolmogorov filters | Poster |
3408 | ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language | Poster |
3411 | JSENet: Joint Semantic Segmentation and Edge Detection Network for 3D Point Clouds | Poster |
3412 | Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior | Poster |
3414 | An Inference Algorithm for Multi-Label MRF-MAP Problems with Clique Size 100 | Poster |
3425 | Dual refinement underwater object detection network | Poster |
3429 | Learning to Visually Localize Multiple Sound Sources via A Two-stage Manner | Poster |
3457 | Task-Aware Quantization Network for JPEG Image Compression | Poster |
3472 | Learning Deep Conditional Target Densities for Accurate Regression | Poster |
3478 | CLOTH3D: Clothed 3D Humans | Poster |
3484 | Encoding Structure-Texture Relation with P-Net for Anomaly Detection in Retinal Images | Poster |
3485 | CLNet: A Compact Latent Network for Fast Adjusting Siamese Tracker | Poster |
3488 | Occlusion-Aware Siamese Network for Human Pose Estimation | Poster |
3492 | Learning to Predict Salient Faces: A Novel Audio-Visual Saliency Model | Poster |
3495 | NormalGAN: Learning Detailed 3D Human from a Single RGB-D Image | Poster |
3498 | Model-based disentanglement of lens occlusions | Poster |
3506 | Rotation-robust Intersection over Union for 3D Object Detection | Poster |
3508 | New Threats against Object Detector with Non-Local Block | Poster |
3516 | Self-Supervised CycleGAN for Object-Preserving Image-to-Image Domain Adaptation | Poster |
3533 | On the Usage of the Trifocal Tensor in Motion Segmentation | Poster |
3539 | 3D-Rotation-Equivariant Quaternion Neural Networks | Poster |
3540 | InterHand2.6M: A New Large-scale Dataset and Baseline for 3D Single and Interacting Hand Pose Estimation from a Single RGB Image | Poster |
3548 | Active Crowd Counting with Limited Supervision | Poster |
3551 | Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance | Poster |
3563 | Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language | Poster |
3568 | Do Not Mask What You Do Not Need to Mask: a Parser Free Virtual Try-On | Poster |
3577 | NODIS: Neural Ordinary Differential Scene Understanding | Poster |
3586 | Assembling Modality Representations via Attention Connections | Poster |
3588 | Learning Propagation Rules for Attribution Map Generation | Poster |
3590 | Reparameterizing Convolutions for Incremental Multi-Task Learning Without Task Interference | Poster |
3606 | Learning Predictive Models from Observation and Interaction | Poster |
3607 | Unifying Deep Local and Global Features for Image Search | Poster |
3610 | Human Body Model Fitting by Learned Gradient Descent | Poster |
3611 | DDGCN: A Dynamic Directed Graph Convolutional Network for Action Recognition | Poster |
3615 | Learning latent representions across multiple data domains using Lifelong VAEGAN | Poster |
3620 | DVI: Depth Guided Video Inpainting for Autonomous Driving | Poster |
3627 | Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation | Poster |
3632 | APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection | Poster |
3640 | Visual Question Answering on Image Sets | Poster |
3643 | Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots | Poster |
3644 | Placepedia: Comprehensive Place Understanding with Multi-Faceted Annotations | Poster |
3649 | Depth Estimation by Learning Triangulation and Densification of Sparse Points for Multi-view Stereo | Poster |
3654 | Dynamic Low-light Imaging with Quanta Image Sensors | Poster |
3668 | Disambiguating Monocular Depth Estimation with a Single Transient | Poster |
3672 | DSDNet: Deep Structured self-Driving Network | Poster |
3679 | QUEST: Quantized embedding space for transferring knowledge | Poster |
3685 | EGDCL: An Adaptive Curriculum Learning Framework for Unbiased Glaucoma Diagnosis | Poster |
3689 | Backpropagated Gradient Representations for Anomaly Detection | Poster |
3694 | Dense RepPoints: Representing Visual Objects with Dense Point Sets | Poster |
3696 | On Dropping Clusters to Regularize Graph Convolutional Neural Networks | Poster |
3702 | Adaptive Video Highlight Detection by Learning from User History | Poster |
3705 | Automated Data Augmentation Significantly Improves 3D Object Detection | Poster |
3719 | DR-KFS: A Differentiable Visual Similarity Metric for 3D Shape Reconstruction | Poster |
3720 | SPAN: Spatial Pyramid Attention Network for Image Manipulation Detection | Poster |
3721 | Transferring Domain Shift Across Tasks for Zero-shot Domain adaptation | Poster |
3723 | YOLO in the Dark - Domain Adaptation Method for Merging Multiple Models - | Poster |
3739 | Identity-Aware Multi-Sentence Video Description | Poster |
3742 | VQA-LOL: Visual Question Answering under the Lens of Logic | Poster |
3751 | Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation | Poster |
3752 | TRRNet: Tree Relation Reasoning for Compositional Visual Question Answering | Poster |
3764 | Mining Inter-Video Proposal Relations for Video Object Detection | Poster |
3768 | TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval | Poster |
3769 | Minimum Class Confusion for Versatile Domain Adaptation | Poster |
3790 | Large Batch Optimization for Object Detection: Training COCO in 12 Minutes | Poster |
3792 | Towards Practical and Efficient High-Resolution HDR Deghosting with CNN | Poster |
3794 | Self-Supervised Differentiable Rendering for Monocular 3D Object Detection | Poster |
3796 | Shape Prior Deformation for Categorical 6D Object Pose and Size Estimation | Poster |
3801 | Dynamic and Static Context-aware LSTM for Multi-agent Motion Prediction | Poster |
3802 | Image-based table recognition: data, model, and evaluation | Poster |
3803 | Group Activity Prediction with Sequential Relational Anticipation Model | Poster |
3805 | PiP: Planning-informed Trajectory Prediction for Autonomous Driving | Poster |
3807 | PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer | Poster |
3819 | Hierarchical Context Embedding for Region-based Object Detection | Poster |
3822 | Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition | Poster |
3830 | Gen-LaneNet: A Generalized and Scalable Approach for 3D Lane Detection | Poster |
3833 | Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction | Poster |
3837 | MEAD: A Large-scale Audio-visual Dataset for Emotional Talking Face Generation | Poster |
3850 | Detecting Human-Object Interactions with Action Co-occurrence Priors | Poster |
3853 | Learning Connectivity of Neural Networks from a Topological Perspective | Poster |
3867 | JSTASR: Joint Size and Transparency-Aware Snow Removal Algorithm Based on Modified Partial Convolution and Veiling Effect Removal | Poster |
3872 | Learning Object-aware Anchor-free Networks for Real-time Object Tracking | Poster |
3884 | Object Tracking using Spatio-Temporal Networks for Future Prediction Location | Poster |
3892 | Pillar-based Object Detection for Autonomous Driving | Poster |
3902 | Sparse Adversarial Attack via Perturbation Factorization | Poster |
3925 | 3D Scene Reconstruction from a Single Viewport | Poster |
3935 | Learning to Optimize Domain Specific Normalization for Domain Generalization | Poster |
3937 | Self-supervised Outdoor Scene Relighting | Poster |
3947 | LC-VSLAM: Real-time Tracking and Bundle Adjustment in Line-Cloud | Poster |
3951 | Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning | Poster |
3960 | Learning Joint Visual Semantic Matching Embeddings for Language-guided Retrieval | Poster |
3990 | Globally Optimal and Efficient Vanishing Point Estimation in Atlanta World | Poster |
3992 | StyleGAN2 Distillation for Feed-forward Image Manipulation | Poster |
3997 | Self-Prediction for Joint Instance and Semantic Segmentation of Point Clouds | Poster |
3999 | Learning Disentangled Representations via Mutual Information Estimation | Poster |
4010 | Challenge-Aware RGBT Tracking | Poster |
4019 | Fully Trainable and Interpretable Non-Local Sparse Models for Image Restoration | Poster |
4034 | AutoSimulate: (Quickly) Learning Synthetic Data Generation | Poster |
4035 | LatticeNet: Towards Lightweight Image Super-resolution with Lattice Block | Poster |
4042 | Learning from Scale-Invariant Examples for Domain Adaptation in Semantic Segmentation | Poster |
4046 | Active Visual Information Gathering for Vision-Language Navigation | Poster |
4061 | Deep Hough-Transform Line Priors | Poster |
4065 | Unsupervised Shape and Pose Disentanglement for 3D Meshes | Poster |
4066 | CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection | Poster |
4072 | Inclusive GAN: Improving Data and Minority Coverage in Generative Models | Poster |
4076 | SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects | Poster |
4095 | Dive Deeper Into Box for Object Detection | Poster |
4097 | PG-Net: Pixel to Global Matching Network for Visual Tracking | Poster |
4098 | Why Are Deep Representations Good Perceptual Quality Features? | Poster |
4101 | Geometric Estimation via Robust Subspace Recovery | Poster |
4102 | Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification | Poster |
4107 | Human Correspondence Consensus for 3D Object Semantic Understanding | Poster |
4111 | Learning Memory Augmented Cascading Network for Compressed Sensing of Images | Poster |
4112 | Least squares surface reconstruction on arbitrary domains | Poster |
4116 | Task-conditioned Domain Adaptation for Pedestrian Detection in Thermal Imagery | Poster |
4118 | Improving the Transferability of Adversarial Examples with Resized-Diverse-Inputs, Diversity-Ensemble and Region Fitting | Poster |
4120 | DADA: Differentiable Automatic Data Augmentation | Poster |
4123 | SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans | Poster |
4125 | Kinship Identification through Joint Learning Using Kinship Verification Ensemble | Poster |
4152 | Kernelized Memory Network for Video Object Segmentation | Poster |
4160 | A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection | Poster |
4165 | Splitting vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation | Poster |
4167 | Temporal Keypoint Matching and Refinement Network for Pose Estimation and Tracking | Poster |
4168 | Neural Point-Based Graphics | Poster |
4171 | FHDe$^2$Net: Full High Definition Demoireing Network | Poster |
4172 | Learning Structural Similarity of User Interface Layouts using Graph Networks | Poster |
4174 | NAS-Count: Counting-by-Density with Neural Architecture Search | Poster |
4185 | Towards Generalization Across Depth for Monocular 3D Object Detection | Poster |
4197 | Margin-Mix: Semi--Supervised Learning for Face Expression Recognition | Poster |
4198 | Principal Feature Visualisation in Convolutional Neural Networks | Poster |
4211 | Progressive Refinement Network for Occluded Pedestrian Detection | Poster |
4214 | MonoPort: Monocular Real-Time Volumetric Teleportation | Poster |
4217 | The Mapillary Traffic Sign Dataset for Detection and Classification on a Global Scale | Poster |
4220 | Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction | Poster |
4234 | Disentangling Multiple Features in Video Sequences using Gaussian Processes in Variational Autoencoders | Poster |
4238 | SEN: A Novel Dissimilarity Measure for Prototypical Few-Shot Learning Networks | Poster |
4241 | Kinematic 3D Object Detection in Monocular Video | Poster |
4257 | Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents | Poster |
4270 | SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-based Symptom Relation Embedding | Poster |
4272 | End-to-End Low Cost Compressive Spectral Imaging with Spatial-Spectral Self-Attention | Poster |
4297 | Know Your Surroundings: Exploiting Scene Information for Object Tracking | Poster |
4298 | Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases | Poster |
4300 | Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection | Poster |
4302 | DeepLandscape: Adversarial Modeling of Landscape Videos | Poster |
4304 | GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images | Poster |
4306 | Spatial-Angular Interaction for Light Field Image Super-Resolution | Poster |
4314 | BATS: Binary ArchitecTure Search | Poster |
4319 | A Closer Look at Local Aggregation Operators in Point Cloud Analysis | Poster |
4322 | Look here! A parametric learning based approach to redirect visual attention | Poster |
4324 | Variational Diffusion Autoencoders with Random Walk Sampling | Poster |
4328 | Adaptive Variance Based Label Distribution Learning For Facial Age Estimation | Poster |
4334 | Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency | Poster |
4342 | Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations | Poster |
4350 | VarSR: Variational Super-Resolution Network for Very Low Resolution Images | Poster |
4353 | Co-Heterogeneous and Adaptive Segmentation from Multi-Source and Multi-Phase CT Imaging Data: A Study on Pathological Liver and Lesion Segmentation | Poster |
4355 | Towards Recognizing Unseen Categories in Unseen Domains | Poster |
4362 | Square Attack: a query-efficient black-box adversarial attack via random search | Poster |
4363 | You Are Here: Geolocation by Embedding Maps and Images | Poster |
4364 | Segmentations-Leak: Membership Inference Attacks and Defenses in Semantic Image Segmentation | Poster |
4366 | From Video to Stability: Learning Dynamics from Kinematics of Human Motion | Poster |
4368 | LevelSet R-CNN: A Deep Variational Method for Instance Segmentation | Poster |
4374 | Efficient Scale-permuted Backbone with Learned Resource Distribution | Poster |
4375 | Bridging Multiple Distant Domains by Learning Transferable Shapes from Sketch | Poster |
4377 | Bridging Knowledge Graphs to Generate Scene Graphs | Poster |
4386 | Implicit Latent Variable Model for Scene-Consistent Motion Forecasting | Poster |
4387 | Learning Visual Commonsense for Robust Scene Graph Generation | Poster |
4396 | MPCC: Matching Priors and Conditionals for Clustering | Poster |
4405 | PointAR: Efficient Lighting Estimation for Mobile Augmented Reality | Poster |
4408 | Discrete Point Flow Networks for Efficient Point Cloud Generation | Poster |
4410 | Accelerating Deep Learning with Millions of Classes | Poster |
4416 | Password-conditioned Anonymization and Deanonymization with Face Identity Transformers | Poster |
4421 | Inertial Safety from Structured Light | Poster |
4424 | PointTriNet: Learned Triangulation of 3D Point Sets | Poster |
4433 | Toward unsupervised, multi-object discovery in large-scale image collections | Poster |
4474 | Deep View Synthesis From Colored 3D PointClouds | Poster |
4495 | Consensus-Aware Visual-Semantic Embedding for Image-Text Matching | Poster |
4499 | Spatial Hierarchy Aware Residual Pyramid Network for Time-of-Flight Depth Denoising | Poster |
4510 | Sat2Graph: Road Graph Extraction through Graph-Tensor Encoding | Poster |
4513 | Cross-task Transfer for Multimodal Aerial Scene Classification | Poster |
4522 | Polarimetric Multi-View Inverse Rendering | Poster |
4524 | SideInfNet: A Deep Neural Network for Semi-Automatic Semantic Segmentation with Side Information | Poster |
4531 | Improving Recognition with Unlabeled Faces in the Wild | Poster |
4532 | NeuRoRA: Neural Robust Rotation Averaging | Poster |
4535 | SG-VAE: Scene Grammar Variational Autoencoder to generate new indoor scenes | Poster |
4544 | Unsupervised Learning of Optical Flow with Deep Feature Similarity | Poster |
4548 | Blended Grammar Network for Human Parsing | Poster |
4549 | A Crisis is an Opportunity: Discriminative Patch-based and Piece-wise Planar-based Unsupervised Depth Estimation in Indoor Environments | Poster |
4553 | Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs | Poster |
4582 | Adaptive Mixture Regression Network with Local Counting Map for Crowd Counting | Poster |
4583 | BIRNAT: Bidirectional Recurrent Neural Networks with Adversarial Training for Video Snapshot Compressive Imaging | Poster |
4584 | Ultra Fast Structure-aware Deep Lane Detection | Poster |
4585 | Cross-Identity Motion Transfer for Arbitrary Objects through Pose-Attentive Video Reassembling | Poster |
4600 | Domain Adaptive Object Detection via Asymmetric Tri-way Faster-RCNN | Poster |
4614 | Exclusivity-Consistency Regularized Knowledge Distillation for Face Recognition | Poster |
4617 | Learning Camera-Aware Noise Models | Poster |
4619 | The Whole is greater than the sum of its Nonrigid Parts | Poster |
4625 | Iterative Distance-Aware Similarity Matrix Convolution with Mutual-Supervised Point Elimination for Efficient Point Cloud Registration | Poster |
4628 | In Defense of Graph Inference Algorithms for Weakly Supervised Object Localization | Poster |
4629 | Environment-agnostic Multitask Learning for Natural Language Grounded Navigation | Poster |
4631 | TPFN: Apply Outer Product along Time for Multimodal Sentiment Analysis Fusion on Imperfect Data | Poster |
4637 | ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component Analysis | Poster |
4644 | Learning with Privileged Information for Efficient Image Super-Resolution | Poster |
4652 | Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive Person Re-Identification | Poster |
4655 | Autoencoder-based Graph Construction for Semi-supervised Learning | Poster |
4670 | Virtual Multi-view Fusion for 3D Semantic Segmentation | Poster |
4672 | Decoupling GCN with DropGraph Module for Skeleton-Based Action Recognition | Poster |
4676 | Deep Shape from Polarization | Poster |
4682 | A Boundary Based Out-Of-Distribution Classifier for Generalized Zero-Shot Learning | Poster |
4690 | Mind the Discriminability: Asymmetric Adversarial Domain Adaptation | Poster |
4694 | SeqXY2SeqZ: Structure Learning for 3D Shapes by Sequentially Predicting 1D Occupancy Segments From 2D Coordinates | Poster |
4729 | Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking | Poster |
4736 | Deep FusionNet for Point Cloud Semantic Segmentation | Poster |
4750 | Deep Material Recognition in Light-Fields via Disentanglement of Spatial and Angular Information | Poster |
4757 | Dual Adversarial Network for Deep Active Learning | Poster |
4763 | Fully Convolutional Networks for Continuous Sign Language Recognition | Poster |
4771 | Self-adapting confidence estimation for stereo | Poster |
4793 | Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention | Poster |
4796 | AutoSTR: Efficient Backbone Search for Scene Text Recognition | Poster |
4802 | Pretraining Matters: A Two-Stage Design for Unsupervised Image Classification | Poster |
4810 | Adversarial Training with Bi-directional Likelihood Regularization for Visual Classification | Poster |
4830 | Faster AutoAugment: Learning Augmentation Strategies using Backpropagation | Poster |
4836 | Hand-Transformer: Non-Autoregressive Structured Modeling for 3D Hand Pose Estimation | Poster |
4845 | Boundary-Aware Cascade Networks for Temporal Action Segmentation | Poster |
4865 | Towards Content-independent Multi-Reference Super-Resolution: Adaptive Pattern Matching and Feature Aggregation | Poster |
4871 | Inference Graphs for CNN Interpretation | Poster |
4879 | An End-to-End OCR Text Re-organization Sequence Learning for Rich-text Detail Image Comprehension | Poster |
4889 | Improving Query Efficiency of Black-box Adversarial Attack | Poster |
4890 | Self-similarity Student for Partial Label Histopathology Image Segmentation | Poster |
4912 | BioMetricNet: deep unconstrained face verification through learning of metrics regularized onto Gaussian distributions | Poster |
4913 | A Decoupled Learning Scheme for Real-world Burst Denoising from Raw Images | Poster |
4920 | Global-and-Local Relative Position Embedding for Unsupervised Video Summarization | Poster |
4924 | Real-World Blur Dataset for Learning and Benchmarking Deblurring Algorithms | Poster |
4927 | SPARK: Spatial-aware Online Incremental Attack Against Visual Tracking | Poster |
4943 | CenterNet Heatmap Propagation for Real-time Video Object Detection | Poster |
4959 | Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection | Poster |
4963 | SOLAR: Second-Order Loss and Attention for Image Retrieval | Poster |
4964 | Fixing Localization Errors to Improve Image Classification | Poster |
4968 | PatchPerPix for Instance Segmentation | Poster |
4997 | Attend and Segment | Poster |
5004 | Accelerating CNN Training by Pruning Activation Gradients | Poster |
5010 | Global and Local Enhancement Networks For Paired and Unpaired Image Enhancement | Poster |
5041 | Probabilistic Anchor Assignment with IoU Prediction for Object Detection | Poster |
5056 | Eyeglasses 3D shape reconstruction from a single face image | Poster |
5061 | Temporal Complementary Learning for Video Person Re-Identification | Poster |
5063 | HoughNet: Integrating near and long-range evidence for bottom-up object detection | Poster |
5066 | Graph Wasserstein Correlation Analysis for Movie Retrieval | Poster |
5068 | Revisiting RCNN for Action Detection in Videos | Poster |
5090 | Full-Time Monocular Road Detection Using Zero-Distribution Prior of Angle of Polarization | Poster |
5095 | A Flexible Recurrent Residual Pyramid Network for Video Frame Interpolation | Poster |
5099 | Learning Enriched Features for Real Image Restoration and Enhancement | Poster |
5105 | Detail Preserved Point Cloud Completion via Separated Feature Aggregation | Poster |
5115 | LabelEnc: A New Intermediate Supervision Method for Object Detection | Poster |
5118 | Unsupervised Learning of Category-Specific Symmetric 3D Keypoints from Point Sets | Poster |
5130 | PAMS: Quantized Super-Resolution via Parameterized Max Scale | Poster |
5131 | SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds | Poster |
5134 | OID: Outlier Identifying and Discarding in Blind Image Deblurring | Poster |
5140 | Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors | Poster |
5150 | Enhanced Sparse Model for Blind Deblurring | Poster |
5155 | SumGraph: Video Summarization via Recursive Graph Modeling | Poster |
5164 | Feature Normalized Knowledge Distillation for Image Classification | Poster |
5170 | A Metric Learning Reality Check | Poster |
5190 | FTL: A universal framework for training low-bit DNNs via Feature Transfer | Poster |
5192 | XingGAN for Person Image Generation | Poster |
5203 | GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering | Poster |
5204 | VCNet: A Robust Approach to Blind Image Inpainting | Poster |
5205 | Learning to Predict Context-adaptive Convolution for Semantic Segmentation | Poster |
5211 | EfficientFCN: Holistically-guided Decoding for Semantic Segmentation | Poster |
5227 | GroSS: Group-Size Series Decomposition for Grouped Architecture Search | Poster |
5291 | Efficient Adversarial Attacks for Visual Object Tracking | Poster |
5299 | Globally-Optimal Event Camera Motion Estimation | Poster |
5301 | Weakly-supervised Learning of Human Dynamics | Poster |
5305 | Journey Towards Tiny Perceptual Super-Resolution | Poster |
5308 | What makes fake images detectable? Understanding properties that generalize | Poster |
5313 | Embedding Propagation: Smoother Manifold for Few-Shot Classification | Poster |
5315 | Category Level Object Pose Estimation via Neural Analysis-by-Synthesis | Poster |
5320 | High-Fidelity Synthesis with Disentangled Representation | Poster |
5323 | PL1P - Point-line Minimal Problems under Partial Visibility in Three Views | Poster |
5327 | Prediction, Recovery and Identification: Adaptive Low-Resolution Person Re-Identification | Poster |
5328 | Learning Canonical Representations for Scene Graph to Image Generation | Poster |
5331 | Adversarial Robustness on In- and Out-Distribution Improves Explainability | Poster |
5333 | Deformable Style Transfer | Poster |
5336 | Aligning Videos in Space and Time | Poster |
5346 | Neural Wireframe Renderer: Learning Wireframe to Image Translations | Poster |
5351 | RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax | Poster |
5368 | Testing the Safety of Self-driving Vehicles by Simulating Perception and Prediction | Poster |
5369 | Determining the Relevance of Features for Deep Neural Networks | Poster |
5372 | Weakly Supervised Semantic Segmentation with Boundary Exploration | Poster |
5381 | GANhopper: Multi-Hop GAN for Unsupervised Image-to-Image Translation | Poster |
5385 | DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild | Poster |
5394 | Multi-view adaptive graph convolutions for graph classification | Poster |
5406 | Universal Self-Training for Unsupervised Domain Adaptation | Poster |
5409 | Weight Decay Scheduling and Knowledge Distillation for Active Learning | Poster |
5414 | HMQ: Hardware Friendly Mixed Precision Quantization Block for CNNs | Poster |
5423 | Truncated Inference for Latent Variable Optimization Problems: Application to Robust Estimation and Learning | Poster |
5424 | Geometry Constrained Weakly Supervised Object Localization | Poster |
5445 | Duality Diagram Similarity: a generic framework for initialization selection in task transfer learning | Poster |
5448 | OneGAN: Simultaneous Unsupervised Learning of Conditional Image Generation, Foreground Segmentation, and Fine-Grained Clustering | Poster |
5450 | Mining self-similarity: Label super-resolution with epitomic representations | Poster |
5480 | AE-OT-GAN: Training GANs from data specific latent distribution | Poster |
5488 | Null-sampling for Invariant and Interpretable Representations | Poster |
5491 | Guiding Monocular Depth Estimation Using Depth Attention-Volume | Poster |
5494 | Tracking Emerges by Looking Around Static Scenes, with Neural 3D Mapping | Poster |
5495 | Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer | Poster |
5496 | BézierSketch: A generative model for scalable vector sketches | Poster |
5530 | Semantic Relation Preserving Knowledge Distillation for Image-to-Image Translation | Poster |
5551 | Domain Adaptation through Task Distillation | Poster |
5563 | PatchAttack: A Black-box Texture-based Attack with Reinforcement Learning | Poster |
5564 | More Classifiers, Less Forgetting: A Generic Multi-classifier Paradigm for Incremental Learning | Poster |
5568 | Extending and Analyzing Self-Supervised Learning Across Domains | Poster |
5573 | Multi-Source Open-Set Deep Adversarial Domain Adaptation | Poster |
5576 | Neural Batch Sampling with Reinforcement Learning for Semi-Supervised Anomaly Detection | Poster |
5581 | LEMMA: A Multiview Dataset for Learning Multi-agent Multi-task Activities | Poster |
5589 | Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images | Poster |
5592 | Accurate Optimization of Weighted Nuclear Norm for Non-Rigid Structure from Motion | Poster |
5605 | Proposal based Video Completion | Poster |
5608 | HGNet: Hybrid Generative Network for Zero-shot Domain Adaptation | Poster |
5622 | Beyond Monocular Deraining: Paired Rain Removal Networks via Unpaired Semantic Understanding | Poster |
5625 | DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks | Poster |
5635 | All at Once: Temporally Adaptive Multi-Frame Interpolation with Advanced Motion Modeling | Poster |
5643 | A Broader Study of Cross-Domain Few-Shot Learning | Poster |
5645 | Practical Poisoning Attacks on Neural Networks | Poster |
5669 | Unsupervised Domain Adaptation in the Dissimilarity Space for Person Re-identification | Poster |
5671 | Learn distributed GAN with Temporary Discriminators | Poster |
5673 | SemifreddoNets: Partially Frozen Neural Networks for Efficient Computer Vision Systems | Poster |
5686 | Improving Adversarial Robustness by Enforcing Local and Global Compactness | Poster |
5687 | TopoGAN: A Generative Adversarial Approach to Topology-Aware Road Segmentation | Poster |
5695 | Channel selection using Gumbel softmax | Poster |
5696 | Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification | Poster |
5698 | An Efficient Training Framework for Reversible Neural Architectures | Poster |
5717 | Box2Seg: Attention Weighted Loss and Discriminative Feature Learning for Weakly Supervised Segmentation | Poster |
5744 | Freeform Structured Light | Poster |
5750 | One-pixel Signature: Characterizing CNN Classifiers for Backdoor Detection | Poster |
5752 | Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning | Poster |
5757 | Structure-Aware Generation Network for Recipe Generation from Images | Poster |
5769 | A Simple and Effective Framework for Pairwise Deep Metric Learning | Poster |
5772 | Meta-rPPG: Remote Heart Rate Estimation Using a Transductive Meta-Learner | Poster |
5775 | A Recurrent Transformer Network for Novel View Action Synthesis | Poster |
5777 | Multi-view Action Recognition using Cross-view Video Prediction | Poster |
5794 | Learning Discriminative Feature with CRF for Unsupervised Video Object Segmentation | Poster |
5809 | SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction | Poster |
5818 | Label-Driven Reconstruction for Domain Adaptation in Semantic Segmentation | Poster |
5831 | Efficient Outdoor 3D Point Cloud Semantic Segmentation for Critical Road Objects and Distributed Contexts | Poster |
5849 | Attributional Robustness Training using Input-Gradient Spatial Alignment | Poster |
5855 | How to Train Your Event Camera Neural Network | Poster |
5863 | Spatial Geometric Reasoning for Room Layout Estimation via Deep Reinforcement Learning | Poster |
5865 | On the Importance of Data Augmentation for Object Detection | Poster |
5875 | DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search | Poster |
5879 | A Closer Look at Generalisation in RAVEN | Poster |
5884 | Supervised Edge Attention Network for Accurate Image Instance Segmentation | Poster |
5888 | Discriminative Partial Domain Adversarial Network | Poster |
5893 | Differentiable Programming for Hyperspectral Unmixing using a Physics-based Dispersion Model | Poster |
5894 | Deep Cross-species Feature Learning for Animal Face Recognition via Residual Interspecies Equivariant Network | Poster |
5897 | Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes | Poster |
5906 | Sound2Sight: Generating Visual Dynamics from Sound and Context | Poster |
5913 | 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection | Poster |
5921 | NoiseRank: Unsupervised Label Noise Reduction with Dependence Models | Poster |
5930 | Fast Adaptation to Super-Resolution Networks via Meta-Learning | Poster |
5931 | TP-LSD: Tri-Points Based Line Segment Detector | Poster |
5940 | Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation | Poster |
5955 | An Attention-driven Two-stage Clustering Method for Unsupervised Person Re-Identification | Poster |
5989 | Toward Fine-grained Facial Expression Manipulation | Poster |
5992 | Adaptive Object Detection with Dual Multi-Label Prediction | Poster |
6007 | Table Structure Recognition using Top-Down and Bottom-Up Cues | Poster |
6013 | Novel View Synthesis on Unpaired Data by Conditional Deformable Variational Auto-Encoder | Poster |
6018 | Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments | Poster |
6021 | Boundary Content Graph Neural Network for Temporal Action Proposal Generation | Poster |
6037 | Pose Augmentation: Class-agnostic Object Pose Transformation for Object Recognition | Poster |
6051 | VLANet: Video-Language Alignment Network for Weakly-Supervised Video Moment Retrieval | Poster |
6054 | Attention-Based Query Expansion Learning | Poster |
6055 | Interpretable Foreground Object Search As Knowledge Distillation | Poster |
6056 | Improving Knowledge Distillation via Category Structure | Poster |
6059 | High Resolution Zero-Shot Domain Adaptation of Synthetically Rendered Face Images | Poster |
6066 | Attentive Prototype Few-shot Learning with Capsule Network-based Embedding | Poster |
6083 | Weakly Supervised Instance Segmentation by Learning Annotation Consistent Instances | Poster |
6091 | DA4AD: End-to-end Deep Attention Aware Features Aided Visual Localization for Autonomous Driving | Poster |
6109 | Visual-Relation Conscious Image Generation from Structured-Text | Poster |
6114 | Patch-wise Attack for Fooling Deep Neural Network | Poster |
6141 | Feature Pyramid Transformer | Poster |
6153 | MABNet: A Lightweight Stereo Network Based on Multibranch Adjustable Bottleneck Module | Poster |
6159 | Guided Saliency Feature Learning for Person Re-identification in Crowded Scenes | Poster |
6188 | Asymmetric Two-Stream Architecture for Accurate RGB-D Saliency Detection | Poster |
6192 | Lightweight Statistical Explanations for Deep Neural Networks | Poster |
6207 | Deep Graph Matching via Blackbox Differentiation of Combinatorial Solvers | Poster |
6215 | Video Representation Learning by Learning to Tell Motions Apart | Poster |
6231 | Unsupervised Monocular Depth Estimation for Night-time Images using Adversarial Domain Feature Adaptation | Poster |
6236 | Variational Connectionist Temporal Classification | Poster |
6258 | End-to-end Dynamic Matching Network for Multi-view Multi-person 3d Pose Estimation | Poster |
6259 | Orderly Disorder in Point Cloud Domain | Poster |
6272 | Deep Decomposition Learning for Inverse Imaging Problems | Poster |
6287 | FLOT: Scene Flow Estimation by Learned Optimal Transport on Point Clouds | Poster |
6294 | Accurate Reconstruction of Oriented 3D Points using Affine Correspondences | Poster |
6316 | Volumetric Transformer Networks | Poster |
6332 | 360º Camera Alignment via Segmentation | Poster |
6334 | A Novel Line Integral Transform for 2D Affine Invariant Shape Retrieval | Poster |
6336 | Explainable Graph Networks for Weakly-supervised Learning of Visual Relations | Poster |
6345 | Guided Semantic Flow | Poster |
6393 | Document Structure Extraction using Prior Based HighResolution Hierarchical Semantic Segmentation | Poster |
6416 | Measuring the importance of temporal features in video saliency | Poster |
6421 | Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution | Poster |
6424 | Towards Reliable Evaluation of Algorithms for Road Network Reconstruction from Aerial Images | Poster |
6425 | Online Continual Learning under Extreme Memory Constraints | Poster |
6436 | Learning to Cluster under Domain Shift | Poster |
6438 | Defense Against Adversarial Attacks via Controlling Gradient Leaking on Embedded Manifolds | Poster |
6440 | Improving Optical Flow on a Pyramid Level | Poster |
6446 | Procrustean Regression Networks: Learning 3D Structure of Non-Rigid Objects from 2D Annotations | Poster |
6474 | Learning to Learn Parameterized Classification Networks for Scalable Input Images | Poster |
6476 | Stereo Event-based Particle Tracking Velocimetry for 3D Fluid Flow Reconstruction | Poster |
6515 | Simplicial Complex based Point Correspondence between Images warped onto Manifolds | Poster |
6535 | Neural Message Passing on Hybrid Spatio-Temporal Visual and Symbolic Graphs for Video Understanding | Poster |
6559 | Distance-Normalized Unified Representation for Monocular 3D Object Detection | Poster |
6576 | Sequential Deformation for Accurate Scene Text Detection | Poster |
6579 | Where to Explore Next? ExHistCNN for History-aware Autonomous 3D Exploration | Poster |
6591 | Semi-Supervised Segmentation based on Error-Correcting Supervision | Poster |
6621 | Quantum-soft QUBO Suppression for Accurate Object Detection | Poster |
6624 | Label-similarity Curriculum Learning | Poster |
6627 | Recurrent Image Annotation With Explicit Inter-Label Dependencies | Poster |
6628 | Cross-Attention in Coupled Unmixing Nets for Unsupervised Hyperspectral Super-Resolution | Poster |
6637 | SimPose: Effectively Learning DensePose and Surface Normal of People from Simulated Data | Poster |
6639 | ByeGlassesGAN: Identity Preserving Eyeglasses Removal for Face Images | Poster |
6693 | Differentiable Joint Pruning and Quantization for Hardware Efficiency | Poster |
6697 | Learning to Generate Customized Dynamic 3D Facial Expressions | Poster |
6698 | LandscapeAR: Large Scale Outdoor Augmented Reality by Matching Photographs with Terrain Models Using Learned Descriptors | Poster |
6709 | Mirrored Autoencoders with Simplex Interpolation for Unsupervised Anomaly Detection | Poster |
6717 | Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration. | Poster |
6719 | Jointly De-biasing Face Recognition and Demographic Attribute Estimation | Poster |
6721 | Regularized Loss for Weakly Supervised Single Class Segmentation | Poster |
6736 | Spike-FlowNet: Event-based Optical Flow Estimation with Energy-Efficient Hybrid Neural Networks | Poster |
6746 | Forgetting Outside the Box: Scrubbing Deep Networks of Information Accessible from Input-Output Observations | Poster |
6748 | Inherent Adversarial Robustness of Deep Spiking Neural Networks: Effects of Discrete Input Encoding and Non-Linear Activations | Poster |
6753 | Synthesizing Coupled 3D Face Modalities by Trunk-Branch Generative Adversarial Networks | Poster |
6754 | Learning to Learn Words from Visual Scenes | Poster |
6765 | On Transferability of Histological Tissue Labels in Computational Pathology | Poster |
6770 | Learning actionness via long-range temporal order verification | Poster |
6773 | Fully Embedding Fast Convolutional Networks on Pixel Processor Arrays | Poster |
6775 | Character Region Attention For Text Spotting | Poster |
6795 | Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network | Poster |
6796 | Dual Mixup Regularized Learning for Adversarial Domain Adaptation | Poster |
6814 | Robust and On-the-fly Dataset Denoising for Image Classification | Poster |
6833 | Imaging Behind Occluders Using Two-Bounce Light | Poster |
6837 | Improving Object Detection with Selective Self-supervised Self-training | Poster |
6873 | Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction | Poster |
6884 | Info3D: Representation Learning on 3D Objects using Mutual Information Maximization and Contrastive Learning | Poster |
6895 | Adversarial Data Augmentation via Deformation Statistics | Poster |
6926 | Neural Predictor for Neural Architecture Search | Poster |
6927 | Learning Permutation Invariant Representations using Memory Networks | Poster |
6936 | Feature Space Augmentation for Long-Tailed Data | Poster |
6940 | Laying the Foundations of Deep Long-Term Crowd Flow Prediction | Poster |
6965 | Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning | Poster |
6967 | Fairness by Learning Orthogonal Disentangled Representations | Poster |
6977 | Self-Supervision with Superpixels: Training Few-shot Medical Image Segmentation without Annotation | Poster |
6979 | On Diverse Asynchronous Activity Anticipation | Poster |
6994 | Representative-Discriminative Learning for Open-set Land Cover Classification of Satellite Imagery | Poster |
7020 | Structure-Aware Human-Action Generation | Poster |
7035 | Towards Efficient Coarse-to-Fine Networks for Action and Gesture Recognition | Poster |
7036 | $S^3$Net: Semantic-Aware Self-Supervised Depth Estimation with Monocular Videos and Synthetic Data | Poster |
7037 | Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning | Poster |
7039 | Weight Excitation: Built-in Attention Mechanisms in Convolutional Neural Networks | Poster |
7093 | UNITER: UNiversal Image-TExt Representation Learning | Poster |
7133 | $Oscar$: Object-Semantics Aligned Pre-training for Vision-and-Language Tasks | Poster |
7177 | Improving Face Recognition from Hard Samples via Distribution Distillation Loss | Poster |
7198 | Extract and Merge: Superpixel Segmentation with Regional Attributes | Poster |
7202 | Spatial-Adaptive Network for Single Image Denoising | Poster |
7263 | Physics-based Feature Dehazing Networks | Poster |
7352 | Master-Slave Interaction Model: An Asymmetric Modelling for Action Assessment | Poster |
7358 | High-quality Single-model Deep Video Compression with Frame-Conv3D and Multi-frame Differential Modulation | Poster |
7362 | Instance-Aware Embedding for Point Cloud Instance Segmentation | Poster |
7424 | Self-Paced Deep Regression Forests with Consideration on Underrepresented Samples | Poster |
7451 | Manifold Projection for Adversarial Defense on Face Recognition | Poster |
7467 | Weakly-Supervised Learning with Side Information for Noisy Labeled Images | Poster |
7476 | Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision | Poster |
7513 | SNE-RoadSeg: Incorporating Surface Normal Information into Semantic Segmentation for Accurate Freespace Detection | Poster |
7548 | Modeling the Space of Point Landmark Constrained Diffeomorphisms | Poster |
7579 | PieNet: Personalized Image Enhancement Network | Poster |
7614 | Statistical Outlier Identification in Pose Graphs Using Cycles | Poster |
7625 | Speech-driven Facial Animation using Cascaded GANs for Learning of Motion and Texture | Poster |
7627 | Solving phase retrieval with a learned reference | Poster |
7644 | Dual Grid Net: Hand Mesh VertexRegression from Single Depth Maps | Poster |
pictrue from pexels.com