Phoenixtree_Zhao

2020 ICML Oral 论文

Oral Papers

38 - ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences

Zhizhong Han (University of Maryland, College Park); Chao Chen (Tsinghua University); Yu-Shen Liu (Tsinghua University)*; Matthias Zwicker (University of Maryland)

53 - Image Inpainting Based on Multi-frequency Probabilistic Inference Model

Jin Wang (Beijing University of Technology)*; Chen Wang (Beijing University of Technology); Qingming Huang (University of Chinese Academy of Sciences); Yunhui Shi (Beijing University of Technology); Jian-Feng Cai (The Hong Kong University of Science and Technology); Qing Zhu (Beijing University of Technology); Baocai Yin (Beijing University of Technology)

60 - Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition

Wenbo Zheng (School of Software Engineering, Xi'an Jiaotong University); Lan Yan (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences); Chao Gou (School of Intelligent Systems Engineering, Sun Yat-sen University)*; Fei-Yue Wang (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences)

63 - Dual Adversarial Network for Unsupervised Ground/Satellite-to-Aerial Scene Adaptation

jianzhe peter lin (University of British Columbia)*; Lichao Mou (DLR&TUM); tianze yu (University of British Columbia); Xiaoxiang Zhu (Technical University of Munich (TUM); German Aerospace Center (DLR)); Z. Jane Wang (University of British Columbia)

68 - Scene-Aware Background Music Synthesis

Yujia Wang (Beijing Institute of Technology)*; Wei Liang (Beijing Institute of Technology); Wanwan Li (George Mason University); Dingzeyu Li (Adobe Research); Lap-Fai Yu (George Mason University)

75 - Adversarial Bipartite Graph Learning for Video Domain Adaptation

Yadan Luo (University of Queensland)*; Zi Huang (University of Queensland); Zijian Wang (University of Queensland); Zheng Zhang (Harbin Institute of Technology, Shenzhen); Mahsa Baktashmotlagh (University of Queensland)

111 - Domain Adaptive Person Re-Identification via Coupling Optimization

Xiaobin Liu (Peking University); Shiliang Zhang (Peking University)*

118 - Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge

Peng Wang (Northwestern Polytechnical University); Dongyang Liu (Northwestern Polytechnical University); Hui Li (the University of Adelaide)*; Qi Wu (University of Adelaide)

142 - Controllable Video Captioning with an Exemplar Sentence

Yitian Yuan (Tsinghua University)*; Lin Ma (Tencent AI Lab); Jingwen Wang (Tencent AI Lab); Wenwu Zhu (Tsinghua University)

143 - MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos

Guangyao Shen (Tsinghua University)*; Xin Wang (Tsinghua University); Xuguang Duan (Tsinghua University); Hongzhi Li (Microsoft Research); Wenwu Zhu (Tsinghua University)

162 - Single Image De-noising via Staged Memory Network

Weijiang Yu (SUN YAT-SEN UNIVERSITY)*; Jian Liang (Nanchang University); Lu Li (Zhejiang University); Nong Xiao (Sun Yat-sen University)

193 - Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing

Peipei Li ( Institute of Automation Chinese Academy of Sciences)*; Yinglu Liu (JD AI); Hailin Shi (JD AI); Xiang Wu (Reconova); Yibo Hu (Institute of Automation, Chinese Academy of Sciences); Ran He (Institute of Automation, Chinese Academy of Sciences); Zhenan Sun (Chinese of Academy of Sciences)

197 - A Human-Computer Duet System for Music Performance

Yuen-Jen Lin (Academia Sinica)*; Hsuan-Kai Kao (Academia Sinica); Yih-Chih Tseng (Academia Sinica); Ming Tsai (KoKo Lab); Li Su (Academia Sinica)

202 - Invisible: Federated Learning over Non-Informative Intermediate Updates against Multimedia Privacy Leakages

Qiushi Li (Tsinghua University)*; Wenwu Zhu (Tsinghua University); Chao Wu (Tsinghua University); xinglin pan (University of Electronic Science and Technology of China); Fan Yang (Tsinghua University); Yuezhi Zhou (Tsinghua University); Yaoxue Zhang (Tsinghua University)

221 - Every Moment Matters: Detail-Aware Networks to Bring a Blurry Image Alive

Kaihao Zhang (Australian National University)*; Wenhan Luo (Tencent AI Lab); Bjorn Stenger (Rakuten Institute of Technology); Wenqi Ren (Institute of Information Engineering, Chinese Academy of Sciences); Lin Ma (Tencent AI Lab); HONGDONG LI (Australian National University, Australia)

230 - Self-supervised Dance Video Synthesis Conditioned on Music

Xuanchi Ren (HKUST); Haoran Li (The Hong Kong University of Science and Technology); Zijian HUANG (the Hong Kong University of Science and Technology); Qifeng Chen (HKUST)*

232 - Co-Attentive Lifting for Infrared-Visible Person Re-Identification

Xing Wei (Xi'an Jiaotong University)*; Diangang Li (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Wei Ke (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

291 - Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

Fanfan Ye ( Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute); Qiaoyong Zhong (Hikvision Research Institute)*; Chao Li (Hikvision Research Institute); Di Xie (Hikvision Research Institute); Huiming Tang (Zhejiang University)

304 - Boosting Visual Question Answering with Context-aware Knowledge Aggregation

Guohao Li (Tsinghua University)*; Xin Wang (Tsinghua University); Wenwu Zhu (Tsinghua University)

306 - Meta Parsing Networks: Towards Generalized Few-shot Scene Parsing with Adaptive Metric Learning

Peike Li (UTS)*; Yunchao Wei (University of Technology Sydney); Yi Yang (UTS)

312 - CODAN: Counting-driven Attention Network for Vehicle Detection in Congested Scenes

Wei Li (Southwest Jiaotong University); Zhenting Wang (Southwest Jiaotong University); Xiao Wu (Southwest Jiaotong University)*; Ji Zhang (Southwest Jiaotong University); Qiang Peng (Southwest Jiaotong University); Hongliang Li (University of Electronic Science and Technology of China)

352 - Modeling both Intra- and Inter-modal Influence for Real-Time Emotion Detection in Conversations

Dong Zhang (Soochow University)*; Weisheng Zhang (Soochow University); Shoushan Li (Soochow University); Zhu Qiaoming (Soochow University); Zhou Guodong (Soochow University)

355 - WIKI Food-500: A dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences)*; Linhu Liu (ICT); Zhiling Wang (Institute of Computing Technology, Chinese Academy of Sciences); Zhengdong Luo (University of Chinese Academy of Sciences); Xiaoming Wei (MeituanDianping group ); Xiaolin Wei (MeituanDianping group ); Shuqiang Jiang (ICT, China Academy of Science)

358 - Learning Image Classifier from Only Web Labels and Metadata: Automatic Label Correction through Graph

Jingkang Yang (Sensetime Research)*; Weirong Chen (SenseTime Research); Litong Feng (Sensetime Research); Xiaopeng Yan (SenseTime Research); Huabin Zheng (SenseTime Research); Wayne Zhang (SenseTime Research)

373 - Photo Stand-Out: Photography with Virtual Character

Yujia Wang (Beijing Institute of Technology)*; Sifan Hou (Beijing Institute of Technology); Wei Liang (Beijing Institute of Technology); Bing Ning (Beijing Institute of Fashion Technology)

378 - Accurate UAV Tracking with Distance-Injected Overlap Maximization

Chunhui Zhang (Chinese Academy of Sciences); Shiming Ge (Chinese Academy of Sciences)*; Kangkai Zhang (Chinese Academy of Sciences); Dan Zeng (Shanghai University)

383 - Context-Aware Multi-View Summarization Network for Image-Text Matching

Leigang Qu (Shandong University); Meng Liu (Shandong Jianzhu University); Da Cao (Hunan University); Liqiang Nie (Shandong University )*; Qi Tian (Huawei Cloud & AI)

391 - PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music

Hongru Liang (Nankai University); Wenqiang Lei (National University of Singapore)*; Paul Yaozhu Chan (A∗STAR); Zhenglu Yang (Nankai University); Maosong Sun (Tsinghua University); Tat-Seng Chua (National Univ. of Singapore)

395 - An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis

Tianyu Zhang (ICT)*; Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences); Ying Zhu (University of Chinese Academy of Sciences); Yong Rui (Lenovo); Shuqiang Jiang (ICT, China Academy of Science)

436 - Label Embedding Online Hashing for Cross-Modal Retrieval

Yongxin Wang (Shandong University); Xin Luo (Shandong University); Xin-Shun Xu (Shandong University)*

444 - Cloze Test Helps: Effective Video Anomaly Detection via Learning to Complete Video Events

Guang Yu (National University of Defense Technology)*; Siqi Wang (National University of Defense Technology); Zhiping Cai (NUDT); En Zhu (National University of Defense Technology); Chuanfu Xu (National University of Defense Technology); Jianping Yin (National University of Defense Technology); Marius Kloft (TU Kaiserslautern)

484 - CRSSC: Salvage Reusable Samples from Noisy Data for Robust Learning

Zeren Sun (Nanjing University of Science and Technology ); Xian-Sheng Hua (Alibaba Group); Yazhou Yao (Nanjing University of Science and Technology)*; Xiu-Shen Wei (Nanjing University of Science and Technology); Guosheng Hu (AnyVision); Jian Zhang (UTS)

519 - MMFL: Multimodal Fusion Learning for Text-Guided Image Inpainting

Qing Lin (Fudan University); Bo Yan (Fudan University)*; Jichun Li (Fudan University); Weimin Tan (Fudan University)

531 - Vision Meets Wireless Positioning: Effective Person Re-identification with Recurrent Context Propagation

Yiheng Liu (University of Science and Technology of China)*; Wengang Zhou (University of Science and Technology of China); Mao Xi (University of Science and Technology of China); Sanjing Shen (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China)

541 - Learning From Music to Visual Storytelling of Shots: A Deep Interactive Learning Mechanism

Jen-Chun Lin (Academia Sinica)*; Wen-Li Wei (Academia Sinica); Yen-Yu Lin (National Chiao Tung University); Tyng-Luh Liu (Academia Sinica); Hong-Yuan Mark Liao (Institute of Information Science, Academia Sinica, Taiwan)

553 - Asymmetric Deep Hashing for Efficient Hash Code Compression

Shu Zhao (Institute of Information Engineering, Chinese Academy of Sciences); Dayan Wu (Institute of Information Engineering, Chinese Academy of Sciences)*; Wanqian Zhang (Institute of Information Engineering, Chinese Academy of Sciences); Yu Zhou (Institute of Information Engineering, CAS); Bo Li ( Institute of Information Engineering, Chinese Academy of Sciences); Weiping Wang (Institute of Information Engineering, CAS, China)

588 - Quaternion-Based Knowledge Graph Network for Recommendation

Zhaopeng Li (State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences)*; Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Yangbangyan Jiang (Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences); Xiaochun Cao (Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)

601 - Multi-Person Action Recognition in Microwave Sensors

Diangang Li (Xi'an Jiaotong University); Jianquan Liu (NEC Corporation)*; Shoji Nishimura (NEC Corporation); Yuka Hayashi (NEC Corporation); Jun Suzuki (NEC Corporation); Yihong Gong (Xi'an Jiaotong University)

612 - Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment

Dingquan Li (Peking University); Tingting Jiang (Peking University)*; Ming Jiang (Peking University)

639 - Coupling deep textural and shape features for sketch retrieval

Qi Jia (Dalian University of Technology); Xin Fan (Dalian University of Technology)*; Meiyu Yu (Didi Chuxing); Yuqing Liu (Dalian University of Technology); Dingrong Wang (Dalian University of Technology); Longin Jan Latecki (Temple University)

647 - Memory-Augmented Relation Network for Few-Shot Learning

He Jun (Hefei University of Technology)*; Richang Hong (Hefei University of Technology); Xueliang Liu (Hefei University of Technology); Mingliang Xu (Zhengzhou University); Zheng-Jun Zha (University of Science and Technology of China); Meng Wang (Hefei University of Technology)

668 - Performance Optimization of Federated Person Re-identification via Benchmark Analysis

Weiming Zhuang (Nanyang Technological University)*; Yonggang Wen (Nanyang Technological University); Xuesen Zhang (SenseTime); Xin Gan (SenseTime); Daiying Yin (SenseTime); Dongzhan Zhou (The University of Sydney); shuai zhang (Sensetime Ltd); Shuai Yi (SenseTime Group Limited)

691 - Guided Attention Network for Object Detection and Counting on Drones

CAI YuanQiang (UCAS); Dawei Du (University of Chinese Academy of Sciences); Libo Zhang (Institute of Software Chinese Academy of Sciences)*; Longyin Wen (JD Digit); Weiqiang Wang (University of Chinese Academy of Sciences); Yanjun Wu (Institute of Software Chinese Academy of Sciences ); Siwei Lyu (University at Albany)

696 - K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering

Yiyi Zhou (Xiamen University); Rongrong Ji (Xiamen University, China)*; Xiaoshuai Sun ( Xiamen University); Gen Luo (Xiamen University); Xiaopeng Hong (Xi'an Jiaotong University); Jinsong Su (Xiamen University); Xinghao Ding (Xiamen University); Ling Shao (Inception Institute of Artificial Intelligence)

701 - TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection

Fangfang Wang (Zhejiang University)*; Yifeng Chen (Zhejiang University); Fei Wu (Zhejiang University, China); Xi Li (Zhejiang University)

704 - Class-Aware Modality Mix and Center-Guided Metric Learning for Visible-Thermal Person Re-Identification

Yongguo Ling (Xiamen University)*; Zhun Zhong (University of Trento); Zhiming Luo (Xiamen University); Paolo Rota (University of Trento); Shaozi Li (Xiamen University, China); Nicu Sebe (University of Trento)

707 - Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition

Yuan Xie (DarkMatter AI); Tianshui Chen (DarkMatter AI)*; Tao Pu (Sun Yat-sen University); Hefeng Wu (Sun Yat-sen University); Liang Lin (DarkMatter AI)

710 - Weakly Supervised Real-time Image Cropping based on Aesthetic Distributions

Peng Lu (Beijing University of Posts and Telecommunications)*; Jiahui Liu (Beijing University of Posts and Telecommunications); Xujun Peng (Information Sciences Institute, University of Southern California); Xiaojie Wang (Beijing University of Posts and Telecommunications)

732 - Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

Yuting Liu (Sichuan University)*; Zheng Wang (National Institute of Informatics); Miaojing Shi (King's College London); Shin'ichi Satoh (National Institute of Informatics); Qijun Zhao (Sichuan University); hongyu yang (sichuan university)

734 - KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue

Xiaoze Jiang (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Siyi Du (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Zengchang Qin (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University)*; Yajing Sun (Institute of Information Engineering,Chinese Academy of Sciences); Jing Yu ( Institute of Information Engineering,Chinese Academy of Sciences)

737 - Occluded Prohibited Items Detection: An X-ray Security Inspection Benchmark and De-occlusion Attention Module

Yanlu Wei (Beihang University); Renshuai Tao (Beihang University)*; Zhangjie Wu (Beihang University); Yuqing Ma (Beihang University); Libo Zhang (Institute of Software Chinese Academy of Sciences); Xianglong Liu (BUAA)

765 - Context-aware Attention Network for Predicting Image Aesthetic Subjectivity

Munan Xu (Shenzhen Graduate School, Peking University); Jia-Xing Zhong (School of Electronic and Computer Engineering, Peking University); Yurui Ren (Shenzhen Graduate School, Peking University); Shan Liu (Tencent America); Ge Li (SECE, Shenzhen Graduate School, Peking University)*

783 - PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection

Jingchen Sun (Zhejiang University); Jiming Chen (Zhejiang University); Tao Chen (Fudan University); jiayuan fan (Fudan University); Shibo He (Zhejiang University)*

787 - ChoreoNet: Torwards Music to Dance Synthesis with Choreographic Action Unit

Zijie Ye (Tsinghua University)*; Haozhe Wu (Tsinghua University); Jia Jia (Tsinghua University); Yaohua Bu (Tsinghua University); Wei Chen (Beijing Sougou Science and Technology Development Co., Ltd); Fanbo Meng (Sogou Corporation, Beijing, China); Yanfeng Wang ( Beijing Sougou Science and Technology Development Co., Ltd)

794 - Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization

Da Cao (Hunan University)*; Yawen Zeng (Hunan University); Xiaochi Wei (Baidu Inc.); Liqiang Nie (Shandong University ); Richang Hong (Hefei University of Technology); Zheng Qin (Hunan University)

795 - Pose-native Network Architecture Search for Multi-person Human Pose Estimation

Qian Bao (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jun Hong (AI Research of JD.com); Lingyu Duan (Peking University); Tao Mei (AI Research of JD.com)

814 - Cascade Grouped Attention Network for Referring Expression Segmentation

Gen Luo (Xiamen University); Rongrong Ji (Xiamen University, China)*; Yiyi Zhou (Xiamen University); Xiaoshuai Sun ( Xiamen University); Jinsong Su (Xiamen University); Chia-Wen Lin (National Tsing Hua University); Qi Tian (Huawei Cloud & AI)

816 - Temporally Guided Music-to-Body-Movement Generation

Hsuan-Kai Kao (Academia Sinica); Li Su (Academia Sinica)*

818 - Compositional Few-Shot Recognition with Primitive Discovery and Enhancing

Yixiong Zou (Peking University)*; Shanghang Zhang (UC Berkeley); Ke Chen (South China University of Technology); José M. F. Moura (Carnegie Mellon University); Yaowei Wang (PengCheng Laboratory); Yonghong Tian (Peking University)

830 - InteractGAN: Learning to Generate Human-Object Interaction

Chen Gao (Institute of Information Engineering, CAS)*; si liu (Beihang University); Defa Zhu (Institute of Information Engineering, CAS); Quan Liu (Beihang University); Jie Cao (Institute of Automation, Chinese Academy of Sciences); Haoqian He (Beihang University); Ran He (Institute of Automation, Chinese Academy of Sciences); Shuicheng Yan (YITU Tech)

867 - Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos

Jie Wu (Sun Yat-sen University)*; Guanbin Li (Sun Yat-sen University); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)); Liang Lin (DarkMatter AI)

876 - Traffic-Aware Multi-Camera Tracking of Vehicles Based on ReID and Camera Link Model

Hung-Min Hsu (UW)*; Yizhou Wang (University of Washington); Jenq-Neng Hwang (University of WA�)

893 - VONAS: Network Design in Visual Odometry using Neural Architecture Search

Xing Cai (Peking University); Lanqing Zhang (Peking University); Chengyuan Li (Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University); Thomas H Li (Advanced Institute of Information Technology, Peking University)*

921 - Category-specific Semantic Coherency Learning for Fine-grained Image Recognition

Shijie Wang (Dalian University of Technology); zhihui wang (Dalian University of Technology); Haojie Li (Dalian University of Technology)*; Wanli Ouyang (The University of Sydney)

961 - Poet: Product-oriented Video Captioner for E-commerce

Shengyu Zhang (Zhejiang University)*; Ziqi Tan (Zhejiang University); Jin Yu (Alibaba Group); Zhou Zhao (Zhejiang University); Kun Kuang (Zhejiang University); jie liu (Alibaba); Jingren Zhou (Alibaba Group); Hongxia Yang (Alibaba Group); Fei Wu (Zhejiang University, China)

976 - Beyond the Attention: Distinguish the Discriminative and Confusable Features For Fine-grained Image Classification

Xiruo Shi (Beijing University of Posts and Telecommunications ); Liutong Xu (Beijing University of Posts and Telecommunications); Pengfei Wang (School of Computer Science, Beijing University of Posts and Telecommunications); Yuanyuan Gao (Beihang Univeristy); Haifang Jian (Institute of Semiconductors, Chinese Academy of Sciences); Wu Liu (AI Research of JD.com)*

977 - BlockMix: Meta Regularization and Self-Calibrated Inference for Metric-Based Meta-Learning

Hao Tang (Nanjing University of Science and Technology); Zechao Li (Nanjing University of Science and Technology)*; Zhimao Peng (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

980 - Structural Semantic Adversarial Active Learning for Image Captioning

Beichen Zhang (University of Chinese Academy of Sciences)*; liang li (Institute of Computing Technology, Chinese Academy of Sciences); Li Su (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)

988 - Scene-Aware Context Reasoning for Unsupervised Abnormal Event Detection in Videos

Che Sun (Beijing Institute of Technology); Yunde Jia (Beijing Institute of Technology); Yao Hu (Alibaba Youku Cognitive and Intelligent Lab); Yuwei WU (Beijing Institute of Technology (BIT), China)*

1002 - Active Object Search

Jie Wu (Sun Yat-sen University)*; Tianshui Chen (DarkMatter AI); Lishan Huang (Sun Yat-Sen University); Hefeng Wu (Sun Yat-sen University); Guanbin Li (Sun Yat-sen University); Ling Tian (University of Electronic Science and Technology of China); Liang Lin (DarkMatter AI)

1009 - Deep-Modal: Real-Time Impact Sound Synthesis for Arbitrary Shapes

Xutong Jin (Peking University); Sheng Li (Peking University)*; Tianshu Qu (Peking University); Dinesh Manocha (UMD); Guoping Wang (Peking University)

1011 - Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID

Dechao Meng (vipl,ict,Chinese academic of science)*; Liang Li (Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Xingyu Gao (Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)

1035 - Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection

xincheng Ju (Soochow University)*; Dong Zhang (Soochow University); Junhui Li (Soochow University); Zhou Guodong (Soochow University)

1038 - Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification

Xinchen Liu (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jinkai Zheng (Hangzhou Dianzi University); Chenggang Yan (Hangzhou Dianzi University); Tao Mei (AI Research of JD.com)

1064 - Look, Read and Feel: Benchmarking Ads Understanding with Multimodal Multitask Learning

Huaizheng Zhang (Nanyang Technological University)*; YONG LUO (Nanyang Technological University); Qiming Ai (Nanyang Technological University); Han Hu (Beijing Institute of Technology, China); Yonggang Wen (Nanyang Technological University)

1075 - Light Field Super-resolution via Attention-Guided Fusion of Hybrid Lenses

Jing Jin (City University of Hong Kong); Junhui Hou (City University of Hong Kong, Hong Kong)*; Jie Chen (Hong Kong Baptist University); Sam Kwong (City Univeristy of Hong Kong); Jingyi Yu (Shanghai Tech University)

1147 - Compact Bilinear Augmented Query Structured Attention for Sport Highlights Classification

Yanbin Hao (City University of Hong Kong); Hao Zhang (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong); Qiang Liu (DeepAIT (Hong Kong) Limited); Xiaojun Hu (DeepAIT (Hong Kong) Limited)

1195 - Semantic Image Analogy with a Conditional Single-Image GAN

Jiacheng Li (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Dong Liu (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1196 - Trajectory Prediction in Heterogeneous Environment via Attended Ecology Embedding

Wei-Cheng Lai (National Chiao Tung University); Zi-Xiang Xia (National Chiao Tung University); Hao-Siang Lin (National Chiao Tung University); Lien-Feng Hsu (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University); I-Hong Jhuo (IBM); Wen-Huang Cheng (EE, NCTU)*

1214 - A Structured Graph Attention Network for Vehicle Re-Identification

Yangchun Zhu (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Tianzhu Zhang (University of Science and Technology of China); Jiawei Liu (University of Science and Technology of China); Jiebo Luo (U. Rochester)

1224 - Scoring High: Analysis and Prediction of Viewer Behavior and Engagement in the Context of 2018 FIFA WC Live Streaming

Nikolas Wehner (University of Würzburg)*; Michael Seufert (University of Würzburg); Sebastian Egger-Lampl (AIT Austrian Institute of Technology GmbH); Bruno Gardlo (AIT Austrian Institute of Technology GmbH); Pedro Casas (AIT Austrian Institute of Technology GmbH); Raimund Schatz (AIT)

1275 - Text-Guided Neural Image Inpainting

Lisai Zhang (Harbin Institute of Technology, Shenzhen)*; Qingcai Chen ( Harbin Institute of Technology, Shenzhen); Baotian Hu (Harbin Institute of Technology, Shenzhen); Shuoran Jiang (Harbin Institute of Technology, Shenzhen)

1319 - Weakly-supervised Image Hashing through Masked Visual Semantic Graph Reasoning

Lu Jin (Nanjing University of Science and Technology ); Zechao Li (Nanjing University of Science and Technology)*; Yonghua Pan (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

1344 - Semantic Consistency Guided Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval

Heyu Zhou (Tianjin University, China); Weizhi Nie (Tianjin University)*; Dan Song (Tianjin University); Nian Hu (Tianjin University); Xuanya Li (Baidu); An-An Liu (Tianjin University)

1347 - Performance over Random: A robust evaluation protocol for video summarization methods

Evlampios Apostolidis (QMUL & CERTH-ITI)*; Eleni Adamantidou (CERTH); Alexandros I Metsai (CERTH-ITI); Vasileios Mezaris (Information Technologies Institute, Centre for Research and Technology Hellas, Greece); Ioannis Patras (Queen Mary University of London)

1355 - ARSketch: Sketch-Based User Interface for Augmented Reality Glasses

Zhaohui Zhang (Rokid); Haichao Zhu (The Chinese University of Hong Kong)*; Qian Zhang (California University, Los Angeles)

1367 - Text-Embedded Bilinear Model for Fine-Grained Visual Recognition

Liang Sun (University of Electronic Science and Technology of China); Xiang Guan (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)*; Lei Zhang (Chongqing University)

1384 - Learning Scales from Points: A Scale-aware Probabilistic Model for Crowd Counting

Zhiheng Ma (Xi'an Jiaotong University)*; Xing Wei (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

1394 - Learning Global Structure Consistency for Robust Object Tracking

Bi Li (Huazhong University of Science and Technology); Chengquan Zhang (Baidu Inc); Zhibin Hong (Baidu Inc.); Xu Tang (Baidu); jingtuo liu (baidu); Junyu Han (Baidu Inc.); Errui Ding (Baidu Inc.); Wenyu Liu (Huazhong University of Science and Technology)*

1399 - RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

Niluthpol c Mithun (SRI International)*; Karan Sikka (SRI International); Han-Pang Chiu (SRI International); Supun Samarasekera (SRI International); Rakesh Kumar (SRI International)

1418 - Multimodal Representation with Embedded Visual Guiding Objects for Named Entity Recognition in Social Media Posts

Zhiwei Wu (School of Software Engineering, South China University of Technology); Changmeng Zheng (South China University of Technology); Yi Cai (School of Software Engineering, South China University of Technology)*; Junying Chen (South China University of Technology); Ho-fung Leung (The Chinese University of Hong Kong); Qing Li (The Hong Kong Polytechnic University)

1453 - Contextual Multi-Scale Feature Learning for Person Re-Identification

Baoyu Fan (Inspur Electronic Information Industry Co.,Ltd.); Li Wang (inspur)*; Runze Zhang (Inspur Electronic Information Industry Co.,Ltd.); Zhenhua Guo (Inspur Electronic Information Industry Co.,Ltd.); Yaqian Zhao (Inspur); Rengang Li (Inspur); Weifeng Gong ( Inspur Electronic Information Industry Co.,Ltd.)

1456 - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene

Xinke Li (National University of Singapore); Chongshou Li (National University of Singapore)*; Zekun Tong (National University of Singapore); Andrew Lim (National University of Singapore); Junsong Yuan ("State University of New York at Buffalo, USA"); Yuwei Wu (National University of Singapore); Jing Tang (National University of Singapore); Raymond Huang (National University of Singapore)

1473 - Space-Time Video Super-Resolution using Temporal Profiles

Zeyu Xiao (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Xueyang Fu (University of Science and Technology of China); Dong Liu (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1493 - Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions

Yu-Siang Huang (Academia Sinica)*; Yi-Hsuan Yang (Academia Sinica)

1541 - MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Devamanyu Hazarika (NUS, Singapore)*; Roger Zimmermann (NUS); Soujanya Poria (Singapore University of Technology and Design)

1549 - Instability of Successive Deep Image Compression

Jun-Hyuk Kim (Yonsei University); Soobeom Jang (Yonsei University); Jun-Ho Choi (Yonsei University); Jong-Seok Lee ("Yonsei University, Korea")*

1570 - DeepFacePencil: Creating Face Images from Freehand Sketches

Yuhang Li (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China)*; Binxin Yang (University of Science and Technology of China); Zihan Chen (University of Science and Technology of China); Zhihua Cheng (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1576 - ALANET: Adaptive Latent Attention Network for Joint Video Deblurring and Interpolation

Akash Gupta (University of California, Riverside)*; Abhishek Aich (University of California, Riverside); Amit K. Roy-Chowdhury (University of California, Riverside)

1595 - CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis

Kaicheng Yang (Hebei University Of Science and Technology); Hua Xu (State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China)*; kai gao (Hebei University Of Science and Technology)

1598 - Single-Shot Two-Pronged Detector with Rectified IoU Loss

Keyang Wang (chongqing university); Lei Zhang (Chongqing University)*

1612 - Object-level Attention for Aesthetic Rating Distribution Prediction

Jingwen Hou (Nanyang Technological University)*; Sheng Yang (Nanyang Technological University); Weisi Lin (Nanyang Technological University, Singapore)

1633 - Not made for each other - Audio-Visual Dissonance-based Deepfake Detection and Localization

Komal Chugh (Indian Institute of Technology Ropar); Parul Gupta (Indian Institute of Technology Ropar); Abhinav Dhall (Monash University)*; Ramanathan Subramanian (Indian Institute of Technology Ropar)

1656 - Make your favorite music curative: music style transfer for anxiety reduction

Zhejing Hu (The Hong Kong Polytechnic University); Yan Liu (The Hong Kong Polytechnic University)*; Gong Chen (The Hong Kong Polytechnic University); Sheng-hua Zhong (Shenzhen University); Aiwei Zhang (St. Paul’s Co-educational College)

1685 - Hearing like Seeing: Improving Voice-Face Interactions and Associations via Adversarial Deep Semantic Matching Network

Kai Cheng (Huaqiao University); Xin Liu (Huaqiao University)*; Yiu-ming CHEUNG (Hong Kong Baptist University); Rui Wang (Huaqiao University); Xing Xu (University of Electronic Science and Technology of China); Bineng Zhong (Huaqiao University)

1702 - Concept Drift Detection for Multivariate Data Streams and Temporal Segmentation of Daylong Egocentric Videos

Pravin Nagar (IIIT Delhi)*; Mansi Khemka (Columbia University); Chetan Arora (Indian Institute of Technology Delhi)

1708 - Dynamic Context-guided Capsule Network for Multimodal Machine Translation

Huan Lin (Xiamen University)*; Fandong Meng (Tencent WeChat AI - Pattern Recognition Center Tencent Inc.); Jinsong Su (Xiamen University); Yongjing Yin (Xiamen University); Zhengyuan Yang (University of Rochester); Yubin Ge (University of Illinois at Urbana-Champaign); Jie Zhou (Tencent); Jiebo Luo (U. Rochester)

1710 - DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices

Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Yihao Huang (East China Normal University); Qing Guo (Nanyang Technological University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)

1717 - RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment

Pengfei Chen (Xidian University / China University of Mining and Technology); Leida Li (Xidian University)*; Lei Ma (Hangzhou Multi-Color Optoelctronics Co., Ltd.); Jinjian Wu (Xidian University); Guangming Shi (Xidian University)

1719 - Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification

BOQIANG XU (University of Chinese Academy of Sciences；Institute of Automation，Chinese Academy of Sciences)*; Lingxiao He (AI Research of JD.com); Xingyu Liao (AI Research of JD.com); Wu Liu (AI Research of JD.com); Zhenan Sun (Chinese of Academy of Sciences); Tao Mei (AI Research of JD.com)

1722 - PopMAG: Pop Music Accompaniment Generation

Yi Ren (Zhejiang University)*; Jinzheng He (Zhejiang University); Xu Tan (Microsoft Research Asia); Tao Qin (Microsoft Research Asia); Zhou Zhao (Zhejiang University); Tie-Yan Liu (Microsoft)

1729 - PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation

Shaotian Yan (Zhejiang University)*; Chen Shen (Alibaba Group); Zhongming Jin (Alibaba Group); Jianqiang Huang (Alibaba Group); Rongxin Jiang (Zhejiang University); Yaowu Chen (Zhejiang University); Xian-Sheng Hua (Alibaba Group)

1761 - Differentiable Manifold Reconstruction for Point Cloud Denoising

Shitong Luo (Peking University)*; Wei Hu (Peking University)

1775 - Discriminative Spatial Feature Learning for Person Re-Identification

Peixi Peng (Peking University)*; Yonghong Tian (Peking University); Yangru Huang (Beijing University); Xiangqian Wang (Huawei); Huilong An (AI Application Research Center)

1781 - FakePolisher: Making DeepFakes More Detection-Evasive by Shallow Reconstruction

Yihao Huang (East China Normal University)*; Felix Juefei-Xu (Alibaba Group); Run Wang (Nanyang Technological University); Qing Guo (Nanyang Technological University); Lei Ma (Kyushu University); Xiaofei Xie (Nanyang Technological University); Jianwen Li (East China Normal University); Weikai Miao (East China Normal University); Yang Liu (Nanyang Technology University, Singapore); Geguang Pu (East China Normal University)

1784 - SalGCN: Saliency Prediction for 360-Degree Images Based on Spherical Graph Convolutional Networks

Haoran Lv (Shanghai Jiao Tong University)*; Qin Yang (Shanghai Jiao Tong University); Chenglin Li (Shanghai Jiao Tong University); Wenrui Dai (Shanghai Jiao Tong University); Junni Zou (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University)

1800 - AdaHGNN: Adaptive Hypergraph Neural Networks for Multi-Label Image Classification

Xiangping Wu (Harbin Institute of Technology, Shenzhen); Qingcai Chen ( Harbin Institute of Technology, Shenzhen)*; Wei Li (Harbin Institute of Technology, Shenzhen); Yulun Xiao (Harbin Institute of Technology, Shenzhen); Baotian Hu (University of Massachusetts)

1828 - Reinforced Similarity Learning: Siamese Relation Networks for Robust Object Tracking

Dawei Zhang (Zhejiang Normal University)*; Zhonglong Zheng (Zhejiang Normal University); Minglu Li (Zhejiang Normal University); Xiaowei He (Zhejiang Normal University); Tianxiang Wang (Zhejiang Normal University); Liyuan Chen (Zhejiang Normal University); Riheng Jia (Zhejiang Normal University); Feilong Lin (Zhejiang Normal University)

1832 - AffectI: A Game for Diverse, Reliable, and Efficient Affective Image Annotation

xingkun zuo (University of Yamanashi); Jiyi Li (University of Yamanashi / RIKEN AIP); qili zhou (hangzhou dianzi university); jianjun li (HangZhou Dianzi University); Xiaoyang mao (University of Yamanashi)*

1837 - Cognitive Representation Learning of Self-Media Online Article Quality

Yiru Wang (Tencent Inc.; Tsinghua University)*; Shen Huang (Tencent Inc.); Gongfu Li (Tencent Inc.); Qiang Deng (Tencent Inc.); Dongliang Liao (Data Quality Team, WeChat, Tencent Inc., China); Pengda Si (Tsinghua University); Yujiu Yang (Tsinghua University); Jin Xu (Tencent Inc.)

1852 - Describing Subjective Experiment Consistency by p-value qq-plot

Jakub Nawała (AGH University of Science and Technology)*; Lucjan Janowski (AGH University of Science and Technology); Bogdan Ćmiel (); Krzysztof Rusek (AGH University of Science and Technology)

1859 - Deep Structural Contour Detection

Ruoxi Deng (Central South University)*; Shengjun Liu (Central South University)

1874 - Multimodal Multi-Task Financial Risk Forecasting

Ramit Sawhney (Netaji Subhas Institute of Technology)*; Puneet Mathur (University of Maryland, College Park); Ayush Mangal (IIT Roorkee); Piyush Khanna (Delhi Technological University); Rajiv Ratn Shah ("Indraprastha Institute of Information Technology, Delhi"); Roger Zimmermann (NUS)

1893 - Cross-modal Non-linear Guided Attention and TemporalCoherence in Multi-modal Deep Video Models

Saurabh Sahu (); Palash Goyal (Samsung Research); Shalini Ghosh (Samsung Research)*; Chul Lee (Samsung Research America)

1946 - Multi-modal Cooking Workflow Construction for Food Recipes

Liang-Ming Pan (National University of Singapore)*; Jingjing Chen (Fudan University); Jianlong Wu (Fudan University); Shaoteng Liu (Xi'an Jiaotong University); Chong-Wah Ngo (City University of Hong Kong); Min-Yen Kan (National University of Singapore); Yu-Gang Jiang (Fudan University); Tat-Seng Chua (National university of Singapore)

1950 - Distributed Multi-agent Video Fast-forwarding

Shuyue Lan (Northwestern University)*; Zhilu Wang (Northwestern University); Amit K. Roy-Chowdhury (University of California, Riverside); Ermin Wei (); Zhu Qi (Northwestern University)

1988 - IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning

Zhenhuan Liu (Institute of Computing Technology, Chinese Academy of Sciences); liang li (Institute of Computing Technology, Chinese Academy of Sciences)*; Shaofei Cai (Institute of Computing Technology, Chinese Academy of Sciences); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Qingming Huang (University of Chinese Academy of Sciences)

1994 - LIGHTEN: Learning Interactions with Graph and Heirarchical TEmporal Networks for HOI in videos

Sai Praneeth Reddy Sunkesula (Indian Institute of Technology, Bombay)*; Rishabh Dabral (IIT Bombay); Ganesh Ramakrishnan (IIT Bombay)

2014 - BS-MCVR: Binary-sensing based Mobile-cloud Visual Recognition

Hongyi Zheng (The Hong Kong Polytechnic University); Lei Zhang ("Hong Kong Polytechnic University, Hong Kong, China")*

2017 - Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition

Yuqian Fu (Fudan University)*; Yanwei Fu (Fudan University); junke wang (Fudan University); Li Zhang (University of Oxford); Xing Zhang (Fudan University); Yu-Gang Jiang (Fudan University)

2030 - Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning

Jingjing Li (University of Electronic Science and Technology of China)*; Mengmeng Jing (University of Electronic Science and Technology of China); Lei Zhu (Shandong Normal Unversity); Zhengming Ding (Indiana University-Purdue University Indianapolis); Ke Lu (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)

2032 - When Bitstream Prior Meets Deep Prior: Compressed Video Super-resolution with Learning from Decoding

Peilin Chen (City University of Hong Kong)*; Wenhan Yang (City University of Hong Kong); Long Sun (Huawei); Shiqi Wang (CityU)

2035 - Describe What to Change: A Text-guided Unsupervised Image-to-image Translation Approach

Yahui Liu (University of Trento); Marco De Nadai (Fondazione Bruno Kessler)*; Deng Cai (The Chinese University of Hong Kong); Huayang Li (Tencent AI Lab); Xavier Alameda-Pineda (INRIA); Nicu Sebe (University of Trento); Bruno Lepri (FBK, Trento, Italy)

2052 - Increasing Video Perceptual Quality with GANs and Semantic Coding

Leonardo Galteri (University of Florence); Marco Bertini (University of Florence)*; Lorenzo Seidenari (University of Florence); Tiberio Uricchio (University of Florence); Alberto Del Bimbo (University of Florence)

2053 - Attentive One-Dimensional Heatmap Regression for Facial Landmark Detection and Tracking

Shi Yin (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Xiaoping Chen (University of Science and Technology of China); Enhong Chen (University of Science and Technology of China); Cong Liang (University of Science and Technology of China)

2071 - Fine-Grained Similarity Measurement between Educational Videos and Exercises

Xin Wang (University of Science and Technology of China); Wei Huang (University of Science and Technology of China); Qi Liu (" University of Science and Technology of China, China")*; Yu Yin (University of Science and Technology of China); Zhenya Huang (University of Science and Technology of China ); Le Wu (Hefei University of Technology); Jianhui Ma (University of Science and Technology of China); Xue Wang (Nankai University)

2073 - One-shot Text Field labeling using Attention and Belief Propagation for Structure information extraction

Mengli Cheng (Alibaba Group)*; Minghui Qiu (Alibaba)

2081 - GRAD: Learning for Overhead-aware Adaptive Video Streaming with Scalable Video Coding

Yunzhuo Liu (Shanghai Jiao Tong University); Bo Jiang (Shanghai Jiao Tong University)*; Tian Guo (Worcester Polytechnic Institute); Ramesh K. Sitaraman (UMass Amherst & Akamai Technologies); Don Towsley (University of Massachusetts Amherst); Xinbing Wang (Shanghai Jiao Tong University)

2088 - Down to the Last Detail: Virtual Try-on with Fine-grained Details

Jiahang Wang (Huazhong University of Science and Technology)*; Tong Sha (Beihang University); Wei Zhang (JD AI Research); Zhoujun Li (Beihang University); Tao Mei (AI Research of JD.com)

2151 - Reduce the Influence of Stability in Content Delivery Network via Learning-Based Caching Algorithm

Gang Yan (Binghamton University-SUNY); Jian Li (Binghamton University-SUNY )*

2158 - Temporal Denoising Mask Synthesis Network for Learning Blind Video Temporal Consistency

Yifeng Zhou (University of Electronic Science and Technology of China); Xing Xu (University of Electronic Science and Technology of China)*; Fumin Shen (UESTC); Lianli Gao (The University of Electronic Science and Technology of China); Huimin Lu (Kyushu Institute of Technology); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

2174 - INCLUDE: A Large Scale Dataset for Indian Sign Language Recognition

Advaith Sridhar (IIT Madras)*; Rohith Gandhi G (IIT Madras); Pratyush Kumar (IIT Madras); Mitesh Khapra (IIT Madras)

2205 - A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild

Prajwal K R (International Institute of Information Technology, Hyderabad)*; Rudrabha Mukhopadhyay (IIIT Hyderabad); Vinay Namboodiri (University of Bath); C.V. Jawahar (IIIT-Hyderabad)

2237 - Efficient adaptation of neural network filter for video compression

Yat-Hong Lam (Nokia Technologies)*; Alireza Zare (Nokia Technologies); Francesco Cricri (Nokia Technologies); Jani Lainema (Nokia); Miska Hannuksela (Nokia Technologies)

2246 - An Analysis of Delay in Live 360° Video Streaming Systems

Jun Yi (Georgia State University)*; Md Reazul Islam (Georgia State University); Shivang Aggarwal (University at Buffalo, The State University of New York); Dimitrios Koutsonikolas (SUNY Buffalo); Y. Charlie Hu (Purdue University); Zhisheng Yan (Georgia State University)

2249 - Adaptive Temporal Triplet-loss for Cross-modal Embedding Learning

David Semedo (Universidade NOVA de Lisboa)*; Joao Magalhaes (Universidade NOVA Lisboa)

2257 - SonoSpace: Visual Feedback of Timbre with Unsupervised Learning

Naoki Kimura (The University of Tokyo)*; Keisuke Shiro (The University of Tokyo); Yota Takakura (Innoqua Inc.); Hiromi Nakamura (The University of Tokyo); Jun Rekimoto (The Univertsity of Tokyo)

2264 - Amora: Black-box Adversarial Morphing Attack

Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Qing Guo (Nanyang Technological University); Yihao Huang (East China Normal University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)

2323 - Single Image Deraining via Scale-space Invariant Attention Neural Network

Bo Pang (Harbin Institute of Technology); Deming Zhai (Harbin Institute of Technolgy); Junjun Jiang (Harbin Institute of Technology); Xianming Liu (Harbin Institute of Technology)*

2342 - Concept-based Explanation for Fine-grained Images and Its Application in Infectious Keratitis Classification

Zhengqing Fang (Zhejiang University)*; Kun Kuang (Zhejiang University); Yuxiao Lin (Zhejiang University); Fei Wu (Zhejiang University); Yufeng Yao (Zhejiang University)

2448 - Visual Relation of Interest Detection

Fan Yu (Nanjing University); Haonan Wang (Nanjing University); Tongwei Ren (Nanjing University)*; Jinhui Tang (Nanjing University of Science and Technology); Gangshan Wu (Nanjing University)

你可能感兴趣的:(随笔,deep,learning,深度学习,机器学习)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
随笔 | 仙一般的灵气海思沧海
仙岛今天，我看了你全部，似乎已经进入你的世界我不知道，这是否是梦幻，还是你仙一般的灵气吸引了我也许每一个人都要有一份属于自己的追求，这样才能够符合人生的梦想，生活才能够充满着阳光与快乐我不知道，我为什么会这样的感叹，是在感叹自己的人生，还是感叹自己一直没有孜孜不倦的追求只感觉虚度了光阴，每天活在自己的梦中，活在一个不真实的世界是在逃避自己，还是在逃避周围的一切有时候我嘲笑自己，嘲笑自己如此的虚无，
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
数字里的世界17期：2021年全球10大顶级数据中心，中国移动榜首张三叨
你知道吗？2016年，全球的数据中心共计用电4160亿千瓦时，比整个英国的发电量还多40％！前言每天，我们都会创造超过250万TB的数据。并且随着物联网（IOT）的不断普及，这一数据将持续增长。如此庞大的数据被存储在被称为“数据中心”的专用设施中。虽然最早的数据中心建于20世纪40年代，但直到1997-2000年的互联网泡沫期间才逐渐成为主流。当前人类的技术，比如人工智能和机器学习，已经将我们推向
nosql数据库技术与应用知识点皆过客，揽星河 NoSQL nosql 数据库大数据数据分析数据结构非关系型数据库
Nosql知识回顾大数据处理流程数据采集(flume、爬虫、传感器)数据存储(本门课程NoSQL所处的阶段)Hdfs、MongoDB、HBase等数据清洗(入仓)Hive等数据处理、分析(Spark、Flink等)数据可视化数据挖掘、机器学习应用(Python、SparkMLlib等)大数据时代存储的挑战(三高)高并发(同一时间很多人访问)高扩展(要求随时根据需求扩展存储)高效率(要求读写速度快)
Python开发常用的三方模块如下：换个网名有点难 python 开发语言
Python是一门功能强大的编程语言，拥有丰富的第三方库，这些库为开发者提供了极大的便利。以下是100个常用的Python库，涵盖了多个领域：1、NumPy，用于科学计算的基础库。2、Pandas，提供数据结构和数据分析工具。3、Matplotlib，一个绘图库。4、Scikit-learn，机器学习库。5、SciPy，用于数学、科学和工程的库。6、TensorFlow，由Google开发的开源机
Python实现简单的机器学习算法 master_chenchengg python python 办公效率 python开发 IT
Python实现简单的机器学习算法开篇：初探机器学习的奇妙之旅搭建环境：一切从安装开始必备工具箱第一步：安装Anaconda和JupyterNotebook小贴士：如何配置Python环境变量算法初体验：从零开始的Python机器学习线性回归：让数据说话数据准备：从哪里找数据编码实战：Python实现线性回归模型评估：如何判断模型好坏逻辑回归：从分类开始理论入门：什么是逻辑回归代码实现：使用skl
祭坛随笔阿门不热
街角右拐，便是北宋的祠堂。平日里冉冉的佛香被雨水打湿了，一地枯黄的银杏显得平静哀伤，如同一地被踩碎的阳光。我喜欢在这样的阴暗里吞噬古代的讯息，那遥远的来自过去的历史风潮。谢却茶扉，轻轻地抚上墙壁，寒风不御，无数深浅的纹路交织在心底，如同一把古琴不堪重负的尾音。寂寞锁朱门，香客们已是三三两两，巨大的雨帘让天空失掉了颜色，灰蒙蒙掉在阁楼一角，沉稳不惊地暗下去，再暗下去......古树上红色的挂牌像一块
《吹牛大王历险记》读书随笔赵炳森
这本书的作者是埃·拉斯伯戈·毕尔格。（没查到相关内容，好像他只写过《吹牛大王历险记》。）最让人百思不得其解的是他居然能自己拉自己的辫子出泥潭？！我觉得自己拉自己的辫子只会把自己的辫子拉断，而不会飞出泥潭。（问:图片中底下的屁股为什么插了一根钢针？）屁股底下居然有根钢针？在泥潭应该是滑滑的吧，可是他怎么能夹紧马肚呢？马肚子应该是在马的下方。还有如果能从泥潭里把连人带马都给拽出来的话，他力气肯定很大，
JavaScript 中，深拷贝（Deep Copy）和浅拷贝（Shallow Copy）跳房子的前端前端面试 javascript 开发语言 ecmascript
在JavaScript中，深拷贝（DeepCopy）和浅拷贝（ShallowCopy）是用于复制对象或数组的两种不同方法。了解它们的区别和应用场景对于避免潜在的bugs和高效地处理数据非常重要。以下是对深拷贝和浅拷贝的详细解释，包括它们的概念、用途、优缺点以及实现方式。1.浅拷贝（ShallowCopy）概念定义：浅拷贝是指创建一个新的对象或数组，其中包含了原对象或数组的基本数据类型的值和对引用数
樵夫随笔 NO.1146吓了公交司机一大跳痴信不改一书生
傍晚，我把公交司机吓了一大跳！下班回家路上，先在公交车上读了会儿书，又写了篇文章，还有大约10分钟才到站，于是，靠在座椅上小眯一会儿。这一眯不要紧，直接眯到了终点站！而且，除我以外的所有人都下车后，司机直接关掉车厢内的灯，紧接着下车，关门儿，准备去厕所。这时，我被惊醒，拍打着玻璃，大喊“师傅……师傅……”司机师傅打开车门后的第一句话就是：“你可把我吓得够呛！”说说当时的情景：终点站设在一破旧的小院
遥感影像的切片处理 sand&wich 计算机视觉 python 图像处理
在遥感影像分析中，经常需要将大尺寸的影像切分成小片段，以便于进行详细的分析和处理。这种方法特别适用于机器学习和图像处理任务，如对象检测、图像分类等。以下是如何使用Python和OpenCV库来实现这一过程，同时确保每个影像片段保留正确的地理信息。准备环境首先，确保安装了必要的Python库，包括numpy、opencv-python和xml.etree.ElementTree。这些库将用于图像处理
随笔（探悟）杰语唱响
亲兄弟姐妹之间不来往，其实不就是吃亏的人不想吃亏了，或者是占便宜的人占不到便宜了，从此就断绝了来往，互不搭理了。那些极度自私的人，是最不讲道理的，无论他们遇到什么事，都要让他占到便宜。顺着他的心情才行。否则那就是别人的不对。有这种想法的人，一看就是个穷命，那些个处处想着占便宜的人，老是想多要点，原因就是我没有吗，什么都没有，德行也没有，财富也没有，你说他的命怎么可能会好。家庭不和，兄弟姐妹断绝来往
推荐3家毕业AI论文可五分钟一键生成！文末附免费教程！小猪包333 写论文人工智能 AI写作深度学习计算机视觉
在当前的学术研究和写作领域，AI论文生成器已经成为许多研究人员和学生的重要工具。这些工具不仅能够帮助用户快速生成高质量的论文内容，还能进行内容优化、查重和排版等操作。以下是三款值得推荐的AI论文生成器：千笔-AIPassPaper、懒人论文以及AIPaperPass。千笔-AIPassPaper千笔-AIPassPaper是一款基于深度学习和自然语言处理技术的AI写作助手，旨在帮助用户快速生成高质
AI大模型的架构演进与最新发展季风泯灭的季节 AI大模型应用技术二人工智能架构
随着深度学习的发展，AI大模型（LargeLanguageModels,LLMs）在自然语言处理、计算机视觉等领域取得了革命性的进展。本文将详细探讨AI大模型的架构演进，包括从Transformer的提出到GPT、BERT、T5等模型的历史演变，并探讨这些模型的技术细节及其在现代人工智能中的核心作用。一、基础模型介绍：Transformer的核心原理Transformer架构的背景在Transfo
ai绘画工具midjourney怎么下载？附作品管理教程设计师早上好
Midjourney是一款功能强大的AI绘画工具，它使用机器学习技术和深度神经网络等算法，可以生成各种艺术风格的绘画作品。在创意设计、广告宣传等方面有着广泛的应用前景。那么，ai绘画工具midjourney怎么下载？本文将为您介绍Midjourney的下载以及作品的相关管理。一、Midjourney下载Midjourney的下载非常简单，只需打开Midjourney官网（点击“GetMidjour
[实践应用] 深度学习之模型性能评估指标 YuanDaima2048 深度学习工具使用深度学习人工智能损失函数性能评估 pytorch python 机器学习
文章总览：YuanDaiMa2048博客文章总览深度学习之模型性能评估指标分类任务回归任务排序任务聚类任务生成任务其他介绍在机器学习和深度学习领域，评估模型性能是一项至关重要的任务。不同的学习任务需要不同的性能指标来衡量模型的有效性。以下是对一些常见任务及其相应的性能评估指标的详细解释和总结。分类任务分类任务是指模型需要将输入数据分配到预定义的类别或标签中。以下是分类任务中常用的性能指标：准确率(
[实践应用] 深度学习之优化器 YuanDaima2048 深度学习工具使用 pytorch 深度学习人工智能机器学习 python 优化器
文章总览：YuanDaiMa2048博客文章总览深度学习之优化器1.随机梯度下降（SGD）2.动量优化（Momentum）3.自适应梯度（Adagrad）4.自适应矩估计（Adam）5.RMSprop总结其他介绍在深度学习中，优化器用于更新模型的参数，以最小化损失函数。常见的优化函数有很多种，下面是几种主流的优化器及其特点、原理和PyTorch实现：1.随机梯度下降（SGD）原理:随机梯度下降通过
机器学习-聚类算法不良人龍木木机器学习机器学习算法聚类
机器学习-聚类算法1.AHC2.K-means3.SC4.MCL仅个人笔记，感谢点赞关注！1.AHC2.K-means3.SC传统谱聚类：个人对谱聚类算法的理解以及改进4.MCL目前仅专注于NLP的技术学习和分享感谢大家的关注与支持！
生成式地图制图 Bwywb_3 深度学习机器学习深度学习生成对抗网络
生成式地图制图（GenerativeCartography）是一种利用生成式算法和人工智能技术自动创建地图的技术。它结合了传统的地理信息系统（GIS）技术与现代生成模型（如深度学习、GANs等），能够根据输入的数据自动生成符合需求的地图。这种方法在城市规划、虚拟环境设计、游戏开发等多个领域具有应用前景。主要特点：自动化生成：通过算法和模型，系统能够根据输入的地理或空间数据自动生成地图，而无需人工逐
【新教育-教师随笔】读《做最好的英语老师》有感 164c5aca7b79
伊川县直中学王素平《做最好的英语老师》这本书是作者这些年在他教学中得与失的总结。里面给我们提供了听力，单词，句子，阅读，作文等模块的教学方法，让我受益匪浅，现总结如下：一.语文教学给了我们什么启示？（1）：现有的英语教材内容简单，枯燥，与学生的心智发展水平严重脱节。我们要给学生补中一些贴近学生生活，能感动和影响他们的经典作品。让学生学习知识的同时，有所感悟和思考，同时享受审美的乐趣！如AWiseO
深度 Qlearning：在直播推荐系统中的应用 AGI通用人工智能之禅程序员提升自我硅基计算碳基计算认知计算生物计算深度学习神经网络大数据 AIGC AGI LLM Java Python 架构设计 Agent 程序员实现财富自由
深度Q-learning：在直播推荐系统中的应用关键词：深度Q-learning,强化学习,直播推荐系统,个性化推荐1.背景介绍1.1问题的由来随着互联网技术的飞速发展,直播平台如雨后春笋般涌现。面对海量的直播内容,用户很难快速找到自己感兴趣的内容。因此,个性化推荐系统在直播平台中扮演着越来越重要的角色。1.2研究现状目前,主流的个性化推荐算法包括协同过滤、基于内容的推荐等。这些方法在一定程度上缓
夏日随笔日记夏天的夜住在城里的庄户孩子
浅聊微信朋友圈及其它文/王立虎（一）又是一个深夜了，夏天的夜显得有些浮躁有些闷热，透过窗户外面街道上街灯依旧明亮，照着匆忙的车与人回家。关上电脑，打开，还是先完成日更，一直坚持着努力着写着，虽没有什么优秀的大作出现，但有时候还是佩服自己对文学的执着和爱好，佩服自己的自律。写点吧，在这夜深人静的时候，独处着，习惯着，随笔写下自己一天的心情，有感悟，有事件，有温度，我想写下总是好的。也有人喜欢这个点来
随笔 Csar_NFBC
别再奋战在凌晨四点半不断留存的我们的遗憾一般流汗不呐喊的伪善到了虚荣心的年纪都眼馋那么多浮夸的浮华都摞在每个人的肩头献丑般的伎俩扩大了每个人的心愁个个都能说会道到最后却难免想上帝苦苦的祷告，越光鲜的就越阴险着，人血馒头吃在嘴里拿在手里从不做冒险者谁又想阴暗呢，现实多残酷，上班族碌碌，面对现实谁又是无辜的？那天空气有些浑浊，办公室中气氛紧张影响脉搏，明明有些事情很清楚还要说上三遍到处传达着加班到凌晨
未来软件市场是怎么样的？做开发的生存空间如何？ cesske 软件需求
目录前言一、未来软件市场的发展趋势二、软件开发人员的生存空间前言未来软件市场是怎么样的？做开发的生存空间如何？一、未来软件市场的发展趋势技术趋势：人工智能与机器学习：随着技术的不断成熟，人工智能将在更多领域得到应用，如智能客服、自动驾驶、智能制造等，这将极大地推动软件市场的增长。云计算与大数据：云计算服务将继续普及，大数据技术的应用也将更加广泛。企业将更加依赖云计算和大数据来优化运营、提升效率，并
吴恩达深度学习笔记(30)-正则化的解释极客Array
正则化（Regularization）深度学习可能存在过拟合问题——高方差，有两个解决方法，一个是正则化，另一个是准备更多的数据，这是非常可靠的方法，但你可能无法时时刻刻准备足够多的训练数据或者获取更多数据的成本很高，但正则化通常有助于避免过拟合或减少你的网络误差。如果你怀疑神经网络过度拟合了数据，即存在高方差问题，那么最先想到的方法可能是正则化，另一个解决高方差的方法就是准备更多数据，这也是非常
个人学习笔记7-6：动手学深度学习pytorch版-李沐浪子L 深度学习深度学习笔记计算机视觉 python 人工智能神经网络 pytorch
#人工智能##深度学习##语义分割##计算机视觉##神经网络#计算机视觉13.11全卷积网络全卷积网络（fullyconvolutionalnetwork，FCN）采用卷积神经网络实现了从图像像素到像素类别的变换。引入l转置卷积（transposedconvolution）实现的，输出的类别预测与输入图像在像素级别上具有一一对应关系：通道维的输出即该位置对应像素的类别预测。13.11.1构造模型下
来时空去时空如是万般皆空一航文学与艺术
关注一航文学与艺术每一天都与众不同更漏子文/宁静致远（阑珊）北风寒，梅蕊落，陌上花开花落。观世间，月儿圆，来回皆是缘。来时空，去时空，如是万般皆空。花儿落，水东流，落花人长留。2018.2.4给我一支笔文／宁静致远（阑珊）给我一支笔绘一幅水墨丹青绿水青山给我一支笔写一曲岁月如歌曲水流觞给我一支笔写下风花雪月前世今生给我一支笔写下灿烂的诗篇吟古诵今2018.2.3游无为寺随笔文/宁静致远光阴一去不复
python中zeros用法_Python中的numpy.zeros()用法江平舟 python中zeros用法
numpy.zeros()函数是最重要的函数之一,广泛用于机器学习程序中。此函数用于生成包含零的数组。numpy.zeros()函数提供给定形状和类型的新数组,并用零填充。句法numpy.zeros(shape,dtype=float,order='C'参数形状：整数或整数元组此参数用于定义数组的尺寸。此参数用于我们要在其中创建数组的形状,例如(3,2)或2。dtype：数据类型(可选)此参数用于
深度学习-点击率预估-研究论文2024-09-14速读 sp_fyf_2024 深度学习人工智能
深度学习-点击率预估-研究论文2024-09-14速读1.DeepTargetSessionInterestNetworkforClick-ThroughRatePredictionHZhong,JMa,XDuan,SGu,JYao-2024InternationalJointConferenceonNeuralNetworks,2024深度目标会话兴趣网络用于点击率预测摘要：这篇文章提出了一种新
Java实现的简单双向Map，支持重复Value superlxw1234 java 双向map
关键字：Java双向Map、DualHashBidiMap 有个需求，需要根据即时修改Map结构中的Value值，比如，将Map中所有value=V1的记录改成value=V2，key保持不变。数据量比较大，遍历Map性能太差，这就需要根据Value先找到Key，然后去修改。即：既要根据Key找Value，又要根据Value
PL/SQL触发器基础及例子百合不是茶 oracle数据库触发器 PL/SQL编程
触发器的简介; 触发器的定义就是说某个条件成立的时候，触发器里面所定义的语句就会被自动的执行。因此触发器不需要人为的去调用，也不能调用。触发器和过程函数类似过程函数必须要调用, 一个表中最多只能有12个触发器类型的,触发器和过程函数相似触发器不需要调用直接执行, 触发时间：指明触发器何时执行，该值可取： before：表示在数据库动作之前触发
[时空与探索]穿越时空的一些问题 comsci 问题
我们还没有进行过任何数学形式上的证明,仅仅是一个猜想..... 这个猜想就是; 任何有质量的物体(哪怕只有一微克)都不可能穿越时空,该物体强行穿越时空的时候,物体的质量会与时空粒子产生反应,物体会变成暗物质,也就是说,任何物体穿越时空会变成暗物质..(暗物质就我的理
easy ui datagrid上移下移一行商人shang js 上移下移 easyui datagrid
/** * 向上移动一行 * * @param dg * @param row */ function moveupRow(dg, row) { var datagrid = $(dg); var index = datagrid.datagrid("getRowIndex", row); if (isFirstRow(dg, row)) {
Java反射 oloz 反射
本人菜鸟，今天恰好有时间，写写博客，总结复习一下java反射方面的知识，欢迎大家探讨交流学习指教首先看看java中的Class package demo; public class ClassTest { /*先了解java中的Class*/ public static void main(String[] args) { //任何一个类都
springMVC 使用JSR-303 Validation验证杨白白 spring mvc
JSR-303是一个数据验证的规范，但是spring并没有对其进行实现，Hibernate Validator是实现了这一规范的，通过此这个实现来讲SpringMVC对JSR-303的支持。 JSR-303的校验是基于注解的，首先要把这些注解标记在需要验证的实体类的属性上或是其对应的get方法上。登录需要验证类 public class Login { @NotEmpty
log4j 香水浓 log4j
log4j.rootCategory=DEBUG, STDOUT, DAILYFILE, HTML, DATABASE #log4j.rootCategory=DEBUG, STDOUT, DAILYFILE, ROLLINGFILE, HTML #console log4j.appender.STDOUT=org.apache.log4j.ConsoleAppender log4
使用ajax和history.pushState无刷新改变页面URL agevs jquery 框架 Ajax html5 chrome
表现如果你使用chrome或者firefox等浏览器访问本博客、github.com、plus.google.com等网站时，细心的你会发现页面之间的点击是通过ajax异步请求的，同时页面的URL发生了了改变。并且能够很好的支持浏览器前进和后退。是什么有这么强大的功能呢？ HTML5里引用了新的API，history.pushState和history.replaceState，就是通过
centos中文乱码 AILIKES centos OS ssh
一、CentOS系统访问 g.cn ，发现中文乱码。于是用以前的方式：yum -y install fonts-chinese CentOS系统安装后，还是不能显示中文字体。我使用 gedit 编辑源码，其中文注释也为乱码。后来，终于找到以下方法可以解决，需要两个中文支持的包： fonts-chinese-3.02-12.
触发器 baalwolf 触发器
触发器(trigger)：监视某种情况，并触发某种操作。触发器创建语法四要素：1.监视地点(table) 2.监视事件(insert/update/delete) 3.触发时间(after/before) 4.触发事件(insert/update/delete) 语法： create trigger triggerName after/before
JS正则表达式的i m g bijian1013 JavaScript 正则表达式
g:表示全局（global)模式，即模式将被应用于所有字符串，而非在发现第一个匹配项时立即停止。 i:表示不区分大小写（case-insensitive）模式，即在确定匹配项时忽略模式与字符串的大小写。 m:表示
HTML5模式和Hashbang模式 bijian1013 JavaScript AngularJS Hashbang模式 HTML5模式
我们可以用$locationProvider来配置$location服务（可以采用注入的方式，就像AngularJS中其他所有东西一样）。这里provider的两个参数很有意思，介绍如下。 html5Mode 一个布尔值，标识$location服务是否运行在HTML5模式下。 ha
[Maven学习笔记六]Maven生命周期 bit1129 maven
从mvn test的输出开始说起当我们在user-core中执行mvn test时，执行的输出如下： /software/devsoftware/jdk1.7.0_55/bin/java -Dmaven.home=/software/devsoftware/apache-maven-3.2.1 -Dclassworlds.conf=/software/devs
【Hadoop七】基于Yarn的Hadoop Map Reduce容错 bit1129 hadoop
运行于Yarn的Map Reduce作业，可能发生失败的点包括 Task Failure Application Master Failure Node Manager Failure Resource Manager Failure 1. Task Failure 任务执行过程中产生的异常和JVM的意外终止会汇报给Application Master。僵死的任务也会被A
记一次数据推送的异常解决端口解决 ronin47 记一次数据推送的异常解决
　　需求：从db获取数据然后推送到B 程序开发完成，上jboss,刚开始报了很多错，逐一解决，可最后显示连接不到数据库。机房的同事说可以ping 通。　　自已画了个图，逐一排除，把linux 防火墙　和　setenforce　设置最低。　　　service iptables stop
巧用视错觉-UI更有趣 brotherlamp UI ui视频 ui教程 ui自学 ui资料
我们每个人在生活中都曾感受过视错觉（optical illusion）的魅力。视错觉现象是双眼跟我们开的一个玩笑，而我们往往还心甘情愿地接受我们看到的假象。其实不止如此，视觉错现象的背后还有一个重要的科学原理——格式塔原理。格式塔原理解释了人们如何以视觉方式感觉物体，以及图像的结构，视角，大小等要素是如何影响我们的视觉的。在下面这篇文章中，我们首先会简单介绍一下格式塔原理中的基本概念，
线段树-poj1177-N个矩形求边长（离散化+扫描线） bylijinnan 数据结构算法线段树
package com.ljn.base; import java.util.Arrays; import java.util.Comparator; import java.util.Set; import java.util.TreeSet; /** * POJ 1177 (线段树+离散化+扫描线)，题目链接为http://poj.org/problem?id=1177
HTTP协议详解 chicony http协议
引言
Scala设计模式 chenchao051 设计模式 scala
Scala设计模式我的话：在国外网站上看到一篇文章，里面详细描述了很多设计模式，并且用Java及Scala两种语言描述，清晰的让我们看到各种常规的设计模式，在Scala中是如何在语言特性层面直接支持的。基于文章很nice，我利用今天的空闲时间将其翻译，希望大家能一起学习，讨论。翻译
安装mysql daizj mysql 安装
安装mysql (1)删除linux上已经安装的mysql相关库信息。rpm -e xxxxxxx --nodeps (强制删除) 执行命令rpm -qa |grep mysql 检查是否删除干净 (2)执行命令 rpm -i MySQL-server-5.5.31-2.el
HTTP状态码大全 dcj3sjt126com http状态码
完整的 HTTP 1.1规范说明书来自于RFC 2616，你可以在http://www.talentdigger.cn/home/link.php?url=d3d3LnJmYy1lZGl0b3Iub3JnLw%3D%3D在线查阅。HTTP 1.1的状态码被标记为新特性，因为许多浏览器只支持 HTTP 1.0。你应只把状态码发送给支持 HTTP 1.1的客户端，支持协议版本可以通过调用request
asihttprequest上传图片 dcj3sjt126com ASIHTTPRequest
NSURL *url =@"yourURL"; ASIFormDataRequest*currentRequest =[ASIFormDataRequest requestWithURL:url]; [currentRequest setPostFormat:ASIMultipartFormDataPostFormat];[currentRequest se
C语言中，关键字static的作用 e200702084 C++c C#
在C语言中，关键字static有三个明显的作用： 1)在函数体，局部的static变量。生存期为程序的整个生命周期，（它存活多长时间）；作用域却在函数体内（它在什么地方能被访问（空间））。一个被声明为静态的变量在这一函数被调用过程中维持其值不变。因为它分配在静态存储区，函数调用结束后并不释放单元，但是在其它的作用域的无法访问。当再次调用这个函数时，这个局部的静态变量还存活，而且用在它的访
win7/8使用curl geeksun win7
1. WIN7/8下要使用curl，需要下载curl-7.20.0-win64-ssl-sspi.zip和Win64OpenSSL_Light-1_0_2d.exe。下载地址： http://curl.haxx.se/download.html 请选择不带SSL的版本，否则还需要安装SSL的支持包 2. 可以给Windows增加c
Creating a Shared Repository; Users Sharing The Repository hongtoushizi git
转载自： http://www.gitguys.com/topics/creating-a-shared-repository-users-sharing-the-repository/ Commands discussed in this section: git init –bare git clone git remote git pull git p
Java实现字符串反转的8种或9种方法 Josh_Persistence 异或反转递归反转二分交换反转 java字符串反转栈反转
注：对于第7种使用异或的方式来实现字符串的反转，如果不太看得明白的，可以参照另一篇博客： http://josh-persistence.iteye.com/blog/2205768 /** * */ package com.wsheng.aggregator.algorithm.string; import java.util.Stack; /**
代码实现任意容量倒水问题 home198979 PHP 算法倒水
形象化设计模式实战 HELLO!架构 redis命令源码解析倒水问题：有两个杯子，一个A升，一个B升，水有无限多，现要求利用这两杯子装C
Druid datasource zhb8015 druid
推荐大家使用数据库连接池 DruidDataSource. http://code.alibabatech.com/wiki/display/Druid/DruidDataSource DruidDataSource经过阿里巴巴数百个应用一年多生产环境运行验证，稳定可靠。它最重要的特点是：监控、扩展和性能。下载和Maven配置看这里： http
两种启动监听器ApplicationListener和ServletContextListener spjich java spring 框架
引言:有时候需要在项目初始化的时候进行一系列工作，比如初始化一个线程池，初始化配置文件，初始化缓存等等，这时候就需要用到启动监听器，下面分别介绍一下两种常用的项目启动监听器 ServletContextListener 特点: 依赖于sevlet容器，需要配置web.xml 使用方法: public class StartListener implements
JavaScript Rounding Methods of the Math object 何不笑 JavaScript Math
The next group of methods has to do with rounding decimal values into integers. Three methods — Math.ceil(), Math.floor(), and Math.round() — handle rounding in differen