PaperWeekly

今日arXiv精选 | 34篇顶会论文：CIKM/ ACL/ Interspeech/ ICCV/ ACM MM

关于 #今日arXiv精选

这是「AI 学术前沿」旗下的一档栏目，编辑将每日从arXiv中精选高质量论文，推送给读者。

DESYR: Definition and Syntactic Representation Based Claim Detection on the Web

Comment: 10 pages, Accepted at CIKM 2021

Link: http://arxiv.org/abs/2108.08759

Abstract

The formulation of a claim rests at the core of argument mining. To demarcatebetween a claim and a non-claim is arduous for both humans and machines, owingto latent linguistic variance between the two and the inadequacy of extensivedefinition-based formalization. Furthermore, the increase in the usage ofonline social media has resulted in an explosion of unsolicited information onthe web presented as informal text. To account for the aforementioned, in thispaper, we proposed DESYR. It is a framework that intends on annulling the saidissues for informal web-based text by leveraging a combination of hierarchicalrepresentation learning (dependency-inspired Poincare embedding),definition-based alignment, and feature projection. We do away with fine-tuningcomputer-heavy language models in favor of fabricating a more domain-centricbut lighter approach. Experimental results indicate that DESYR builds upon thestate-of-the-art system across four benchmark claim datasets, most of whichwere constructed with informal texts. We see an increase of 3 claim-F1 pointson the LESA-Twitter dataset, an increase of 1 claim-F1 point and 9 macro-F1points on the Online Comments(OC) dataset, an increase of 24 claim-F1 pointsand 17 macro-F1 points on the Web Discourse(WD) dataset, and an increase of 8claim-F1 points and 5 macro-F1 points on the Micro Texts(MT) dataset. We alsoperform an extensive analysis of the results. We make a 100-D pre-trainedversion of our Poincare-variant along with the source code.

Fine-Grained Element Identification in Complaint Text of Internet Fraud

Comment: 5 pages, 5 figures, 3 tables accepted as a short paper to CIKM 2021

Link: http://arxiv.org/abs/2108.08676

Abstract

Existing system dealing with online complaint provides a final decisionwithout explanations. We propose to analyse the complaint text of internetfraud in a fine-grained manner. Considering the complaint text includesmultiple clauses with various functions, we propose to identify the role ofeach clause and classify them into different types of fraud element. Weconstruct a large labeled dataset originated from a real finance serviceplatform. We build an element identification model on top of BERT and proposeadditional two modules to utilize the context of complaint text for betterelement label classification, namely, global context encoder and label refiner.Experimental results show the effectiveness of our model.

Language Model Augmented Relevance Score

Comment: In ACL 2021

Link: http://arxiv.org/abs/2108.08485

Abstract

Although automated metrics are commonly used to evaluate NLG systems, theyoften correlate poorly with human judgements. Newer metrics such as BERTScorehave addressed many weaknesses in prior metrics such as BLEU and ROUGE, whichrely on n-gram matching. These newer methods, however, are still limited inthat they do not consider the generation context, so they cannot properlyreward generated text that is correct but deviates from the given reference. In this paper, we propose Language Model Augmented Relevance Score (MARS), anew context-aware metric for NLG evaluation. MARS leverages off-the-shelflanguage models, guided by reinforcement learning, to create augmentedreferences that consider both the generation context and available humanreferences, which are then used as additional references to score generatedtext. Compared with seven existing metrics in three common NLG tasks, MARS notonly achieves higher correlation with human reference judgements, but alsodifferentiates well-formed candidates from adversarial samples to a largerdegree.

QUEACO: Borrowing Treasures from Weakly-labeled Behavior Data for Query Attribute Value Extraction

Comment: The 30th ACM International Conference on Information and Knowledge Management (CIKM 2021, Applied Research Track)

Link: http://arxiv.org/abs/2108.08468

Abstract

We study the problem of query attribute value extraction, which aims toidentify named entities from user queries as diverse surface form attributevalues and afterward transform them into formally canonical forms. Such aproblem consists of two phases: {named entity recognition (NER)} and {attributevalue normalization (AVN)}. However, existing works only focus on the NER phasebut neglect equally important AVN. To bridge this gap, this paper proposes aunified query attribute value extraction system in e-commerce search namedQUEACO, which involves both two phases. Moreover, by leveraging large-scaleweakly-labeled behavior data, we further improve the extraction performancewith less supervision cost. Specifically, for the NER phase, QUEACO adopts anovel teacher-student network, where a teacher network that is trained on thestrongly-labeled data generates pseudo-labels to refine the weakly-labeled datafor training a student network. Meanwhile, the teacher network can bedynamically adapted by the feedback of the student's performance onstrongly-labeled data to maximally denoise the noisy supervisions from the weaklabels. For the AVN phase, we also leverage the weakly-labeledquery-to-attribute behavior data to normalize surface form attribute valuesfrom queries into canonical forms from products. Extensive experiments on areal-world large-scale E-commerce dataset demonstrate the effectiveness ofQUEACO.

Augmenting Slot Values and Contexts for Spoken Language Understanding with Pretrained Models

Comment: Accepted by Interspeech2021

Link: http://arxiv.org/abs/2108.08451

Abstract

Spoken Language Understanding (SLU) is one essential step in building adialogue system. Due to the expensive cost of obtaining the labeled data, SLUsuffers from the data scarcity problem. Therefore, in this paper, we focus ondata augmentation for slot filling task in SLU. To achieve that, we aim atgenerating more diverse data based on existing data. Specifically, we try toexploit the latent language knowledge from pretrained language models byfinetuning them. We propose two strategies for finetuning process: value-basedand context-based augmentation. Experimental results on two public SLU datasetshave shown that compared with existing data augmentation methods, our proposedmethod can generate more diverse sentences and significantly improve theperformance on SLU.

Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs

Comment: accepted to ICCV 2021

Link: http://arxiv.org/abs/2108.08841

Abstract

Controllable scene synthesis consists of generating 3D information thatsatisfy underlying specifications. Thereby, these specifications should beabstract, i.e. allowing easy user interaction, whilst providing enoughinterface for detailed control. Scene graphs are representations of a scene,composed of objects (nodes) and inter-object relationships (edges), proven tobe particularly suited for this task, as they allow for semantic control on thegenerated content. Previous works tackling this task often rely on syntheticdata, and retrieve object meshes, which naturally limits the generationcapabilities. To circumvent this issue, we instead propose the first work thatdirectly generates shapes from a scene graph in an end-to-end manner. Inaddition, we show that the same model supports scene modification, using therespective scene graph as interface. Leveraging Graph Convolutional Networks(GCN) we train a variational Auto-Encoder on top of the object and edgecategories, as well as 3D shapes and scene layouts, allowing latter sampling ofnew scenes and shapes.

PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers

Comment: Accepted to ICCV 2021 (Oral Presentation)

Link: http://arxiv.org/abs/2108.08839

Abstract

Point clouds captured in real-world applications are often incomplete due tothe limited sensor resolution, single viewpoint, and occlusion. Therefore,recovering the complete point clouds from partial ones becomes an indispensabletask in many practical applications. In this paper, we present a new methodthat reformulates point cloud completion as a set-to-set translation problemand design a new model, called PoinTr that adopts a transformer encoder-decoderarchitecture for point cloud completion. By representing the point cloud as aset of unordered groups of points with position embeddings, we convert thepoint cloud to a sequence of point proxies and employ the transformers forpoint cloud generation. To facilitate transformers to better leverage theinductive bias about 3D geometric structures of point clouds, we further devisea geometry-aware block that models the local geometric relationshipsexplicitly. The migration of transformers enables our model to better learnstructural knowledge and preserve detailed information for point cloudcompletion. Furthermore, we propose two more challenging benchmarks with morediverse incomplete point clouds that can better reflect the real-worldscenarios to promote future research. Experimental results show that our methodoutperforms state-of-the-art methods by a large margin on both the newbenchmarks and the existing ones. Code is available athttps://github.com/yuxumin/PoinTr

Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation

Comment: ICCV 2021 (Oral)

Link: http://arxiv.org/abs/2108.08829

Abstract

Self-supervised monocular depth estimation has been widely studied, owing toits practical importance and recent promising improvements. However, most workssuffer from limited supervision of photometric consistency, especially in weaktexture regions and at object boundaries. To overcome this weakness, we proposenovel ideas to improve self-supervised monocular depth estimation by leveragingcross-domain information, especially scene semantics. We focus on incorporatingimplicit semantic knowledge into geometric representation enhancement andsuggest two ideas: a metric learning approach that exploits thesemantics-guided local geometry to optimize intermediate depth representationsand a novel feature fusion module that judiciously utilizes cross-modalitybetween two heterogeneous feature representations. We comprehensively evaluateour methods on the KITTI dataset and demonstrate that our method outperformsstate-of-the-art methods. The source code is available athttps://github.com/hyBlue/FSRE-Depth.

Towards Vivid and Diverse Image Colorization with Generative Color Prior

Comment: ICCV 2021

Link: http://arxiv.org/abs/2108.08826

Abstract

Colorization has attracted increasing interest in recent years. Classicreference-based methods usually rely on external color images for plausibleresults. A large image database or online search engine is inevitably requiredfor retrieving such exemplars. Recent deep-learning-based methods couldautomatically colorize images at a low cost. However, unsatisfactory artifactsand incoherent colors are always accompanied. In this work, we aim atrecovering vivid colors by leveraging the rich and diverse color priorsencapsulated in a pretrained Generative Adversarial Networks (GAN).Specifically, we first "retrieve" matched features (similar to exemplars) via aGAN encoder and then incorporate these features into the colorization processwith feature modulations. Thanks to the powerful generative color prior anddelicate designs, our method could produce vivid colors with a single forwardpass. Moreover, it is highly convenient to obtain diverse results by modifyingGAN latent codes. Our method also inherits the merit of interpretable controlsof GANs and could attain controllable and smooth transitions by walking throughGAN latent space. Extensive experiments and user studies demonstrate that ourmethod achieves superior performance than previous works.

Click to Move: Controlling Video Generation with Sparse Motion

Comment: Accepted by International Conference on Computer Vision (ICCV 2021)

Link: http://arxiv.org/abs/2108.08815

Abstract

This paper introduces Click to Move (C2M), a novel framework for videogeneration where the user can control the motion of the synthesized videothrough mouse clicks specifying simple object trajectories of the key objectsin the scene. Our model receives as input an initial frame, its correspondingsegmentation map and the sparse motion vectors encoding the input provided bythe user. It outputs a plausible video sequence starting from the given frameand with a motion that is consistent with user input. Notably, our proposeddeep architecture incorporates a Graph Convolution Network (GCN) modelling themovements of all the objects in the scene in a holistic manner and effectivelycombining the sparse user motion information and image features. Experimentalresults show that C2M outperforms existing methods on two publicly availabledatasets, thus demonstrating the effectiveness of our GCN framework atmodelling object interactions. The source code is publicly available athttps://github.com/PierfrancescoArdino/C2M.

Causal Attention for Unbiased Visual Recognition

Comment: Accepted by ICCV 2021

Link: http://arxiv.org/abs/2108.08782

Abstract

Attention module does not always help deep models learn causal features thatare robust in any confounding context, e.g., a foreground object feature isinvariant to different backgrounds. This is because the confounders trick theattention to capture spurious correlations that benefit the prediction when thetraining and testing data are IID (identical & independent distribution); whileharm the prediction when the data are OOD (out-of-distribution). The solefundamental solution to learn causal attention is by causal intervention, whichrequires additional annotations of the confounders, e.g., a "dog" model islearned within "grass+dog" and "road+dog" respectively, so the "grass" and"road" contexts will no longer confound the "dog" recognition. However, suchannotation is not only prohibitively expensive, but also inherentlyproblematic, as the confounders are elusive in nature. In this paper, wepropose a causal attention module (CaaM) that self-annotates the confounders inunsupervised fashion. In particular, multiple CaaMs can be stacked andintegrated in conventional attention CNN and self-attention Vision Transformer.In OOD settings, deep models with CaaM outperform those without itsignificantly; even in IID settings, the attention localization is alsoimproved by CaaM, showing a great potential in applications that require robustvisual saliency. Codes are available at \url{https://github.com/Wangt-CN/CaaM}.

Learning to Match Features with Seeded Graph Matching Network

Comment: Accepted by ICCV2021, code to be realeased at https://github.com/vdvchen/SGMNet

Link: http://arxiv.org/abs/2108.08771

Abstract

Matching local features across images is a fundamental problem in computervision. Targeting towards high accuracy and efficiency, we propose Seeded GraphMatching Network, a graph neural network with sparse structure to reduceredundant connectivity and learn compact representation. The network consistsof 1) Seeding Module, which initializes the matching by generating a small setof reliable matches as seeds. 2) Seeded Graph Neural Network, which utilizesseed matches to pass messages within/across images and predicts assignmentcosts. Three novel operations are proposed as basic elements for messagepassing: 1) Attentional Pooling, which aggregates keypoint features within theimage to seed matches. 2) Seed Filtering, which enhances seed features andexchanges messages across images. 3) Attentional Unpooling, which propagatesseed features back to original keypoints. Experiments show that our methodreduces computational and memory complexity significantly compared with typicalattention-based networks while competitive or higher performance is achieved.

Category-Level 6D Object Pose Estimation via Cascaded Relation and Recurrent Reconstruction Networks

Comment: accepted by IROS2021

Link: http://arxiv.org/abs/2108.08755

Abstract

Category-level 6D pose estimation, aiming to predict the location andorientation of unseen object instances, is fundamental to many scenarios suchas robotic manipulation and augmented reality, yet still remains unsolved.Precisely recovering instance 3D model in the canonical space and accuratelymatching it with the observation is an essential point when estimating 6D posefor unseen objects. In this paper, we achieve accurate category-level 6D poseestimation via cascaded relation and recurrent reconstruction networks.Specifically, a novel cascaded relation network is dedicated for advancedrepresentation learning to explore the complex and informative relations amonginstance RGB image, instance point cloud and category shape prior. Furthermore,we design a recurrent reconstruction network for iterative residual refinementto progressively improve the reconstruction and correspondence estimations fromcoarse to fine. Finally, the instance 6D pose is obtained leveraging theestimated dense correspondences between the instance point cloud and thereconstructed 3D model in the canonical space. We have conducted extensiveexperiments on two well-acknowledged benchmarks of category-level 6D poseestimation, with significant performance improvement over existing approaches.On the representatively strict evaluation metrics of $3D_{75}$ and $5^{\circ}2cm$, our method exceeds the latest state-of-the-art SPD by $4.9\%$ and $17.7\%$on the CAMERA25 dataset, and by $2.7\%$ and $8.5\%$ on the REAL275 dataset.Codes are available at https://wangjiaze.cn/projects/6DPoseEstimation.html.

Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

Comment: Accepted to ICCV 2021

Link: http://arxiv.org/abs/2108.08728

Abstract

Attention mechanism has demonstrated great potential in fine-grained visualrecognition tasks. In this paper, we present a counterfactual attentionlearning method to learn more effective attention based on causal inference.Unlike most existing methods that learn visual attention based on conventionallikelihood, we propose to learn the attention with counterfactual causality,which provides a tool to measure the attention quality and a powerfulsupervisory signal to guide the learning process. Specifically, we analyze theeffect of the learned visual attention on network prediction throughcounterfactual intervention and maximize the effect to encourage the network tolearn more useful attention for fine-grained image recognition. Empirically, weevaluate our method on a wide range of fine-grained recognition tasks whereattention plays a crucial role, including fine-grained image categorization,person re-identification, and vehicle re-identification. The consistentimprovement on all benchmarks demonstrates the effectiveness of our method.Code is available at https://github.com/raoyongming/CAL

How to cheat with metrics in single-image HDR reconstruction

Comment: ICCV 2021 workshop on Learning for Computational Imaging (LCI)

Link: http://arxiv.org/abs/2108.08713

Abstract

Single-image high dynamic range (SI-HDR) reconstruction has recently emergedas a problem well-suited for deep learning methods. Each successive techniquedemonstrates an improvement over existing methods by reporting higher imagequality scores. This paper, however, highlights that such improvements inobjective metrics do not necessarily translate to visually superior images. Thefirst problem is the use of disparate evaluation conditions in terms of dataand metric parameters, calling for a standardized protocol to make it possibleto compare between papers. The second problem, which forms the main focus ofthis paper, is the inherent difficulty in evaluating SI-HDR reconstructionssince certain aspects of the reconstruction problem dominate objectivedifferences, thereby introducing a bias. Here, we reproduce a typicalevaluation using existing as well as simulated SI-HDR methods to demonstratehow different aspects of the problem affect objective quality metrics.Surprisingly, we found that methods that do not even reconstruct HDRinformation can compete with state-of-the-art deep learning methods. We showhow such results are not representative of the perceived quality and thatSI-HDR reconstruction needs better evaluation protocols.

Real-time Image Enhancer via Learnable Spatial-aware 3D Lookup Tables

Comment: Accepted to ICCV2021

Link: http://arxiv.org/abs/2108.08697

Abstract

Recently, deep learning-based image enhancement algorithms achievedstate-of-the-art (SOTA) performance on several publicly available datasets.However, most existing methods fail to meet practical requirements either forvisual perception or for computation efficiency, especially for high-resolutionimages. In this paper, we propose a novel real-time image enhancer vialearnable spatial-aware 3-dimentional lookup tables(3D LUTs), which wellconsiders global scenario and local spatial information. Specifically, weintroduce a light weight two-head weight predictor that has two outputs. One isa 1D weight vector used for image-level scenario adaptation, the other is a 3Dweight map aimed for pixel-wise category fusion. We learn the spatial-aware 3DLUTs and fuse them according to the aforementioned weights in an end-to-endmanner. The fused LUT is then used to transform the source image into thetarget tone in an efficient way. Extensive results show that our modeloutperforms SOTA image enhancement methods on public datasets both subjectivelyand objectively, and that our model only takes about 4ms to process a 4Kresolution image on one NVIDIA V100 GPU.

3DIAS: 3D Shape Reconstruction with Implicit Algebraic Surfaces

Comment: Published at ICCV 2021

Link: http://arxiv.org/abs/2108.08653

Abstract

3D Shape representation has substantial effects on 3D shape reconstruction.Primitive-based representations approximate a 3D shape mainly by a set ofsimple implicit primitives, but the low geometrical complexity of theprimitives limits the shape resolution. Moreover, setting a sufficient numberof primitives for an arbitrary shape is challenging. To overcome these issues,we propose a constrained implicit algebraic surface as the primitive with fewlearnable coefficients and higher geometrical complexities and a deep neuralnetwork to produce these primitives. Our experiments demonstrate thesuperiorities of our method in terms of representation power compared to thestate-of-the-art methods in single RGB image 3D shape reconstruction.Furthermore, we show that our method can semantically learn segments of 3Dshapes in an unsupervised manner. The code is publicly available fromhttps://myavartanoo.github.io/3dias/ .

Spatio-Temporal Interaction Graph Parsing Networks for Human-Object Interaction Recognition

Comment: ACM MM Oral paper

Link: http://arxiv.org/abs/2108.08633

Abstract

For a given video-based Human-Object Interaction scene, modeling thespatio-temporal relationship between humans and objects are the important cueto understand the contextual information presented in the video. With theeffective spatio-temporal relationship modeling, it is possible not only touncover contextual information in each frame but also to directly captureinter-time dependencies. It is more critical to capture the position changes ofhuman and objects over the spatio-temporal dimension when their appearancefeatures may not show up significant changes over time. The full use ofappearance features, the spatial location and the semantic information are alsothe key to improve the video-based Human-Object Interaction recognitionperformance. In this paper, Spatio-Temporal Interaction Graph Parsing Networks(STIGPN) are constructed, which encode the videos with a graph composed ofhuman and object nodes. These nodes are connected by two types of relations:(i) spatial relations modeling the interactions between human and theinteracted objects within each frame. (ii) inter-time relations capturing thelong range dependencies between human and the interacted objects across frame.With the graph, STIGPN learn spatio-temporal features directly from the wholevideo-based Human-Object Interaction scenes. Multi-modal features and amulti-stream fusion strategy are used to enhance the reasoning capability ofSTIGPN. Two Human-Object Interaction video datasets, including CAD-120 andSomething-Else, are used to evaluate the proposed architectures, and thestate-of-the-art performance demonstrates the superiority of STIGPN.

VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction

Comment: ICCV 2021 Accepted

Link: http://arxiv.org/abs/2108.08623

Abstract

To reconstruct a 3D scene from a set of calibrated views, traditionalmulti-view stereo techniques rely on two distinct stages: local depth mapscomputation and global depth maps fusion. Recent studies concentrate on deepneural architectures for depth estimation by using conventional depth fusionmethod or direct 3D reconstruction network by regressing Truncated SignedDistance Function (TSDF). In this paper, we advocate that replicating thetraditional two stages framework with deep neural networks improves both theinterpretability and the accuracy of the results. As mentioned, our networkoperates in two steps: 1) the local computation of the local depth maps with adeep MVS technique, and, 2) the depth maps and images' features fusion to builda single TSDF volume. In order to improve the matching performance betweenimages acquired from very different viewpoints (e.g., large-baseline androtations), we introduce a rotation-invariant 3D convolution kernel calledPosedConv. The effectiveness of the proposed architecture is underlined via alarge series of experiments conducted on the ScanNet dataset where our approachcompares favorably against both traditional and deep learning techniques.

Spatially-Adaptive Image Restoration using Distortion-Guided Networks

Comment: Accepted at ICCV 2021

Link: http://arxiv.org/abs/2108.08617

Abstract

We present a general learning-based solution for restoring images sufferingfrom spatially-varying degradations. Prior approaches are typicallydegradation-specific and employ the same processing across different images anddifferent pixels within. However, we hypothesize that such spatially rigidprocessing is suboptimal for simultaneously restoring the degraded pixels aswell as reconstructing the clean regions of the image. To overcome thislimitation, we propose SPAIR, a network design that harnessesdistortion-localization information and dynamically adjusts computation todifficult regions in the image. SPAIR comprises of two components, (1) alocalization network that identifies degraded pixels, and (2) a restorationnetwork that exploits knowledge from the localization network in filter andfeature domain to selectively and adaptively restore degraded pixels. Our keyidea is to exploit the non-uniformity of heavy degradations in spatial-domainand suitably embed this knowledge within distortion-guided modules performingsparse normalization, feature extraction and attention. Our architecture isagnostic to physical formation model and generalizes across several types ofspatially-varying degradations. We demonstrate the efficacy of SPAIRindividually on four restoration tasks-removal of rain-streaks, raindrops,shadows and motion blur. Extensive qualitative and quantitative comparisonswith prior art on 11 benchmark datasets demonstrate that ourdegradation-agnostic network design offers significant performance gains overstate-of-the-art degradation-specific architectures. Code available athttps://github.com/human-analysis/spatially-adaptive-image-restoration.

Feature Stylization and Domain-aware Contrastive Learning for Domain Generalization

Comment: Accepted to ACM MM 2021 (oral)

Link: http://arxiv.org/abs/2108.08596

Abstract

Domain generalization aims to enhance the model robustness against domainshift without accessing the target domain. Since the available source domainsfor training are limited, recent approaches focus on generating samples ofnovel domains. Nevertheless, they either struggle with the optimization problemwhen synthesizing abundant domains or cause the distortion of class semantics.To these ends, we propose a novel domain generalization framework where featurestatistics are utilized for stylizing original features to ones with noveldomain properties. To preserve class information during stylization, we firstdecompose features into high and low frequency components. Afterward, westylize the low frequency components with the novel domain styles sampled fromthe manipulated statistics, while preserving the shape cues in high frequencyones. As the final step, we re-merge both components to synthesize novel domainfeatures. To enhance domain robustness, we utilize the stylized features tomaintain the model consistency in terms of features as well as outputs. Weachieve the feature consistency with the proposed domain-aware supervisedcontrastive loss, which ensures domain invariance while increasing classdiscriminability. Experimental results demonstrate the effectiveness of theproposed feature stylization and the domain-aware contrastive loss. Throughquantitative comparisons, we verify the lead of our method upon existingstate-of-the-art methods on two benchmarks, PACS and Office-Home.

3D Shapes Local Geometry Codes Learning with SDF

Comment: DLGC workshop in ICCV 2021

Link: http://arxiv.org/abs/2108.08593

Abstract

A signed distance function (SDF) as the 3D shape description is one of themost effective approaches to represent 3D geometry for rendering andreconstruction. Our work is inspired by the state-of-the-art method DeepSDFthat learns and analyzes the 3D shape as the iso-surface of its shell and thismethod has shown promising results especially in the 3D shape reconstructionand compression domain. In this paper, we consider the degeneration problem ofreconstruction coming from the capacity decrease of the DeepSDF model, whichapproximates the SDF with a neural network and a single latent code. We proposeLocal Geometry Code Learning (LGCL), a model that improves the original DeepSDFresults by learning from a local shape geometry of the full 3D shape. We add anextra graph neural network to split the single transmittable latent code into aset of local latent codes distributed on the 3D shape. Mentioned latent codesare used to approximate the SDF in their local regions, which will alleviatethe complexity of the approximation compared to the original DeepSDF.Furthermore, we introduce a new geometric loss function to facilitate thetraining of these local latent codes. Note that other local shape adjustingmethods use the 3D voxel representation, which in turn is a problem highlydifficult to solve or even is insolvable. In contrast, our architecture isbased on graph processing implicitly and performs the learning regressionprocess directly in the latent code space, thus make the proposed architecturemore flexible and also simple for realization. Our experiments on 3D shapereconstruction demonstrate that our LGCL method can keep more details with asignificantly smaller size of the SDF decoder and outperforms considerably theoriginal DeepSDF method under the most important quantitative metrics.

Exploiting Scene Graphs for Human-Object Interaction Detection

Comment: Accepted to ICCV 2021

Link: http://arxiv.org/abs/2108.08584

Abstract

Human-Object Interaction (HOI) detection is a fundamental visual task aimingat localizing and recognizing interactions between humans and objects. Existingworks focus on the visual and linguistic features of humans and objects.However, they do not capitalise on the high-level and semantic relationshipspresent in the image, which provides crucial contextual and detailed relationalknowledge for HOI inference. We propose a novel method to exploit thisinformation, through the scene graph, for the Human-Object Interaction (SG2HOI)detection task. Our method, SG2HOI, incorporates the SG information in twoways: (1) we embed a scene graph into a global context clue, serving as thescene-specific environmental context; and (2) we build a relation-awaremessage-passing module to gather relationships from objects' neighborhood andtransfer them into interactions. Empirical evaluation shows that our SG2HOImethod outperforms the state-of-the-art methods on two benchmark HOI datasets:V-COCO and HICO-DET. Code will be available at https://github.com/ht014/SG2HOI.

StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimation

Comment: Accepted by ICCV2021. Project is in https://github.com/SJTU-ViSYS/StructDepth

Link: http://arxiv.org/abs/2108.08574

Abstract

Self-supervised monocular depth estimation has achieved impressiveperformance on outdoor datasets. Its performance however degrades notably inindoor environments because of the lack of textures. Without rich textures, thephotometric consistency is too weak to train a good depth network. Inspired bythe early works on indoor modeling, we leverage the structural regularitiesexhibited in indoor scenes, to train a better depth network. Specifically, weadopt two extra supervisory signals for self-supervised training: 1) theManhattan normal constraint and 2) the co-planar constraint. The Manhattannormal constraint enforces the major surfaces (the floor, ceiling, and walls)to be aligned with dominant directions. The co-planar constraint states thatthe 3D points be well fitted by a plane if they are located within the sameplanar region. To generate the supervisory signals, we adopt two components toclassify the major surface normal into dominant directions and detect theplanar regions on the fly during training. As the predicted depth becomes moreaccurate after more training epochs, the supervisory signals also improve andin turn feedback to obtain a better depth model. Through extensive experimentson indoor benchmark datasets, the results show that our network outperforms thestate-of-the-art methods. The source code is available athttps://github.com/SJTU-ViSYS/StructDepth .

DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders

Comment: International Conference on Computer Vision 2021 (ICCV 2021), 8 pages, 4 figures, 4 tables, accepted for ICCV 2021 oral

Link: http://arxiv.org/abs/2108.08557

Abstract

Human Pose Estimation (HPE) aims at retrieving the 3D position of humanjoints from images or videos. We show that current 3D HPE methods suffer a lackof viewpoint equivariance, namely they tend to fail or perform poorly whendealing with viewpoints unseen at training time. Deep learning methods oftenrely on either scale-invariant, translation-invariant, or rotation-invariantoperations, such as max-pooling. However, the adoption of such procedures doesnot necessarily improve viewpoint generalization, rather leading to moredata-dependent methods. To tackle this issue, we propose a novel capsuleautoencoder network with fast Variational Bayes capsule routing, named DECA. Bymodeling each joint as a capsule entity, combined with the routing algorithm,our approach can preserve the joints' hierarchical and geometrical structure inthe feature space, independently from the viewpoint. By achieving viewpointequivariance, we drastically reduce the network data dependency at trainingtime, resulting in an improved ability to generalize for unseen viewpoints. Inthe experimental validation, we outperform other methods on depth images fromboth seen and unseen viewpoints, both top-view, and front-view. In the RGBdomain, the same network gives state-of-the-art results on the challengingviewpoint transfer task, also establishing a new framework for top-view HPE.The code can be found at https://github.com/mmlab-cv/DECA.

A Unified Objective for Novel Class Discovery

Comment: ICCV 2021 (Oral)

Link: http://arxiv.org/abs/2108.08536

Abstract

In this paper, we study the problem of Novel Class Discovery (NCD). NCD aimsat inferring novel object categories in an unlabeled set by leveraging fromprior knowledge of a labeled set containing different, but related classes.Existing approaches tackle this problem by considering multiple objectivefunctions, usually involving specialized loss terms for the labeled and theunlabeled samples respectively, and often requiring auxiliary regularizationterms. In this paper, we depart from this traditional scheme and introduce aUNified Objective function (UNO) for discovering novel classes, with theexplicit purpose of favoring synergy between supervised and unsupervisedlearning. Using a multi-view self-labeling strategy, we generate pseudo-labelsthat can be treated homogeneously with ground truth labels. This leads to asingle classification objective operating on both known and unknown classes.Despite its simplicity, UNO outperforms the state of the art by a significantmargin on several benchmarks (~+10% on CIFAR-100 and +8% on ImageNet). Theproject page is available at: \url{https://ncd-uno.github.io}.

Understanding and Mitigating Annotation Bias in Facial Expression Recognition

Comment: To appear in ICCV 2021

Link: http://arxiv.org/abs/2108.08504

Abstract

The performance of a computer vision model depends on the size and quality ofits training data. Recent studies have unveiled previously-unknown compositionbiases in common image datasets which then lead to skewed model outputs, andhave proposed methods to mitigate these biases. However, most existing worksassume that human-generated annotations can be considered gold-standard andunbiased. In this paper, we reveal that this assumption can be problematic, andthat special care should be taken to prevent models from learning suchannotation biases. We focus on facial expression recognition and compare thelabel biases between lab-controlled and in-the-wild datasets. We demonstratethat many expression datasets contain significant annotation biases betweengenders, especially when it comes to the happy and angry expressions, and thattraditional methods cannot fully mitigate such biases in trained models. Toremove expression annotation bias, we propose an AU-Calibrated FacialExpression Recognition (AUC-FER) framework that utilizes facial action units(AUs) and incorporates the triplet loss into the objective function.Experimental results suggest that the proposed method is more effective inremoving expression annotation bias than existing techniques.

Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

Comment: ICCV 2021

Link: http://arxiv.org/abs/2108.08487

Abstract

Recently, the generalization behavior of Convolutional Neural Networks (CNN)is gradually transparent through explanation techniques with the frequencycomponents decomposition. However, the importance of the phase spectrum of theimage for a robust vision system is still ignored. In this paper, we noticethat the CNN tends to converge at the local optimum which is closely related tothe high-frequency components of the training images, while the amplitudespectrum is easily disturbed such as noises or common corruptions. In contrast,more empirical studies found that humans rely on more phase components toachieve robust recognition. This observation leads to more explanations of theCNN's generalization behaviors in both robustness to common perturbations andout-of-distribution detection, and motivates a new perspective on dataaugmentation designed by re-combing the phase spectrum of the current image andthe amplitude spectrum of the distracter image. That is, the generated samplesforce the CNN to pay more attention to the structured information from phasecomponents and keep robust to the variation of the amplitude. Experiments onseveral image datasets indicate that the proposed method achievesstate-of-the-art performances on multiple generalizations and calibrationtasks, including adaptability for common corruptions and surface variations,out-of-distribution detection, and adversarial attack.

Learning Anchored Unsigned Distance Functions with Gradient Direction Alignment for Single-view Garment Reconstruction

Comment: ICCV 2021

Link: http://arxiv.org/abs/2108.08478

Abstract

While single-view 3D reconstruction has made significant progress benefitingfrom deep shape representations in recent years, garment reconstruction isstill not solved well due to open surfaces, diverse topologies and complexgeometric details. In this paper, we propose a novel learnable AnchoredUnsigned Distance Function (AnchorUDF) representation for 3D garmentreconstruction from a single image. AnchorUDF represents 3D shapes bypredicting unsigned distance fields (UDFs) to enable open garment surfacemodeling at arbitrary resolution. To capture diverse garment topologies,AnchorUDF not only computes pixel-aligned local image features of query points,but also leverages a set of anchor points located around the surface to enrich3D position features for query points, which provides stronger 3D space contextfor the distance function. Furthermore, in order to obtain more accurate pointprojection direction at inference, we explicitly align the spatial gradientdirection of AnchorUDF with the ground-truth direction to the surface duringtraining. Extensive experiments on two public 3D garment datasets, i.e., MGNand Deep Fashion3D, demonstrate that AnchorUDF achieves the state-of-the-artperformance on single-view garment reconstruction.

Medical Image Segmentation using 3D Convolutional Neural Networks: A Review

Comment: 17 pages, 4 figures

Link: http://arxiv.org/abs/2108.08467

Abstract

Computer-aided medical image analysis plays a significant role in assistingmedical practitioners for expert clinical diagnosis and deciding the optimaltreatment plan. At present, convolutional neural networks (CNN) are thepreferred choice for medical image analysis. In addition, with the rapidadvancements in three-dimensional (3D) imaging systems and the availability ofexcellent hardware and software support to process large volumes of data, 3Ddeep learning methods are gaining popularity in medical image analysis. Here,we present an extensive review of the recently evolved 3D deep learning methodsin medical image segmentation. Furthermore, the research gaps and futuredirections in 3D medical image segmentation are discussed.

Self-Supervised Video Representation Learning with Meta-Contrastive Network

Comment: Accepted to ICCV 2021

Link: http://arxiv.org/abs/2108.08426

Abstract

Self-supervised learning has been successfully applied to pre-train videorepresentations, which aims at efficient adaptation from pre-training domain todownstream tasks. Existing approaches merely leverage contrastive loss to learninstance-level discrimination. However, lack of category information will leadto hard-positive problem that constrains the generalization ability of thiskind of methods. We find that the multi-task process of meta learning canprovide a solution to this problem. In this paper, we propose aMeta-Contrastive Network (MCN), which combines the contrastive learning andmeta learning, to enhance the learning ability of existing self-supervisedapproaches. Our method contains two training stages based on model-agnosticmeta learning (MAML), each of which consists of a contrastive branch and a metabranch. Extensive evaluations demonstrate the effectiveness of our method. Fortwo downstream tasks, i.e., video action recognition and video retrieval, MCNoutperforms state-of-the-art approaches on UCF101 and HMDB51 datasets. To bemore specific, with R(2+1)D backbone, MCN achieves Top-1 accuracies of 84.8%and 54.5% for video action recognition, as well as 52.5% and 23.7% for videoretrieval.

Generating Smooth Pose Sequences for Diverse Human Motion Prediction

Comment: ICCV21(oral)

Link: http://arxiv.org/abs/2108.08422

Abstract

Recent progress in stochastic motion prediction, i.e., predicting multiplepossible future human motions given a single past pose sequence, has led toproducing truly diverse future motions and even providing control over themotion of some body parts. However, to achieve this, the state-of-the-artmethod requires learning several mappings for diversity and a dedicated modelfor controllable motion prediction. In this paper, we introduce a unified deepgenerative network for both diverse and controllable motion prediction. To thisend, we leverage the intuition that realistic human motions consist of smoothsequences of valid poses, and that, given limited data, learning a pose prioris much more tractable than a motion one. We therefore design a generator thatpredicts the motion of different body parts sequentially, and introduce anormalizing flow based pose prior, together with a joint angle loss, to achievemotion realism.Our experiments on two standard benchmark datasets, Human3.6Mand HumanEva-I, demonstrate that our approach outperforms the state-of-the-artbaselines in terms of both sample diversity and accuracy. The code is availableat https://github.com/wei-mao-2019/gsps

Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes

Comment: ICCV'21 Accepted

Link: http://arxiv.org/abs/2108.08421

Abstract

Vision systems that deploy Deep Neural Networks (DNNs) are known to bevulnerable to adversarial examples. Recent research has shown that checking theintrinsic consistencies in the input data is a promising way to detectadversarial attacks (e.g., by checking the object co-occurrence relationshipsin complex scenes). However, existing approaches are tied to specific modelsand do not offer generalizability. Motivated by the observation that languagedescriptions of natural scene images have already captured the objectco-occurrence relationships that can be learned by a language model, we developa novel approach to perform context consistency checks using such languagemodels. The distinguishing aspect of our approach is that it is independent ofthe deployed object detector and yet offers very high accuracy in terms ofdetecting adversarial examples in practical scenes with multiple objects.

Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning

Comment: Initial submission; appeared as spotlight talk in ICML 2021 Workshop on Theory of RL

Link: http://arxiv.org/abs/2108.08812

Abstract

Actor-critic methods are widely used in offline reinforcement learningpractice, but are not so well-understood theoretically. We propose a newoffline actor-critic algorithm that naturally incorporates the pessimismprinciple, leading to several key advantages compared to the state of the art.The algorithm can operate when the Bellman evaluation operator is closed withrespect to the action value function of the actor's policies; this is a moregeneral setting than the low-rank MDP model. Despite the added generality, theprocedure is computationally tractable as it involves the solution of asequence of second-order programs. We prove an upper bound on the suboptimalitygap of the policy returned by the procedure that depends on the data coverageof any arbitrary, possibly data dependent comparator policy. The achievableguarantee is complemented with a minimax lower bound that is matching up tologarithmic factors.

你可能感兴趣的:(3d,gwt,firebug,办公软件,nagios)

TVP：用于高效二维时序视频定位的文本-视觉提示方法 AI专题精讲强化学习强化学习文本视觉人工智能
温馨提示：本篇文章已同步至"AI专题精讲"TVP：用于高效二维时序视频定位的文本-视觉提示方法摘要本文研究的是时序视频定位（TemporalVideoGrounding，TVG）问题，其目标是在一段未经剪辑的长视频中，根据一条文本描述预测对应事件片段的起始和结束时间点。近年来，得益于精细的三维视觉特征，TVG技术取得了显著进展。然而，三维卷积神经网络（3DCNN）计算复杂度高，使得密集的3D视觉特
基于Qt+libVLC内核设计视频播放器-完整版源码(WinID-D3D渲染) 鱼弦音视频开发系列实践 qt 音视频 3d
鱼弦：公众号【红尘灯塔】，CSDN博客专家、内容合伙人、新星导师、全栈领域优质创作者、51CTO(Top红人+专家博主)、github开源爱好者（go-zero源码二次开发、游戏后端架构https://github.com/Peakchen）基于Qt+libVLC内核设计视频播放器-完整版源码(WinID-D3D渲染)1.介绍基于Qt+libVLC内核设计视频播放器是一种功能强大、易于使用且可扩展
305李03days作业#裂变实验室# 李_d891
A账号大数据里加的人B账号精筛选一遍的客户C账号vip客户深度信任客户今天事情有点多，没有好好学习，明天重新写一个补到新作业里。
7、开启C与Unity 3D的编程之旅珊珊333333 Unity C#Unity 3D 编程基础
开启C#与Unity3D的编程之旅1.前期准备在进行每一个教程之前，都有一个名为Scene的场景文件。在整个学习过程中，教程通常从下载项目中的Scene文件开始。打开场景的方法有两种：-直接在项目面板的Assets目录下双击场景图标。-选择File→OpenScene来打开项目中的任何场景。2.学习回顾与要点创建并将新的C#文件分配给对象并不复杂，在Unity3D编辑器中有多种方法可以实现。添加代
Shader编写指南(六十一):使用 Visual Studio 调试 Unity 着色器（Windows 平台）小李也疯狂 visual studio unity 着色器 shader
在Windows平台上，可通过VisualStudio结合DirectX11/12对Unity着色器进行调试。以下是详细步骤及注意事项：一、准备工作：启用调试符号在需要调试的着色器中添加编译指令，确保生成包含调试符号的代码：hlsl#pragmaenable_d3d11_debug_symbols//启用DirectX11调试符号//或针对DirectX12（需配合PIX调试）注意：该指令会导致性
【无标题】
PyQt5相关论文方向扩充及技术特性解析PyQt5的核心优势PyQt5作为基于Qt框架的Python绑定库，在科研与工程应用中具备显著优势。其跨平台兼容性极强，可在Windows、macOS、Linux等主流操作系统上稳定运行，且能保持界面风格的一致性，这对开发多场景应用系统至关重要。在界面设计方面，PyQt5提供了丰富的UI组件库，从基础的按钮、文本框到高级的图表、3D控件应有尽有，同时支持Qt
instantiate 卡顿严重_Unity3D研究院之利用缓存池解决Instantiate慢的问题（七十三）... weixin_39992312 instantiate 卡顿严重
Unity3D做项目有三个地方处理不好游戏整体就会出现卡顿的问题。2.角色放技能的时候卡尤其是放群体攻击技能时，因为每个人身上都要产生一个技能特效。技能都是用粒子特效做的，虽然Unity中粒子特效也是一个GameObject.但是ParticleSystem这个组件太特殊了。Instantiate以后会自动的执行脚本的初始化工作，ParticleSystem组件肯定也是个脚本，虽然我们看不到它实现
instantiate 卡顿严重_利用缓存池解决Instantiate慢的问题 weixin_39958100 instantiate 卡顿严重
Unity3D做项目有三个地方处理不好游戏整体就会出现卡顿的问题。1.NGUI直接打开界面卡，建议看看这一篇文章http://www.xuanyusong.com/archives/2799(本文就不赘述了)2.角色放技能的时候卡尤其是放群体攻击技能时，因为每个人身上都要产生一个技能特效。技能都是用粒子特效做的，虽然Unity中粒子特效也是一个GameObject.但是ParticleSystem
NX636NX644美光固态闪存NX663NX665 18922804861 人工智能性能优化大数据服务器网络
美光固态闪存深度解析：NX636、NX644、NX663、NX665全面评测技术架构与核心性能美光NX636、NX644、NX663、NX665系列固态闪存均基于176层3DTLCNAND技术，采用美光自研主控芯片，支持PCIe4.0协议，理论带宽可达16GT/s。其中，NX665作为旗舰型号，连续读取速度突破7.4GB/s，随机写入性能较NX636提升约40%，相当于从“自行车道”升级至“高速公
蔚来汽车视觉算法面试30问全景精解
蔚来汽车视觉算法面试30问全景精解——智能电动×高阶辅助驾驶×视觉创新：蔚来汽车视觉算法面试核心考点全览前言蔚来汽车作为全球领先的智能电动汽车品牌，致力于通过AI与高阶辅助驾驶技术推动智能出行的未来。蔚来视觉算法团队专注于自动驾驶感知、智能座舱、车路协同、3D重建等领域，强调算法的工程落地、系统安全与创新突破。蔚来视觉算法岗位面试不仅考察候选人对视觉基础理论的扎实掌握，更关注其在自动驾驶、智能感知
【转】Unity3.5是一次较大的更新.它包含的新功能和改进会让你爱不释手. SODASTUDIO Unity3D
Shuriken粒子系统内建寻路系统升级遮挡裁切和增加LOD系统谷歌Chrome浏览器的NativeClient支持线性空间照明和HDR主要的新功能AdobeFlash:现在版本支持AdobeSWF格式的Flash输出(预览版).有关Flash预览版相关问题见:http://unity3d.com/unity/publishing/flash新的粒子系统-"Shuriken".可以手动控制时间线来
Zabbix企业级分布式监控付出不多 zabbix 分布式
目录一、zabbix监控系统1.1监控的五大核心类型1.2监控的五层逻辑架构（1）基础设施监控（2）系统层监控（3）应用层监控（4）业务监控（5）端用户体验监控二、监控系统的技术原理2.1监控系统的核心模块2.2数据采集协议分类2.3数据采集模式（1）被动模式（2）主动模式2.4分布式代理架构三、主流开源监控系统对比3.1Zabbix3.2Prometheus+Grafana3.3Nagios3.
「日拱一码」035 机器学习——调参过程可视化胖达不服输「日拱一码」机器学习人工智能调参过程可视化神经网络 python 模型可解释性
目录超参数搜索的3D曲面可视化交互式3D可视化神经网络学习率的3D可视化SVM超参数的3D决策边界可视化超参数优化的3D动画超参数搜索的3D曲面可视化##超参数搜索的3D曲面可视化importnumpyasnpimportmatplotlib.pyplotaspltfrommpl_toolkits.mplot3dimportAxes3Dfromsklearn.datasetsimportmake_
【OS】AUTOSAR架构下的Interrupt详解（下篇）汽车电子嵌入式 AUTOSAR精进之路 AUTOSAR OS Interrupt EnableInterrupt SuspendISR
目录3.代码分析3.1中断配置代码3.2OS如何找到中断处理函数3.3Os_InitialEnableInterruptSources实现3.4Os_EnableInterruptSource3.5DisableAllInterrupts3.5.1Os_IntSuspendCat13.5.2Os_InterruptDisableAllEnter3.5.3Disable二类中断3.5.4Disabl
第5天-代码画笔下的奇幻艺术世界速易达网络青少年编程课程人工智能
一个融合编程思维与艺术创作的沉浸式绘画工具项目亮点当Scratch积木变成画笔：用编程逻辑创作视觉艺术零基础双启蒙：同时培养编程思维与艺术创造力AI魔法实验室：智能生成创意绘画模板元宇宙画廊：3D虚拟展厅展示数字作品核心功能设计1.积木调色板（BlockPalette）积木类型功能说明艺术效果示例运动画笔移动/旋转/缩放路径分形几何图案色彩实验室RGB调色盘+渐变生成器
2023-02-21 初心倩萦
2023.2.21周二P22-P24碎片化时间，我们可以做哪些安排呢？第22页到23页给了我们答案。书中提到了还可以用碎片时间来学习新技能和开展第二职业。比如说，学习的新技能，像学习新的语言，新的计算机操作技巧及办公软件技能、了解一些未知领域的新知识。其中，“通勤路上听与专业有关的新闻”这句话提醒了我。其实我在上学和刚上班的时候，都面临着较长时间等公交车和坐公交车的车程。之前的这段时间，要么思考一
华锐云空间平台：开启数字化创新体验新时代 ykjhr_3d VR实训 3D虚拟展厅 3D数字捏脸
（一）3D虚拟展厅搭建，轻松打造独特展示空间华锐云空间平台的3D虚拟展厅搭建功能堪称一绝，为用户提供了超过500个丰富多样的展厅模板，这些模板涵盖了各种风格与主题，无论是科技感十足的现代风，还是充满艺术氛围的文艺风，亦或是庄严肃穆的商务风，都能在这里找到。即使你是毫无技术背景的小白，也能轻松上手。平台采用了简单便捷的拖拽式编辑方式，无需掌握复杂的编程技能，只需通过简单的拖拽操作，就能随心所欲地添加
VR 火化设备仿真系统具备哪些优势？
VR火化设备仿真系统，是融合了当下前沿的VR(虚拟现实)技术，精心打造出的一套针对火化设备的模拟演示与学习系统。它借助先进的3D建模技术，对火化设备进行1：1的高精度还原建模，无论是设备外观的每一处细节，还是内部复杂的构造，都能逼真呈现。(一)培训革命：随时随地沉浸式学习VR火化设备仿真系统给传统培训模式带来了颠覆性的变革。以往，培训往往受限于特定的时间和场地，工作人员必须在火化场规定的时间内，跟
Game Programming with DirectX -- 01[初识Direct3D]
GameProgrammingwithDirectX--01[初识Direct3D]第一卷朦胧的3D世界第一集初识Direct3D简介我们通过2个例子来简单的认识3D1.1接口和数据结构我们首先来看看我们以后用的比较多的接口,a.IDirect3D9b.IDirect3DDevice9c.IDirect3DVertexBuffer9d.IDirect3DIndexBuffer9e.IDirect3
初识Direct3D gauss 客户端编程 direct3d Direct3D null NULL parameters 工作数据结构
第一卷朦胧的3D世界第一集初识Direct3D简介我们通过2个例子来简单的认识3D1.1接口和数据结构我们首先来看看我们以后用的比较多的接口,a.IDirect3D9b.IDirect3DDevice9c.IDirect3DVertexBuffer9d.IDirect3DIndexBuffer9e.IDirect3DSurface9f.IDirect3DTexture9g.ID3DXMesh再看看
01[初识Direct3D]
第一卷朦胧的3D世界第一集初识Direct3D简介我们通过2个例子来简单的认识3D1.1接口和数据结构我们首先来看看我们以后用的比较多的接口,a.IDirect3D9b.IDirect3DDevice9c.IDirect3DVertexBuffer9d.IDirect3DIndexBuffer9e.IDirect3DSurface9f.IDirect3DTexture9g.ID3DXMesh再看看
win10 2004 微软原版镜像下载 userxxcc 工具
微软原版镜像，BT下载。1.商业版64位（专业、企业）：Windows10(businessedition),Version2004(x64)-DVD(Chinese-Simplified)：magnet:?xt=urn:btih:8E49569FDE852E4F3CCB3D13EFB296B6B02D82A6&dn=cn_windows_10_business_editions_version_
仙剑奇侠传3D回合哪个平台充值有返利？仙剑奇侠传3D回合哪个平台内部福利多折扣最高？会飞滴鱼儿
导读：仙剑奇侠传3D回合哪个平台充值有返利？仙剑奇侠传3D回合哪个平台内部福利多折扣最高？现在的手游也是越来越多了，怎么才能称的上一款好的游戏呢？那必须要有内部福利才行，下面就给大家解析一下什么是内部号，手游托号，返利号，折扣号，并且分享一下申请渠道。谈及手游内部号和“托”号，以及返利号，折扣号，很多人都只是听闻它们的存在，并不了解它真正的作用，“托”号以及返利，折扣号其实都是内部号的一种说法，内
python 密码学模块_Python加密与解密 No module named 'Crypto' weixin_39827304 python 密码学模块
DES加密全称为DataEncryptionStandard，即数据加密标准，是一种使用密钥加密的块算法入口参数有三个：Key、Data、ModeKey为7个字节共56位，是DES算法的工作密钥；Data为8个字节64位，是要被加密或被解密的数据；Mode为DES的工作方式,有两种:加密或解密3DES(即TripleDES)是DES向AES过渡的加密算法使用两个密钥，执行三次DES算法加密的过程是
Microsoft Powerpoint for Mac 2021 中文破解版 (幻灯片演示文稿制作) 1f40c7e94f60
软件介绍/功能MicrosoftPowerPoint2021forMac破解版是办公必备的软件之一，作为知名的幻灯片演示文稿制作软件，这次的PowerPoint2021破解版改进和新增不少功能，比如@提及功能、墨迹绘制、3D模型插入等，功能更加完善，制作PPT怎么能少的了这款PowerPoint2021破解版，欢迎各位下载PowerPoint2021mac版体验全新功能！软件地址：macdwn.s
Three.js入门：创建第一个3D场景薯条说影 Three.js 3D场景创建跨平台设置安全异常处理 HTML骨架搭建
背景简介Three.js是一个轻量级的3D图形库，它让Web开发者能够在浏览器中创建和显示3D图形。本章介绍如何设置环境以开始使用Three.js，包括不同操作系统下的安装步骤、安全异常处理以及基本的HTML骨架创建。安装与设置操作系统兼容性：Three.js的使用不仅限于Windows系统。对于其他操作系统，如Linux和MacOS，需要将可执行文件复制到目标目录，并通过命令行启动。无论是哪种操
Three.js入门第一步：两种方式搭建你的3D项目[特殊字符]️
上一篇我们聊了学习Three.js前的“地基”知识，现在地基牢固，该正式动工了！在创造炫酷的3D世界之前，我们得先把开发环境给搭好。官方手册提供了两种主流的安装方式，分别适用于不同场景。选对方法，事半功倍！方式一：CDN+Importmap(极速上手)这是官方最为推荐的、也是最简单的入门方式，尤其适合学习、做小练习、或者快速验证一个想法。优点：无需安装任何东西！只需要一个能联网的浏览器。操作方法：
3D美术总监的“精准投射”：精通Substance Painter无损贴花工作流 reddingtons 3d substance painter 贴图 adobe 设计师图像处理媒体
在三维视觉艺术的创作中，我们常常探讨一对核心的“对立统一”：一方面是**“二维的平面”（The2DPlane），我们在此之上，创造出逻辑清晰、细节丰富的图形与标志；另一方面是“三维的曲面”（The3DSurface）**，它是物理世界中，物体真实存在的形态。如何将前者，无损、无畸变地，“投射”到后者之上，是所有3D艺术家都必须精通的核心技艺。在海外设计界工作的十余年间，我发现，最高效的3D贴图工作
【三维感知目标检测论文阅读】《Point RCNN: An Angle-Free Framework for Rotated Object Detection》
今天给大家带来的论文是2019年的《PointRCNN:AnAngle-FreeFrameworkforRotatedObjectDetection》。尽管这是一篇较早的纯点云检测论文，但我把它放在了最后来讲。因为在了解了各类主流方法后，再回过头来阅读它会有更深的理解。PointRCNN采用自底向上的方式直接从点云生成高质量的3D候选框，其对于旋转框的无角度（Angle-Free）处理方式，对于理
cm3d2 & com3d2 HECUgauss Kiss 经验分享游戏程序
新增一个分类，因为旧做cm3d2有些插件是可以通用的，标注CM3D2的就是理论上旧做也能用的（但我手上没有cm3d2所以只是理论上）网站Hgamewikicom3d2分区CustomMaid3D2-HgamesWiki(anime-sharing.com)https://wiki.anime-sharing.com/hgames/index.php?title=Custom_Maid_3D2中文資
js动画html标签（持续更新中） 843977358 html js 动画 media opacity
1.jQuery 效果 - animate() 方法改变 "div" 元素的高度： $(".btn1").click(function(){ $("#box").animate({height:"300px
springMVC学习笔记 caoyong springMVC
1、搭建开发环境 a>、添加jar文件，在ioc所需jar包的基础上添加spring-web.jar,spring-webmvc.jar b>、在web.xml中配置前端控制器 <servlet> &nbs
POI中设置Excel单元格格式 107x poi style 列宽合并单元格自动换行
引用：http://apps.hi.baidu.com/share/detail/17249059 POI中可能会用到一些需要设置EXCEL单元格格式的操作小结：先获取工作薄对象: HSSFWorkbook wb = new HSSFWorkbook(); HSSFSheet sheet = wb.createSheet(); HSSFCellStyle setBorder = wb.
jquery 获取A href 触发js方法的this参数无效的情况一炮送你回车库 jquery
html如下： <td class=\"bord-r-n bord-l-n c-333\"> <a class=\"table-icon edit\" onclick=\"editTrValues(this);\">修改</a> </td>" j
md5 3213213333332132 MD5
import java.security.MessageDigest; import java.security.NoSuchAlgorithmException; public class MDFive { public static void main(String[] args) { String md5Str = "cq
完全卸载干净Oracle11g sophia天雪 orale数据库卸载干净清理注册表
完全卸载干净Oracle11g A、存在OUI卸载工具的情况下：第一步：停用所有Oracle相关的已启动的服务；第二步：找到OUI卸载工具：在“开始”菜单中找到“oracle_OraDb11g_home”文件夹中 &
apache 的access.log 日志文件太大如何解决 darkranger apache
CustomLog logs/access.log common 此写法导致日志数据一致自增变大。直接注释上面的语法 #CustomLog logs/access.log common 增加： CustomLog "|bin/rotatelogs.exe -l logs/access-%Y-%m-d.log
Hadoop单机模式环境搭建关键步骤 aijuans 分布式
Hadoop环境需要sshd服务一直开启，故，在服务器上需要按照ssh服务，以Ubuntu Linux为例，按照ssh服务如下： sudo apt-get install ssh sudo apt-get install rsync 编辑HADOOP_HOME/conf/hadoop-env.sh文件，将JAVA_HOME设置为Java
PL/SQL DEVELOPER 使用的一些技巧 atongyeye java sql
1 记住密码这是个有争议的功能，因为记住密码会给带来数据安全的问题。但假如是开发用的库，密码甚至可以和用户名相同，每次输入密码实在没什么意义，可以考虑让PLSQL Developer记住密码。位置：Tools菜单－－Preferences－－Oracle－－Logon HIstory－－Store with password 2 特殊Copy 在SQL Window
PHP：在对象上动态添加一个新的方法 bardo 方法动态添加闭包
有关在一个对象上动态添加方法，如果你来自Ruby语言或您熟悉这门语言，你已经知道它是什么...... Ruby提供给你一种方式来获得一个instancied对象，并给这个对象添加一个额外的方法。好！不说Ruby了，让我们来谈谈PHP PHP未提供一个“标准的方式”做这样的事情，这也是没有核心的一部分... 但无论如何，它并没有说我们不能做这样
ThreadLocal与线程安全 bijian1013 java java多线程 threadLocal
首先来看一下线程安全问题产生的两个前提条件： 1.数据共享，多个线程访问同样的数据。 2.共享数据是可变的，多个线程对访问的共享数据作出了修改。实例：定义一个共享数据： public static int a = 0;
Tomcat 架包冲突解决征客丶 tomcat Web
环境： Tomcat 7.0.6 win7 x64 错误表象：【我的冲突的架包是：catalina.jar 与 tomcat-catalina-7.0.61.jar 冲突，不知道其他架包冲突时是不是也报这个错误】严重: End event threw exception java.lang.NoSuchMethodException: org.apache.catalina.dep
【Scala三】分析Spark源代码总结的Scala语法一 bit1129 scala
Scala语法 1. classOf运算符 Scala中的classOf[T]是一个class对象，等价于Java的T.class,比如classOf[TextInputFormat]等价于TextInputFormat.class 2. 方法默认值 defaultMinPartitions就是一个默认值，类似C++的方法默认值
java 线程池管理机制 BlueSkator java线程池管理机制
编辑 Add Tools jdk线程池一、引言第一：降低资源消耗。通过重复利用已创建的线程降低线程创建和销毁造成的消耗。第二：提高响应速度。当任务到达时，任务可以不需要等到线程创建就能立即执行。第三：提高线程的可管理性。线程是稀缺资源，如果无限制的创建，不仅会消耗系统资源，还会降低系统的稳定性，使用线程池可以进行统一的分配，调优和监控。
关于hql中使用本地sql函数的问题（问-答） BreakingBad HQL 存储函数
转自于：http://www.iteye.com/problems/23775 问：我在开发过程中，使用hql进行查询（mysql5）使用到了mysql自带的函数find_in_set()这个函数作为匹配字符串的来讲效率非常好，但是我直接把它写在hql语句里面（from ForumMemberInfo fm,ForumArea fa where find_in_set(fm.userId,f
读《研磨设计模式》-代码笔记-迭代器模式-Iterator bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.Arrays; import java.util.List; /** * Iterator模式提供一种方法顺序访问一个聚合对象中各个元素，而又不暴露该对象内部表示 * * 个人觉得，为了不暴露该
常用SQL chenjunt3 oracle sql C++c C#
--NC建库 CREATE TABLESPACE NNC_DATA01 DATAFILE 'E:\oracle\product\10.2.0\oradata\orcl\nnc_data01.dbf' SIZE 500M AUTOEXTEND ON NEXT 50M EXTENT MANAGEMENT LOCAL UNIFORM SIZE 256K ; CREATE TABLESPA
数学是科学技术的语言 comsci 工作活动领域模型
从小学到大学都在学习数学，从小学开始了解数字的概念和背诵九九表到大学学习复变函数和离散数学，看起来好像掌握了这些数学知识，但是在工作中却很少真正用到这些知识，为什么？最近在研究一种开源软件-CARROT2的源代码的时候，又一次感觉到数学在计算机技术中的不可动摇的基础作用，CARROT2是一种用于自动语言分类（聚类）的工具性软件，用JAVA语言编写，它
Linux系统手动安装rzsz 软件包 daizj linux sz rz
1、下载软件 rzsz-3.34.tar.gz。登录linux，用命令 wget http://freeware.sgi.com/source/rzsz/rzsz-3.48.tar.gz下载。 2、解压 tar zxvf rzsz-3.34.tar.gz 3、安装 cd rzsz-3.34 ; make posix 。注意：这个软件安装与常规的GNU软件不
读源码之:ArrayBlockingQueue dieslrae java
ArrayBlockingQueue是concurrent包提供的一个线程安全的队列,由一个数组来保存队列元素.通过 takeIndex和 putIndex来分别记录出队列和入队列的下标,以保证在出队列时不进行元素移动. //在出队列或者入队列的时候对takeIndex或者putIndex进行累加,如果已经到了数组末尾就又从0开始,保证数
C语言学习九枚举的定义和应用 dcj3sjt126com c
枚举的定义 # include <stdio.h> enum WeekDay { MonDay, TuesDay, WednesDay, ThursDay, FriDay, SaturDay, SunDay }; int main(void) { //int day; //day定义成int类型不合适 enum WeekDay day = Wedne
Vagrant 三种网络配置详解 dcj3sjt126com vagrant
Forwarded port Private network Public network Vagrant 中一共有三种网络配置，下面我们将会详解三种网络配置各自优缺点。端口映射(Forwarded port)，顾名思义是指把宿主计算机的端口映射到虚拟机的某一个端口上，访问宿主计算机端口时，请求实际是被转发到虚拟机上指定端口的。Vagrantfile中设定语法为： c
16.性能优化-完结 frank1234 性能优化
性能调优是一个宏大的工程，需要从宏观架构(比如拆分，冗余，读写分离，集群，缓存等)，软件设计（比如多线程并行化，选择合适的数据结构），数据库设计层面（合理的表设计，汇总表，索引，分区，拆分，冗余等）以及微观（软件的配置，SQL语句的编写，操作系统配置等）根据软件的应用场景做综合的考虑和权衡，并经验实际测试验证才能达到最优。性能水很深，笔者经验尚浅，赶脚也就了解了点皮毛而已，我觉得
Word Search hcx2013 search
Given a 2D board and a word, find if the word exists in the grid. The word can be constructed from letters of sequentially adjacent cell, where "adjacent" cells are those horizontally or ve
Spring4新特性——Web开发的增强 jinnianshilongnian spring spring mvc spring4
Spring4新特性——泛型限定式依赖注入 Spring4新特性——核心容器的其他改进 Spring4新特性——Web开发的增强 Spring4新特性——集成Bean Validation 1.1(JSR-349)到SpringMVC Spring4新特性——Groovy Bean定义DSL Spring4新特性——更好的Java泛型操作API Spring4新
CentOS安装配置tengine并设置开机启动 liuxingguome centos
yum install gcc-c++ yum install pcre pcre-devel yum install zlib zlib-devel yum install openssl openssl-devel Ubuntu上可以这样安装 sudo aptitude install libdmalloc-dev libcurl4-opens
第14章工具函数（上） onestopweb 函数
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
Xelsius 2008 and SAP BW at a glance blueoxygen BO Xelsius
Xelsius提供了丰富多样的数据连接方式，其中为SAP BW专属提供的是BICS。那么Xelsius的各种连接的优缺点比较以及Xelsius是如何直接连接到BEx Query的呢？以下Wiki文章应该提供了全面的概览。 http://wiki.sdn.sap.com/wiki/display/BOBJ/Xcelsius+2008+and+SAP+NetWeaver+BW+Co
oracle表空间相关 tongsh6 oracle
在oracle数据库中，一个用户对应一个表空间，当表空间不足时，可以采用增加表空间的数据文件容量，也可以增加数据文件，方法有如下几种： 1.给表空间增加数据文件 ALTER TABLESPACE "表空间的名字" ADD DATAFILE '表空间的数据文件路径' SIZE 50M; &nb
.Net framework4.0安装失败 yangjuanjava .net windows
上午的.net framework 4.0，各种失败，查了好多答案，各种不靠谱，最后终于找到答案了和Windows Update有关系，给目录名重命名一下再次安装，即安装成功了！下载地址：http://www.microsoft.com/en-us/download/details.aspx?id=17113 方法： 1.运行cmd，输入net stop WuAuServ 2.点击开