蜗牛me

计算机视觉论文整理

经典论文

计算机视觉论文

ImageNet分类
物体检测
物体跟踪
低级视觉
边缘检测
语义分割
视觉注意力和显著性
物体识别
人体姿态估计
CNN原理和性质（Understanding CNN）
图像和语言
图像解说
视频解说
图像生成

微软ResNet

论文：用于图像识别的深度残差网络

作者：何恺明、张祥雨、任少卿和孙剑

链接：http://arxiv.org/pdf/1512.03385v1.pdf

微软PRelu（随机纠正线性单元/权重初始化）

论文：深入学习整流器：在ImageNet分类上超越人类水平

作者：何恺明、张祥雨、任少卿和孙剑

链接：http://arxiv.org/pdf/1502.01852.pdf

谷歌Batch Normalization

论文：批量归一化：通过减少内部协变量来加速深度网络训练

作者：Sergey Ioffe, Christian Szegedy

链接：http://arxiv.org/pdf/1502.03167.pdf

谷歌GoogLeNet

论文：更深的卷积，CVPR 2015

作者：Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich

链接：http://arxiv.org/pdf/1409.4842.pdf

牛津VGG-Net

论文：大规模视觉识别中的极深卷积网络，ICLR 2015

作者：Karen Simonyan & Andrew Zisserman

链接：http://arxiv.org/pdf/1409.1556.pdf

AlexNet

论文：使用深度卷积神经网络进行ImageNet分类

作者：Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton

链接：http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf

物体检测

PVANET

论文：用于实时物体检测的深度轻量神经网络（PVANET：Deep but Lightweight Neural Networks for Real-time Object Detection）

作者：Kye-Hyeon Kim, Sanghoon Hong, Byungseok Roh, Yeongjae Cheon, Minje Park

链接：http://arxiv.org/pdf/1608.08021

纽约大学OverFeat

论文：使用卷积网络进行识别、定位和检测（OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks），ICLR 2014

作者：Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, Yann LeCun

链接：http://arxiv.org/pdf/1312.6229.pdf

伯克利R-CNN

论文：精确物体检测和语义分割的丰富特征层次结构（Rich feature hierarchies for accurate object detection and semantic segmentation），CVPR 2014

作者：Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik

链接：http://www.cv-foundation.org/openaccess/content_cvpr_2014/papers/Girshick_Rich_Feature_Hierarchies_2014_CVPR_paper.pdf

微软SPP

论文：视觉识别深度卷积网络中的空间金字塔池化（Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition），ECCV 2014

作者：何恺明、张祥雨、任少卿和孙剑

链接：http://arxiv.org/pdf/1406.4729.pdf

微软Fast R-CNN

论文：Fast R-CNN

作者：Ross Girshick

链接：http://arxiv.org/pdf/1504.08083.pdf

微软Faster R-CNN

论文：使用RPN走向实时物体检测（Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks）

作者：任少卿、何恺明、Ross Girshick、孙剑

链接：http://arxiv.org/pdf/1506.01497.pdf

牛津大学R-CNN minus R

论文：R-CNN minus R

作者：Karel Lenc, Andrea Vedaldi

链接：http://arxiv.org/pdf/1506.06981.pdf

端到端行人检测

论文：密集场景中端到端的行人检测（End-to-end People Detection in Crowded Scenes）

作者：Russell Stewart, Mykhaylo Andriluka

链接：http://arxiv.org/pdf/1506.04878.pdf

实时物体检测

论文：你只看一次：统一实时物体检测（You Only Look Once: Unified, Real-Time Object Detection）

作者：Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi

链接：http://arxiv.org/pdf/1506.02640.pdf

Inside-Outside Net

论文：使用跳跃池化和RNN在场景中检测物体（Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks）

作者：Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick

链接：http://arxiv.org/abs/1512.04143.pdf

微软ResNet

论文：用于图像识别的深度残差网络

作者：何恺明、张祥雨、任少卿和孙剑

链接：http://arxiv.org/pdf/1512.03385v1.pdf

R-FCN

论文：通过区域全卷积网络进行物体识别（R-FCN: Object Detection via Region-based Fully Convolutional Networks）

作者：代季峰，李益，何恺明，孙剑

链接：http://arxiv.org/abs/1605.06409

SSD

论文：单次多框检测器（SSD: Single Shot MultiBox Detector）

作者：Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg

链接：http://arxiv.org/pdf/1512.02325v2.pdf

速度/精度权衡

论文：现代卷积物体检测器的速度/精度权衡（Speed/accuracy trade-offs for modern convolutional object detectors）

作者：Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy

链接：http://arxiv.org/pdf/1611.10012v1.pdf

物体跟踪

论文：用卷积神经网络通过学习可区分的显著性地图实现在线跟踪（Online Tracking by Learning Discriminative Saliency Map with Convolutional Neural Network）

作者：Seunghoon Hong, Tackgeun You, Suha Kwak, Bohyung Han

地址：arXiv:1502.06796.

论文：DeepTrack：通过视觉跟踪的卷积神经网络学习辨别特征表征（DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for Visual Tracking）

作者：Hanxi Li, Yi Li and Fatih Porikli

发表： BMVC, 2014.

论文：视觉跟踪中，学习深度紧凑图像表示（Learning a Deep Compact Image Representation for Visual Tracking）

作者：N Wang, DY Yeung

发表：NIPS, 2013.

论文：视觉跟踪的分层卷积特征（Hierarchical Convolutional Features for Visual Tracking）

作者：Chao Ma, Jia-Bin Huang, Xiaokang Yang and Ming-Hsuan Yang

发表： ICCV 2015

论文：完全卷积网络的视觉跟踪（Visual Tracking with fully Convolutional Networks）

作者：Lijun Wang, Wanli Ouyang, Xiaogang Wang, and Huchuan Lu,

发表：ICCV 2015

论文：学习多域卷积神经网络进行视觉跟踪（Learning Multi-Domain Convolutional Neural Networks for Visual Tracking）

作者：Hyeonseob Namand Bohyung Han

对象识别（Object Recognition）

论文：卷积神经网络弱监督学习（Weakly-supervised learning with convolutional neural networks）

作者：Maxime Oquab，Leon Bottou，Ivan Laptev，Josef Sivic，CVPR，2015

链接：
http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Oquab_Is_Object_Localization_2015_CVPR_paper.pdf

FV-CNN

论文：深度滤波器组用于纹理识别和分割（Deep Filter Banks for Texture Recognition and Segmentation）

作者：Mircea Cimpoi, Subhransu Maji, Andrea Vedaldi, CVPR, 2015.

链接：
http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Cimpoi_Deep_Filter_Banks_2015_CVPR_paper.pdf

人体姿态估计（Human Pose Estimation）

论文：使用 Part Affinity Field的实时多人2D姿态估计（Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields）

作者：Zhe Cao, Tomas Simon, Shih-En Wei, and Yaser Sheikh, CVPR, 2017.

论文：Deepcut：多人姿态估计的联合子集分割和标签（Deepcut: Joint subset partition and labeling for multi person pose estimation）

作者：Leonid Pishchulin, Eldar Insafutdinov, Siyu Tang, Bjoern Andres, Mykhaylo Andriluka, Peter Gehler, and Bernt Schiele, CVPR, 2016.

论文：Convolutional pose machines

作者：Shih-En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh, CVPR, 2016.

论文：人体姿态估计的 Stacked hourglass networks（Stacked hourglass networks for human pose estimation）

作者：Alejandro Newell, Kaiyu Yang, and Jia Deng, ECCV, 2016.

论文：用于视频中人体姿态估计的Flowing convnets（Flowing convnets for human pose estimation in videos）

作者：Tomas Pfister, James Charles, and Andrew Zisserman, ICCV, 2015.

论文：卷积网络和人类姿态估计图模型的联合训练（Joint training of a convolutional network and a graphical model for human pose estimation）

作者：Jonathan J. Tompson, Arjun Jain, Yann LeCun, Christoph Bregler, NIPS, 2014.

理解CNN

论文：通过测量同变性和等价性来理解图像表示(Understanding image representations by measuring their equivariance and equivalence)

作者：Karel Lenc, Andrea Vedaldi, CVPR, 2015.

链接：
http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Lenc_Understanding_Image_Representations_2015_CVPR_paper.pdf

论文：深度神经网络容易被愚弄：无法识别的图像的高置信度预测（Deep Neural Networks are Easily Fooled:High Confidence Predictions for Unrecognizable Images）

作者：Anh Nguyen, Jason Yosinski, Jeff Clune, CVPR, 2015.

链接：
http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Nguyen_Deep_Neural_Networks_2015_CVPR_paper.pdf

论文：通过反演理解深度图像表示（Understanding Deep Image Representations by Inverting Them）

作者：Aravindh Mahendran, Andrea Vedaldi, CVPR, 2015

链接：
http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Mahendran_Understanding_Deep_Image_2015_CVPR_paper.pdf

论文：深度场景CNN中的对象检测器（Object Detectors Emerge in Deep Scene CNNs）

作者：Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba, ICLR, 2015.

链接：http://arxiv.org/abs/1412.6856

论文：用卷积网络反演视觉表示（Inverting Visual Representations with Convolutional Networks）

作者：Alexey Dosovitskiy, Thomas Brox, arXiv, 2015.

链接：http://arxiv.org/abs/1506.02753

论文：可视化和理解卷积网络（Visualizing and Understanding Convolutional Networks）

作者：Matthrew Zeiler, Rob Fergus, ECCV, 2014.

链接：http://www.cs.nyu.edu/~fergus/papers/zeilerECCV2014.pdf

图像与语言

图像说明（Image Captioning）

UCLA / Baidu

用多模型循环神经网络解释图像（Explain Images with Multimodal Recurrent Neural Networks）

Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Alan L. Yuille, arXiv:1410.1090

http://arxiv.org/pdf/1410.1090

Toronto

使用多模型神经语言模型统一视觉语义嵌入（Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models）

Ryan Kiros, Ruslan Salakhutdinov, Richard S. Zemel, arXiv:1411.2539.

http://arxiv.org/pdf/1411.2539

Berkeley

用于视觉识别和描述的长期循环卷积网络（Long-term Recurrent Convolutional Networks for Visual Recognition and Description）

Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, Trevor Darrell, arXiv:1411.4389.

http://arxiv.org/pdf/1411.4389

Google

看图写字：神经图像说明生成器（Show and Tell: A Neural Image Caption Generator）

Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan, arXiv:1411.4555.

http://arxiv.org/pdf/1411.4555

Stanford

用于生成图像描述的深度视觉语义对齐（Deep Visual-Semantic Alignments for Generating Image Description）

Andrej Karpathy, Li Fei-Fei, CVPR, 2015.

Web：http://cs.stanford.edu/people/karpathy/deepimagesent/

Paper：http://cs.stanford.edu/people/karpathy/cvpr2015.pdf

UML / UT

使用深度循环神经网络将视频转换为自然语言（Translating Videos to Natural Language Using Deep Recurrent Neural Networks）

Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko, NAACL-HLT, 2015.

http://arxiv.org/pdf/1412.4729

CMU / Microsoft

学习图像说明生成的循环视觉表示（Learning a Recurrent Visual Representation for Image Caption Generation）

Xinlei Chen, C. Lawrence Zitnick, arXiv:1411.5654.

Xinlei Chen, C. Lawrence Zitnick, Mind’s Eye: A Recurrent Visual Representation for Image Caption Generation, CVPR 2015

http://www.cs.cmu.edu/~xinleic/papers/cvpr15_rnn.pdf

Microsoft

从图像说明到视觉概念（From Captions to Visual Concepts and Back）

Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollár, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, C. Lawrence Zitnick, Geoffrey Zweig, CVPR, 2015.

http://arxiv.org/pdf/1411.4952

Univ. Montreal / Univ. Toronto

Show, Attend, and Tell：视觉注意力与神经图像标题生成（Show, Attend, and Tell: Neural Image Caption Generation with Visual Attention）

Kelvin Xu, Jimmy Lei Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard S. Zemel, Yoshua Bengio, arXiv:1502.03044 / ICML 2015

http://www.cs.toronto.edu/~zemel/documents/captionAttn.pdf

Idiap / EPFL / Facebook

基于短语的图像说明（Phrase-based Image Captioning）

Remi Lebret, Pedro O. Pinheiro, Ronan Collobert, arXiv:1502.03671 / ICML 2015

http://arxiv.org/pdf/1502.03671

UCLA / Baidu

像孩子一样学习：从图像句子描述快速学习视觉的新概念（Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images）

Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan L. Yuille, arXiv:1504.06692

http://arxiv.org/pdf/1504.06692

MS + Berkeley

探索图像说明的最近邻方法（ Exploring Nearest Neighbor Approaches for Image Captioning）

Jacob Devlin, Saurabh Gupta, Ross Girshick, Margaret Mitchell, C. Lawrence Zitnick, arXiv:1505.04467

http://arxiv.org/pdf/1505.04467.pdf

图像说明的语言模型（Language Models for Image Captioning: The Quirks and What Works）

Jacob Devlin, Hao Cheng, Hao Fang, Saurabh Gupta, Li Deng, Xiaodong He, Geoffrey Zweig, Margaret Mitchell, arXiv:1505.01809

http://arxiv.org/pdf/1505.01809.pdf

阿德莱德

具有中间属性层的图像说明（ Image Captioning with an Intermediate Attributes Layer）

Qi Wu, Chunhua Shen, Anton van den Hengel, Lingqiao Liu, Anthony Dick, arXiv:1506.01144

蒂尔堡

通过图片学习语言(Learning language through pictures)

Grzegorz Chrupala, Akos Kadar, Afra Alishahi, arXiv:1506.03694

蒙特利尔大学

使用基于注意力的编码器-解码器网络描述多媒体内容（Describing Multimedia Content using Attention-based Encoder-Decoder Networks）

Kyunghyun Cho, Aaron Courville, Yoshua Bengio, arXiv:1507.01053

康奈尔

图像表示和神经图像说明的新领域（Image Representations and New Domains in Neural Image Captioning）

Jack Hessel, Nicolas Savva, Michael J. Wilber, arXiv:1508.02091

MS + City Univ. of HongKong

Learning Query and Image Similarities with Ranking Canonical Correlation Analysis

Ting Yao, Tao Mei, and Chong-Wah Ngo, ICCV, 2015

视频字幕（Video Captioning）

伯克利

Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, Trevor Darrell, Long-term Recurrent Convolutional Networks for Visual Recognition and Description, CVPR, 2015.

犹他州/ UML / 伯克利

Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko, Translating Videos to Natural Language Using Deep Recurrent Neural Networks, arXiv:1412.4729.

微软

Yingwei Pan, Tao Mei, Ting Yao, Houqiang Li, Yong Rui, Joint Modeling Embedding and Translation to Bridge Video and Language, arXiv:1505.01861.

犹他州/ UML / 伯克利

Subhashini Venugopalan, Marcus Rohrbach, Jeff Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko, Sequence to Sequence–Video to Text, arXiv:1505.00487.

蒙特利尔大学/ 舍布鲁克

Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville, Describing Videos by Exploiting Temporal Structure, arXiv:1502.08029

MPI / 伯克利

Anna Rohrbach, Marcus Rohrbach, Bernt Schiele, The Long-Short Story of Movie Description, arXiv:1506.01698

多伦多大学 / MIT

Yukun Zhu, Ryan Kiros, Richard Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler, Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books, arXiv:1506.06724

蒙特利尔大学

Kyunghyun Cho, Aaron Courville, Yoshua Bengio, Describing Multimedia Content using Attention-based Encoder-Decoder Networks, arXiv:1507.01053

TAU / 美国南加州大学

Dotan Kaufman, Gil Levi, Tal Hassner, Lior Wolf, Temporal Tessellation for Video Annotation and Summarization, arXiv:1612.06950.

图像生成

卷积/循环网络

论文：Conditional Image Generation with PixelCNN Decoders”

作者：Aäron van den Oord, Nal Kalchbrenner, Oriol Vinyals, Lasse Espeholt, Alex Graves, Koray Kavukcuoglu

论文：Learning to Generate Chairs with Convolutional Neural Networks

作者：Alexey Dosovitskiy, Jost Tobias Springenberg, Thomas Brox

发表：CVPR, 2015.

论文：DRAW: A Recurrent Neural Network For Image Generation

作者：Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Jimenez Rezende, Daan Wierstra

发表：ICML, 2015.

对抗网络

论文：生成对抗网络（Generative Adversarial Networks）

作者：Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio

发表：NIPS, 2014.

论文：使用对抗网络Laplacian Pyramid 的深度生成图像模型（Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks）

作者：Emily Denton, Soumith Chintala, Arthur Szlam, Rob Fergus

发表：NIPS, 2015.

论文：生成模型演讲概述（A note on the evaluation of generative models）

作者：Lucas Theis, Aäron van den Oord, Matthias Bethge

发表：ICLR 2016.

论文：变分自动编码深度高斯过程（Variationally Auto-Encoded Deep Gaussian Processes）

作者：Zhenwen Dai, Andreas Damianou, Javier Gonzalez, Neil Lawrence

发表：ICLR 2016.

论文：用注意力机制从字幕生成图像（Generating Images from Captions with Attention）

作者：Elman Mansimov, Emilio Parisotto, Jimmy Ba, Ruslan Salakhutdinov

发表： ICLR 2016

论文：分类生成对抗网络的无监督和半监督学习（Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks）

作者：Jost Tobias Springenberg

发表：ICLR 2016

论文：用一个对抗检测表征（Censoring Representations with an Adversary）

作者：Harrison Edwards, Amos Storkey

发表：ICLR 2016

论文：虚拟对抗训练实现分布式顺滑（Distributional Smoothing with Virtual Adversarial Training）

作者：Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Ken Nakae, Shin Ishii

发表：ICLR 2016

论文：自然图像流形上的生成视觉操作（Generative Visual Manipulation on the Natural Image Manifold）

作者：朱俊彦, Philipp Krahenbuhl, Eli Shechtman, and Alexei A. Efros

发表： ECCV 2016.

论文：深度卷积生成对抗网络的无监督表示学习（Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks）

作者：Alec Radford, Luke Metz, Soumith Chintala

发表： ICLR 2016

问题回答

弗吉尼亚大学 / 微软研究院

论文：VQA: Visual Question Answering, CVPR, 2015 SUNw:Scene Understanding workshop.

作者：Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh

MPI / 伯克利

论文：Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

作者：Mateusz Malinowski, Marcus Rohrbach, Mario Fritz,

发布： arXiv:1505.01121.

多伦多

论文： Image Question Answering: A Visual Semantic Embedding Model and a New Dataset

作者：Mengye Ren, Ryan Kiros, Richard Zemel

发表： arXiv:1505.02074 / ICML 2015 deep learning workshop.

百度/ 加州大学洛杉矶分校

作者：Hauyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, 徐伟

论文：Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering

发表： arXiv:1505.05612.

POSTECH（韩国）

论文：Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction

作者：Hyeonwoo Noh, Paul Hongsuck Seo, and Bohyung Han

发表： arXiv:1511.05765

CMU / 微软研究院

论文：Stacked Attention Networks for Image Question Answering

作者：Yang, Z., He, X., Gao, J., Deng, L., & Smola, A. (2015)

发表： arXiv:1511.02274.

MetaMind

论文：Dynamic Memory Networks for Visual and Textual Question Answering

作者：Xiong, Caiming, Stephen Merity, and Richard Socher

发表： arXiv:1603.01417 (2016).

首尔国立大学 + NAVER

论文：Multimodal Residual Learning for Visual QA

作者：Jin-Hwa Kim, Sang-Woo Lee, Dong-Hyun Kwak, Min-Oh Heo, Jeonghee Kim, Jung-Woo Ha, Byoung-Tak Zhang

发表：arXiv:1606:01455

UC Berkeley + 索尼

论文：Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding

作者：Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, and Marcus Rohrbach

发表：arXiv:1606.01847

Postech

论文：Training Recurrent Answering Units with Joint Loss Minimization for VQA

作者：Hyeonwoo Noh and Bohyung Han

发表： arXiv:1606.03647

首尔国立大学 + NAVER

论文： Hadamard Product for Low-rank Bilinear Pooling

作者：Jin-Hwa Kim, Kyoung Woon On, Jeonghee Kim, Jung-Woo Ha, Byoung-Tak Zhan

发表：arXiv:1610.04325.

视觉注意力和显著性

论文：Predicting Eye Fixations using Convolutional Neural Networks

作者：Nian Liu, Junwei Han, Dingwen Zhang, Shifeng Wen, Tianming Liu

发表：CVPR, 2015.

学习地标的连续搜索

作者：Learning a Sequential Search for Landmarks

论文：Saurabh Singh, Derek Hoiem, David Forsyth

发表：CVPR, 2015.

视觉注意力机制实现多物体识别

论文：Multiple Object Recognition with Visual Attention

作者：Jimmy Lei Ba, Volodymyr Mnih, Koray Kavukcuoglu,

发表：ICLR, 2015.

视觉注意力机制的循环模型

作者：Volodymyr Mnih, Nicolas Heess, Alex Graves, Koray Kavukcuoglu

论文：Recurrent Models of Visual Attention

发表：NIPS, 2014.

低级视觉

超分辨率

Iterative Image Reconstruction

Sven Behnke: Learning Iterative Image Reconstruction. IJCAI, 2001.

Sven Behnke: Learning Iterative Image Reconstruction in the Neural Abstraction Pyramid. International Journal of Computational Intelligence and Applications, vol. 1, no. 4, pp. 427-438, 2001.

Super-Resolution (SRCNN)

Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, Learning a Deep Convolutional Network for Image Super-Resolution, ECCV, 2014.

Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, Image Super-Resolution Using Deep Convolutional Networks, arXiv:1501.00092.

Very Deep Super-Resolution

Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee, Accurate Image Super-Resolution Using Very Deep Convolutional Networks, arXiv:1511.04587, 2015.

Deeply-Recursive Convolutional Network

Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee, Deeply-Recursive Convolutional Network for Image Super-Resolution, arXiv:1511.04491, 2015.

Casade-Sparse-Coding-Network

Zhaowen Wang, Ding Liu, Wei Han, Jianchao Yang and Thomas S. Huang, Deep Networks for Image Super-Resolution with Sparse Prior. ICCV, 2015.

Perceptual Losses for Super-Resolution

Justin Johnson, Alexandre Alahi, Li Fei-Fei, Perceptual Losses for Real-Time Style Transfer and Super-Resolution, arXiv:1603.08155, 2016.

SRGAN

Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi, Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network, arXiv:1609.04802v3, 2016.

其他应用

Optical Flow (FlowNet)

Philipp Fischer, Alexey Dosovitskiy, Eddy Ilg, Philip Häusser, Caner Hazırbaş, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, Thomas Brox, FlowNet: Learning Optical Flow with Convolutional Networks, arXiv:1504.06852.

Compression Artifacts Reduction

Chao Dong, Yubin Deng, Chen Change Loy, Xiaoou Tang, Compression Artifacts Reduction by a Deep Convolutional Network, arXiv:1504.06993.

Blur Removal

Christian J. Schuler, Michael Hirsch, Stefan Harmeling, Bernhard Schölkopf, Learning to Deblur, arXiv:1406.7444

Jian Sun, Wenfei Cao, Zongben Xu, Jean Ponce, Learning a Convolutional Neural Network for Non-uniform Motion Blur Removal, CVPR, 2015

Image Deconvolution

Li Xu, Jimmy SJ. Ren, Ce Liu, Jiaya Jia, Deep Convolutional Neural Network for Image Deconvolution, NIPS, 2014.

Deep Edge-Aware Filter

Li Xu, Jimmy SJ. Ren, Qiong Yan, Renjie Liao, Jiaya Jia, Deep Edge-Aware Filters, ICML, 2015.

Computing the Stereo Matching Cost with a Convolutional Neural Network

Jure Žbontar, Yann LeCun, Computing the Stereo Matching Cost with a Convolutional Neural Network, CVPR, 2015.

Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei A. Efros, ECCV, 2016

Feature Learning by Inpainting

Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, Alexei A. Efros, Context Encoders: Feature Learning by Inpainting, CVPR, 2016

边缘检测

Saining Xie, Zhuowen Tu, Holistically-Nested Edge Detection, arXiv:1504.06375.

DeepEdge

Gedas Bertasius, Jianbo Shi, Lorenzo Torresani, DeepEdge: A Multi-Scale Bifurcated Deep Network for Top-Down Contour Detection, CVPR, 2015.

DeepContour

Wei Shen, Xinggang Wang, Yan Wang, Xiang Bai, Zhijiang Zhang, DeepContour: A Deep Convolutional Feature Learned by Positive-Sharing Loss for Contour Detection, CVPR, 2015.

语义分割

SEC: Seed, Expand and Constrain

Alexander Kolesnikov, Christoph Lampert, Seed, Expand and Constrain: Three Principles for Weakly-Supervised Image Segmentation, ECCV, 2016.

Adelaide

Guosheng Lin, Chunhua Shen, Ian Reid, Anton van dan Hengel, Efficient piecewise training of deep structured models for semantic segmentation, arXiv:1504.01013. (1st ranked in VOC2012)

Guosheng Lin, Chunhua Shen, Ian Reid, Anton van den Hengel, Deeply Learning the Messages in Message Passing Inference, arXiv:1508.02108. (4th ranked in VOC2012)

Deep Parsing Network (DPN)

Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen Change Loy, Xiaoou Tang, Semantic Image Segmentation via Deep Parsing Network, arXiv:1509.02634 / ICCV 2015 (2nd ranked in VOC 2012)

CentraleSuperBoundaries, INRIA

Iasonas Kokkinos, Surpassing Humans in Boundary Detection using Deep Learning, arXiv:1411.07386 (4th ranked in VOC 2012)

BoxSup

Jifeng Dai, Kaiming He, Jian Sun, BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation, arXiv:1503.01640. (6th ranked in VOC2012)

POSTECH

Hyeonwoo Noh, Seunghoon Hong, Bohyung Han, Learning Deconvolution Network for Semantic Segmentation, arXiv:1505.04366. (7th ranked in VOC2012)

Seunghoon Hong, Hyeonwoo Noh, Bohyung Han, Decoupled Deep Neural Network for Semi-supervised Semantic Segmentation, arXiv:1506.04924.

Seunghoon Hong,Junhyuk Oh,Bohyung Han, andHonglak Lee, Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network, arXiv:1512.07928

Conditional Random Fields as Recurrent Neural Networks

Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr, Conditional Random Fields as Recurrent Neural Networks, arXiv:1502.03240. (8th ranked in VOC2012)

DeepLab

Liang-Chieh Chen, George Papandreou, Kevin Murphy, Alan L. Yuille, Weakly-and semi-supervised learning of a DCNN for semantic image segmentation, arXiv:1502.02734. (9th ranked in VOC2012)

Zoom-out

Mohammadreza Mostajabi, Payman Yadollahpour, Gregory Shakhnarovich, Feedforward Semantic Segmentation With Zoom-Out Features, CVPR, 2015

Joint Calibration

Holger Caesar, Jasper Uijlings, Vittorio Ferrari, Joint Calibration for Semantic Segmentation, arXiv:1507.01581.

Fully Convolutional Networks for Semantic Segmentation

Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully Convolutional Networks for Semantic Segmentation, CVPR, 2015.

Hypercolumn

Bharath Hariharan, Pablo Arbelaez, Ross Girshick, Jitendra Malik, Hypercolumns for Object Segmentation and Fine-Grained Localization, CVPR, 2015.

Deep Hierarchical Parsing

Abhishek Sharma, Oncel Tuzel, David W. Jacobs, Deep Hierarchical Parsing for Semantic Segmentation, CVPR, 2015.

Learning Hierarchical Features for Scene Labeling

Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun, Scene Parsing with Multiscale Feature Learning, Purity Trees, and Optimal Covers, ICML, 2012.

Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun, Learning Hierarchical Features for Scene Labeling, PAMI, 2013.

University of Cambridge

Vijay Badrinarayanan, Alex Kendall and Roberto Cipolla “SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.” arXiv preprint arXiv:1511.00561, 2015.

Alex Kendall, Vijay Badrinarayanan and Roberto Cipolla “Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding.” arXiv preprint arXiv:1511.02680, 2015.

Princeton

Fisher Yu, Vladlen Koltun, “Multi-Scale Context Aggregation by Dilated Convolutions”, ICLR 2016

Univ. of Washington, Allen AI

Hamid Izadinia, Fereshteh Sadeghi, Santosh Kumar Divvala, Yejin Choi, Ali Farhadi, “Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing”, ICCV, 2015

INRIA

Iasonas Kokkinos, “Pusing the Boundaries of Boundary Detection Using deep Learning”, ICLR 2016

UCSB

Niloufar Pourian, S. Karthikeyan, and B.S. Manjunath, “Weakly supervised graph based semantic segmentation by learning communities of image-parts”, ICCV, 2015

其他资源

课程

深度视觉

[斯坦福] CS231n: Convolutional Neural Networks for Visual Recognition

[香港中文大学] ELEG 5040: Advanced Topics in Signal Processing(Introduction to Deep Learning)

· 更多深度课程推荐

[斯坦福] CS224d: Deep Learning for Natural Language Processing

[牛津 Deep Learning by Prof. Nando de Freitas

[纽约大学] Deep Learning by Prof. Yann LeCun

图书

免费在线图书

Deep Learning by Ian Goodfellow, Yoshua Bengio, and Aaron Courville

Neural Networks and Deep Learning by Michael Nielsen

Deep Learning Tutorial by LISA lab, University of Montreal

视频

演讲

Deep Learning, Self-Taught Learning and Unsupervised Feature Learning By Andrew Ng

Recent Developments in Deep Learning By Geoff Hinton

The Unreasonable Effectiveness of Deep Learning by Yann LeCun

Deep Learning of Representations by Yoshua bengio

软件

框架

Tensorflow: An open source software library for numerical computation using data flow graph by Google [Web]
Torch7: Deep learning library in Lua, used by Facebook and Google Deepmind [Web]
Torch-based deep learning libraries: [torchnet],
Caffe: Deep learning framework by the BVLC [Web]
Theano: Mathematical library in Python, maintained by LISA lab [Web]
Theano-based deep learning libraries: [Pylearn2], [Blocks], [Keras], [Lasagne]
MatConvNet: CNNs for MATLAB [Web]
MXNet: A flexible and efficient deep learning library for heterogeneous distributed systems with multi-language support [Web]
Deepgaze: A computer vision library for human-computer interaction based on CNNs [Web]

应用

对抗训练 Code and hyperparameters for the paper “Generative Adversarial Networks” [Web]
理解与可视化 Source code for “Understanding Deep Image Representations by Inverting Them,” CVPR, 2015. [Web]
词义分割 Source code for the paper “Rich feature hierarchies for accurate object detection and semantic segmentation,” CVPR, 2014. [Web] ； Source code for the paper “Fully Convolutional Networks for Semantic Segmentation,” CVPR, 2015. [Web]
超分辨率 Image Super-Resolution for Anime-Style-Art [Web]
边缘检测 Source code for the paper “DeepContour: A Deep Convolutional Feature Learned by Positive-Sharing Loss for Contour Detection,” CVPR, 2015. [Web]
Source code for the paper “Holistically-Nested Edge Detection”, ICCV 2015. [Web]

讲座

[CVPR 2014] Tutorial on Deep Learning in Computer Vision
[CVPR 2015] Applied Deep Learning for Computer Vision with Torch

博客

Deep down the rabbit hole: CVPR 2015 and beyond@Tombone’s Computer Vision Blog
CVPR recap and where we’re going@Zoya Bylinskii (MIT PhD Student)’s Blog
Facebook’s AI Painting@Wired
Inceptionism: Going Deeper into Neural Networks@Google Research
Implementing Neural networks

你可能感兴趣的:(深度学习,github,python,计算机视觉论文)

量化交易系统中如何处理机器学习模型的训练和部署？ openwin_top 量化交易系统开发机器学习人工智能量化交易
microPythonPython最小内核源码解析NI-motion运动控制c语言示例代码解析python编程示例系列python编程示例系列二python的Web神器Streamlit如何应聘高薪职位量化交易系统中，机器学习模型的训练和部署需要遵循一套严密的流程，以确保模型的可靠性、性能和安全性。以下是详细描述以及相关的示例：1.数据收集和预处理数据收集在量化交易中，数据是最重要的资产。收集的数
Mac下载python并安装小小酥*
下载pythonPython官网：https://www.python.org/进入官网后点击download，选择MacOSX版本2.安装MAC系统一般都自带有Python2.x版本的环境，你也可以在链接https://www.python.org/downloads/mac-osx/上下载最新版安装。3.设置环境变量程序和可执行文件可以在许多目录，而这些路径很可能不在操作系统提供可执行文件的搜
Python使用minIO上传下载身似山河挺脊梁 python
前提VSCode+Python3.9minIO有Python的例子1.python生成临时文件2.写入一些数据3.上传到minIO4.获取分享出连接5.发出通知#创建一个客户端minioClient=Minio(endpoint='xx',access_key='xx',secret_key='xx',secure=False)#生成文件名current_datetime=datetime.dat
深入理解Python上下文管理器 ……-…… python 开发语言
1.什么是上下文管理器？2.with语句的魔法3.创建上下文管理器的两种方式3.1基于类的实现3.2使用contextlib模块4.异常处理1.什么是上下文管理器？上下文管理器（ContextManager）是Python中用于精确分配和释放资源的机制。它通过__enter__()和__exit__()两个魔术方法实现了上下文管理协议，确保即使在代码执行出错的情况下，资源也能被正确清理。#经典文件
【Appium】Appium征服安卓自动化：GitHub 10.5k+星开源神器，Python代码实战全解析！山河不见老 python 测试 appium android 自动化
Appium一、为什么开发者都在用Appium？二、环境搭建：5分钟极速配置2.1核心工具链2.2安卓设备连接三、脚本实战：从零编写自动化操作3.1示例1：自动登录微信并发送消息3.2示例2：动态滑动屏幕与数据抓取四、避坑指南4.1元素定位优化4.2稳定性增强4.3云真机集成五、生态扩展：超越安卓的自动化版图一、为什么开发者都在用Appium？万星认证：GitHub超10.5k+星标，活跃社区持续
基于Streamlit实现的音频处理示例大霸王龙音视频 ffmpeg
基于Streamlit实现的音频处理示例，包含录音、语音转文本、文件下载和进度显示功能，整合了多个技术方案：一、环境准备#安装依赖库pipinstallstreamlitstreamlit-webrtcaudio-recorder-streamlitopenai-whisperpython-dotx二、完整示例代码importstreamlitasstfromaudio_recorder_stre
npm错误 gyp错误 vs版本不对 msvs_version不兼容澎湖Java架构师前端 html npm node.js 前端
npm错误gyp错误vs版本不对msvs_version不兼容windowsSDK报错执行更新GYP语句第一种方案第二种方案执行更新GYP语句npminstall-gnode-gyp最新的GYP好像已经不支持Python2.7版本，npm会提示你更新都3.*.*版本安装Node.js的时候一定要勾选以下这个，会自动检测安装缺少的环境第一种方案管理员运行CMD（PowerShell也行）执行更新工具
深入了解 ArangoDB 的图数据库应用与 Python 实践 eahba 数据库 python 开发语言
在当前数据驱动的时代，对连接数据的高效处理和分析需求日益增长。ArangoDB作为一个可扩展的图数据库系统，能够加速从连接数据中获取价值。本文将介绍如何使用Python连接和操作ArangoDB，并展示如何结合图问答链来获取数据洞察。技术背景介绍ArangoDB是一个多模型数据库，支持文档、图和键值类型的数据存储。其强大的图形存储和查询能力使其成为处理复杂数据关系的理想选择。通过JSON支持和单一
不懂英语可以学编程吗?,不懂英文可以学编程吗 P5688346 人工智能
大家好，给大家分享一下英语不好能学python编程吗，很多人还不知道这一点。下面详细解释一下。现在让我们来看看！Sourcecodedownload:本文相关源码提到人工智能，就不得不提Python编程语言，大多数人觉得编程语言肯定会涉及到很多代码，满屏的英文字母，想想就头疼，觉得自己不会英语，肯定学不好Python，但是不会英语到底能不能够学习Python呢，下面小编给大家分析分析。其实各位想要
android视频缓存框架 [AndroidVideoCache](https://github.com/danikula/AndroidVideoCache) 源码解析与评估 MrJarvisDong third party 源码
文章目录android视频缓存框架[AndroidVideoCache](https://github.com/danikula/AndroidVideoCache)源码解析与评估引言使用方式关键类解析HttpProxyCacheServer代理缓存服务类**java.net.ProxySelector**代理选择Pinger判断本地serverSocket是否存活GetRequest封装用于获取
一、Python入门基础 MeyrlNotFound python 开发语言
1.Python简介与环境搭建•了解Python的历史、特点和应用领域Python的历史Python是一种高级编程语言，由GuidovanRossum于1989年发明。Python语言的设计目标是让代码易读、易写、易维护，从而提高开发效率和代码质量。自其诞生以来，Python已从一个简单的系统管理工具发展成为一种广泛应用于多个领域的编程语言。Python的特点1.简单易学：Python的语法简洁明
基于JAVA中的spring框架和jsp实现自然灾害论坛平台项目【附项目源码+论文说明】大雄是个程序员项目实践自然灾害论坛平台 java 项目源码 spring 毕业设计课程设计网页设计
摘要在上个世纪末期，也就是20世纪末，随着计算机技术的发展与进步和数据库方面的知识在互联网的大力运用，互联网技术以及网站技术在网上的大力推广，网上论坛（自然灾害论坛）也逐渐在网兴起，它的出现帮助了网上各种特定的群体进行一个在线的知识传递与信息的交流。本计算机自然灾害论坛设计，采用了JSP（JAVA）技术和MYSQL数据库开发，尝试实现了自然灾害论坛的基本功能以及帮助我们掌握了论坛技术的核心特点。该
npm error gyp info 计算机辅助工程 npm 前端 node.js
在使用npm安装Node.js包时，可能会遇到各种错误，其中gyp错误是比较常见的一种。gyp是Node.js的一个工具，用于编译C++代码。这些错误通常发生在需要编译原生模块的npm包时。下面是一些常见的原因和解决方法：常见原因及解决方法Python未安装或版本不兼容：Node.js使用Python来运行gyp。确保你的系统上安装了Python，并且版本与node-gyp兼容。通常推荐使用Pyt
股票量化交易开发 Yfinance 数字化转型2025 python 开发语言
以下是一段基于Python的股票量化分析代码，包含数据获取、技术指标计算、策略回测和可视化功能：pythonimportyfinanceasyfimportpandasaspdimportnumpyasnpimportmatplotlib.pyplotaspltimportseabornassnsfrombacktestingimportBacktest,Strategyfrombacktesti
sqlmap笔记君如尘网络安全-渗透笔记笔记
1.运行环境sqlmap是用Python编写的，因此首先需要确保你的系统上安装了Python。sqlmap支持Python2.6、2.7和Python3.4及以上版本。2.常用命令通用格式：bythonsqlmap.py-r注入点地址--参数-rpost请求-uget请求--level=测试等级--risk=测试风险-v显示详细信息级别-p针对某个注入点注入-threads更改线程数，加速--ba
python环境部署工具 uv Honnnnnn uv
以原先使用的pipenv工具为例子，通过pipfile.lock生成requirements文件，再将requirements转成pyproject.toml文件，最后生成uv.lock基于当前虚拟环境导出requirements.txt--pipfreeze>requirements.txt（如果原先不是env而是基础的通过requirements.txt文件，省去转化requirements的
穴位按摩培训系统Django-SpringBoot-php-Node.js-flask QQ188083800 django spring boot php
目录具体实现截图技术栈介绍系统设计研究方法：设计步骤设计流程核心代码部分展示研究方法详细视频演示试验方案论文大纲源码获取/详细视频演示具体实现截图技术栈介绍本课题的研究方法和研究步骤基本合理，难度适中，本选题是学生所学专业知识的延续，符合学生专业发展方向，对于提高学生的基本知识和技能以及钻研能力有益。该学生能够在预定时间内完成该课题的设计。研究的选题立意明确，结构合理，研究内容充实，研究方法准确有
【读点论文】Chain Replication for Supporting High Throughput and Availability 寻雾&启示分布式系统论文阅读
在分布式系统中，强一致性往往和高可用、高吞吐是矛盾的。比如传统的关系型数据库，其保证了强一致性，但往往牺牲了可用性和吞吐量。而像NoSQL数据库，虽然其吞吐量、和扩展性很高，但往往只支持最终一致性，无法保证强一致性。由此ChainReplicationforSupportingHighThroughputandAvailability提出了链式复制协议，旨在保证高吞吐、高可用的同时，支持数据的强一
leetcode-hot100-python-专题三：滑动窗口 ༺ Dorothy ༻ leetcode hot100 leetcode python 算法
1、无重复字符的最长子串中等给定一个字符串s，请你找出其中不含有重复字符的最长子串的长度。示例1:输入:s=“abcabcbb”输出:3解释:因为无重复字符的最长子串是“abc”，所以其长度为3示例2:输入:s=“bbbbb”输出:1解释:因为无重复字符的最长子串是“b”，所以其长度为1。示例3:输入:s=“pwwkew”输出:3解释:因为无重复字符的最长子串是“wke”，所以其长度为3。请注意，
Python UV - 安装、升级、卸载云客Coder python uv 开发语言
文章目录安装检查升级设置自动补全卸载UV命令官方文档详见：https://docs.astral.sh/uv/getting-started/installation/安装pipinstalluv检查安装后可运行下面命令，查看是否安装成功uv--version%uv--versionuv0.6.3(a0b9f22a22025-02-24)升级uvselfupdate将重新运行安装程序并可能修改您的
使用Python构建去中心化预测市场：从概念到实现 Echo_Wish Python！实战！python 去中心化开发语言
使用Python构建去中心化预测市场：从概念到实现大家好，我是Echo_Wish。今天，我们将深入探讨一个前沿的区块链应用——去中心化预测市场，并学习如何使用Python来构建一个简易的预测市场平台。预测市场是基于市场参与者对未来事件的预测来产生结果的地方，通常被用来预测政治事件、金融市场走向、体育比赛结果等。传统的预测市场如Augur、Polymarket等，基于去中心化平台，利用区块链技术确保
Python自动登陆、登出南京理工大学NJUST校园网程序 JimesMz python 开发语言
本文程序针对南京理工大学NJUST和NJUST-FREE校园网开发，其他学校无法使用。文章目录开发目的使用说明参考资料开发目的今天突然想要用代码实现一下自动登陆校园网，上网搜寻了一下。知乎有一些教程，CSDN也有一些完整的代码，但是我跟随教程或者直接运行现有代码都没有能够成功登陆，且NJUST校园网付费，我想要一个“登出”功能，借助Kimi自己写了一下。本人技术不精，以实现功能为主。使用说明请确保
Python爬虫笔记一（来自MOOC） Requests库入门小灰不停前进 #Python python pycharm 爬虫
Python爬虫笔记一通用代码框架：importrequestsdefgetHTMLText(url):try:r=requests.get(url,timeput=30)r.raise_for_status()#如果状态不是200，引发HTTPError异常r.encoding=r.apparemt_encodingreturnr.textexcept:return"产生异常"if__name_
Python调用fofa API接口并写入csv文件中 YOHO !GIRL 网络测绘 python 网络安全
前言一.功能目的二.功能调研三.编写代码1.引入库2.读取数据3.写入csv文件中总结前言上一篇我们讲述了目前较为主流的几款网络探测系统，简单介绍了页面的使用方法。链接如下，点击跳转：网络空间测绘引擎集合：Zoomeye、fofa、360、shodan、censys、鹰图然而当我们需要针对单个引擎进行二次开发时，页面就不能满足我们的需求了，这就需要参考API文档进行简单的数据处理，接下来，给大家介
无法访问 GitHub？教你如何轻松解决 CarlowZJ github
在开发过程中，GitHub是开发者不可或缺的代码托管平台。然而，由于网络环境或地区限制，国内用户有时会遇到无法访问GitHub的问题。本文将详细介绍几种常见原因及解决方法，帮助你快速恢复对GitHub的访问。一、常见原因及解决方案1.DNS解析问题DNS解析问题是最常见的原因之一，可能导致GitHub的域名无法正确解析为IP地址。解决方法：更换公共DNS：将本地DNS服务器更换为公共DNS，例如G
SenseVoice 部署记录安静六角开源软件
最近试用了SenseVoice（阿里团队开源的语音转文字）效果可以，可以本地部署，有webui界面，测试了万字以上的转换效果可以。首先部署好conda环境和cuda，这个可以查看他人的文章。步骤1.创建虚拟环境：condacreate-nmainenvpython=3.102.然后安装依赖condaactivatemainenvpipinstall-rC:\Users\xx\Documents\P
使用kubeadm部署高可用IPV4/IPV6集群---V1.32
使用kubeadm部署高可用IPV4/IPV6集群https://github.com/cby-chen/Kubernetes开源不易，帮忙点个star，谢谢了k8s基础系统环境配置配置IP#注意！#若虚拟机是进行克隆的那么网卡的UUID和MachineID会重复#需要重新生成新的UUIDUUID和MachineID#UUID和MachineID重复无法DHCP获取到IPV6地址sshroot@1
Python基于深度学习的动物图片识别技术的研究与实现 Java老徐 Python 毕业设计 python 深度学习开发语言深度学习的动物图片识别技术 Python动物图片识别技术
博主介绍：✌程序员徐师兄、7年大厂程序员经历。全网粉丝12w+、csdn博客专家、掘金/华为云/阿里云/InfoQ等平台优质作者、专注于Java技术领域和毕业项目实战✌文末获取源码联系精彩专栏推荐订阅不然下次找不到哟2022-2024年最全的计算机软件毕业设计选题大全：1000个热门选题推荐✅Java项目精品实战案例《100套》Java微信小程序项目实战《100套》感兴趣的可以先收藏起来，还有大家
【深度学习与大模型基础】第7章-特征分解与奇异值分解 lynn-66 深度学习与大模型基础算法机器学习人工智能
一、特征分解特征分解（EigenDecomposition）是线性代数中的一种重要方法，广泛应用于计算机行业的多个领域，如机器学习、图像处理和数据分析等。特征分解将一个方阵分解为特征值和特征向量的形式，帮助我们理解矩阵的结构和性质。1.特征分解的定义对于一个n×n的方阵A，如果存在一个非零向量v和一个标量λ，使得：则称λ为矩阵A的特征值，v为对应的特征向量。特征分解将矩阵A分解为：其中：Q是由特征
Python实现微信自动发送消息热心市民小汪 python 微信开发语言
实现需求：Python定时发送微信消息importpyautoguiaspgimportpyperclipaspcfromapscheduler.schedulers.blockingimportBlockingScheduler"""实现定时自动发送消息"""#操作间隔为1秒pg.PAUSE=1name='Hello~'msg='是时候点餐啦！！'defmain():#打开微信pg.hotkey
SQL的各种连接查询 xieke90 UNION ALL UNION 外连接内连接 JOIN
一、内连接概念：内连接就是使用比较运算符根据每个表共有的列的值匹配两个表中的行。内连接（join 或者inner join ） SQL语法： select * fron
java编程思想--复用类百合不是茶 java 继承代理组合 final类
复用类看着标题都不知道是什么,再加上java编程思想翻译的比价难懂,所以知道现在才看这本软件界的奇书一:组合语法:就是将对象的引用放到新类中即可代码: package com.wj.reuse; /** * * @author Administrator 组
[开源与生态系统]国产CPU的生态系统 comsci cpu
计算机要从娃娃抓起...而孩子最喜欢玩游戏.... 要让国产CPU在国内市场形成自己的生态系统和产业链,国家和企业就不能够忘记游戏这个非常关键的环节.... 投入一些资金和资源,人力和政策,让游
JVM内存区域划分Eden Space、Survivor Space、Tenured Gen，Perm Gen解释商人shang jvm内存
jvm区域总体分两类，heap区和非heap区。heap区又分：Eden Space（伊甸园）、Survivor Space(幸存者区)、Tenured Gen（老年代-养老区）。非heap区又分：Code Cache(代码缓存区)、Perm Gen（永久代）、Jvm Stack(java虚拟机栈)、Local Method Statck(本地方法栈)。 HotSpot虚拟机GC算法采用分代收
页面上调用 QQ oloz qq
<A href="tencent://message/?uin=707321921&Site=有事Q我&Menu=yes"> <img style="border:0px;" src=http://wpa.qq.com/pa?p=1:707321921:1></a>
一些问题文强chu 问题
1.eclipse 导出 doc 出现“The Javadoc command does not exist.” javadoc command 选择 jdk/bin/javadoc.exe 2.tomcate 配置 web 项目 ..... SQL:3.mysql * 必须得放前面否则 select&nbs
生活没有安全感小桔子生活孤独安全感
圈子好小，身边朋友没几个，交心的更是少之又少。在深圳，除了男朋友，没几个亲密的人。不知不觉男朋友成了唯一的依靠，毫不夸张的说，业余生活的全部。现在感情好，也很幸福的。但是说不准难免人心会变嘛，不发生什么大家都乐融融，发生什么很难处理。我想说如果不幸被分手(无论原因如何)，生活难免变化很大，在深圳，我没交心的朋友。明
php 基础语法 aichenglong php 基本语法
1 .1 php变量必须以$开头 <?php $a=” b”; echo ?> 1 .2 php基本数据库类型 Integer float/double Boolean string 1 .3 复合数据类型数组array和对象 object 1 .4 特殊数据类型 null 资源类型(resource) $co
mybatis tools 配置详解 AILIKES mybatis
MyBatis Generator中文文档 MyBatis Generator中文文档地址： http://generator.sturgeon.mopaas.com/ 该中文文档由于尽可能和原文内容一致，所以有些地方如果不熟悉，看中文版的文档的也会有一定的障碍，所以本章根据该中文文档以及实际应用，使用通俗的语言来讲解详细的配置。本文使用Markdown进行编辑，但是博客显示效
继承与多态的探讨百合不是茶 JAVA面向对象继承对象
继承 extends 多态继承是面向对象最经常使用的特征之一：继承语法是通过继承发、基类的域和方法 //继承就是从现有的类中生成一个新的类，这个新类拥有现有类的所有extends是使用继承的关键字：在A类中定义属性和方法； class A{ //定义属性 int age； //定义方法 public void go
JS的undefined与null的实例 bijian1013 JavaScript JavaScript
<form name="theform" id="theform"> </form> <script language="javascript"> var a alert(typeof(b)); //这里提示undefined if(theform.datas
TDD实践（一） bijian1013 java 敏捷 TDD
一.TDD概述 TDD：测试驱动开发，它的基本思想就是在开发功能代码之前，先编写测试代码。也就是说在明确要开发某个功能后，首先思考如何对这个功能进行测试，并完成测试代码的编写，然后编写相关的代码满足这些测试用例。然后循环进行添加其他功能，直到完全部功能的开发。
[Maven学习笔记十]Maven Profile与资源文件过滤器 bit1129 maven
什么是Maven Profile Maven Profile的含义是针对编译打包环境和编译打包目的配置定制，可以在不同的环境上选择相应的配置，例如DB信息，可以根据是为开发环境编译打包，还是为生产环境编译打包，动态的选择正确的DB配置信息 Profile的激活机制 1.Profile可以手工激活，比如在Intellij Idea的Maven Project视图中可以选择一个P
【Hive八】Hive用户自定义生成表函数(UDTF) bit1129 hive
1. 什么是UDTF UDTF，是User Defined Table-Generating Functions，一眼看上去，貌似是用户自定义生成表函数，这个生成表不应该理解为生成了一个HQL Table，貌似更应该理解为生成了类似关系表的二维行数据集 2. 如何实现UDTF 继承org.apache.hadoop.hive.ql.udf.generic
tfs restful api 加auth 2.0认计 ronin47
　　目前思考如何给tfs的ngx-tfs api增加安全性。有如下两点：　　一是基于客户端的ip设置。这个比较容易实现。　　二是基于OAuth2.0认证，这个需要lua，实现起来相对于一来说，有些难度。　　现在重点介绍第二种方法实现思路。　　前言：我们使用Nginx的Lua中间件建立了OAuth2认证和授权层。如果你也有此打算，阅读下面的文档，实现自动化并获得收益。SeatGe
jdk环境变量配置 byalias java jdk
进行java开发，首先要安装jdk，安装了jdk后还要进行环境变量配置： 1、下载jdk（http://java.sun.com/javase/downloads/index.jsp），我下载的版本是：jdk-7u79-windows-x64.exe 2、安装jdk-7u79-windows-x64.exe 3、配置环境变量：右击"计算机"-->&quo
《代码大全》表驱动法-Table Driven Approach-2 bylijinnan java
package com.ljn.base; import java.io.BufferedReader; import java.io.FileInputStream; import java.io.InputStreamReader; import java.util.ArrayList; import java.util.Collections; import java.uti
SQL 数值四舍五入小数点后保留2位 chicony 四舍五入
1.round() 函数是四舍五入用，第一个参数是我们要被操作的数据，第二个参数是设置我们四舍五入之后小数点后显示几位。 2.numeric 函数的2个参数，第一个表示数据长度，第二个参数表示小数点后位数。例如：　　select cast(round(12.5,2) as numeric(5,2))
c++运算符重载 CrazyMizzz C++
一、加+，减-，乘*，除/ 的运算符重载 Rational operator*(const Rational &x) const{ return Rational(x.a * this->a); } 在这里只写乘法的，加减除的写法类似二、<<输出,>>输入的运算符重载 &nb
hive DDL语法汇总 daizj hive 修改列 DDL 修改表
hive DDL语法汇总１、对表重命名 hive> ALTER TABLE table_name RENAME TO new_table_name; 2、修改表备注 hive> ALTER TABLE table_name SET TBLPROPERTIES ('comment' = new_comm
jbox使用说明 dcj3sjt126com Web
参考网址：http://www.kudystudio.com/jbox/jbox-demo.html jBox v2.3 beta [ 点击下载] 技术交流QQGroup：172543951 100521167 [2011-11-11] jBox v2.3 正式版 - [调整&修复] IE6下有iframe或页面有active、applet控件
UISegmentedControl 开发笔记 dcj3sjt126com
// typedef NS_ENUM(NSInteger, UISegmentedControlStyle) { // UISegmentedControlStylePlain, // large plain &
Slick生成表映射文件 ekian scala
Scala添加SLICK进行数据库操作，需在sbt文件上添加slick-codegen包 "com.typesafe.slick" %% "slick-codegen" % slickVersion 因为我是连接SQL Server数据库，还需添加slick-extensions，jtds包 "com.typesa
ES-TEST gengzg test
package com.MarkNum; import java.io.IOException; import java.util.Date; import java.util.HashMap; import java.util.Map; import javax.servlet.ServletException; import javax.servlet.annotation
为何外键不再推荐使用 hugh.wang mysql DB
表的关联，是一种逻辑关系，并不需要进行物理上的“硬关联”，而且你所期望的关联，其实只是其数据上存在一定的联系而已，而这种联系实际上是在设计之初就定义好的固有逻辑。在业务代码中实现的时候，只要按照设计之初的这种固有关联逻辑来处理数据即可，并不需要在数据库层面进行“硬关联”，因为在数据库层面通过使用外键的方式进行“硬关联”，会带来很多额外的资源消耗来进行一致性和完整性校验，即使很多时候我们并不
领域驱动设计 julyflame VO DAO 设计模式 DTO po
概念： VO（View Object）：视图对象，用于展示层，它的作用是把某个指定页面（或组件）的所有数据封装起来。 DTO（Data Transfer Object）：数据传输对象，这个概念来源于J2EE的设计模式，原来的目的是为了EJB的分布式应用提供粗粒度的数据实体，以减少分布式调用的次数，从而提高分布式调用的性能和降低网络负载，但在这里，我泛指用于展示层与服务层之间的数据传输对
单例设计模式 hm4123660 java Singleton 单例设计模式懒汉式饿汉式
单例模式是一种常用的软件设计模式。在它的核心结构中只包含一个被称为单例类的特殊类。通过单例模式可以保证系统中一个类只有一个实例而且该实例易于外界访问，从而方便对实例个数的控制并节约系统源。如果希望在系统中某个类的对象只能存在一个，单例模式是最好的解决方案。 &nb
logback zhb8015 log logback
一、logback的介绍 Logback是由log4j创始人设计的又一个开源日志组件。logback当前分成三个模块：logback-core,logback- classic和logback-access。logback-core是其它两个模块的基础模块。logback-classic是log4j的一个改良版本。此外logback-class
整合Kafka到Spark Streaming——代码示例和挑战 Stark_Summer spark storm zookeeper PARALLELISM processing
作者Michael G. Noll是瑞士的一位工程师和研究员，效力于Verisign，是Verisign实验室的大规模数据分析基础设施（基础Hadoop）的技术主管。本文，Michael详细的演示了如何将Kafka整合到Spark Streaming中。期间， Michael还提到了将Kafka整合到 Spark Streaming中的一些现状，非常值得阅读，虽然有一些信息在Spark 1.2版
spring-master-slave-commondao 王新春 DAO spring dataSource slave master
互联网的web项目，都有个特点：请求的并发量高，其中请求最耗时的db操作，又是系统优化的重中之重。为此，往往搭建 db的一主多从库的数据库架构。作为web的DAO层，要保证针对主库进行写操作，对多个从库进行读操作。当然在一些请求中，为了避免主从复制的延迟导致的数据不一致性，部分的读操作也要到主库上。（这种需求一般通过业务垂直分开，比如下单业务的代码所部署的机器，读去应该也要从主库读取数