20170605计算机视觉报告小结

听了一个关于计算机视觉的报告,报告名字叫"computer vision ++: Where do we go from here?",做点小结。我对计算机视觉了解的不多,连入门都算不上,但是近来对此产生了兴趣。报告主要讲的是一些应用。

提到了David Marr (neuroscientist)好像是一个大牛,写过一本书:Vision-豆瓣,Vision-Amazon

两个超过人的事情:1.人脸识别,LFW数据集;2.图像分类,ImageNet数据集。

两个数据集:

1.国际权威人脸识别公开测试集LFW(Labeled Faces in the Wild),LFW由马萨诸塞大学于2007年建立,用于评测非约束条件下的人脸识别算法性能,是人脸识别领域使用最广泛的评测集合。

2.ImageNet

应用

图像分类

Liu J, Luo J, Shah M. Recognizing realistic actions from videos[C]// Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009:1996-2003.

Cao L, Luo J, Liang F, et al. Heterogeneous feature machines for visual recognition[C]// IEEE, International Conference on Computer Vision. IEEE, 2010:1095-1102.

行为识别

Li Q, Qiu Z, Yao T, et al. Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation[C]// ACM on International Conference on Multimedia Retrieval. ACM, 2016:159-166.

vision+Language

Unsupervised Alignment of Actions in Video with Text Descriptions

You Q, Jin H, Wang Z, et al. Image Captioning with Semantic Attention[J]. 2016:4651-4659.

Li Y, Song Y, Cao L, et al. TGIF: A New Dataset and Benchmark on Animated GIF Description[C]// IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2016:4641-4650.

什么是Image Captioning?

感觉是从图像中获取信息,caption应该是字幕的意思,那么应该是从图像中识别一些信息并把它标注出来,看起来应该是这样(Image Captioning - Kiran Vodrahalli):

20170605计算机视觉报告小结_第1张图片

视频和文本的无监督对准

Naim I, Song Y C, Liu Q, et al. Unsupervised alignment of natural language instructions with video segments[C]// Twenty-Eighth AAAI Conference on Artificial Intelligence. AAAI Press, 2014:1558-1564.

Image/Video Captioning with visual and semantic attention

You Q, Jin H, Wang Z, et al. Image Captioning with Semantic Attention[J]. 2016:4651-4659.

情感计算

情感计算—人工智能的重要发展方向

You Q, Luo J, Jin H, et al. Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark[J]. 2016.

You Q, Cao L, Jin H, et al. Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks[C]// ACM, 2016:1008-1017.

研究自拍的

Chen T, Chen Y, Luo J. A Selfie is Worth a Thousand Words: Mining Personal Patterns behind User Selfie-posting Behaviours[J]. 2017.

用大数据来找到热销衣服的特征:

Chen K T, Luo J. When Fashion Meets Big Data: Discriminative Mining of Best Selling Clothing Features[J]. 2017.

医疗方面的应用

帮助新手医生练习手术

你可能感兴趣的:(计算机视觉)