酸罗卜不酸II

Overview and Resources of Scene Text Detection

Scene Text Detection Resources

转自华南理工大学SCTU-DLVC实验室，原文链接https://github.com/HCIILAB/Scene-Text-Detection

Author: Chongyu Liu

1.Datasets
- 1.1 Horizontal-Text Datasets
- 1.2 Arbitrary-Quadrilateral-Text Datasets
- 1.3 Irregular-Text Datasets
- 1.4 Synthetic Datasets
- 1.5 Comparison of Datasets
2. Summary of Scene Text Detection Resources
- 2.1 Comparison of Methods
  - 2.1.1 Traditional Methods
  - 2.1.2 Segmentation-based Methods
  - 2.1.3 Regression-based Methods
  - 2.1.4 Hybrid Methods
- 2.2 Detection Results
  - 2.2.1 Detection Results on Horizontal-Text Datasets
  - 2.2.2 Detection Results on Arbitrary-Quadrilateral-Text Datasets
  - 2.2.3 Detection Results on Irregular-Text Datasets
3. Survey
4. Evaluation
5. OCR Service
6. References and Code

1. Datasets

1.1 Horizontal-Text Datasets

ICDAR 2003(IC03)：
- Introduction: It contains 509 images in total, 258 for training and 251 for testing. Specifically, it contains 1110 text instance in training set, while 1156 in testing set. It has word-level annotation. IC03 only consider English text instance.
- Link: IC03-download
ICDAR 2011(IC11):
- Introduction: IC11 is an English dataset for text detection. It contains 484 images, 229 for training and 255 for testing. There are 1564 text instance in this dataset. It provides both word-level and character-level annotation.
- Link: IC11-download
ICDAR 2013(IC13)：
- Introduction: IC13 is almost the same as IC11. It contains 462 images in total, 229 for training and 233 for testing. Specifically, it contains 849 text instance in training set, while 1095 in testing set.
- Link: IC13-download

1.2 Arbitrary-Quadrilateral-Text Datasets

USTB-SV1K：
- Introduction: USTB-SV1K is an English dataset. It contains 1000 street images from Google Street View with 2955 text instance in total. It only provides word-level annotations.
- Link: USTB-SV1K-download
SVT：
- Introduction: It contains 350 images with 725 English text intance in total. SVT has both character-level and word-level annotations. The images of SVT are harvested from Google Street View and have low resolution.
- Link: SVT-download
SVT-P：
- Introduction: It contains 639 cropped word images for testing. Images were selected from the side-view angle snapshots in Google Street View. Therefore, most images are heavily distorted by the non-frontal view angle. It is the imporved datasets of SVT.
- Link: SVT-P-download (Password : vnis)
ICDAR 2015(IC15)：
- Introduction: It contains 1500 images in total, 1000 for training and 500 for testing. Specifically, it contains 17548 text instance. It provides word-level annotations. IC15 is the first incidental scene text dataset and it only considers English words.
- Link: IC15-download
COCO-Text：
- Introduction: It contains 63686 images in total, 43686 for training, 10000 for validating and 10000 for testing. Specifically, it contains 145859 cropped word images for testing, including handwritten and printed, clear and blur, English and non-English.
- Link: COCO-Text-download
MSRA-TD500：
- Introduction: It contains 500 images in total. It provides text-line-level annotation rather than word, and polygon boxes rather than axis-aligned rectangles for text region annootation. It contains both English and Chinese text instance.
- Link: MSRA-TD500-download
MLT 2017：
- Introduction: It contains 10000 natural images in total. It provides word-level annotation. There are 9 languages for MLT. It is a more real and complex datasets for scene text detection and recognition…
- Link: MLT-download
MLT 2019:
- Introduction: It contains 18000 images in total. It provides word-level annotation. Compared to MLT, this dataset has 10 languages. It is a more real and complex datasets for scene text detection and recognition…
- Link: MLT-2019-download
CTW：
- Introduction: It contains 32285 high resolution street view images of Chinese text, with 1018402 character instances in total. All images are annotated at the character level, including its underlying character type, bouding box, and 6 other attributes. These attributes indicate whether its background is complex, whether it’s raised, whether it’s hand-written or printed, whether it’s occluded, whether it’s distorted, whether it uses word-art.
- Link: CTW-download
RCTW-17：
- Introduction: It contains 12514 images in total, 11514 for training and 1000 for testing. Images in RCTW-17 were mostly collected by camera or mobile phone, and others were generated images. Text instances are annotated with parallelograms. It is the first large scale Chinese dataset, and was also the largest published one by then.
- Link: RCTW-17-download
ReCTS：
- Introduction: This data set is a large-scale Chinese Street View Trademark Data Set. It is based on Chinese words and Chinese text line-level labeling. The labeling method is arbitrary quadrilateral labeling. It contains 20000 images in total.
- Link: ReCTS-download

1.3 Irregular-Text Datasets

CUTE80：
- Introduction: It contains 80 high-resolution images taken in natural scenes. Specifically, it contains 288 cropped word images for testing. The dataset focuses on curved text. No lexicon is provided.
- Link: CUTE80-download
Total-Text：
- Introduction: It contains 1,555 images in total. Specifically, it contains 11,459 cropped word images with more than three different text orientations: horizontal, multi-oriented and curved.
- Link: Total-Text-download
SCUT-CTW1500：
- Introduction: It contains 1500 images in total, 1000 for training and 500 for testing. Specifically, it contains 10751 cropped word images for testing. Annotations in CTW-1500 are polygons with 14 vertexes. The dataset mainly consists of Chinese and English.
- Link: CTW-1500-download
LSVT：
- Introduction: LSVT consists of 20,000 testing data, 30,000 training data in full annotations and 400,000 training data in weak annotations, which are referred to as partial labels. The labeled text regions demonstrate the diversity of text: horizontal, multi-oriented and curved.
- Link: LSVT-download
ArT：
- Introduction: ArT consists of 10,166 images, 5,603 for training and 4,563 for testing. They were collected with text shape diversity in mind and all text shapes have high number of existence in ArT.
- Link: ArT-download

1.4 Synthetic Datasets

Synth80k :
- Introduction: It contains 800 thousands images with approximately 8 million synthetic word instances. Each text instance is annotated with its text-string, word-level and character-level bounding-boxes.
- Link: Synth80k-download
SynthText :
- Introduction: It contains 6 million cropped word images. The generation process is similar to that of Synth90k. It is also annotated in horizontal-style.
- Link: SynthText-download

1.5 Comparison of Datasets

Comparison of Datasets

Datasets

Language

Image

Text instance

Text Shape

Annotation level

Total

Train

Test

Total

Train

Test

Horizontal

Arbitrary-Quadrilateral

Multi-oriented

Char

Word

Text-Line

IC03

English

509

258

251

2266

1110

1156

✓

IC11

English

484

229

255

1564

～

✓

IC13

English

462

229

233

1944

849

1095

✓

USTB-SV1K

English

1000

500

2955

～

✓

SVT

English

350

100

250

725

211

514

✓

SVT-P

English

238

～

639

～

✓

IC15

English

1500

1000

500

17548

122318

5230

✓

COCO-Text

English

63686

43686

20000

145859

118309

27550

✓

MSRA-TD500

English/Chinese

500

300

200

～

✓

MLT 2017

Multi-lingual

18000

7200

10800

～

✓

MLT 2019

Multi-lingual

20000

10000

～

✓

CTW

Chinese

32285

25887

6398

1018402

812872

205530

✓

RCTW-17

English/Chinese

12514

15114

1000

～

✓

ReCTS

Chinese

20000

～

✓

CUTE80

English

～

✓

Total-Text

English

1525

1225

300

9330

～

✓

CTW-1500

English/Chinese

1500

1000

500

10751

～

✓

LSVT

English/Chinese

450000

430000

20000

～

✓

ArT

English/Chinese

10166

5603

4563

～

✓

Synth80k

English

80k

～

✓

SynthText

English

800k

～

✓

2. Summary of Scene Text Detection Resources

2.1 Comparison of Methods

Scene text detection methods can be devided into four parts:

(a) Traditional methods;

(b) Segmentation-based methods;

© Regression-based methods;

(d) Hybrid methods.

It is important to notice that: (1) “Hori” stands for horizontal scene text datasets. (2) “Quad” stands for arbitrary-quadrilateral-text datasets. (3) “Irreg” stands for irregular scence text datasets. (4) “Traditional method” stands for the methods that don’t rely on deep learning.

2.1.1 Traditional Methods

Method	Model	Code	Hori	Quad	Irreg	Source	Time	Highlight
Yao et al. [1]	TD-Mixture	✕	✓	✓	✕	CVPR	2012	1) A new dataset MSRA-TD500 and protocol for evaluation. 2) Equipped a two-level classification scheme and two sets of features extractor.
Yin et al. [2]		✕	✓	✕	✕	TPAMI	2013	Extract Maximally Stable Extremal Regions (MSERs) as character candidates and group them together.
Le et al. [5]	HOCC	✕	✓	✓	✕	CVPR	2014	HOCC + MSERs
Yin et al. [7]		✕	✓	✓	✕	TPAMI	2015	Presenting a unified distance metric learning framework for adaptive hierarchical clustering.
Wu et al. [9]		✕	✓	✓	✕	TMM	2015	Exploring gradient directional symmetry at component level for smoothing edge components before text detection.
Tian et al. [17]		✕	✓	✕	✕	IJCAI	2016	Scene text is first detected locally in individual frames and finally linked by an optimal tracking trajectory.
Yang et al. [33]		✕	✓	✓	✕	TIP	2017	A text detector will locate character candidates and extract text regions. Then they will linked by an optimal tracking trajectory.
Liang et al. [8]		✕	✓	✓	✓	TIP	2015	Exploring maxima stable extreme regions along with stroke width transform for detecting candidate text regions.
Michal et al.[12]	FASText	✕	✓	✓	✕	ICCV	2015	Stroke keypoints are efficiently detected and then exploited to obtain stroke segmentations.

2.1.2 Segmentation-based Methods

Method

Model

Code

Hori

Quad

Irreg

Source

Time

Highlight

Li et al. [3]

✓

TIP

2014

(1)develop three novel cues that are tailored for character detection and a Bayesian method for their integration; (2)design a Markov random field model to exploit the inherent dependencies between characters.

Zhang et al. [14]

✓

CVPR

2016

Utilizing FCN for salient map detection and centroid of each character prediction.

Zhu et al. [16]

✓

CVPR

2016

Performs a graph-based segmentation of connected components into words (Word-Graph).

He et al. [18]

Text-CNN

✓

TIP

2016

Developing a new learning mechanism to train the Text-CNN with multi-level and rich supervised information.

Yao et al. [21]

✓

arXiv

2016

Proposing to localize text in a holistic manner, by casting scene text detection as a semantic segmentation problem.

Hu et al. [27]

WordSup

✓

ICCV

2017

Proposing a weakly supervised framework that can utilize word annotations. Then the detected characters are fed to a text structure analysis module.

Wu et al. [28]

✓

ICCV

2017

Introducing the border class to the text detection problem for the first time, and validate that the decoding process is largely simplified with the help of text border.

Tang et al.[32]

✓

TIP

2017

A text-aware candidate text region(CTR) extraction model + CTR refinement model.

Dai et al. [35]

FTSN

✓

arXiv

2017

Detecting and segmenting the text instance jointly and simultaneously, leveraging merits from both semantic segmentation task and region proposal based object detection task.

Wang et al. [38]

✓

ICDAR

2017

This paper proposes a novel character candidate extraction method based on super-pixel segmentation and hierarchical clustering.

Deng et al. [40]

PixelLink

✓

AAAI

2018

Text instances are first segmented out by linking pixels wthin the same instance together.

Liu et al. [42]

MCN

✓

CVPR

2018

Stochastic Flow Graph (SFG) + Markov Clustering.

Lyu et al. [43]

✓

CVPR

2018

Detect scene text by localizing corner points of text bounding boxes and segmenting text regions in relative positions.

Chu et al. [45]

Border

✓

ECCV

2018

The paper presents a novel scene text detection technique that makes use of semantics-aware text borders and bootstrapping based text segment augmentation.

Long et al. [46]

TextSnake

✓

ECCV

2018

The paper proposes TextSnake, which is able to effectively represent text instances in horizontal, oriented and curved forms based on symmetry axis.

Yang et al. [47]

IncepText

✓

IJCAI

2018

Designing a novel Inception-Text module and introduce deformable PSROI pooling to deal with multi-oriented text detection.

Yue et al. [48]

✓

BMVC

2018

Proposing a general framework for text detection called Guided CNN to achieve the two goals simultaneously.

Zhong et al. [53]

AF-RPN

✓

arXiv

2018

Presenting AF-RPN(anchor-free) as an anchor-free and scale-friendly region proposal network for the Faster R-CNN framework.

Wang et al. [54]

PSENet

✓

CVPR

2019

Proposing a novel Progressive Scale Expansion Network (PSENet), designed as a segmentation-based detector with multiple predictions for each text instance.

Xu et al.[57]

TextField

✓

arXiv

2018

Presenting a novel direction field which can represent scene texts of arbitrary shapes.

Tian et al. [58]

FTDN

✓

ICIP

2018

FTDN is able to segment text region and simultaneously regress text box at pixel-level.

Tian et al. [83]

✓

CVPR

2019

Constraining embedding feature of pixels inside the same text region to share similar properties.

Huang et al. [4]

MSERs-CNN

✓

ECCV

2014

Combining MSERs with CNN

Sun et al. [6]

✓

2015

Presenting a robust text detection approach based on color-enhanced CER and neural networks.

Baek et al. [62]

CRAFT

✓

CVPR

2019

Proposing CRAFT effectively detect text area by exploring each character and affinity between characters.

2.1.3 Regression-based Methods

Method

Model

Code

Hori

Quad

Irreg

Source

Time

Highlight

Gupta et al. [15]

FCRN

✓

CVPR

2016

(a) Proposing a fast and scalable engine to generate synthetic images of text in clutter; (b) FCRN.

Zhong et al. [20]

DeepText

✓

arXiv

2016

(a) Inception-RPN; (b) Utilize ambiguous text category (ATC) information and multilevel region-of-interest pooling (MLRP).

Liao et al. [22]

TextBoxes

✓

AAAI

2017

Mainly basing SSD object detection framework.

Liu et al. [25]

DMPNet

✓

CVPR

2017

Quadrilateral sliding windows + shared Monte-Carlo method for fast and accurate computing of the polygonal areas + a sequential protocol for relative regression.

He et al. [26]

DDR

✓

ICCV

2017

Proposing an FCN that has bi-task outputs where one is pixel-wise classification between text and non-text, and the other is direct regression to determine the vertex coordinates of quadrilateral text boundaries.

Jiang et al. [36]

R2CNN

✓

arXiv

2017

Using the Region Proposal Network (RPN) to generate axis-aligned bounding boxes that enclose the texts with different orientations.

Xing et al. [37]

ArbiText

✓

arXiv

2017

Adopting the circle anchors and incorporating a pyramid pooling module into the Single Shot MultiBox Detector framework.

Zhang et al. [39]

FEN

✓

AAAI

2018

Proposing a refined scene text detector with a novel Feature Enhancement Network (FEN) for Region Proposal and Text Detection Refinement.

Wang et al. [41]

ITN

✓

CVPR

2018

ITN is presented to learn the geometry-aware representation encoding the unique geometric configurations of scene text instances with in-network transformation embedding.

Liao et al. [44]

RRD

✓

CVPR

2018

The regression branch extracts rotation-sensitive features, while the classification branch extracts rotation-invariant features by pooling the rotation sensitive features.

Liao et al. [49]

TextBoxes++

✓

TIP

2018

Mainly basing SSD object detection framework and it replaces the rectangular box representation in conventional object detector by a quadrilateral or oriented rectangle representation.

He et al. [50]

✓

TIP

2018

Proposing a scene text detection framework based on fully convolutional network with a bi-task prediction module.

Ma et al. [51]

RRPN

✓

TMM

2018

RRPN + RRoI Pooling.

Zhu et al. [55]

SLPR

✓

arXiv

2018

SLPR regresses multiple points on the edge of text line and then utilizes these points to sketch the outlines of the text.

Deng et al. [56]

✓

arXiv

2018

CRPN employs corners to estimate the possible locations of text instances. And it also designs a embedded data augmentation module inside region-wise subnetwork.

Cai et al. [59]

FFN

✓

ICIP

2018

Proposing a Feature Fusion Network to deal with text regions differing in enormous sizes.

Sabyasachi et al. [60]

RGC

✓

ICIP

2018

Proposing a novel recurrent architecture to improve the learnings of a feature map at a given time.

Liu et al. [63]

CTD

✓

2019

CTD + TLOC + PNMS

Xie et al. [79]

DeRPN

✓

AAAI

2019

DeRPN utilizes anchor string mechanism instead of anchor box in RPN.

Wang et al. [82]

✓

CVPR

2019

Text-RPN + RNN

Liu et al. [84]

✓

CVPR

2019

CSE mechanism

He et al. [29]

SSTD

✓

ICCV

2017

Proposing an attention mechanism. Then developing a hierarchical inception module which efficiently aggregates multi-scale inception features.

Tian et al. [11]

✓

ICCV

2015

Cascade boosting detects character candidates, and the min-cost flow network model get the final result.

Tian et al. [13]

CTPN

✓

ECCV

2016

1) RPN + LSTM. 2) RPN incorporate a new vertical anchor mechanism and LSTM connects the region to get the final result.

He et al. [19]

✓

ACCV

2016

ER detetctor detects regions to get coarse prediction of text regions. Then the local context is aggregated to classify the remaining regions to obtain a final prediction.

Shi et al. [23]

SegLink

✓

CVPR

2017

Decomposing text into segments and links. A link connects two adjacent segments.

Tian et al. [30]

WeText

✓

ICCV

2017

Proposing a weakly supervised scene text detection method (WeText).

Zhu et al. [31]

RTN

✓

ICDAR

2017

Mainly basing CTPN vertical vertical proposal mechanism.

Ren et al. [34]

✓

TMM

2017

Proposing a CNN-based detector. It contains a text structure component detector layer, a spatial pyramid layer, and a multi-input-layer deep belief network (DBN).

Zhang et al. [10]

✓

CVPR

2015

The proposed algorithm exploits the symmetry property of character groups and allows for direct extraction of text lines from natural images.

2.1.4 Hybrid Methods

Method

Model

Code

Hori

Quad

Irreg

Source

Time

Highlight

Tang et al. [52]

SSFT

✓

TMM

2018

Proposing a novel scene text detection method that involves superpixel-based stroke feature transform (SSFT) and deep learning based region classification (DLRC).

Xie et al.[61]

SPCNet

✓

AAAI

2019

Text Context module + Re-Score mechanism.

Liu et al. [64]

PMTD

✓

arXiv

2019

Perform “soft” semantic segmentation. It assigns a soft pyramid label (i.e., a real value between 0 and 1) for each pixel within text instance.

Liu et al. [80]

BDN

✓

IJCAI

2019

Discretizing bouding boxes into key edges to address label confusion for text detection.

Zhang et al. [81]

LOMO

✓

CVPR

2019

DR + IRM + SEM

Zhou et al. [24]

EAST

✓

CVPR

2017

The pipeline directly predicts words or text lines of arbitrary orientations and quadrilateral shapes in full images with instance segmentation.

Yue et al. [48]

✓

BMVC

2018

Proposing a general framework for text detection called Guided CNN to achieve the two goals simultaneously.

Zhong et al. [53]

AF-RPN

✓

arXiv

2018

Presenting AF-RPN(anchor-free) as an anchor-free and scale-friendly region proposal network for the Faster R-CNN framework.

2.2 Detection Results

2.2.1 Detection Results on Horizontal-Text Datasets

Method

Model

Source

Time

Method Category

IC11[68]

IC13 [69]

IC05[67]

Yao et al. [1]

TD-Mixture

CVPR

2012

Traditional

0.69

0.66

0.67

Yin et al. [2]

TPAMI

2013

0.86

0.68

0.76

Yin et al. [7]

TPAMI

2015

0.838

0.66

0.738

Wu et al. [9]

TMM

2015

0.76

0.70

0.73

Liang et al. [8]

TIP

2015

0.77

0.68

0.71

0.76

0.68

0.72

Michal et al.[12]

FASText

ICCV

2015

0.84

0.69

0.77

Li et al. [3]

TIP

2014

Segmentation

0.80

0.62

0.70

Zhang et al. [14]

CVPR

2016

0.88

0.78

0.83

He et al. [18]

Text-CNN

TIP

2016

0.91

0.74

0.82

0.93

0.73

0.82

0.87

0.73

0.79

Yao et al. [21]

arXiv

2016

0.889

0.802

0.843

Hu et al. [27]

WordSup

ICCV

2017

0.933

0.875

0.903

Tang et al.[32]

TIP

2017

0.90

0.86

0.88

0.92

0.87

0.89

Wang et al. [38]

ICDAR

2017

0.87

0.78

0.82

0.87

0.82

0.84

Deng et al. [40]

PixelLink

AAAI

2018

0.886

0.875

0.881

Liu et al. [42]

MCN

CVPR

2018

0.88

0.87

0.88

Lyu et al. [43]

CVPR

2018

0.92

0.844

0.880

Chu et al. [45]

Border

ECCV

2018

0.915

0.871

0.892

Wang et al. [54]

PSENet

CVPR

2019

0.94

0.90

0.92

Huang et al. [4]

MSERs-CNN

ECCV

2014

0.88

0.71

0.78

0.84

0.67

0.75

Sun et al. [6]

2015

0.92

0.91

0.94

0.92

0.93

Gupta et al. [15]

FCRN

CVPR

2016

Regression

0.94

0.77

0.85

0.938

0.764

0.842

Zhong et al. [20]

DeepText

arXiv

2016

0.87

0.83

0.85

0.81

0.83

Liao et al. [22]

TextBoxes

AAAI

2017

0.89

0.82

0.86

0.89

0.83

0.86

Liu et al. [25]

DMPNet

CVPR

2017

0.93

0.83

0.870

Jiang et al. [36]

R2CNN

arXiv

2017

0.92

0.81

0.86

Xing et al. [37]

ArbiText

arXiv

2017

0.826

0.936

0.877

Wang et al. [41]

ITN

CVPR

2018

0.896

0.889

0.892

0.941

0.893

0.916

Liao et al. [49]

TextBoxes++

TIP

2018

0.92

0.86

0.89

He et al. [50]

TIP

2018

0.91

0.84

0.88

Ma et al. [51]

RRPN

TMM

2018

0.95

0.89

0.91

Zhu et al. [55]

SLPR

arXiv

2018

0.90

0.72

0.80

Cai et al. [59]

FFN

ICIP

2018

0.92

0.84

0.876

Sabyasachi et al. [60]

RGC

ICIP

2018

0.89

0.77

0.83

Wang et al. [82]

CVPR

2019

0.937

0.878

0.907

Liu et al. [84]

CVPR

2019

0.937

0.897

0.917

He et al. [29]

SSTD

ICCV

2017

0.89

0.86

0.88

Tian et al. [11]

ICCV

2015

0.86

0.76

0.81

0.852

0.759

0.802

Tian et al. [13]

CTPN

ECCV

2016

0.93

0.83

0.88

He et al. [19]

ACCV

2016

0.90

0.75

0.81

Shi et al. [23]

SegLink

CVPR

2017

0.877

0.83

0.853

Tian et al. [30]

WeText

ICCV

2017

0.911

0.831

0.869

Zhu et al. [31]

RTN

ICDAR

2017

0.94

0.89

0.91

Ren et al. [34]

TMM

2017

0.78

0.67

0.72

0.81

0.67

0.73

Zhang et al. [10]

CVPR

2015

0.84

0.76

0.80

0.88

0.74

0.80

Tang et al. [52]

SSFT

TMM

2018

Hybrid

0.906

0.847

0.876

0.911

0.861

0.885

Xie et al.[61]

SPCNet

AAAI

2019

0.94

0.91

0.92

Liu et al. [80]

BDN

IJCAI

2019

0.887

0.894

0.89

Zhou et al. [24]

EAST

CVPR

2017

0.93

0.83

0.870

Yue et al. [48]

BMVC

2018

0.885

0.846

0.870

Zhong et al. [53]

AF-RPN

arXiv

2018

0.94

0.90

0.92

2.2.2 Detection Results on Arbitrary-Quadrilateral-Text Datasets

Method

Model

Source

Time

Method Category

IC15 [70]

MSRA-TD500 [71]

USTB-SV1K [65]

SVT [66]

Le et al. [5]

HOCC

CVPR

2014

Traditional

0.71

0.62

0.66

Yin et al. [7]

TPAMI

2015

0.81

0.63

0.71

0.499

0.454

0.475

Wu et al. [9]

TMM

2015

0.63

0.70

0.66

Tian et al. [17]

IJCAI

2016

0.95

0.58

0.721

0.537

0.488

0.51

Yang et al. [33]

TIP

2017

0.95

0.58

0.72

0.54

0.49

0.51

Liang et al. [8]

TIP

2015

0.74

0.66

0.70

Zhang et al. [14]

CVPR

2016

Segmentation

0.71

0.43

0.54

0.83

0.67

0.74

Zhu et al. [16]

CVPR

2016

0.81

0.91

0.85

He et al. [18]

Text-CNN

TIP

2016

0.76

0.61

0.69

Yao et al. [21]

arXiv

2016

0.723

0.587

0.648

0.765

0.753

0.759

Hu et al. [27]

WordSup

ICCV

2017

0.793

0.77

0.782

Wu et al. [28]

ICCV

2017

0.91

0.78

0.84

0.77

0.78

0.77

Dai et al. [35]

FTSN

arXiv

2017

0.886

0.80

0.841

0.876

0.771

0.82

Deng et al. [40]

PixelLink

AAAI

2018

0.855

0.820

0.837

0.830

0.732

0.778

Liu et al. [42]

MCN

CVPR

2018

0.72

0.80

0.76

0.88

0.79

0.83

Lyu et al. [43]

CVPR

2018

0.895

0.797

0.843

0.876

0.762

0.815

Chu et al. [45]

Border

ECCV

2018

0.830

0.774

0.801

Long et al. [46]

TextSnake

ECCV

2018

0.849

0.804

0.826

0.832

0.739

0.783

Yang et al. [47]

IncepText

IJCAI

2018

0.938

0.873

0.905

0.875

0.790

0.830

Wang et al. [54]

PSENet

CVPR

2019

0.8692

0.845

0.8569

Xu et al.[57]

TextField

arXiv

2018

0.843

0.805

0.824

0.874

0.759

0.813

Tian et al. [58]

FTDN

ICIP

2018

0.847

0.773

0.809

Tian et al. [83]

CVPR

2019

0.883

0.850

0.866

0.842

0.817

0.829

Baek et al. [62]

CRAFT

CVPR

2019

0.898

0.843

0.869

0.882

0.782

0.829

Gupta et al. [15]

FCRN

CVPR

2016

Regression

0.651

0.599

0.624

Liu et al. [25]

DMPNet

CVPR

2017

0.732

0.682

0.706

He et al. [26]

DDR

ICCV

2017

0.82

0.80

0.81

0.77

0.70

0.74

Jiang et al. [36]

R2CNN

arXiv

2017

0.856

0.797

0.825

Xing et al. [37]

ArbiText

arXiv

2017

0.792

0.735

0.759

0.78

0.72

0.75

Wang et al. [41]

ITN

CVPR

2018

0.857

0.741

0.795

0.903

0.723

0.803

Liao et al. [44]

RRD

CVPR

2018

0.88

0.8

0.838

0.876

0.73

0.79

Liao et al. [49]

TextBoxes++

TIP

2018

0.878

0.785

0.829

He et al. [50]

TIP

2018

0.85

0.80

0.82

0.91

0.81

0.86

Ma et al. [51]

RRPN

TMM

2018

0.822

0.732

0.774

0.821

0.677

0.742

Zhu et al. [55]

SLPR

arXiv

2018

0.855

0.836

0.845

Deng et al. [56]

arXiv

2018

0.89

0.81

0.845

Sabyasachi et al. [60]

RGC

ICIP

2018

0.83

0.81

0.82

0.85

0.76

0.80

Wang et al. [82]

CVPR

2019

0.892

0.86

0.876

0.852

0.821

0.836

He et al. [29]

SSTD

ICCV

2017

0.80

0.73

0.77

Tian et al. [13]

CTPN

ECCV

2016

0.74

0.52

0.61

He et al. [19]

ACCV

2016

0.87

0.73

0.79

Shi et al. [23]

SegLink

CVPR

2017

0.731

0.768

0.75

0.86

0.70

0.77

Tang et al. [52]

SSFT

TMM

2018

Hybrid

0.541

0.758

0.631

Xie et al.[61]

SPCNet

AAAI

2019

0.89

0.86

0.87

Liu et al. [64]

PMTD

arXiv

2019

0.913

0.874

0.893

Liu et al. [80]

BDN

IJCAI

2019

0.881

0.846

0.863

0.87

0.815

0.842

Zhang et al. [81]

LOMO

CVPR

2019

0.878

0.876

0.877

Zhou et al. [24]

EAST

CVPR

2017

0.833

0.783

0.807

0.873

0.674

0.761

Yue et al. [48]

BMVC

2018

0.866

0.789

0.823

0.691

0.660

0.675

Zhong et al. [53]

AF-RPN

arXiv

2018

0.89

0.83

0.86

Method

Model

Source

Time

Method Category

COCO-Text [72]

RCTW-17 [73]

MLT [76]

OSTD[77]

Le et al. [5]

HOCC

CVPR

2014

Traditional

0.80

0.73

0.76

Yao et al. [21]

arXiv

2016

Segmentation

0.432

0.27

0.333

Hu et al. [27]

WordSup

ICCV

2017

0.452

0.309

0.368

Lyu et al. [43]

CVPR

2018

0.351

0.348

0.349

0.743

0.706

0.724

Chu et al. [45]

Border

ECCV

2018

0.782

0.588

0.671

0.777

0.621

0.690

Yang et al. [47]

IncepText

IJCAI

2018

0.785

0.569

0.660

Wang et al. [54]

PSENet

CVPR

2019

0.7535

0.6918

0.7213

Baek et al. [62]

CRAFT

CVPR

2019

0.806

0.682

0.739

He et al. [29]

SSTD

ICCV

2017

Regression

0.46

0.31

0.37

Gupta et al. [15]

FCRN

CVPR

2016

0.844

0.763

0.801

Liao et al. [49]

TextBoxes++

TIP

2018

0.61

0.57

0.59

Ma et al. [51]

RRPN

TMM

2018

0.7669

0.5794

0.6601

Deng et al. [56]

arXiv

2018

0.555

0.633

0.591

Cai et al. [59]

FFN

ICIP

2018

0.43

0.35

0.39

Xie et al. [79]

DeRPN

AAAI

2019

0.586

0.557

0.571

He et al. [29]

SSTD

ICCV

2017

0.46

0.31

0.37

Xie et al.[61]

SPCNet

AAAI

2019

Hybrid

0.806

0.686

0.741

Liu et al. [64]

PMTD

arXiv

2019

0.844

0.763

0.801

Liu et al. [80]

BDN

IJCAI

2019

0.791

0.698

0.742

Zhang et al. [81]

LOMO

CVPR

2019

0.791

0.602

0.684

0.802

0.672

0.731

Zhou et al. [24]

EAST

CVPR

2017

0.504

0.324

0.395

Zhong et al. [53]

AF-RPN

arXiv

2018

0.75

0.66

0.70

2.2.3 Detection Results on Irregular-Text Datasets

In this section, we only select those methods suitable for irregular text detection.

Method	Model	Source	Time	Method Category	Total-text [74]			SCUT-CTW1500 [75]
Method	Model	Source	Time	Method Category	P	R	F	P	R	F
Baek et al. [62]	CRAFT	CVPR	2019	Segmentation	0.876	0.799	0.836	0.860	0.811	0.835
Long et al. [46]	TextSnake	ECCV	2018		0.827	0.745	0.784	0.679	0.853	0.756
Tian et al. [83]		CVPR	2019		~	~	~	81.7	84.2	80.1
Wang et al. [54]	PSENet	CVPR	2019		0.840	0.779	0.809	0.848	0.797	0.822
Zhu et al. [55]	SLPR	arXiv	2018	Regression	~	~	~	0.801	0.701	0.748
Liu et al. [63]	CTD+TLOC	PR	2019		~	~	~	0.774	0.698	0.734
Wang et al. [82]		CVPR	2019		~	~	~	80.1	80.2	80.1
Liu et al. [84]		CVPR	2019		0.814	0.791	0.802	0.787	0.761	0.774
Zhang et al. [81]	LOMO	CVPR	2019	Hybrid	87.6	79.3	83.3	85.7	76.5	80.8
Xie et al.[61]	SPCNet	AAAI	2019	Hybrid	0.83	0.83	0.83	~	~	~

3. Survey

[A] [TPAMI-2015] Ye Q, Doermann D. Text detection and recognition in imagery: A survey[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(7): 1480-1500. paper

[B] [Frontiers-Comput. Sci-2016] Zhu Y, Yao C, Bai X. Scene text detection and recognition: Recent advances and future trends[J]. Frontiers of Computer Science, 2016, 10(1): 19-36. paper

[C] [arXiv-2018] Long S, He X, Ya C. Scene Text Detection and Recognition: The Deep Learning Era[J]. arXiv preprint arXiv:1811.04256, 2018. paper

4. Evaluation

If you are insterested in developing better scene text detection metrics, some references recommended here might be useful.

[A] Wolf, Christian, and Jean-Michel Jolion. “Object count/area graphs for the evaluation of object detection and segmentation algorithms.” International Journal of Document Analysis and Recognition (IJDAR) 8.4 (2006): 280-296. paper

[B] D. Karatzas, L. Gomez-Bigorda, A. Nicolaou, S. K. Ghosh, A. D.Bagdanov, M. Iwamura, J. Matas, L. Neumann, V. R. Chandrasekhar, S. Lu, F. Shafait, S. Uchida, and E. Valveny. ICDAR 2015 competition on robust reading. In ICDAR, pages 1156–1160, 2015. paper

[C] Calarasanu, Stefania, Jonathan Fabrizio, and Severine Dubuisson. “What is a good evaluation protocol for text localization systems? Concerns, arguments, comparisons and solutions.” Image and Vision Computing 46 (2016): 1-17. paper

[D] Shi, Baoguang, et al. “ICDAR2017 competition on reading chinese text in the wild (RCTW-17).” 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). Vol. 1. IEEE, 2017. paper

[E] Nayef, N; Yin, F; Bizid, I; et al. ICDAR2017 robust reading challenge on multi-lingual scene text detection and script identiﬁcation-rrc-mlt. In Document Analysis and Recognition (ICDAR), 2017 14th IAPR International Conference on, volume 1, 1454–1459. IEEE.
paper

[F] Dangla, Aliona, et al. “A first step toward a fair comparison of evaluation protocols for text detection algorithms.” 2018 13th IAPR International Workshop on Document Analysis Systems (DAS). IEEE, 2018. paper

[G] He,Mengchao and Liu, Yuliang, et al. ICPR2018 Contest on Robust Reading for Multi-Type Web images. ICPR 2018. paper

[H] Liu, Yuliang and Jin, Lianwen, et al. “Tightness-aware Evaluation Protocol for Scene Text Detection” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2019. paper code

5. OCR Service

OCR	API	Free
Tesseract OCR Engine	×	√
Azure	√	√
ABBYY	√	√
OCR Space	√	√
SODA PDF OCR	√	√
Free Online OCR	√	√
Online OCR	√	√
Super Tools	√	√
Online Chinese Recognition	√	√
Calamari OCR	×	√
Tencent OCR	√	×

6. References and Code


[1] Yao C, Bai X, Liu W, et al. Detecting texts of arbitrary orientations in natural images. 2012 IEEE Conference on Computer Vision and Pattern Recognition(CVPR), 2012: 1083-1090. Paper
[2] Yin X C, Yin X, Huang K, et al. Robust text detection in natural scene images. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2013, 36(5): 970-83. Paper
[3] Li Y, Jia W, Shen C, et al. Characterness: An indicator of text in the wild. IEEE transactions on image processing, 2014, 23(4): 1666-1677. Paper
[4] Huang W, Qiao Y, Tang X. Robust scene text detection with convolution neural network induced mser trees. European Conference on Computer Vision(ECCV), 2014: 497-511. Paper
[5] Kang L, Li Y, Doermann D. Orientation robust text line detection in natural images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014: 4034-4041. Paper
[6] Sun L, Huo Q, Jia W, et al. A robust approach for text detection from natural scene images. Pattern Recognition, 2015, 48(9): 2906-2920. Paper
[7] Yin X C, Pei W Y, Zhang J, et al. Multi-orientation scene text detection with adaptive clustering. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015 (9): 1930-1937. Paper
[8] Liang G, Shivakumara P, Lu T, et al. Multi-spectral fusion based approach for arbitrarily oriented scene text detection in video images. IEEE Transactions on Image Processing, 2015, 24(11): 4488-4501. Paper
[9] Wu L, Shivakumara P, Lu T, et al. A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video. IEEE Trans. Multimedia, 2015, 17(8): 1137-1152. Paper
[10] Zheng Z, Wei S, et al. Symmetry-based text line detection in natural scenes. IEEE Conference on Computer Vision & Pattern Recognition(CVPR), 2015. Paper
[11] Tian S, Pan Y, Huang C, et al. Text flow: A unified text detection system in natural scene images. Proceedings of the IEEE international conference on computer vision(ICCV). 2015: 4651-4659. Paper
[12] Buta M, et al. FASText: Efficient unconstrained scene text detector. 2015 IEEE International Conference on Computer Vision (ICCV). 2015: 1206-1214. Paper
[13] Tian Z, Huang W, He T, et al. Detecting text in natural image with connectionist text proposal network. European conference on computer vision(ECCV), 2016: 56-72. Paper Code
[14] Zhang Z, Zhang C, Shen W, et al. Multi-oriented text detection with fully convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR). 2016: 4159-4167. Paper
[15] Gupta A, Vedaldi A, Zisserman A. Synthetic data for text localisation in natural images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR). 2016: 2315-2324. Paper Code
[16] S. Zhu and R. Zanibbi, A Text Detection System for Natural Scenes with Convolutional Feature Learning and Cascaded Classification, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: 625-632. Paper
[17] Tian S, Pei W Y, Zuo Z Y, et al. Scene Text Detection in Video by Learning Locally and Globally. IJCAI. 2016: 2647-2653. Paper
[18] He T, Huang W, Qiao Y, et al. Text-attentional convolutional neural network for scene text detection. IEEE transactions on image processing, 2016, 25(6): 2529-2541. Paper
[19] He, Dafang and Yang, Xiao and Huang, Wenyi and Zhou, Zihan and Kifer, Daniel and Giles, C Lee. Aggregating local context for accurate scene text detection. ACCV, 2016. Paper
[20] Zhong Z, Jin L, Zhang S, et al. Deeptext: A unified framework for text proposal generation and text detection in natural images. arXiv preprint arXiv:1605.07314, 2016. Paper
[21] Yao C, Bai X, Sang N, et al. Scene text detection via holistic, multi-channel prediction. arXiv preprint arXiv:1606.09002, 2016. Paper
[22] Liao M, Shi B, Bai X, et al. TextBoxes: A Fast Text Detector with a Single Deep Neural Network. AAAI. 2017: 4161-4167. Paper Code
[23] Shi B, Bai X, Belongie S. Detecting Oriented Text in Natural Images by Linking Segments. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2017: 3482-3490. Paper Code
[24] Zhou X, Yao C, Wen H, et al. EAST: an efficient and accurate scene text detector. CVPR, 2017: 2642-2651. Paper Code
[25] Liu Y, Jin L. Deep matching prior network: Toward tighter multi-oriented text detection. CVPR, 2017: 3454-3461. Paper
[26] He W, Zhang X Y, Yin F, et al. Deep Direct Regression for Multi-Oriented Scene Text Detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV). 2017: 745-753. Paper
[27] Hu H, Zhang C, Luo Y, et al. Wordsup: Exploiting word annotations for character based text detection. ICCV, 2017. Paper
[28] Wu Y, Natarajan P. Self-organized text detection with minimal post-processing via border learning. ICCV, 2017. Paper
[29] He P, Huang W, He T, et al. Single shot text detector with regional attention. The IEEE International Conference on Computer Vision (ICCV). 2017, 6(7). Paper Code
[30] Tian S, Lu S, Li C. Wetext: Scene text detection under weak supervision. ICCV, 2017. Paper
[31] Zhu, Xiangyu and Jiang, Yingying et al. Deep Residual Text Detection Network for Scene Text. ICDAR, 2017. Paper
[32] Tang Y , Wu X. Scene Text Detection and Segmentation Based on Cascaded Convolution Neural Networks. IEEE Transactions on Image Processing, 2017, 26(3):1509-1520. Paper
[33] Yang C, Yin X C, Pei W Y, et al. Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework with Dynamic Programming. IEEE Transactions on Image Processing, 2017. Paper
[34] X. Ren, Y. Zhou, J. He, K. Chen, X. Yang and J. Sun, A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling. in IEEE Transactions on Multimedia, vol. 19, no. 3, pp. 506-518, March 2017. Paper
[35] Dai Y, Huang Z, Gao Y, et al. Fused text segmentation networks for multi-oriented scene text detection. arXiv preprint arXiv:1709.03272, 2017. Paper
[36] Jiang Y, Zhu X, Wang X, et al. R2CNN: rotational region CNN for orientation robust scene text detection. arXiv preprint arXiv:1706.09579, 2017. Paper
[37] Xing D, Li Z, Chen X, et al. ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene. arXiv preprint arXiv:1711.11249, 2017. Paper
[38] C. Wang, F. Yin and C. Liu, Scene Text Detection with Novel Superpixel Based Character Candidate Extraction. in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2017, pp. 929-934. Paper
[39] Sheng Zhang, Yuliang Liu, Lianwen Jin et al. Feature Enhancement Network: A Refined Scene Text Detector. In AAAI 2018. Paper
[40] Dan Deng et al. PixelLink: Detecting Scene Text via Instance Segmentation. In AAAI 2018. Paper Code
[41] Fangfang Wang, Liming Zhao, Xi L et al. Geometry-Aware Scene Text Detection with Instance Transformation Network. In CVPR 2018. Paper
[42] Zichuan Liu, Guosheng Lin, Sheng Yang et al. Learning Markov Clustering Networks for Scene Text Detection. In CVPR 2018. Paper
[43] Pengyuan Lyu, Cong Yao, Wenhao Wu et al. Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation. In CVPR 2018. Paper
[44] Minghui L, Zhen Z, Baoguang S. Rotation-Sensitive Regression for Oriented Scene Text Detection. In CVPR 2018. Paper
[45] Chuhui Xue et al. Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping. In ECCV 2018. Paper
[46] Long, Shangbang and Ruan, Jiaqiang, et al. TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes. In ECCV, 2018. Paper
[47] Qiangpeng Yang, Mengli Cheng et al. IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection. In IJCAI 2018. Paper
[48] Xiaoyu Yue et al. Boosting up Scene Text Detectors with Guided CNN. In BMVC 2018. Paper
[49] Liao M, Shi B , Bai X. TextBoxes++: A Single-Shot Oriented Scene Text Detector. IEEE Transactions on Image Processing, 2018, 27(8):3676-3690. Paper Code
[50] W. He, X. Zhang, F. Yin and C. Liu, Multi-Oriented and Multi-Lingual Scene Text Detection With Direct Regression, in IEEE Transactions on Image Processing, vol. 27, no. 11, pp.5406-5419, 2018. Paper
[51] Ma J, Shao W, Ye H, et al. Arbitrary-oriented scene text detection via rotation proposals.in IEEE Transactions on Multimedia, 2018. Paper Code
[52] Youbao Tang and Xiangqian Wu. Scene Text Detection Using Superpixel-Based Stroke Feature Transform and Deep Learning Based Region Classification. In TMM, 2018. Paper
[53] Zhuoyao Zhong, Lei Sun and Qiang Huo. An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches. arXiv preprint arXiv:1804.09003. 2018. Paper
[54] Wenhai W, Enze X, et al. Shape Robust Text Detection with Progressive Scale Expansion Network. In CVPR 2019. Paper Code
[55] Zhu Y, Du J. Sliding Line Point Regression for Shape Robust Scene Text Detection. arXiv preprint arXiv:1801.09969, 2018. Paper
[56] Linjie D, Yanxiang Gong, et al. Detecting Multi-Oriented Text with Corner-based Region Proposals. arXiv preprint arXiv: 1804.02690, 2018. Paper Code
[57] Yongchao Xu, Yukang Wang, Wei Zhou, et al. TextField: Learning A Deep Direction Field for Irregular Scene Text Detection. arXiv preprint arXiv: 1812.01393, 2018. Paper
[58] Xiaowei Tian, Dao Wu, Rui Wang, Xiaochun Cao. Focal Text: an Accurate Text Detection with Focal Loss. In ICIP 2018. Paper
[59] Chenqin C, Pin L, Bing S. Feature Fusion Network for Scene Text Detection. In ICIP, 2018. Paper
[60] Sabyasachi Mohanty et al. Recurrent Global Convolutional Network for Scene Text Detection. In ICIP 2018. Paper
[61] Enze Xie, et al. Scene Text Detection with Supervised Pyramid Context Network. In AAAI 2019. Paper
[62] Youngmin Baek, Bado Lee, et al. Character Region Awareness for Text Detection. In CVPR 2019. Paper
[63] Yuliang L, Lianwen J, Shuaitao Z, et al. Curved Scene Text Detection via Transverse and Longitudinal Sequence Connection. Pattern Recognition, 2019. Paper Code
[64] Jingchao Liu, Xuebo Liu, et al, Pyramid Mask Text Detector. arXiv preprint arXiv:1903.11800, 2019. Paper Code
[79] Lele Xie, Yuliang Liu, Lianwen Jin, Zecheng Xie, DeRPN: Taking a further step toward more general object detection. In AAAI, 2019. Paper Code
[80] Yuliang Liu, Lianwen Jin, et al, Omnidirectional Scene Text Detction with Sequential-free Box Discretization. In IJ

你可能感兴趣的:(图像处理)

遥感影像的切片处理 sand&wich 计算机视觉 python 图像处理
在遥感影像分析中，经常需要将大尺寸的影像切分成小片段，以便于进行详细的分析和处理。这种方法特别适用于机器学习和图像处理任务，如对象检测、图像分类等。以下是如何使用Python和OpenCV库来实现这一过程，同时确保每个影像片段保留正确的地理信息。准备环境首先，确保安装了必要的Python库，包括numpy、opencv-python和xml.etree.ElementTree。这些库将用于图像处理
Python实现下载当前年份的谷歌影像 sand&wich python 开发语言
在GIS项目和地图应用中，获取最新的地理影像数据是非常重要的。本文将介绍如何使用Python代码从Google地图自动下载当前年份的影像数据，并将其保存为高分辨率的TIFF格式文件。这个过程涉及地理坐标转换、多线程下载和图像处理。关键功能该脚本的核心功能包括：坐标转换：支持WGS-84与WebMercator投影之间转换，以及处理中国GCJ-02偏移。自动化下载：多线程下载地图瓦片，提高效率。图像
Python实现TIFF 文件转换为 PNG 和 JPG 格式 sand&wich python 开发语言
在日常的图像处理工作中，可能会遇到需要将TIFF格式的图像转换为其他格式的情况，例如PNG和JPG。下面，本文将介绍如何使用Python和GDAL库实现这一功能。准备工作在开始之前，请确保已经安装了必要的库：GDAL（GeospatialDataAbstractionLibrary）可以使用以下命令安装GDAL：pipinstallgdal代码实现以下是一个将TIFF文件转换为PNG文件的示例代码
OpenCV图像处理技术（Python）——入门森屿_ opencv
©FuXianjun.AllRightsReserved.OpenCV入门图像作为人类感知世界的视觉基础，是人类获取信息、表达信息的重要手段，OpenCV作为一个开源的计算机视觉库，它包括几百个易用的图像成像和视觉函数，既可以用于学术研究，也可用于工业邻域，它于1999年由因特尔的GaryBradski启动，OpenCV库主要由C和C++语言编写，它可以在多个操作系统上运行。1.1图像处理基本操作
opencv学习：图像旋转的两种方法，旋转后的图片进行模板匹配代码实现夜清寒风学习 opencv 机器学习人工智能计算机视觉
图像旋转在图像处理中，rotate和rot90是两种常见的图像旋转方法，它们在功能和使用上有一些区别。下面我将分别介绍这两种方法，并解释它们的主要区别rot90方法rot90方法是NumPy提供的一种数组旋转函数，它主要用于对二维数组（如图像）进行90度的旋转。这个方法比较简单，只支持90度的倍数旋转，不支持任意角度旋转。使用NumPy进行旋转使用NumPy的rot90函数对模板图像进行旋转操作。
Python OpenCV图像处理：从基础到高级的全方位指南极客代码玩转Python 开发语言 python opencv 图像处理计算机视觉
目录第一部分：PythonOpenCV图像处理基础1.1OpenCV简介1.2PythonOpenCV安装1.3实战案例：图像显示与保存1.4注意事项第二部分：PythonOpenCV图像处理高级技巧2.1图像变换2.2图像增强2.3图像复原第三部分：PythonOpenCV图像处理实战项目3.1图像滤波3.2图像分割3.3图像特征提取第四部分：PythonOpenCV图像处理注意事项与优化策略4
服务器状态监控php源码,服务器状态监控_监控Linux服务器网站状态的SHELL脚本温糯米服务器状态监控php源码
摘要腾兴网为您分享:监控Linux服务器网站状态的SHELL脚本，蜗牛集市，同花顺，探客宝，手柄助手等软件知识，以及日期倒计时插件，云南省教育资源公共，rui手机桌面，小屁孩桌面便签，合金装备崛起复仇，朝夕日历，photoshop图像处理软件,一年级学生每日计划表，悟空找房，饿了吗外卖商家版，逃生，中国民宿网，realpolitiks，交通安全知识竞赛，雅思流利说等软件it资讯，欢迎关注腾兴网。1
多模态Transformer之文本与图像联合建模 - Transformer教程 shandianfk_com ChatGPT Transformer transformer 深度学习人工智能
大家好，今天我们来聊聊一个既前沿又有趣的话题——多模态Transformer，特别是文本与图像的联合建模。对于很多小伙伴来说，Transformer这个词已经不陌生了，但它不仅仅应用于自然语言处理，还能在图像处理、甚至是多模态数据的处理上大显身手。接下来，我会带大家深入了解什么是多模态Transformer，以及它是如何实现文本与图像的联合建模的。Transformer简介首先，我们简单回顾一下T
Matlab2024a安装教程是阿宇呢信息可视化开发语言
MATLAB是一款商业数学软件，用于算法开发、数据可视化、数据分析以及数值计算的高级技术计算语言和交互式环境，主要包括MATLAB和Simulink两大部分，可以进行矩阵运算、绘制函数和数据、实现算法、创建用户界面、连接其他编程语言的程序等，主要应用于工程计算、控制设计、信号处理与通讯、图像处理、信号检测、金融建模设计与分析等领域。1.解压安装包：①鼠标右击【MATLABR2024a(64bit)
图像处理的作用（6幅图诗）静月园
静月园著2020年1月️4日1自然力出现的图形画面，即无序，又有形。奇妙令人联想无限。好象理石花纹，又类似草木树植。2为何要如此色彩？好奇怪哦！自然的物态鬼斧神工。3孩童们信手涂鸦，但是脑控制了手的动作，所绘画的物体形状代表了孩子们对环境人物的所看，所听，所理解的形状。脑的心理活动影像，被转换成手的动作输出到笔尖的移动动作上，于是我们看到了简单的结构形状图。而对于我们的写作者来说，我们的作家脑内有
OpenCV高阶操作富士达幸运星 opencv 人工智能计算机视觉
在图像处理与计算机视觉领域，OpenCV（OpenSourceComputerVisionLibrary）无疑是最为强大且广泛使用的工具之一。从基础的图像读取、1.图片的上下，采样下采样（Downsampling）下采样通常用于减小图像的尺寸，从而减少图像中的像素数。这个过程可以通过多种方法实现，但最常见的是通过图像金字塔中的pyrDown函数（在OpenCV中）或其他类似的滤波器（如平均池化、最
opencv 之实战项目识别银行卡上的数字 SEVEN-YEARS opencv 计算机视觉人工智能
OpenCV之实战项目：识别银行卡上的数字引言在日常生活中，银行卡的识别是一个常见的需求，特别是在金融领域。本实战项目旨在使用OpenCV库来识别银行卡上的数字。我们将通过模板匹配的方法，结合图像处理技术，来准确识别银行卡上的数字序列。项目准备本项目需要安装Python和OpenCV库。确保已经安装了必要的库，并准备好银行卡图像和数字模板图像。实验素材定义函数importcv2defsort_co
【图像压缩】奇异值分解SVD灰色图像压缩（可设置压缩比）【含Matlab源码 4358期】 Matlab武动乾坤 Matlab图像处理（进阶版）matlab
✅博主简介：热爱科研的Matlab仿真开发者，修心和技术同步精进，Matlab项目合作可私信。个人主页：海神之光代码获取方式：海神之光Matlab王者学习之路—代码获取方式⛳️座右铭：行百里者，半于九十。更多Matlab仿真内容点击Matlab图像处理（进阶版）路径规划（Matlab）神经网络预测与分类（Matlab）优化求解（Matlab）语音处理（Matlab）信号处理（Matlab）车间调度
Python OpenCV精讲系列 - 高级图像处理技术（五）极客代码 Python OpenCV精讲 python opencv 图像处理开发语言人工智能计算机视觉
⚡️⚡️专栏：PythonOpenCV精讲⚡️⚡️本专栏聚焦于Python结合OpenCV库进行计算机视觉开发的专业教程。通过系统化的课程设计，从基础概念入手，逐步深入到图像处理、特征检测、物体识别等多个领域。适合希望在计算机视觉方向上建立坚实基础的技术人员及研究者。每一课不仅包含理论讲解，更有实战代码示例，助力读者快速将所学应用于实际项目中，提升解决复杂视觉问题的能力。无论是入门者还是寻求技能进
K-means 算法的介绍与应用小魏冬琅 matlab 算法 kmeans 机器学习
目录引言K-means算法的基本原理表格总结：K-means算法的主要步骤K-means算法的MATLAB实现优化方法与改进K-means算法的应用领域表格总结：K-means算法的主要应用领域结论引言K-means算法是一种经典的基于距离的聚类算法，在数据挖掘、模式识别、图像处理等多个领域中得到了广泛应用。其核心思想是将相似的数据对象聚类到同一个簇中，而使得簇内对象的相似度最大、簇间的相似度最小
基于VGG的猫狗识别卑微小鹿 tensorflow tensorflow
由于猫和狗的数据在这里，所以就做了一下分类的神经网络1、首先进行图像处理：importcsvimportglobimportosimportrandomos.environ['TF_CPP_MIN_LOG_LEVEL']='2'importtensorflowastffromtensorflowimportkerasfromtensorflow.kerasimportlayersimportnum
MATLAB车牌定位和识别系统清风明月来几时图像算法处理 matlab 开发语言
有很多方法可以实现MATLAB车牌的定位和识别系统。以下是一种可能的实现步骤：车牌定位：使用图像处理技术（如边缘检测、区域生长或颜色分割）来检测图像中的车牌区域。使用形态学操作来排除不符合车牌形状的区域。对车牌区域进行裁剪或调整大小，以便后续的识别。车牌识别：将车牌图像转换为灰度图像。使用图像处理技术（如二值化、滤波或增强）来减少噪音并突出字符。使用字符分割算法将车牌中的字符分开。使用特征提取方法
MATLAB车牌识别系统清风明月来几时图像算法处理 matlab 开发语言
MATLAB车牌识别系统是一个基于MATLAB开发的用于识别和提取车牌信息的系统。该系统使用图像处理和机器学习算法来实现车牌的定位和字符识别。以下是一个基本的MATLAB车牌识别系统的工作流程：图像预处理：首先，将输入的图像进行预处理，包括灰度化、高斯平滑、边缘检测等操作，以提高后续的车牌定位和字符识别的准确性。车牌定位：在预处理后的图像中，使用形态学运算和边缘检测算法来寻找车牌的位置。这可以通过
直方图匹配（Histogram Matching）姜太公钓鲸233 计算机视觉人工智能机器学习
直方图匹配（HistogramMatching），也被称为直方图规定化（HistogramSpecification）或直方图修正（HistogramEqualization），是一种图像处理技术，用于调整图像的直方图，以使其与某个目标直方图相匹配。目标直方图通常是用户定义的或者是希望获得的期望分布。直方图匹配的目标是改变图像的像素值分布，从而使其在视觉上更接近目标直方图。这对于图像增强、风格迁移
uint8 姜太公钓鲸233 python numpy
无符号8位整数（uint8）是一种数据类型，通常用于表示整数，但它不包括负数，只能表示非负的整数值。它的范围是从0到255，共有256个不同的可能取值。在计算机中，整数数据类型可以分为有符号和无符号。有符号整数可以表示正数、负数和零，而无符号整数只能表示非负的整数。在图像处理中，无符号8位整数通常用于表示灰度图像的像素值。一个像素的灰度值代表了图像中对应点的亮度强度，通常从0（黑色）到255（白色
【Python第三方库】OpenCV库实用指南墨辰JC Python opencv python 人工智能学习
文章目录前言安装OpenCV读取图像图像基本操作获取图像信息裁剪图像图像缩放图像转换为灰度图图像模糊处理边缘检测图像翻转图像保存视频相关操作方法讲解读取视频从摄像头读取视频前言OpenCV（OpenSourceComputerVisionLibrary）作为一个强大的计算机视觉库，提供了丰富的图像处理和计算机视觉功能，尤其在图像识别、对象检测、视频分析等领域有着广泛的应用。本文将带领读者使用Pyt
计算机视觉之旅-进阶-图像滤波处理撸码猿计算机视觉图像处理人工智能
1.基本概念1.1.数字图像图像处理的对象是数字图像,它是由像素点阵列表示的图像。需要了解像素、图像分辨率、灰度级、RBG等图像表示方法。用numpy数组表示,每个元素为像素值。例如RGB图像 importnumpyasnp img=np.array([[[255,0,0],[0,255,0]],[[0,0,255],[255,255,255]]]) 1.2.采样和量化数字图像是通过采样和量化得到
动手学深度学习（pytorch土堆）-03常见的Transforms #include<菜鸡> 深度学习深度学习 pytorch 人工智能
Composetransforms.Compose是PyTorch中的一个函数，用于将多个图像变换操作组合在一起，形成一个变换流水线。这样可以将一系列的图像处理操作整合为一个步骤，便于对图像进行批量预处理或增强。基本用法transforms.Compose接受一个列表，列表中的每个元素是一个变换操作。这些操作会按照给定的顺序依次作用在输入的图像上。Example:>>>transforms.Com
论文学习笔记 VMamba: Visual State Space Model Wils0nEdwards 学习笔记
概览这篇论文的动机源于在计算机视觉领域设计计算高效的网络架构的持续需求。当前的视觉模型如卷积神经网络（CNNs）和视觉Transformer（ViTs）在处理大规模视觉任务时展现出良好的表现，但都存在各自的局限性。特别是，ViTs尽管在处理大规模数据上具有优势，但其自注意力机制的二次复杂度对高分辨率图像处理时的计算成本极高。因此，研究者希望通过引入新的架构来降低这种复杂度，并提高视觉任务的效率。现
数字图像处理（一系列对图像进行处理、分析和改进的技术）编程日记✧ 智能医疗计算机视觉图像处理人工智能
数字图像处理是指对图像进行一系列的数学和算法处理，以增强、分析或理解图像的内容。这些处理包括从基础的像素操作到复杂的高维变换和机器学习模型。1.图像降噪在图像获取和传输过程中，往往会引入噪声。降噪技术用于减少这些噪声，同时尽量保持图像的细节。常见方法有：均值滤波：将像素邻域内的像素值取平均值，从而平滑图像。这种方法简单但可能会模糊边缘。高斯滤波：使用高斯函数为权重对像素进行加权平均，可以更好地平滑
python图像处理的图像几何变换 yava_free 图像处理 python 计算机视觉
一.图像几何变换图像几何变换不改变图像的像素值，在图像平面上进行像素变换。适当的几何变换可以最大程度地消除由于成像角度、透视关系乃至镜头自身原因所造成的几何失真所产生的负面影响。几何变换常常作为图像处理应用的预处理步骤，是图像归一化的核心工作之一[1]。一个几何变换需要两部分运算：空间变换：包括平移、缩放、旋转和正平行投影等，需要用它来表示输出图像与输入图像之间的像素映射关系。灰度插值算法：按照这
OpenCV3最常用的基本操作 HeoLis
OpenCV介绍OpenCV的全称是OpenSourceComputerVisionLibrary，是一个跨平台的计算机视觉库。OpenCV是由英特尔公司发起并参与开发，以BSD许可证授权发行，可以在商业和研究领域中免费使用。OpenCV可用于开发实时的图像处理、计算机视觉以及模式识别程序。该程序库也可以使用英特尔公司的IPP进行加速处理。以上是维基百科关于OpenCV的介绍，简单来说它就是处理图
yolov5 +gui界面+单目测距实现对图片视频摄像头的测距毕设宇航 QQ767172261 yolov5 单目测距
可实现对图片，视频，摄像头的检测项目概述本项目旨在实现一个集成了YOLOv5目标检测算法、图形用户界面（GUI）以及单目测距功能的系统。该系统能够对图片、视频或实时摄像头输入进行目标检测，并估算目标的距离。通过结合YOLOv5的强大检测能力和单目测距技术，系统能够在多种应用场景中提供高效、准确的目标检测和测距功能。技术栈YOLOv5：用于目标检测的深度学习模型。OpenCV：用于图像处理和单目测距
Python中cv2 (OpenCV, opencv-python)库的安装、使用方法demo最新详细教程猫头虎 AI人工智能技术专栏 python opencv 开发语言计算机视觉语音识别目标检测神经网络
Python中cv2(OpenCV,opencv-python)库的安装、使用方法demo最新详细教程文章目录Python中cv2(OpenCV,opencv-python)库的安装、使用方法demo最新详细教程摘要引言正文OpenCV库概述安装OpenCV环境要求安装命令验证安装基础使用方法读取和显示图像图像处理示例❓常见问题解答小结参考资料表格总结总结和未来展望温馨提示摘要本文全面介绍了Pyt
c#视觉应用开发中如何使用Emgu CV在C#中进行图像处理？ openwin_top C#视觉应用开发问题系列 c#图像处理开发语言
microPythonPython最小内核源码解析NI-motion运动控制c语言示例代码解析python编程示例系列python编程示例系列二python的Web神器Streamlit如何应聘高薪职位EmguCV是OpenCV的.NET包装器，可以让开发者在.NET语言（如C#）中使用OpenCV的功能进行图像处理。在进行图像处理时，EmguCV提供了丰富的API可以使用。以下是使用EmguCV
ztree设置禁用节点 3213213333332132 JavaScript ztree json setDisabledNode Ajax
ztree设置禁用节点的时候注意，当使用ajax后台请求数据,必须要设置为同步获取数据，否者会获取不到节点对象，导致设置禁用没有效果。 $(function(){ showTree(); setDisabledNode(); });
JVM patch by Taobao bookjovi java HotSpot
在网上无意中看到淘宝提交的hotspot patch，共四个，有意思，记录一下。 7050685：jsdbproc64.sh has a typo in the package name 7058036：FieldsAllocationStyle=2 does not work in 32-bit VM 7060619：C1 should respect inline and
将session存储到数据库中 dcj3sjt126com sql PHP session
CREATE TABLE sessions ( id CHAR(32) NOT NULL, data TEXT, last_accessed TIMESTAMP NOT NULL, PRIMARY KEY (id) ); <?php /** * Created by PhpStorm. * User: michaeldu * Date
Vector 171815164 vector
public Vector<CartProduct> delCart(Vector<CartProduct> cart, String id) { for (int i = 0; i < cart.size(); i++) { if (cart.get(i).getId().equals(id)) { cart.remove(i);
各连接池配置参数比较 g21121 连接池
排版真心费劲，大家凑合看下吧，见谅~ Druid DBCP C3P0 Proxool 数据库用户名称 Username Username User 数据库密码 Password Password Password 驱动名
[简单]mybatis insert语句添加动态字段 53873039oycg mybatis
mysql数据库,id自增,配置如下： <insert id="saveTestTb" useGeneratedKeys="true" keyProperty="id" parameterType=&
struts2拦截器配置云端月影 struts2拦截器
struts2拦截器interceptor的三种配置方法方法1. 普通配置法 <struts> <package name="struts2" extends="struts-default"> &
IE中页面不居中，火狐谷歌等正常 aijuans IE中页面不居中
问题是首页在火狐、谷歌、所有IE中正常显示，列表页的页面在火狐谷歌中正常，在IE6、7、8中都不中，觉得可能那个地方设置的让IE系列都不认识，仔细查看后发现，列表页中没写HTML模板部分没有添加DTD定义，就是<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3
String,int,Integer,char 几个类型常见转换 antonyup_2006 html sql .net
如何将字串 String 转换成整数 int? int i = Integer.valueOf(my_str).intValue(); int i=Integer.parseInt(str); 如何将字串 String 转换成Integer ? Integer integer=Integer.valueOf(str); 如何将整数 int 转换成字串 String ? 1.
PL/SQL的游标类型百合不是茶显示游标(静态游标)隐式游标游标的更新和删除 %rowtype ref游标(动态游标)
游标是oracle中的一个结果集,用于存放查询的结果; PL/SQL中游标的声明; 1,声明游标 2,打开游标(默认是关闭的); 3,提取数据 4,关闭游标注意的要点:游标必须声明在declare中,使用open打开游标,fetch取游标中的数据,close关闭游标隐式游标:主要是对DML数据的操作隐
JUnit4中@AfterClass @BeforeClass @after @before的区别对比 bijian1013 JUnit4 单元测试
一.基础知识 JUnit4使用Java5中的注解（annotation），以下是JUnit4常用的几个annotation： @Before：初始化方法对于每一个测试方法都要执行一次（注意与BeforeClass区别，后者是对于所有方法执行一次）@After：释放资源对于每一个测试方法都要执行一次（注意与AfterClass区别，后者是对于所有方法执行一次
精通Oracle10编程SQL(12)开发包 bijian1013 oracle 数据库 plsql
/* *开发包 *包用于逻辑组合相关的PL/SQL类型（例如TABLE类型和RECORD类型）、PL/SQL项（例如游标和游标变量）和PL/SQL子程序（例如过程和函数） */ --包用于逻辑组合相关的PL/SQL类型、项和子程序，它由包规范和包体两部分组成 --建立包规范：包规范实际是包与应用程序之间的接口，它用于定义包的公用组件，包括常量、变量、游标、过程和函数等 --在包规
【EhCache二】ehcache.xml配置详解 bit1129 ehcache.xml
在ehcache官网上找了多次，终于找到ehcache.xml配置元素和属性的含义说明文档了，这个文档包含在ehcache.xml的注释中！ ehcache.xml ： http://ehcache.org/ehcache.xml ehcache.xsd ： http://ehcache.org/ehcache.xsd ehcache配置文件的根元素是ehcahe ehcac
java.lang.ClassNotFoundException: org.springframework.web.context.ContextLoaderL 白糖_ java eclipse spring tomcat Web
今天学习spring+cxf的时候遇到一个问题：在web.xml中配置了spring的上下文监听器： <listener> <listener-class>org.springframework.web.context.ContextLoaderListener</listener-class> </listener> 随后启动
angular.element boyitech AngularJS AngularJS API angular.element
angular.element 描述: 包裹着一部分DOM element或者是HTML字符串，把它作为一个jQuery元素来处理。（类似于jQuery的选择器啦）如果jQuery被引入了，则angular.element就可以看作是jQuery选择器，选择的对象可以使用jQuery的函数；如果jQuery不可用，angular.e
java-给定两个已排序序列，找出共同的元素。 bylijinnan java
import java.util.ArrayList; import java.util.Arrays; import java.util.List; public class CommonItemInTwoSortedArray { /** * 题目：给定两个已排序序列，找出共同的元素。 * 1.定义两个指针分别指向序列的开始。 * 如果指向的两个元素
sftp 异常，有遇到的吗？求解 Chen.H java jcraft auth jsch jschexception
com.jcraft.jsch.JSchException: Auth cancel at com.jcraft.jsch.Session.connect(Session.java:460) at com.jcraft.jsch.Session.connect(Session.java:154) at cn.vivame.util.ftp.SftpServerAccess.connec
[生物智能与人工智能]神经元中的电化学结构代表什么? comsci 人工智能
我这里做一个大胆的猜想,生物神经网络中的神经元中包含着一些化学和类似电路的结构,这些结构通常用来扮演类似我们在拓扑分析系统中的节点嵌入方程一样,使得我们的神经网络产生智能判断的能力,而这些嵌入到节点中的方程同时也扮演着"经验"的角色.... 我们可以尝试一下...在某些神经
通过LAC和CID获取经纬度信息 dai_lm lac cid
方法1：用浏览器打开http://www.minigps.net/cellsearch.html，然后输入lac和cid信息(mcc和mnc可以填0)，如果数据正确就可以获得相应的经纬度方法2：发送HTTP请求到http://www.open-electronics.org/celltrack/cell.php?hex=0&lac=<lac>&cid=&
JAVA的困难分析 datamachine java
前段时间转了一篇SQL的文章（http://datamachine.iteye.com/blog/1971896），文章不复杂，但思想深刻，就顺便思考了一下java的不足，当砖头丢出来，希望引点和田玉。 -----------------------------------------------------------------------------------------
小学5年级英语单词背诵第二课 dcj3sjt126com english word
money 钱 paper 纸 speak 讲，说 tell 告诉 remember 记得，想起 knock 敲，击，打 question 问题 number 数字，号码 learn 学会，学习 street 街道 carry 搬运，携带 send 发送，邮寄，发射 must 必须 light 灯，光线，轻的 front
linux下面没有tree命令 dcj3sjt126com linux
centos p安装 yum -y install tree mac os安装 brew install tree 首先来看tree的用法 tree 中文解释：tree 功能说明：以树状图列出目录的内容。语　　法：tree [-aACdDfFgilnNpqstux][-I <范本样式>][-P <范本样式
Map迭代方式，Map迭代，Map循环蕃薯耀 Map循环 Map迭代 Map迭代方式
Map迭代方式，Map迭代，Map循环 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 蕃薯耀 2015年
Spring Cache注解+Redis hanqunfeng spring
Spring3.1 Cache注解依赖jar包：  <dependency> <groupId>org.springframework.data</groupId> <artifactId>spring-data-redis</artifactId>
Guava中针对集合的 filter和过滤功能 jackyrong filter
在guava库中，自带了过滤器(filter)的功能，可以用来对collection 进行过滤，先看例子： @Test public void whenFilterWithIterables_thenFiltered() { List<String> names = Lists.newArrayList("John"
学习编程那点事 lampcy 编程 android PHP html5
一年前的夏天，我还在纠结要不要改行，要不要去学php？能学到真本事吗？改行能成功吗？太多的问题，我终于不顾一切，下定决心，辞去了工作，来到传说中的帝都。老师给的乘车方式还算有效，很顺利的就到了学校，赶巧了，正好学校搬到了新校区。先安顿了下来，过了个轻松的周末，第一次到帝都，逛逛吧！接下来的周一，是我噩梦的开始，学习内容对我这个零基础的人来说，除了勉强完成老师布置的作业外，我已经没有时间和精力去
架构师之流处理---------bytebuffer的mark,limit和flip nannan408 ByteBuffer
1.前言。如题，limit其实就是可以读取的字节长度的意思，flip是清空的意思，mark是标记的意思。 2.例子. 例子代码: String str = "helloWorld"; ByteBuffer buff = ByteBuffer.wrap(str.getBytes()); Sy
org.apache.el.parser.ParseException: Encountered " ":" ": "" at line 1, column 1 Everyday都不同 $转义 el表达式
最近在做Highcharts的过程中，在写js时，出现了以下异常：严重: Servlet.service() for servlet jsp threw exception org.apache.el.parser.ParseException: Encountered " ":" ": "" at line 1,
用Java实现发送邮件到163 tntxia java实现
/* 在java版经常看到有人问如何用javamail发送邮件？如何接收邮件？如何访问多个文件夹等。问题零散，而历史的回复早已经淹没在问题的海洋之中。本人之前所做过一个java项目，其中包含有WebMail功能，当初为用java实现而对javamail摸索了一段时间，总算有点收获。看到论坛中的经常有此方面的问题，因此把我的一些经验帖出来，希望对大家有些帮助。此篇仅介绍用
探索实体类存在的真正意义 java小叶檀 POJO
一. 实体类简述实体类其实就是俗称的POJO,这种类一般不实现特殊框架下的接口，在程序中仅作为数据容器用来持久化存储数据用的 POJO（Plain Old Java Objects）简单的Java对象它的一般格式就是 public class A{ private String id; public Str