项目地址:https://github.com/zhongqianli/svm_classification
python2.7+ or python3.6+
opencv2.4+ or opencv3+
这是一个通用的svm模型训练平台。
基于svm完成各种分类任务,比如数字识别、眨眼检测、美瞳检测、活体检测等。
可以通过feature_extractor模块自定义人工特征,比如lbp、hog、颜色纹理特征等。
config模块用于配置人工特征类型、svm参数、数据路径等。
Todo: 基于当前框架,升级成通用的机器学习模型训练平台,加入常用的机器学习分类算法:ANN、Logistic Regression等。
(1)配置save_imagepath.train.sh和save_imagepath.test.sh的数据路径
在config中生成imagepath list:
$ sh save_imagepath.train.sh
$ sh save_imagepath.test.sh
(2)修改config.json中的参数
data
svm
train_label_listfile
test_label_listfile
(3)模型训练
$ python svm_train.py ‘./config/blink_detect.48x32/config.json’
或者运行run.sh脚本
模型将保存在model目录下
若在一段视频中检测到睁开的眼睛以及闭合的眼睛,则认为检测到眨眼动作。
问题可以转化为眼睛状态分类问题,即对睁开的眼睛与闭合的眼睛进行分类。
需要采集不同姿势不同光照情况下睁开的眼睛以及闭合的眼睛。示例项目的训练样本2000,测试样本1000。
{
“description”:
{
“version” : 1.0,
“object label_listfile” : “label_listfile means label and listfile which includes imagepaths.”,
“object svm” : “svm config.”,
“class num limits” : “no”
},
“data”:
{
“image_channel”: 1,
“feature_type”: “hog_cv2.48x32”
},
“svm”:
{
“model_name”: “model/blink_detect.48x32.binary.xml”,
“kernel” : “linear”,
“autotrain”: true,
“binary_classify”: true,
“c”: 2.67,
“gamma”: 5.383
},
“train_label_listfile”:
{
“1”:
[
“./config/blink_detect.48x32/train/class1.lst”
]
“2”:
[
“./config/blink_detect.48x32/train/class2.lst”
]
},
“test_label_listfile”:
{
“1”:
[
“./config/blink_detect.48x32/test/class1.lst”
],
“2”:
[
“./config/blink_detect.48x32/test/class2.lst”
]
}
}
[train] load data …
svm training…
data config: {‘image_channel’: 1, ‘feature_type’: ‘hog_cv2.48x32’}
svm config: {‘model_name’: ‘model/blink_detect.48x32.binary.xml’, ‘kernel’: ‘linear’, ‘autotrain’: True, ‘binary_classify’: True, ‘c’: 2.67, ‘gamma’: 5.383}
train auto
data dims = 324
class_num = 2
labels = [1, 2]
total_count = [10, 10]
error_count = [0, 0]
accurate = [ 1. 1.]
total accurate = 1.0
[test] load data …
svm testing…
model_name: model/blink_detect.48x32.binary.xml
data dims = 324
class_num = 2
labels = [1, 2]
total_count = [10, 10]
error_count = [0, 0]
accurate = [ 1. 1.]
total accurate = 1.0
使用mnist数据库的部分数据。示例项目的训练样本2400,测试样本2400。
python3.6.1
opencv3.4.1
[train] load data …
svm training…
data config: {‘image_channel’: 1, ‘feature_type’: ‘hog_cv2.32x32’}
svm config: {‘model_name’: ‘model/digits.32x32.svm_hog.xml’, ‘kernel’: ‘linear’, ‘autotrain’: True, ‘binary_classify’: False, ‘c’: 2.67, ‘gamma’: 5.383}
train auto
data dims = 324
class_num = 10
labels = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
total_count = [250, 250, 250, 150, 250, 250, 250, 250, 250, 250]
error_count = [1, 2, 1, 4, 3, 2, 1, 2, 3, 4]
accurate = [ 0.99599999 0.99199998 0.99599999 0.97333336 0.98799998 0.99199998
0.99599999 0.99199998 0.98799998 0.98400003]
total accurate = 0.9904166460037231
[test] load data …
svm testing…
model_name: model/digits.32x32.svm_hog.xml
data dims = 324
class_num = 10
labels = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
total_count = [250, 250, 250, 350, 250, 250, 250, 250, 250, 250]
error_count = [1, 5, 7, 16, 4, 1, 2, 10, 16, 6]
accurate = [ 0.99599999 0.98000002 0.972 0.95428574 0.98400003 0.99599999
0.99199998 0.95999998 0.93599999 0.97600001]
total accurate = 0.9738461375236511