羽星_s

AutoGluon包使用示例（表格、图像与多模态）

前言

前些天看李沐老师的课，发现一个AutoMl包AutoGluon，李沐老师说使用该包在Kaggle泰坦尼克号生还预测中取得前10%的成绩，在房价预测中拿到了第1名的成绩（用到了表格+文本的多模态模型）
这里我用员工满意度预测(Table)、Children vs Adults Classification(Image)、流浪猫收留预测(Multimodal)，数据下载可点击数据标题
接下来我将分别示例表格类型数据预测、图片类型数据分类、多模态预测（表格、文本、图片）
安装AutoGluon(Jupyter Notebook中)：! pip install autogluon
关于AutoGluon的更多API请参照官方网站，Github地址

表格类型预测

下图为员工满意度预测数据中字段信息内容：

导入包

导入必要包

#加载包
import numpy as np
import pandas as pd
from plotnine import*
import seaborn as sns
from scipy import stats

import matplotlib as mpl
import matplotlib.pyplot as plt
#中文显示问题
plt.rcParams['font.sans-serif']=['SimHei']
plt.rcParams['axes.unicode_minus'] = False

# notebook嵌入图片
%matplotlib inline
# 提高分辨率
%config InlineBackend.figure_format='retina'

# 切分数据
from sklearn.model_selection import train_test_split
#评价指标
from sklearn.metrics import mean_squared_error

# 忽略警告
import warnings
warnings.filterwarnings('ignore')

数据导入与探索

导入训练与测试数据

df_train = pd.read_csv("../input/employee-satisfaction/训练集.csv", encoding="gbk",index_col = 'id')
df_test = pd.read_csv("../input/employee-satisfaction/测试集.csv", encoding="gbk",index_col = 'id')
df_train.head()

将数据中字符串列数值化

# 将字符串列改为数值列
df_train.replace({'package':{'a':0,'b':1,'c':2,'d':3,'e':4},
                  'salary':{'low':0,'medium':1,'high':2}},
                 inplace=True)
df_test.replace({'package':{'a':0,'b':1,'c':2,'d':3,'e':4},
                  'salary':{'low':0,'medium':1,'high':2}},
                 inplace=True)
df_train.head()

对部门列进行独热编码

df_train = pd.get_dummies(df_train,columns=['division'])
df_test = pd.get_dummies(df_test,columns=['division'])
df_train.head()

异常值检查

# 绘制箱型图进行异常值检查
i = 0
frows = 2
fcols = 2
plt.figure(dpi = 600,figsize=(12, 8))
for lab in df_train.columns[[0,2,8]]:
    i += 1
    plt.subplot(frows,fcols,i)
    plt.boxplot(x=df_train[lab].values,labels=[lab])
# plt.savefig('1.png')

数据归一化

# 数据归一化
from sklearn import preprocessing 
# 归一化的特征列不包含预测列
features_columns = [col for col in df_train.columns if col not in ['satisfaction_level']]

min_max_scaler = preprocessing.MinMaxScaler()

min_max_scaler = min_max_scaler.fit(df_train[features_columns])

train_data_scaler = min_max_scaler.transform(df_train[features_columns])
test_data_scaler = min_max_scaler.transform(df_test[features_columns])

train_data_scaler = pd.DataFrame(train_data_scaler)
train_data_scaler.columns = features_columns

test_data_scaler = pd.DataFrame(test_data_scaler)
test_data_scaler.columns = features_columns

train_data_scaler['satisfaction_level'] = df_train['satisfaction_level'].values
df_train = train_data_scaler
df_test = test_data_scaler

观察训练集与测试集数据分布是否有偏差

# 观察训练集与测试集数据分布
dist_cols = 6
dist_rows = len(df_test.columns)

plt.figure(figsize=(4*dist_cols,4*dist_rows))

for i, col in enumerate(df_test.columns):
    ax=plt.subplot(dist_rows,dist_cols,i+1)
    ax = sns.kdeplot(df_train[col], color="Red", shade=True)
    ax = sns.kdeplot(df_test[col], color="Blue", shade=True)
    ax.set_xlabel(col)
    ax.set_ylabel("Frequency")
    ax = ax.legend(["train","test"])
plt.show()

绘制特征间相关性热力图

# 绘制相关性热力图
plt.figure(dpi = 300,figsize=(20, 16))
# 获取列标签
column = df_train.columns.tolist()  
mcorr = df_train[column].corr(method="spearman")  
# 创建一个和相关性矩阵相同维度的空矩阵
mask = np.zeros_like(mcorr, dtype=np.bool)  
mask[np.triu_indices_from(mask)] = True  
cmap = sns.diverging_palette(220, 10, as_cmap=True)  
g = sns.heatmap(mcorr, mask=mask, cmap=cmap, square=True, annot=True, fmt='0.2f')  
# plt.savefig('2.png')
plt.show()

Auto ML

将数据转换为autogluon中所需格式，并定义预测标签，不考虑时间成本追求最优模型，5择交叉检验、模型融合
使用CPU训练了大约20分钟，可以看到最优模型为WeightedEnsemble_L3

from autogluon.tabular import TabularDataset, TabularPredictor
train_data = TabularDataset(df_train)
# 预测标签
label = 'satisfaction_level'
# 模型保存文件名
save_path = 'agModels-predictClass'
# 建立预测模型，verbosity(0~4)，默认为2就好
predictor = TabularPredictor(label=label,path=save_path,verbosity=0)
# presets='best_quality'不考虑时间成本，追求最好模型
predictor.fit(train_data,presets='best_quality',num_bag_folds=5,num_bag_sets=1,num_stack_levels=1)
# 输出模型表现
predictor.leaderboard(silent=True)

输出：

	model	score_val	pred_time_val	fit_time	pred_time_val_marginal	fit_time_marginal	stack_level	can_infer	fit_order
0	WeightedEnsemble_L3	-0.172081	7.343329	333.876595	0.001310	0.623922	3	True	22
1	ExtraTreesMSE_BAG_L2	-0.172605	5.116233	172.232698	1.107705	7.378392	2	True	17
2	CatBoost_BAG_L2	-0.173452	4.080730	179.991769	0.072202	15.137464	2	True	16
3	LightGBMXT_BAG_L2	-0.173462	4.172402	174.590106	0.163874	9.735801	2	True	13
4	RandomForestMSE_BAG_L2	-0.173785	4.957497	191.662132	0.948968	26.807827	2	True	15
5	LightGBM_BAG_L2	-0.173939	4.105616	175.162125	0.097088	10.307820	2	True	14
6	WeightedEnsemble_L2	-0.174116	2.045973	34.980294	0.001211	0.699372	2	True	12
7	XGBoost_BAG_L2	-0.174219	4.094137	180.652536	0.085609	15.798230	2	True	19
8	LightGBMLarge_BAG_L2	-0.174762	4.210132	182.043600	0.201604	17.189295	2	True	21
9	NeuralNetFastAI_BAG_L2	-0.175477	4.541154	209.018030	0.532626	44.163725	2	True	18
10	RandomForestMSE_BAG_L1	-0.175685	0.904012	7.887363	0.904012	7.887363	1	True	5
11	ExtraTreesMSE_BAG_L1	-0.177117	0.847087	4.248557	0.847087	4.248557	1	True	7
12	NeuralNetTorch_BAG_L2	-0.177767	4.301633	212.179403	0.293104	47.325097	2	True	20
13	LightGBMLarge_BAG_L1	-0.179927	0.274584	11.694356	0.274584	11.694356	1	True	11
14	XGBoost_BAG_L1	-0.180700	0.115822	7.230349	0.115822	7.230349	1	True	9
15	CatBoost_BAG_L1	-0.180793	0.053771	14.898683	0.053771	14.898683	1	True	6
16	LightGBM_BAG_L1	-0.181259	0.313841	8.541626	0.313841	8.541626	1	True	4
17	LightGBMXT_BAG_L1	-0.183204	0.583974	9.379441	0.583974	9.379441	1	True	3
18	NeuralNetFastAI_BAG_L1	-0.188396	0.415200	45.523810	0.415200	45.523810	1	True	8
19	NeuralNetTorch_BAG_L1	-0.192048	0.241119	54.613551	0.241119	54.613551	1	True	10
20	KNeighborsDist_BAG_L1	-0.195021	0.124069	0.015969	0.124069	0.015969	1	True	2
21	KNeighborsUnif_BAG_L1	-0.196171	0.135048	0.820599	0.135048	0.820599	1	True	1

输出各特征重要性

# 删除其余模型（减少内存开销）
predictor.delete_models(models_to_keep='best')
# 输出最优模型
predictor.get_model_best()
# 输出特征重要程度
predictor.feature_importance(train_data)

输出：

	importance	stddev	p_value	n	p99_high	p99_low
number_project	0.140325	0.001946	4.437726e-09	5	0.144332	0.136318
average_monthly_hours	0.120057	0.001612	3.897295e-09	5	0.123376	0.116739
time_spend_company	0.113286	0.002141	1.531254e-08	5	0.117695	0.108877
last_evaluation	0.108663	0.000795	3.442178e-10	5	0.110301	0.107026
package	0.071335	0.000921	3.339475e-09	5	0.073232	0.069438
salary	0.034672	0.001662	6.312180e-07	5	0.038093	0.031250
Work_accident	0.016181	0.000579	1.958612e-07	5	0.017373	0.014990
division_sales	0.016054	0.000679	3.826687e-07	5	0.017451	0.014656
division_technical	0.015590	0.000977	1.843122e-06	5	0.017603	0.013578
division_support	0.012123	0.000587	6.598496e-07	5	0.013332	0.010913
division_IT	0.007362	0.000297	3.168499e-07	5	0.007973	0.006750
division_product_mng	0.006758	0.000402	1.501938e-06	5	0.007587	0.005930
division_marketing	0.006032	0.000617	1.297950e-05	5	0.007302	0.004761
division_accounting	0.005793	0.000534	8.553800e-06	5	0.006892	0.004694
division_RandD	0.005265	0.000286	1.042488e-06	5	0.005854	0.004676
division_hr	0.004269	0.000494	2.108536e-05	5	0.005286	0.003253
division_management	0.003864	0.000752	1.638736e-04	5	0.005413	0.002316
promotion_last_5years	0.002596	0.000383	5.523791e-05	5	0.003385	0.001807

预测

导入测试数据集，autogluon会自动使用最优模型进行预测

# 导入预测数据
test_data = TabularDataset(df_test)
# 导入模型
predictor = TabularPredictor.load(save_path)
# 得到预测值
y_pred = predictor.predict(test_data)
y_pred

图像数据分类

划分数据集

import autogluon.core as ag
from autogluon.vision import ImagePredictor, ImageDataset
train_data, _, test_data = ImageDataset.from_folders('../input/children-vs-adults-images', train='train', test='test')
print('train #', len(train_data), 'test #', len(test_data))

输出：

train # 680 test # 120

Auto ML

选择不考虑时间成本，最求最优模型
这个模型我在Kaggle上使用GPU训练了快12个小时！，可以看到模型在训练集上准确率为96.05%，验证集上准确率为97.05%，效果很好

predictor = ImagePredictor(verbosity=2)
predictor.fit(train_data,presets ='best_quality')

输出

Finished, total runtime is 42713.23 s
{ 'best_config': { 'augmentation': { 'auto_augment': None,
                                     'color_jitter': 0.4,
                                     'cutmix': 0.0,
                                     'cutmix_minmax': None,
                                     'drop': 0.0,
                                     'drop_block': None,
                                     'drop_path': None,
                                     'hflip': 0.5,
                                     'mixup': 0.0,
                                     'mixup_mode': 'batch',
                                     'mixup_off_epoch': 0,
                                     'mixup_prob': 1.0,
                                     'mixup_switch_prob': 0.5,
                                     'no_aug': False,
                                     'ratio': (0.75, 1.3333333333333333),
                                     'scale': (0.08, 1.0),
                                     'smoothing': 0.1,
                                     'train_interpolation': 'random',
                                     'vflip': 0.0},
                   'data': { 'crop_pct': 0.99,
                             'img_size': None,
                             'input_size': None,
                             'interpolation': '',
                             'mean': None,
                             'std': None,
                             'validation_batch_size_multiplier': 1},
                   'estimator': <class 'gluoncv.auto.estimators.torch_image_classification.torch_image_classification.TorchImageClassificationEstimator'>,
                   'gpus': [0],
                   'img_cls': { 'global_pool_type': None,
                                'model': 'swin_base_patch4_window7_224',
                                'pretrained': True},
                   'misc': { 'amp': False,
                             'apex_amp': False,
                             'eval_metric': 'top1',
                             'log_interval': 50,
                             'native_amp': False,
                             'num_workers': 2,
                             'pin_mem': False,
                             'prefetcher': False,
                             'save_images': False,
                             'seed': 467,
                             'torchscript': False,
                             'tta': 0,
                             'use_multi_epochs_loader': False},
                   'model_ema': { 'model_ema': True,
                                  'model_ema_decay': 0.9998,
                                  'model_ema_force_cpu': False},
                   'optimizer': { 'clip_grad': None,
                                  'clip_mode': 'norm',
                                  'momentum': 0.9,
                                  'opt': 'sgd',
                                  'opt_betas': None,
                                  'opt_eps': None,
                                  'weight_decay': 0.0001},
                   'train': { 'batch_size': 16,
                              'bn_eps': None,
                              'bn_momentum': None,
                              'cooldown_epochs': 10,
                              'decay_epochs': 30,
                              'decay_rate': 0.1,
                              'early_stop_baseline': -inf,
                              'early_stop_max_value': inf,
                              'early_stop_min_delta': 0.001,
                              'early_stop_patience': 50,
                              'epochs': 200,
                              'lr': 0.0005115112828551085,
                              'lr_cycle_limit': 1,
                              'lr_cycle_mul': 1.0,
                              'lr_noise': None,
                              'lr_noise_pct': 0.67,
                              'lr_noise_std': 1.0,
                              'min_lr': 1e-05,
                              'output_lr_mult': 0.1,
                              'patience_epochs': 10,
                              'sched': 'step',
                              'start_epoch': 0,
                              'sync_bn': False,
                              'transfer_lr_mult': 0.01,
                              'warmup_epochs': 3,
                              'warmup_lr': 0.0001}},
  'total_time': 42712.701295375824,
  'train_acc': 0.9605263157894737,
  'valid_acc': 0.9705882352941176}

预测

最优模型在测试集上准确率为89.2%也是一个非常不错的成绩，比我用手调的还好2个点！

test_acc = predictor.evaluate(test_data)
print('Top-1 test acc: %.3f' % test_acc['top1'])

输出：

Top-1 test acc: 0.892

输出测试集预测标签

result = predictor.predict(test_data)
print(result)

输出：

0      0
1      0
2      0
3      0
4      0
      ..
115    1
116    1
117    1
118    1
119    1
Name: label, Length: 120, dtype: int64

多模态数据预测

多模态数据即表格数据+文本数据+图像数据，是基于多模型融合的预测。

下载数据集

download_dir = './ag_petfinder_tutorial'
zip_file = 'https://automl-mm-bench.s3.amazonaws.com/petfinder_kaggle.zip'
from autogluon.core.utils.loaders import load_zip
load_zip.unzip(zip_file, unzip_dir=download_dir)

dataset_path = download_dir + '/petfinder_processed'
os.listdir(dataset_path)
# train_images:训练集图片
# test_images:测试集图片
# train.csv:训练集标签，特征，文本
# test.csv:测试集标签，特征，文本
# dev.csv:所有数据集标签

导入数据

train_data = pd.read_csv(f'{dataset_path}/train.csv', index_col=0)
test_data = pd.read_csv(f'{dataset_path}/dev.csv', index_col=0)
train_data.info()

输出：

<class 'pandas.core.frame.DataFrame'>
Int64Index: 11994 entries, 10721 to 5640
Data columns (total 25 columns):
 #   Column         Non-Null Count  Dtype  
---  ------         --------------  -----  
 0   Type           11994 non-null  int64  
 1   Name           10988 non-null  object 
 2   Age            11994 non-null  int64  
 3   Breed1         11994 non-null  int64  
 4   Breed2         11994 non-null  int64  
 5   Gender         11994 non-null  int64  
 6   Color1         11994 non-null  int64  
 7   Color2         11994 non-null  int64  
 8   Color3         11994 non-null  int64  
 9   MaturitySize   11994 non-null  int64  
 10  FurLength      11994 non-null  int64  
 11  Vaccinated     11994 non-null  int64  
 12  Dewormed       11994 non-null  int64  
 13  Sterilized     11994 non-null  int64  
 14  Health         11994 non-null  int64  
 15  Quantity       11994 non-null  int64  
 16  Fee            11994 non-null  int64  
 17  State          11994 non-null  int64  
 18  RescuerID      11994 non-null  object 
 19  VideoAmt       11994 non-null  int64  
 20  Description    11986 non-null  object 
 21  PetID          11994 non-null  object 
 22  PhotoAmt       11994 non-null  float64
 23  AdoptionSpeed  11994 non-null  int64  
 24  Images         11994 non-null  object 
dtypes: float64(1), int64(19), object(5)
memory usage: 2.4+ MB

告诉Auto模型预测列与图像路径列

# 需要预测的值
label = 'AdoptionSpeed'
# 对应图像标签
image_col = 'Images'

同1只动物有2~3张不同的照片，因为AutoGluon包1行数据只能读取1张照片，这里只取图片路径列的第1张

# 每行取第一张图片
train_data[image_col] = train_data[image_col].apply(lambda ele: ele.split(';')[0])
test_data[image_col] = test_data[image_col].apply(lambda ele: ele.split(';')[0])

train_data[image_col].iloc[0]

将图片路径改为绝对路径

# 将csv文件中的图片路径补充完整
def path_expander(path, base_folder):
    path_l = path.split(';')
    return ';'.join([os.path.abspath(os.path.join(base_folder, path)) for path in path_l])

train_data[image_col] = train_data[image_col].apply(lambda ele: path_expander(ele, base_folder=dataset_path))
test_data[image_col] = test_data[image_col].apply(lambda ele: path_expander(ele, base_folder=dataset_path))

train_data[image_col].iloc[0]

绘制图片

example_row = train_data.iloc[1]
example_image = example_row['Images']

from IPython.display import Image, display
pil_img = Image(filename=example_image)
display(pil_img)

数据采样，迅速定位值得训练的模型

# 数据采样（了解哪些模型值得训练）
train_data = train_data.sample(5000, random_state=0)

多模态特征提取

from autogluon.tabular import FeatureMetadata
feature_metadata = FeatureMetadata.from_df(train_data)

print(feature_metadata)

输出：

('float', [])        :  1 | ['PhotoAmt']
('int', [])          : 19 | ['Type', 'Age', 'Breed1', 'Breed2', 'Gender', ...]
('object', [])       :  4 | ['Name', 'RescuerID', 'PetID', 'Images']
('object', ['text']) :  1 | ['Description']

将Images列设为图像路径属性（方便训练器识别）

# 将Images列设为图像路径属性（方便训练器识别）
feature_metadata = feature_metadata.add_special_types({image_col: ['image_path']})
print(feature_metadata)

输出：

('float', [])              :  1 | ['PhotoAmt']
('int', [])                : 19 | ['Type', 'Age', 'Breed1', 'Breed2', 'Gender', ...]
('object', [])             :  3 | ['Name', 'RescuerID', 'PetID']
('object', ['image_path']) :  1 | ['Images']
('object', ['text'])       :  1 | ['Description']

Auto ML

from autogluon.tabular.configs.hyperparameter_configs import get_hyperparameter_config
# 多模态训练模式
hyperparameters = get_hyperparameter_config('multimodal')
hyperparameters

时间限制最多训练8个小时，并使用GPU加速

from autogluon.tabular import TabularPredictor
predictor = TabularPredictor(label=label).fit(
    train_data=train_data,
    hyperparameters=hyperparameters,
    feature_metadata=feature_metadata,
    presets = 'best_quality',
    time_limit=8*3600,
    ag_args_fit={'num_gpus': 0})

输出：

Fitting model: LightGBMLarge_BAG_L1 ... Training model for up to 17493.06s of the 27095.97s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.432	 = Validation score   (accuracy)
	386.71s	 = Training   runtime
	1.49s	 = Validation runtime
Fitting model: TextPredictor_BAG_L1 ... Training model for up to 17096.1s of the 26699.01s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
E1019 05:14:49.161276010    2762 chttp2_transport.cc:1103]   Received a GOAWAY with error code ENHANCE_YOUR_CALM and debug data equal to "too_many_pings"
	0.3634	 = Validation score   (accuracy)
	3734.32s	 = Training   runtime
	18.76s	 = Validation runtime
Fitting model: ImagePredictor_BAG_L1 ... Training model for up to 13344.84s of the 22947.75s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.3524	 = Validation score   (accuracy)
	4517.97s	 = Training   runtime
	28.6s	 = Validation runtime
Completed 1/20 k-fold bagging repeats ...
Fitting model: WeightedEnsemble_L2 ... Training model for up to 1919.15s of the 18419.43s of remaining time.
	0.4578	 = Validation score   (accuracy)
	1.42s	 = Training   runtime
	0.0s	 = Validation runtime
Fitting 6 L2 models ...
Fitting model: LightGBM_BAG_L2 ... Training model for up to 18417.99s of the 18417.59s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4564	 = Validation score   (accuracy)
	199.13s	 = Training   runtime
	1.04s	 = Validation runtime
Fitting model: LightGBMXT_BAG_L2 ... Training model for up to 18213.83s of the 18213.57s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4528	 = Validation score   (accuracy)
	140.68s	 = Training   runtime
	0.68s	 = Validation runtime
Fitting model: CatBoost_BAG_L2 ... Training model for up to 18067.64s of the 18067.38s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.463	 = Validation score   (accuracy)
	1012.6s	 = Training   runtime
	1.73s	 = Validation runtime
Fitting model: XGBoost_BAG_L2 ... Training model for up to 17049.43s of the 17049.16s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4614	 = Validation score   (accuracy)
	355.78s	 = Training   runtime
	0.67s	 = Validation runtime
Fitting model: NeuralNetTorch_BAG_L2 ... Training model for up to 16687.78s of the 16687.5s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4528	 = Validation score   (accuracy)
	62.11s	 = Training   runtime
	0.85s	 = Validation runtime
Fitting model: LightGBMLarge_BAG_L2 ... Training model for up to 16621.21s of the 16620.96s of remaining time.
	Fitting 8 child models (S1F1 - S1F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4588	 = Validation score   (accuracy)
	698.95s	 = Training   runtime
	2.09s	 = Validation runtime
Repeating k-fold bagging: 2/20
Fitting model: LightGBM_BAG_L2 ... Training model for up to 15917.11s of the 15916.86s of remaining time.
	Fitting 8 child models (S2F1 - S2F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4474	 = Validation score   (accuracy)
	361.11s	 = Training   runtime
	1.7s	 = Validation runtime
Fitting model: LightGBMXT_BAG_L2 ... Training model for up to 15750.46s of the 15750.21s of remaining time.
	Fitting 8 child models (S2F1 - S2F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4566	 = Validation score   (accuracy)
	383.67s	 = Training   runtime
	2.51s	 = Validation runtime
Fitting model: CatBoost_BAG_L2 ... Training model for up to 15502.33s of the 15502.08s of remaining time.
	Fitting 8 child models (S2F1 - S2F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4534	 = Validation score   (accuracy)
	2029.12s	 = Training   runtime
	3.6s	 = Validation runtime
Fitting model: XGBoost_BAG_L2 ... Training model for up to 14480.59s of the 14480.34s of remaining time.
	Fitting 8 child models (S2F1 - S2F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4574	 = Validation score   (accuracy)
	713.87s	 = Training   runtime
	1.32s	 = Validation runtime
Fitting model: NeuralNetTorch_BAG_L2 ... Training model for up to 14118.13s of the 14117.89s of remaining time.
	Fitting 8 child models (S2F1 - S2F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4494	 = Validation score   (accuracy)
	123.94s	 = Training   runtime
	1.64s	 = Validation runtime
Fitting model: LightGBMLarge_BAG_L2 ... Training model for up to 14051.54s of the 14051.05s of remaining time.
	Fitting 8 child models (S2F1 - S2F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4516	 = Validation score   (accuracy)
	1290.52s	 = Training   runtime
	3.15s	 = Validation runtime
Repeating k-fold bagging: 3/20
Fitting model: LightGBM_BAG_L2 ... Training model for up to 13454.83s of the 13454.58s of remaining time.
	Fitting 8 child models (S3F1 - S3F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4534	 = Validation score   (accuracy)
	565.3s	 = Training   runtime
	2.79s	 = Validation runtime
Fitting model: LightGBMXT_BAG_L2 ... Training model for up to 13245.22s of the 13244.89s of remaining time.
	Fitting 8 child models (S3F1 - S3F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4552	 = Validation score   (accuracy)
	562.97s	 = Training   runtime
	3.64s	 = Validation runtime
Fitting model: CatBoost_BAG_L2 ... Training model for up to 13061.41s of the 13061.16s of remaining time.
	Fitting 8 child models (S3F1 - S3F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4538	 = Validation score   (accuracy)
	2989.07s	 = Training   runtime
	5.31s	 = Validation runtime
Fitting model: XGBoost_BAG_L2 ... Training model for up to 12096.7s of the 12096.4s of remaining time.
	Fitting 8 child models (S3F1 - S3F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4628	 = Validation score   (accuracy)
	1065.02s	 = Training   runtime
	1.96s	 = Validation runtime
Fitting model: NeuralNetTorch_BAG_L2 ... Training model for up to 11741.2s of the 11740.95s of remaining time.
	Fitting 8 child models (S3F1 - S3F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4536	 = Validation score   (accuracy)
	191.06s	 = Training   runtime
	2.32s	 = Validation runtime
Fitting model: LightGBMLarge_BAG_L2 ... Training model for up to 11669.58s of the 11669.33s of remaining time.
	Fitting 8 child models (S3F1 - S3F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.455	 = Validation score   (accuracy)
	1921.24s	 = Training   runtime
	4.67s	 = Validation runtime
Repeating k-fold bagging: 4/20
Fitting model: LightGBM_BAG_L2 ... Training model for up to 11032.26s of the 11032.01s of remaining time.
	Fitting 8 child models (S4F1 - S4F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4564	 = Validation score   (accuracy)
	779.74s	 = Training   runtime
	4.27s	 = Validation runtime
Fitting model: LightGBMXT_BAG_L2 ... Training model for up to 10813.47s of the 10813.22s of remaining time.
	Fitting 8 child models (S4F1 - S4F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4564	 = Validation score   (accuracy)
	724.24s	 = Training   runtime
	4.74s	 = Validation runtime
Fitting model: CatBoost_BAG_L2 ... Training model for up to 10647.91s of the 10647.64s of remaining time.
	Fitting 8 child models (S4F1 - S4F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4544	 = Validation score   (accuracy)
	4043.74s	 = Training   runtime
	7.46s	 = Validation runtime
Fitting model: XGBoost_BAG_L2 ... Training model for up to 9588.75s of the 9588.44s of remaining time.
	Fitting 8 child models (S4F1 - S4F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4632	 = Validation score   (accuracy)
	1380.01s	 = Training   runtime
	2.51s	 = Validation runtime
Fitting model: NeuralNetTorch_BAG_L2 ... Training model for up to 9269.72s of the 9269.47s of remaining time.
	Fitting 8 child models (S4F1 - S4F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4528	 = Validation score   (accuracy)
	249.06s	 = Training   runtime
	3.08s	 = Validation runtime
Fitting model: LightGBMLarge_BAG_L2 ... Training model for up to 9207.7s of the 9207.42s of remaining time.
	Fitting 8 child models (S4F1 - S4F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4544	 = Validation score   (accuracy)
	2604.8s	 = Training   runtime
	6.68s	 = Validation runtime
Repeating k-fold bagging: 5/20
Fitting model: LightGBM_BAG_L2 ... Training model for up to 8518.63s of the 8518.39s of remaining time.
	Fitting 8 child models (S5F1 - S5F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4568	 = Validation score   (accuracy)
	969.05s	 = Training   runtime
	5.1s	 = Validation runtime
Fitting model: LightGBMXT_BAG_L2 ... Training model for up to 8324.31s of the 8324.06s of remaining time.
	Fitting 8 child models (S5F1 - S5F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.46	 = Validation score   (accuracy)
	954.32s	 = Training   runtime
	6.73s	 = Validation runtime
Fitting model: CatBoost_BAG_L2 ... Training model for up to 8089.46s of the 8089.21s of remaining time.
	Fitting 8 child models (S5F1 - S5F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4526	 = Validation score   (accuracy)
	4982.74s	 = Training   runtime
	9.3s	 = Validation runtime
Fitting model: XGBoost_BAG_L2 ... Training model for up to 7145.47s of the 7145.16s of remaining time.
	Fitting 8 child models (S5F1 - S5F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4618	 = Validation score   (accuracy)
	1687.99s	 = Training   runtime
	3.07s	 = Validation runtime
Fitting model: NeuralNetTorch_BAG_L2 ... Training model for up to 6833.41s of the 6833.16s of remaining time.
	Fitting 8 child models (S5F1 - S5F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4534	 = Validation score   (accuracy)
	311.52s	 = Training   runtime
	3.78s	 = Validation runtime
Fitting model: LightGBMLarge_BAG_L2 ... Training model for up to 6766.63s of the 6766.35s of remaining time.
	Fitting 8 child models (S5F1 - S5F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4552	 = Validation score   (accuracy)
	3278.15s	 = Training   runtime
	8.55s	 = Validation runtime
Repeating k-fold bagging: 6/20
Fitting model: LightGBM_BAG_L2 ... Training model for up to 6087.9s of the 6087.65s of remaining time.
	Fitting 8 child models (S6F1 - S6F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4586	 = Validation score   (accuracy)
	1164.77s	 = Training   runtime
	6.08s	 = Validation runtime
Fitting model: LightGBMXT_BAG_L2 ... Training model for up to 5886.76s of the 5886.47s of remaining time.
	Fitting 8 child models (S6F1 - S6F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4624	 = Validation score   (accuracy)
	1100.85s	 = Training   runtime
	7.32s	 = Validation runtime
Fitting model: CatBoost_BAG_L2 ... Training model for up to 5735.64s of the 5735.38s of remaining time.
	Fitting 8 child models (S6F1 - S6F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4518	 = Validation score   (accuracy)
	5980.75s	 = Training   runtime
	11.3s	 = Validation runtime
Fitting model: XGBoost_BAG_L2 ... Training model for up to 4732.41s of the 4732.16s of remaining time.
	Fitting 8 child models (S6F1 - S6F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4578	 = Validation score   (accuracy)
	1990.26s	 = Training   runtime
	3.59s	 = Validation runtime
Fitting model: NeuralNetTorch_BAG_L2 ... Training model for up to 4425.39s of the 4425.12s of remaining time.
	Fitting 8 child models (S6F1 - S6F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4546	 = Validation score   (accuracy)
	377.89s	 = Training   runtime
	4.44s	 = Validation runtime
Fitting model: LightGBMLarge_BAG_L2 ... Training model for up to 4354.59s of the 4354.33s of remaining time.
	Fitting 8 child models (S6F1 - S6F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4524	 = Validation score   (accuracy)
	3901.7s	 = Training   runtime
	10.22s	 = Validation runtime
Repeating k-fold bagging: 7/20
Fitting model: LightGBM_BAG_L2 ... Training model for up to 3725.6s of the 3725.34s of remaining time.
	Fitting 8 child models (S7F1 - S7F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4596	 = Validation score   (accuracy)
	1347.21s	 = Training   runtime
	6.95s	 = Validation runtime
Fitting model: LightGBMXT_BAG_L2 ... Training model for up to 3538.96s of the 3538.7s of remaining time.
	Fitting 8 child models (S7F1 - S7F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4622	 = Validation score   (accuracy)
	1247.92s	 = Training   runtime
	7.96s	 = Validation runtime
Fitting model: CatBoost_BAG_L2 ... Training model for up to 3387.65s of the 3387.4s of remaining time.
	Fitting 8 child models (S7F1 - S7F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.452	 = Validation score   (accuracy)
	6844.92s	 = Training   runtime
	13.13s	 = Validation runtime
Fitting model: XGBoost_BAG_L2 ... Training model for up to 2519.12s of the 2518.87s of remaining time.
	Fitting 8 child models (S7F1 - S7F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4578	 = Validation score   (accuracy)
	2340.68s	 = Training   runtime
	4.2s	 = Validation runtime
Fitting model: NeuralNetTorch_BAG_L2 ... Training model for up to 2164.52s of the 2164.27s of remaining time.
	Fitting 8 child models (S7F1 - S7F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4576	 = Validation score   (accuracy)
	441.27s	 = Training   runtime
	5.19s	 = Validation runtime
Fitting model: LightGBMLarge_BAG_L2 ... Training model for up to 2097.19s of the 2096.93s of remaining time.
	Fitting 8 child models (S7F1 - S7F8) | Fitting with ParallelLocalFoldFittingStrategy
	0.4582	 = Validation score   (accuracy)
	4558.04s	 = Training   runtime
	11.89s	 = Validation runtime
Completed 7/20 k-fold bagging repeats ...
Fitting model: WeightedEnsemble_L3 ... Training model for up to 1841.8s of the 1436.21s of remaining time.
	0.4694	 = Validation score   (accuracy)
	0.93s	 = Training   runtime
	0.0s	 = Validation runtime
AutoGluon training complete, total runtime = 27364.79s ... Best model: "WeightedEnsemble_L3"
TabularPredictor saved. To load, use: predictor = TabularPredictor.load("AutogluonModels/ag-20221019_043937/")

预测

观察各模型在测试集上表现

leaderboard = predictor.leaderboard(test_data)

                   model  score_test  score_val  pred_time_test  pred_time_val      fit_time  pred_time_test_marginal  pred_time_val_marginal  fit_time_marginal  stack_level  can_infer  fit_order
0         CatBoost_BAG_L2    0.440147     0.4520      311.027459      67.010538  17144.274338                 3.961333               13.129473        6844.922442            2       True         12
1   NeuralNetTorch_BAG_L2    0.439146     0.4576      313.881686      59.070805  10740.619594                 6.815560                5.189740         441.267699            2       True         14
2     WeightedEnsemble_L3    0.438146     0.4694      356.545686      84.355455  21175.077252                 0.013631                0.000960           0.931303            3       True         16
3       LightGBMXT_BAG_L2    0.435145     0.4622      331.060749      61.838714  11547.272638                23.994623                7.957649        1247.920743            2       True         11
4     WeightedEnsemble_L2    0.434478     0.4578      301.874017      52.586521  10151.541209                 0.006723                0.001295           1.415491            2       True          9
5         LightGBM_BAG_L2    0.431811     0.4596      328.935630      60.831540  11646.558565                21.869503                6.950476        1347.206670            2       True         10
6    LightGBMLarge_BAG_L2    0.431811     0.4582      345.884833      65.766117  14857.395032                38.818707               11.885052        4558.043137            2       True         15
7          XGBoost_BAG_L2    0.429810     0.4578      321.760539      58.077633  12640.035064                14.694413                4.196568        2340.683169            2       True         13
8    LightGBMLarge_BAG_L1    0.426475     0.4320        6.381093       1.489012    386.712939                 6.381093                1.489012         386.712939            1       True          6
9         LightGBM_BAG_L1    0.423474     0.4302        2.985550       0.803576    132.148328                 2.985550                0.803576         132.148328            1       True          1
10         XGBoost_BAG_L1    0.421140     0.4302        2.731979       0.632295    190.459603                 2.731979                0.632295         190.459603            1       True          4
11        CatBoost_BAG_L1    0.420140     0.4452        0.933754       1.721086   1114.353101                 0.933754                1.721086        1114.353101            1       True          3
12      LightGBMXT_BAG_L1    0.410470     0.4126        5.198832       1.295838    149.226177                 5.198832                1.295838         149.226177            1       True          2
13  NeuralNetTorch_BAG_L1    0.397799     0.4078        0.728582       0.579873     74.164125                 0.728582                0.579873          74.164125            1       True          5
14   TextPredictor_BAG_L1    0.370457     0.3634       96.319121      18.761692   3734.320718                96.319121               18.761692        3734.320718            1       True          7
15  ImagePredictor_BAG_L1    0.350784     0.3524      191.787215      28.597692   4517.966904               191.787215               28.597692        4517.966904            1       True          8

输出测试集预测值

y_pred = predictor.predict(test_data)
y_pred

结语

AutoGluon包是很强大的Auto ML包，但相应的需要付出的算力是人工调参的10倍以上，针对一些有经验性的任务不妨用手调，能快速部署。
AutoGluon官网上还有很多例子，比如文本（NLP）类预测，以及目标检测、时间序列，这里因为篇幅原因不过多赘述，后面有时间会再更新。

你可能感兴趣的:(python,机器学习,深度学习,AutoGluon)

python keyerror列名报错_keyerror weixin_39870199 python keyerror列名报错
ValueError：传入参数不是调用者所期望的(从书上所得，输入的参数不是数字而是字母)TypeError：传入参数的类型不符合IndexError：传入的参数个数不满足AttributeError：访问对象的某属性无效KeyError：访问字典的无效关键字IOError：无法打开文件最近接到一个使用python写一个解析yaml文件，并根据内容配置指定对应的shell来执行(比如bat、pow
【python】Python中常见的KeyError报错分析景天科技苑 python 开发语言 python报错 KeyError
✨✨欢迎大家来到景天科技苑✨✨养成好习惯，先赞后看哦~作者简介：景天科技苑《头衔》：大厂架构师，华为云开发者社区专家博主，阿里云开发者社区专家博主，CSDN全栈领域优质创作者，掘金优秀博主，51CTO博客专家等。《博客》：Python全栈，前后端开发，小程序开发，人工智能，js逆向，App逆向，网络系统安全，数据分析，Django，fastapi，flask等框架，linux，shell脚本等实操
Python 常用函数全解析，轻松提升编码效率 yang789022 python 开发语言 windows
Python常用函数全解析，轻松提升编码效率Python常用函数全解析，轻松提升编码效率1.基础内置函数1.1`print()`与`input()`1.2`len()`、`type()`与`isinstance()`2.数学与数值处理函数2.1`abs()`、`round()`与`pow()`2.2`divmod()`与`max()/min()`3.序列与迭代相关函数3.1`range()`与`e
全自动文章生成发布构建 PyAIGCMaster 我的学习笔记 python
单机版、定时生成文章和分平台发布，以下是优化后的解决方案及代码示例：---###**推荐方案：APScheduler+内置调度逻辑**选择**APScheduler**是最佳方案，原因：1.**轻量级**：纯Python实现，无需额外服务（如Redis/CeleryWorker）。2.**精准调度**：支持Cron式定时任务（如每天3点生成、8点发布）。3.**单机友好**：直接嵌入代码中，适合打
OmniParser V2 安装与使用教程 Leaton Lee OmniParser V2 人工智能 deepseek
1.环境准备操作系统：支持Windows/macOS/Linux。Python版本：确保已安装Python3.7或更高版本。包管理工具：使用pip（Python自带）。安装环境：condacreate-n"omni"python==3.12condaactivateomnipipinstall-rrequirements.txt确保您已将V2权重下载到weights文件夹中（确保标题权重文件夹名为
解决Python中递归报错的问题硫酸锌01 Python python
1、问题背景Duringhandlingoftheaboveexception,anotherexceptionoccurred:有没有见到过这个报错？当出现这个报错的时候，意味着报错信息特别特别地长，难以关注到有效信息。那么这种报错是如何产生的？以及如何设计才能避免产生这种冗长的报错？2、我的需求如果我有一个Python的多维数组列表：lst=[[[1,2],[3,4]],[[5,6],[7,8
蓝桥杯Python赛道备赛——Day6：算术（二）（数学问题） SKY YEAM 蓝桥杯备赛蓝桥杯 python 职场和发展
本期博客是蓝桥杯备赛中算术（数学问题）的第二期，包括：快速幂算法、逆元（模意义下的倒数）、组合数计算和排列数计算。每一种数学问题都在给出定义的同时，给出了其求解方法的示例代码，以供低年级师弟师妹们学习和练习。前序知识：（1）Python基础语法算术（二）（数学问题）一、快速幂算法二、逆元（模意义下的倒数）三、组合数计算四、排列数计算一、快速幂算法1.定义：快速计算大指数幂的算法。2.算法原理：二进
蓝桥杯Python赛道备赛——Day1：基础算法 SKY YEAM 蓝桥杯备赛蓝桥杯 python 算法
本博客就蓝桥杯中的基础算法（这一部分说是算法，但更是一些简单的操作）进行罗列，包括：枚举、模拟、前缀和、差分、二分查找、进制转换、贪心、位运算和双指针。每一个算法都在给出概念解释的同时，给出了示例代码，以供低年级师弟师妹们学习和练习。前序知识：（1）Python基础语法（2）PythonOOP（面向对象编程）基础算法（操作）一、枚举二、模拟三、前缀和四、差分五、二分查找六、进制转换七、贪心八、位运
如何用python做一个小程序进行炒股？大懒猫软件 python 小程序开发语言
使用Python分析股票的完整程序以下是一个完整的Python程序，展示如何获取股票数据、进行数据清洗、计算技术指标、并进行简单的价格走势分析。1.安装必要的库首先，确保安装了必要的库：bash复制pipinstallrequestspandasmatplotlibyfinance2.获取股票数据使用yfinance库获取股票数据。yfinance是一个流行的库，可以方便地从雅虎财经获取股票数据。
数据集格式转换——json2txt、xml2txt、txt2json【复制就能用】 kay_545 YOLO11改进有效涨点 python 人工智能机器学习
秋招面试专栏推荐：深度学习算法工程师面试问题总结【百面算法工程师】——点击即可跳转本专栏所有程序均经过测试，可成功执行专栏地址：YOLO11入门+改进涨点——点击即可跳转欢迎订阅目录json2txt脚本xml2txttxt2json
蓝桥杯Python赛道备赛——Day7：动态规划（基础） SKY YEAM 蓝桥杯备赛蓝桥杯 python 动态规划
本博客就蓝桥杯中所涉及的动态规划基础问题进行讲解，包括：递推、记忆化搜索、最长公共子序列（LCS）和最长上升子序列（LIS）。每一种动态规划问题都在给出定义的同时，给出了其求解方法的示例代码，以供低年级师弟师妹们学习和练习。前序知识：（1）Python基础语法动态规划（基础）一、递推（迭代法）二、记忆化搜索（递归+缓存）三、最长公共子序列（LCS）四、最长上升子序列（LIS）一、递推（迭代法）定义
链上赋能：智能合约重塑供应链管理 Echo_Wish 前沿技术人工智能智能合约 linux 运维
链上赋能：智能合约重塑供应链管理供应链是现代经济活动的核心，而复杂的供应链环节常常面临诸多挑战：数据孤岛、信息不透明、操作低效甚至信任危机。这些问题不仅增加了运营成本，还导致资源浪费。随着区块链技术的兴起，供应链管理迎来了新的解决方案，其中智能合约（SmartContract）作为区块链的重要组成部分，正在颠覆传统的供应链管理模式。在本文中，我将结合Python开发与智能合约，探讨智能合约在供应链
量子计算+AI：未来AI Agent的计算范式 AI天才研究院计算 ChatGPT DeepSeek RL 强化学习 agent agi 推理模型智能驾驶
量子计算+AI：未来AIAgent的计算范式关键词：量子计算，人工智能，AIAgent，量子算法，量子机器学习，量子优化，量子数据处理摘要：量子计算和人工智能（AI）的结合正在改变AIAgent的计算范式。通过量子计算的超强算力和独特性质，AIAgent在数据处理、算法优化和决策能力方面展现出巨大潜力。本文将详细探讨量子计算与AI结合的核心概念、算法原理、系统架构，并通过实际案例展示量子AIAge
AI人工智能深度学习算法：在量子计算中的应用 AI天才研究院 AI大模型企业级应用开发实战 AI大模型应用入门实战与进阶 DeepSeek R1 &大数据AI人工智能大模型计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
1.背景介绍随着科技的不断发展，人工智能和量子计算成为了当今世界的热门话题。人工智能的深度学习算法在处理大规模数据和复杂任务方面取得了显著的成果，而量子计算则具有强大的并行计算能力和高效的信息处理能力。将人工智能与量子计算相结合，为解决一些具有挑战性的问题提供了新的思路和方法。本文将探讨人工智能深度学习算法在量子计算中的应用，包括其背景、意义和应用场景。2.核心概念与联系在人工智能中，深度学习是一
批量将将xlsx转为csv，将csv转为csv utf-8 Znnjcidmslz 数据 python pandas
csv转换为csvutf-8将csv格式文件批量转换为csvutf-8格式文件，以下为使用Python处理的代码：importosimportpandasaspd#存有文件的路径current_path=os.getcwd()#current_path=os.path.dirname('G:/weather_output2')#转换之后存放的路径为“UTF8”，会检查当前路径是否有，没有就创建ut
1.4使用pandas读取和写入Excel文件的基本操作林伽一 python处理excel pandas excel python
读取和写入Excel文件是使用Python处理Excel的基本操作。在Python中，可以使用不同的库来实现这些操作，例如pandas、openpyxl等。以下是读取和写入Excel文件的基本操作示例：读取Excel文件使用pandas库读取Excel文件非常方便。下面的示例演示了如何使用pandas读取Excel文件：importpandasaspd#读取Excel文件df=pd.read_ex
Python与C ++开发匿名捐赠1对1管理APP Geeker-2025 python c++
开发一款用于**匿名捐赠1对1管理**的App，结合Python和C++的优势，可以实现高效的后端数据处理、实时的捐赠监控以及用户友好的前端界面。以下是一个详细的开发方案，涵盖技术选型、功能模块、开发步骤等内容。##技术选型###后端（Python）-**编程语言**：Python-**Web框架**：Django或Flask-**数据库**：PostgreSQL或MySQL-**实时通信**：W
python颜色参数_python matplotlib:plt.scatter() 大小和颜色参数详解 weixin_39926311 python颜色参数
语法plt.scatter(x,y,s=20,c='b')大小s默认为20，s=0时点不显示；颜色c默认为蓝色。为每一个点指定大小和颜色有时我们需要为每一个点指定大小和方向，以区分不同的点。这时，可以向s和c传入列表。如：importmatplotlib.pyplotaspltimportnumpyasnpx=list(range(1,7))plt.scatter(x,x,s=10*np.arra
Python中scatter()函数--转载 1361976860 python
原博地址：http://blog.csdn.net/anneqiqi/article/details/64125186最近开始学习Python编程，遇到scatter函数，感觉里面的参数不知道什么意思于是查资料,最后总结如下：1、scatter函数原型2、其中散点的形状参数marker如下：3、其中颜色参数c如下:4、基本的使用方法如下：[python]viewplaincopy#导入必要的模块i
还在为找图发愁？图生生AI以图生图，一键生成专属风格！图生生人工智能 ai AI作画图生生
你是否也遇到过这样的烦恼：想为文章配图，却找不到风格合适的图片？设计海报时，灵感枯竭，不知从何下手？看到喜欢的图片风格，却无法应用到自己的作品中？别担心，图生生AI生图来帮你！只需上传一张图片，AI就能自动生成相似风格的图片，让你轻松拥有专属图库！图生生AI生图是一款基于人工智能技术的图片生成工具，它能够深度学习和理解图片的风格、色彩、构图等元素，并以此为基础生成全新的图片。无论你是设计师、自媒体
python中的scatter()函数用法品易HTTP python javascript css js 人工智能
若是现在已经对数据化有了解的话，那就一定要来参与看看本章要学习的函数，在样式以及排版上效果还是很好的，经常被用于测试数据上的大小更改以及设置不同颜色，还有时候，对于线条的宽度的更改也都需要利用到这个函数，以上基本就是本章函数的基本用法了，下面进行详细讲述。制作如图所示图片：需要准备：X、Y轴包括数值以及大小和颜色调用语法：plt.scatter()实现代码：importmatplotlibasmp
深度学习中的Channel，通道数是什么？ %KT% 深度学习深度学习人工智能
参考文章：直观理解深度学习的卷积操作，超赞！-CSDN博客如何理解卷积神经网络中的通道（channel）_神经网络通道数-CSDN博客深度学习-卷积神经网络—卷积操作详细介绍_深度卷积的作用-CSDN博客正文：在跑深度学习代码的过程中，经常遇到的一个报错是：模型尺寸不匹配的问题。一般pytorch中尺寸/张量的表现方式是：torch.size([16,3,24,24])。这四个参数的含义如下：16
C语言：哈希表 %KT% C/C++算法数据结构 c语言散列表开发语言
1、文章声明：本文是基于链地址法建立的哈希表。文章中若存在错误，欢迎各路大佬指正。本文涉及二级指针，链表等内容。该方面的知识点，可以参考文章：数据结构：单链表的相关操作-CSDN博客C语言：利用二级指针动态创建二维矩阵-CSDN博客2、哈希表的介绍：哈希表其实可以理解成一种映射，通过映射关系来存储数据，有点类似于Python中的字典。常见的如数组，链表等存储结构，他们查询数据都有一个特点，往往需要
AI人工智能深度学习算法：搭建可拓展的深度学习模型架构 AI大模型应用之禅 DeepSeek R1 &AI大模型与大数据 java python javascript kotlin golang 架构人工智能
深度学习、模型架构、可拓展性、神经网络、机器学习1.背景介绍深度学习作为人工智能领域最前沿的技术之一，在图像识别、自然语言处理、语音识别等领域取得了突破性的进展。深度学习模型的成功离不开其强大的学习能力和可拓展性。本文将深入探讨深度学习算法的原理、模型架构设计以及可拓展性的关键要素，并通过代码实例和实际应用场景，帮助读者理解如何搭建可拓展的深度学习模型架构。2.核心概念与联系深度学习的核心概念是人
大模型工程师学习日记（五）：基于LangServe的AI服务架构深度解析 MMMMMMMay Love Code 学习架构语言模型深度学习人工智能 git
1.概述LangServe️帮助开发者将LangChain可运行和链部署为RESTAPI。该库集成了FastAPI并使用pydantic进行数据验证。Pydantic是一个在Python中用于数据验证和解析的第三方库，现在是Python中使用广泛的数据验证库。它利用声明式的方式定义数据模型和Python类型提示的强大功能来执行数据验证和序列化，使您的代码更可靠、更可读、更简洁且更易于调试。。它还可
远程调试Python脚本之ptvsd 工头阿乐 PyTorch 深度学习 python 开发语言
深度学习文章目录深度学习前言前言有时候需要远程调试Python脚本，怎么办呢…以下这段代码用于远程调试Python脚本，特别是通过VisualStudioCode（VSCode）的远程调试功能。它会在指定的服务器IP和端口上等待调试器的连接。#检查是否提供了服务器IP和端口ifargs.server_ipandargs.server_port:#远程调试-参见https://code.visual
yolo模型coco数据集详解工头阿乐深度学习 YOLO
深度学习文章目录深度学习前言前言instances_train2017.json和instances_val2017.json文件均分为五大部分，这五部分对应的关键字分别为info、licenses、images、annotations、categories。{"info":info,"licenses":[license1,license2,license3,...],"images":[ima
【Python】爬取高校数据（名字，院校特色，所在地，性质）。可用于判断高校是否为双一流，本科/专科等分析 llzcxdb Python python 开发语言爬虫
源网站：http://college.gaokao.com/schlist/p1利用Python的lxml库进行html解析，源代码：importrequestsfromlxmlimportetreeimportpandasaspdimportcsv#请求URLurl='http://college.gaokao.com/schlist/p'#构建请求头headers={'User-Agent':
electron 源码下载与编译构五一编程学习交流 electron javascript 前端 webrtc c语言 c++
electron源码下载与编译构建预先安装安装nodejs下载eletron构建工具：安装python构建Electron基本要求环境依赖交叉编译构建故障排查高级提示使用clang之外的其它编译器electron的depot_tools工具下载构建源码。这个工具是用nodejs写的，封装了chromium自身的depot_tools工具。非常方便易用。主要是electron在下载完chromium
机器学习之向量化珠峰日记 AI理论与实践机器学习人工智能
文章目录向量化是什么为什么要向量化提升计算效率简化代码与增强可读性适配模型需求怎么做向量化数据预处理特征提取特征选择向量构建机器学习与深度学习中向量化的区别数据特征提取方式机器学习深度学习模型结构与复杂度机器学习深度学习计算资源需求机器学习深度学习数据规模适应性机器学习深度学习向量化是什么向量化是把数据转化为向量形式进行表示与处理的过程。在机器学习与深度学习的范畴内，现实中的各类数据，像文本、图像
Spring的注解积累 yijiesuifeng spring 注解
用注解来向Spring容器注册Bean。需要在applicationContext.xml中注册： <context:component-scan base-package=”pagkage1[,pagkage2,…,pagkageN]”/>。如：在base-package指明一个包 <context:component-sc
传感器百合不是茶 android 传感器
android传感器的作用主要就是来获取数据,根据得到的数据来触发某种事件下面就以重力传感器为例; 1,在onCreate中获得传感器服务 private SensorManager sm;// 获得系统的服务 private Sensor sensor;// 创建传感器实例 @Override protected void
[光磁与探测]金吕玉衣的意义 comsci
这是一个古代人的秘密:现在告诉大家信不信由你们: 穿上金律玉衣的人,如果处于灵魂出窍的状态,可以飞到宇宙中去看星星这就是为什么古代
精简的反序打印某个数沐刃青蛟打印
以前看到一些让求反序打印某个数的程序。比如：输入123，输出321。记得以前是告诉你是几位数的，当时就抓耳挠腮，完全没有思路。似乎最后是用到%和/方法解决的。而今突然想到一个简短的方法，就可以实现任意位数的反序打印（但是如果是首位数或者尾位数为0时就没有打印出来了）代码如下： long num, num1=0;
PHP：6种方法获取文件的扩展名 IT独行者 PHP 扩展名
PHP：6种方法获取文件的扩展名 1、字符串查找和截取的方法 1 $extension = substr ( strrchr ( $file , '.' ), 1); 2、字符串查找和截取的方法二 1 $extension = substr
面试111 文强chu 面试
1事务隔离级别有那些，事务特性是什么（问到一次） 2 spring aop 如何管理事务的，如何实现的。动态代理如何实现，jdk怎么实现动态代理的，ioc是怎么实现的，spring是单例还是多例，有那些初始化bean的方式，各有什么区别（经常问） 3 struts默认提供了那些拦截器（一次） 4 过滤器和拦截器的区别（频率也挺高） 5 final，finally final
XML的四种解析方式小桔子 dom jdom dom4j sax
在平时工作中，难免会遇到把 XML 作为数据存储格式。面对目前种类繁多的解决方案，哪个最适合我们呢？在这篇文章中，我对这四种主流方案做一个不完全评测，仅仅针对遍历 XML 这块来测试，因为遍历 XML 是工作中使用最多的（至少我认为）。　　预备　　测试环境：　　AMD 毒龙1.4G OC 1.5G、256M DDR333、Windows2000 Server
wordpress中常见的操作 aichenglong 中文注册 wordpress 移除菜单
1 wordpress中使用中文名注册解决办法 1)使用插件 2)修改wp源代码进入到wp-include/formatting.php文件中找到 function sanitize_user( $username, $strict = false
小飞飞学管理-1 alafqq 管理
项目管理的下午题，其实就在提出问题（挑刺），分析问题，解决问题。今天我随意看下10年上半年的第一题。主要就是项目经理的提拨和培养。结合我自己经历写下心得对于公司选拔和培养项目经理的制度有什么毛病呢？ 1，公司考察，选拔项目经理，只关注技术能力，而很少或没有关注管理方面的经验，能力。 2，公司对项目经理缺乏必要的项目管理知识和技能方面的培训。 3，公司对项目经理的工作缺乏进行指
IO输入输出部分探讨百合不是茶 IO
//文件处理在处理文件输入输出时要引入java.IO这个包； /* 1，运用File类对文件目录和属性进行操作 2，理解流，理解输入输出流的概念 3，使用字节/符流对文件进行读/写操作 4，了解标准的I/O 5，了解对象序列化 */ //1，运用File类对文件目录和属性进行操作 //在工程中线创建一个text.txt
getElementById的用法 bijian1013 element
getElementById是通过Id来设置/返回HTML标签的属性及调用其事件与方法。用这个方法基本上可以控制页面所有标签，条件很简单，就是给每个标签分配一个ID号。返回具有指定ID属性值的第一个对象的一个引用。语法： &n
励志经典语录 bijian1013 励志人生
经典语录1: 哈佛有一个著名的理论：人的差别在于业余时间，而一个人的命运决定于晚上8点到10点之间。每晚抽出2个小时的时间用来阅读、进修、思考或参加有意的演讲、讨论，你会发现，你的人生正在发生改变，坚持数年之后，成功会向你招手。不要每天抱着QQ/MSN/游戏/电影/肥皂剧……奋斗到12点都舍不得休息，看就看一些励志的影视或者文章，不要当作消遣；学会思考人生，学会感悟人生
[MongoDB学习笔记三]MongoDB分片 bit1129 mongodb
MongoDB的副本集(Replica Set)一方面解决了数据的备份和数据的可靠性问题，另一方面也提升了数据的读写性能。MongoDB分片(Sharding)则解决了数据的扩容问题，MongoDB作为云计算时代的分布式数据库，大容量数据存储，高效并发的数据存取，自动容错等是MongoDB的关键指标。本篇介绍MongoDB的切片(Sharding) 1.何时需要分片 &nbs
【Spark八十三】BlockManager在Spark中的使用场景 bit1129 manager
1. Broadcast变量的存储，在HttpBroadcast类中可以知道 2. RDD通过CacheManager存储RDD中的数据，CacheManager也是通过BlockManager进行存储的 3. ShuffleMapTask得到的结果数据，是通过FileShuffleBlockManager进行管理的，而FileShuffleBlockManager最终也是使用BlockMan
yum方式部署zabbix ronin47 yum方式部署zabbix
安装网络yum库#rpm -ivh http://repo.zabbix.com/zabbix/2.4/rhel/6/x86_64/zabbix-release-2.4-1.el6.noarch.rpm 通过yum装mysql和zabbix调用的插件还有agent代理#yum install zabbix-server-mysql zabbix-web-mysql mysql-
Hibernate4和MySQL5.5自动创建表失败问题解决方法 byalias J2EE Hibernate4
今天初学Hibernate4，了解了使用Hibernate的过程。大体分为4个步骤： ①创建hibernate.cfg.xml文件 ②创建持久化对象 ③创建*.hbm.xml映射文件 ④编写hibernate相应代码在第四步中，进行了单元测试，测试预期结果是hibernate自动帮助在数据库中创建数据表，结果JUnit单元测试没有问题，在控制台打印了创建数据表的SQL语句，但在数据库中
Netty源码学习-FrameDecoder bylijinnan java netty
Netty 3.x的user guide里FrameDecoder的例子，有几个疑问： 1.文档说：FrameDecoder calls decode method with an internally maintained cumulative buffer whenever new data is received. 为什么每次有新数据到达时，都会调用decode方法？ 2.Dec
SQL行列转换方法 chicony 行列转换
create table tb(终端名称 varchar(10) , CEI分值 varchar(10) , 终端数量 int) insert into tb values('三星' , '0-5' , 74) insert into tb values('三星' , '10-15' , 83) insert into tb values('苹果' , '0-5' , 93)
中文编码测试 ctrain 编码
循环打印转换编码 String[] codes = { "iso-8859-1", "utf-8", "gbk", "unicode" }; for (int i = 0; i < codes.length; i++) { for (int j
hive 客户端查询报堆内存溢出解决方法 daizj hive 堆内存溢出
hive> select * from t_test where ds=20150323 limit 2; OK Exception in thread "main" java.lang.OutOfMemoryError: Java heap space 问题原因： hive堆内存默认为256M 这个问题的解决方法为：修改/us
人有多大懒，才有多大闲 (评论『卓有成效的程序员』) dcj3sjt126com 程序员
卓有成效的程序员给我的震撼很大，程序员作为特殊的群体，有的人可以这么懒，懒到事情都交给机器去做，而有的人又可以那么勤奋，每天都孜孜不倦得做着重复单调的工作。在看这本书之前，我属于勤奋的人，而看完这本书以后，我要努力变成懒惰的人。不要在去庞大的开始菜单里面一项一项搜索自己的应用程序，也不要在自己的桌面上放置眼花缭乱的快捷图标
Eclipse简单有用的配置 dcj3sjt126com eclipse
1、显示行号 Window -- Prefences -- General -- Editors -- Text Editors -- show line numbers 2、代码提示字符 Window ->Perferences，并依次展开 Java -> Editor -> Content Assist，最下面一栏 auto-Activation
在tomcat上面安装solr4.8.0全过程 eksliang Solr solr4.0后的版本安装 solr4.8.0安装
转载请出自出处： http://eksliang.iteye.com/blog/2096478 首先solr是一个基于java的web的应用，所以安装solr之前必须先安装JDK和tomcat，我这里就先省略安装tomcat和jdk了第一步：当然是下载去官网上下载最新的solr版本，下载地址
Android APP通用型拒绝服务、漏洞分析报告 gg163 漏洞 android APP 分析
点评：记得曾经有段时间很多SRC平台被刷了大量APP本地拒绝服务漏洞，移动安全团队爱内测（ineice.com）发现了一个安卓客户端的通用型拒绝服务漏洞，来看看他们的详细分析吧。 0xr0ot和Xbalien交流所有可能导致应用拒绝服务的异常类型时，发现了一处通用的本地拒绝服务漏洞。该通用型本地拒绝服务可以造成大面积的app拒绝服务。针对序列化对象而出现的拒绝服务主要
HoverTree项目已经实现分层 hvt 编程 .net Web C#ASP.ENT
HoverTree项目已经初步实现分层，源代码已经上传到 http://hovertree.codeplex.com请到SOURCE CODE查看。在本地用SQL Server 2008 数据库测试成功。数据库和表请参考：http://keleyi.com/a/bjae/ue6stb42.htmHoverTree是一个ASP.NET 开源项目，希望对你学习ASP.NET或者C#语言有帮助，如果你对
Google Maps API v3: Remove Markers 移除标记天梯梦 google maps api
Simply do the following: I. Declare a global variable: var markersArray = []; II. Define a function: function clearOverlays() { for (var i = 0; i < markersArray.length; i++ )
jQuery选择器总结 lq38366 jquery 选择器
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40
基础数据结构和算法六：Quick sort sunwinner Algorithm Quicksort
Quick sort is probably used more widely than any other. It is popular because it is not difficult to implement, works well for a variety of different kinds of input data, and is substantially faster t
如何让Flash不遮挡HTML div元素的技巧_HTML/Xhtml_网页制作刘星宇 html Web
今天在写一个flash广告代码的时候，因为flash自带的链接，容易被当成弹出广告，所以做了一个div层放到flash上面，这样链接都是a触发的不会被拦截，但发现flash一直处于div层上面，原来flash需要加个参数才可以。让flash置于DIV层之下的方法，让flash不挡住飘浮层或下拉菜单，让Flash不档住浮动对象或层的关键参数：wmode=opaque。方法如下：
Mybatis实用Mapper SQL汇总示例 wdmcygah sql mysql mybatis 实用
Mybatis作为一个非常好用的持久层框架，相关资料真的是少得可怜，所幸的是官方文档还算详细。本博文主要列举一些个人感觉比较常用的场景及相应的Mapper SQL写法，希望能够对大家有所帮助。不少持久层框架对动态SQL的支持不足，在SQL需要动态拼接时非常苦恼，而Mybatis很好地解决了这个问题，算是框架的一大亮点。对于常见的场景，例如：批量插入/更新/删除，模糊查询，多条件查询，联表查询，