qq_37401291

pytorch使用bert微调实现文本情感分析例子（混合精度fp16）

数据集：
https://download.csdn.net/download/qq_37401291/87392009

# Import necessary libraries
import numpy as np
import pandas as pd
import seaborn as sns
from pylab import rcParams
import matplotlib.pyplot as plt
from matplotlib import rc
from sklearn.model_selection import train_test_split
from sklearn.metrics import confusion_matrix, classification_report
from collections import defaultdict
from textwrap import wrap

# Torch ML libraries
import transformers
from transformers import BertModel, BertTokenizer, AdamW, get_linear_schedule_with_warmup
import torch
from torch import nn, optim
from torch.utils.data import Dataset, DataLoader

# Misc.
import warnings

warnings.filterwarnings('ignore')
import datetime

# 获得计算机当前时间
starttime = datetime.datetime.now()

# Set intial variables and constants
# % config InlineBackend.figure_format='retina'

# Graph Designs
sns.set(style='whitegrid', palette='muted', font_scale=1.2)
HAPPY_COLORS_PALETTE = ["#01BEFE", "#FFDD00", "#FF7D00", "#FF006D", "#ADFF02", "#8F00FF"]
sns.set_palette(sns.color_palette(HAPPY_COLORS_PALETTE))
rcParams['figure.figsize'] = 12, 8

# Random seed for reproducibilty
RANDOM_SEED = 42
np.random.seed(RANDOM_SEED)
torch.manual_seed(RANDOM_SEED)

# Set GPU
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

df = pd.read_csv('D:/2022/code/ai/nlp-learn/reviews.csv')
df.shape

(12495, 12)

df

	reviewId	userName	userImage	content	score	thumbsUpCount	reviewCreatedVersion	at	replyContent	repliedAt	sortOrder	appId
0	gp:AOqpTOEhZuqSqqWnaKRgv-9ABYdajFUB0WugPGh-SG-...	Eric Tie	https://play-lh.googleusercontent.com/a-/AOh14...	I cannot open the app anymore	1	0	5.4.0.6	2020-10-27 21:24:41	NaN	NaN	newest	com.anydo
1	gp:AOqpTOH0WP4IQKBZ2LrdNmFy_YmpPCVrV3diEU9KGm3...	john alpha	https://play-lh.googleusercontent.com/a-/AOh14...	I have been begging for a refund from this app...	1	0	NaN	2020-10-27 14:03:28	Please note that from checking our records, yo...	2020-10-27 15:05:52	newest	com.anydo
2	gp:AOqpTOEMCkJB8Iq1p-r9dPwnSYadA5BkPWTf32Z1azu...	Sudhakar .S	https://play-lh.googleusercontent.com/a-/AOh14...	Very costly for the premium version (approx In...	1	0	NaN	2020-10-27 08:18:40	NaN	NaN	newest	com.anydo
3	gp:AOqpTOGFrUWuKGycpje8kszj3uwHN6tU_fd4gLVFy9z...	SKGflorida@bellsouth.net DAVID S	https://play-lh.googleusercontent.com/-75aK0WF...	Used to keep me organized, but all the 2020 UP...	1	0	NaN	2020-10-26 13:28:07	What do you find troublesome about the update?...	2020-10-26 14:58:29	newest	com.anydo
4	gp:AOqpTOHls7DW8wmDFzTkHwxuqFkdNQtKHmO6Pt9jhZE...	Louann Stoker	https://play-lh.googleusercontent.com/-pBcY_Z-...	Dan Birthday Oct 28	1	0	5.6.0.7	2020-10-26 06:10:50	NaN	NaN	newest	com.anydo
...	...	...	...	...	...	...	...	...	...	...	...	...
12490	gp:AOqpTOEQPqib7pb6vFyjMY9JEfsMs_u8WCdqg6mbcar...	Mildred Olima	https://play-lh.googleusercontent.com/a-/AOh14...	I really like the planner, it helps me achieve...	5	0	4.5.4	2018-12-21 00:13:09	NaN	NaN	newest	com.appxy.planner
12491	gp:AOqpTOE1KKOOVVKUfhAfXQs2NfJpoywfucrJCMK3Hmu...	Roaring Grizzly Bear	https://play-lh.googleusercontent.com/a-/AOh14...	****	5	0	NaN	2018-12-12 21:52:56	NaN	NaN	newest	com.appxy.planner
12492	gp:AOqpTOFEn5UgYYggqiHKauDJVLLN8-16nk1AfZbEhkj...	amirbadang	https://play-lh.googleusercontent.com/-CM2FcrU...	Very useful apps. You must try it	5	0	4.5.4	2018-12-11 15:49:43	NaN	NaN	newest	com.appxy.planner
12493	gp:AOqpTOHOH6YdYLR91qZdYpeIVkMI-LeAE0EwYgrctid...	Emma Stebbins	https://play-lh.googleusercontent.com/-oCj6g6k...	Would pay for this if there were even more add...	5	0	4.5.4	2018-12-06 04:59:26	NaN	NaN	newest	com.appxy.planner
12494	gp:AOqpTOFuJtS1McUdEZuLCnRn7k-UUcGNml7XqxKTSk2...	DAVOR SPASENOSKI	https://play-lh.googleusercontent.com/a-/AOh14...	Sooow good	5	0	4.5.4	2018-11-26 01:19:13	NaN	NaN	newest	com.appxy.planner

12495 rows × 12 columns

df.head()

	reviewId	userName	userImage	content	score	reviewCreatedVersion	at	replyContent	repliedAt	sortOrder	appId
0	gp:AOqpTOEhZuqSqqWnaKRgv-9ABYdajFUB0WugPGh-SG-...	Eric Tie	https://play-lh.googleusercontent.com/a-/AOh14...	I cannot open the app anymore	1	5.4.0.6	2020-10-27 21:24:41	NaN	NaN	newest	com.anydo
1	gp:AOqpTOH0WP4IQKBZ2LrdNmFy_YmpPCVrV3diEU9KGm3...	john alpha	https://play-lh.googleusercontent.com/a-/AOh14...	I have been begging for a refund from this app...	1	NaN	2020-10-27 14:03:28	Please note that from checking our records, yo...	2020-10-27 15:05:52	newest	com.anydo
2	gp:AOqpTOEMCkJB8Iq1p-r9dPwnSYadA5BkPWTf32Z1azu...	Sudhakar .S	https://play-lh.googleusercontent.com/a-/AOh14...	Very costly for the premium version (approx In...	1	NaN	2020-10-27 08:18:40	NaN	NaN	newest	com.anydo
3	gp:AOqpTOGFrUWuKGycpje8kszj3uwHN6tU_fd4gLVFy9z...	SKGflorida@bellsouth.net DAVID S	https://play-lh.googleusercontent.com/-75aK0WF...	Used to keep me organized, but all the 2020 UP...	1	NaN	2020-10-26 13:28:07	What do you find troublesome about the update?...	2020-10-26 14:58:29	newest	com.anydo
4	gp:AOqpTOHls7DW8wmDFzTkHwxuqFkdNQtKHmO6Pt9jhZE...	Louann Stoker	https://play-lh.googleusercontent.com/-pBcY_Z-...	Dan Birthday Oct 28	1	5.6.0.7	2020-10-26 06:10:50	NaN	NaN	newest	com.anydo

df.isnull().sum()

reviewId                   0
userName                   0
userImage                  0
content                    0
score                      0
thumbsUpCount              0
reviewCreatedVersion    2162
at                         0
replyContent            6677
repliedAt               6677
sortOrder                  0
appId                      0
dtype: int64

# # Let's have a look at the class balance.
# sns.countplot(df.score)
# # print(sns)
# plt.xlabel('review score')
# df.score
# ps =df.groupby('score')['score'].count()
# ps

# Function to convert score to sentiment
def to_sentiment(rating):
    rating = int(rating)

    # Convert to class
    if rating <= 2:
        return 0
    elif rating == 3:
        return 1
    else:
        return 2


# Apply to the dataset
df['sentiment'] = df.score.apply(to_sentiment)

# Plot the distribution
class_names = ['negative', 'neutral', 'positive']
print(df.sentiment)
# ax = sns.countplot(df.sentiment)
# plt.xlabel('review sentiment')
# ax.set_xticklabels(class_names)

0        0
1        0
2        0
3        0
4        0
        ..
12490    2
12491    2
12492    2
12493    2
12494    2
Name: sentiment, Length: 12495, dtype: int64

# Set the model name
MODEL_NAME = 'bert-base-cased'

# Build a BERT based tokenizer
tokenizer = BertTokenizer.from_pretrained(MODEL_NAME)

# Some of the common BERT tokens
print(tokenizer.sep_token, tokenizer.sep_token_id)  # marker for ending of a sentence
print(tokenizer.cls_token, tokenizer.cls_token_id)  # start of each sentence, so BERT knows we’re doing classification
print(tokenizer.pad_token, tokenizer.pad_token_id)  # special token for padding
print(tokenizer.unk_token, tokenizer.unk_token_id)  # tokens not found in training set

[SEP] 102
[CLS] 101
[PAD] 0
[UNK] 100

# Store length of each review
token_lens = []
# Iterate through the content slide
for txt in df.content:
    tokens = tokenizer.encode(txt, max_length=512)
    token_lens.append(len(tokens))

Truncation was not explicitly activated but `max_length` is provided a specific value, please use `truncation=True` to explicitly truncate examples to max length. Defaulting to 'longest_first' truncation strategy. If you encode pairs of sequences (GLUE-style) with the tokenizer you can select this strategy more precisely by providing a specific strategy to `truncation`.

# plot the distribution of review lengths
sns.distplot(token_lens)
plt.xlim([0, 256])
plt.xlabel('Token count')

Text(0.5, 0, 'Token count')

MAX_LEN = 160

class GPReviewDataset(Dataset):
    # Constructor Function
    def __init__(self, reviews, targets, tokenizer, max_len):
        self.reviews = reviews
        self.targets = targets
        self.tokenizer = tokenizer
        self.max_len = max_len

    # Length magic method
    def __len__(self):
        return len(self.reviews)

    # get item magic method
    def __getitem__(self, item):
        review = str(self.reviews[item])
        target = self.targets[item]

        # Encoded format to be returned
        encoding = self.tokenizer.encode_plus(
            review,
            add_special_tokens=True,
            max_length=self.max_len,
            return_token_type_ids=False,
            pad_to_max_length=True,
            return_attention_mask=True,
            return_tensors='pt',
        )

        return {
            'review_text': review,
            'input_ids': encoding['input_ids'].flatten(),
            'attention_mask': encoding['attention_mask'].flatten(),
            'targets': torch.tensor(target, dtype=torch.long)
        }

df_train, df_test = train_test_split(df, test_size=0.2, random_state=42)
df_val, df_test = train_test_split(df_test, test_size=0.5, random_state=42)

print(df_train.shape, df_val.shape, df_test.shape)

(9996, 13) (1249, 13) (1250, 13)

def create_data_loader(df, tokenizer, max_len, batch_size):
    ds = GPReviewDataset(
        reviews=df.content.to_numpy(),
        targets=df.sentiment.to_numpy(),
        tokenizer=tokenizer,
        max_len=max_len
    )

    return DataLoader(
        ds,
        batch_size=batch_size,
        num_workers=0
    )

# Create train, test and val data loaders
BATCH_SIZE = 32
train_data_loader = create_data_loader(df_train, tokenizer, MAX_LEN, BATCH_SIZE)
val_data_loader = create_data_loader(df_val, tokenizer, MAX_LEN, BATCH_SIZE)
test_data_loader = create_data_loader(df_test, tokenizer, MAX_LEN, BATCH_SIZE)

# Examples
data = next(iter(train_data_loader))
print(data.keys())

dict_keys(['review_text', 'input_ids', 'attention_mask', 'targets'])

print(data['input_ids'].shape)
print(data['attention_mask'].shape)
print(data['targets'].shape)

torch.Size([32, 160])
torch.Size([32, 160])
torch.Size([32])

# Load the basic BERT model
bert_model = BertModel.from_pretrained(MODEL_NAME)

Some weights of the model checkpoint at bert-base-cased were not used when initializing BertModel: ['cls.seq_relationship.bias', 'cls.predictions.decoder.weight', 'cls.predictions.transform.dense.weight', 'cls.predictions.bias', 'cls.seq_relationship.weight', 'cls.predictions.transform.dense.bias', 'cls.predictions.transform.LayerNorm.bias', 'cls.predictions.transform.LayerNorm.weight']
- This IS expected if you are initializing BertModel from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing BertModel from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).

# Build the Sentiment Classifier class
class SentimentClassifier(nn.Module):

    # Constructor class
    def __init__(self, n_classes):
        super(SentimentClassifier, self).__init__()
        self.bert = BertModel.from_pretrained(MODEL_NAME)
        self.drop = nn.Dropout(p=0.3)
        self.out = nn.Linear(self.bert.config.hidden_size, n_classes)

    # Forward propagaion class
    def forward(self, input_ids, attention_mask):
        _, pooled_output = self.bert(
            input_ids=input_ids,
            attention_mask=attention_mask,
            return_dict=False
        )
        #  Add a dropout layer
        output = self.drop(pooled_output)
        return self.out(output)

# Instantiate the model and move to classifier
model = SentimentClassifier(len(class_names))
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = model.to(device)

Some weights of the model checkpoint at bert-base-cased were not used when initializing BertModel: ['cls.seq_relationship.bias', 'cls.predictions.decoder.weight', 'cls.predictions.transform.dense.weight', 'cls.predictions.bias', 'cls.seq_relationship.weight', 'cls.predictions.transform.dense.bias', 'cls.predictions.transform.LayerNorm.bias', 'cls.predictions.transform.LayerNorm.weight']
- This IS expected if you are initializing BertModel from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing BertModel from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).

# Number of hidden units
print(bert_model.config.hidden_size)

# Number of iterations
EPOCHS = 10

# Optimizer Adam
optimizer = AdamW(model.parameters(), lr=2e-5, correct_bias=False)

total_steps = len(train_data_loader) * EPOCHS

scheduler = get_linear_schedule_with_warmup(
    optimizer,
    num_warmup_steps=0,
    num_training_steps=total_steps
)

# Set the loss function
loss_fn = nn.CrossEntropyLoss().to(device)

# # Function for a single training iteration FP32 loss function
# def train_epoch(model, data_loader, loss_fn, optimizer, device, scheduler, n_examples):
#     model = model.train()
#     losses = []
#     correct_predictions = 0
#
#     for d in data_loader:
#         input_ids = d["input_ids"].to(device)
#         attention_mask = d["attention_mask"].to(device)
#         targets = d["targets"].to(device)
#
#         outputs = model(
#             input_ids=input_ids,
#             attention_mask=attention_mask
#         )
#
#         _, preds = torch.max(outputs, dim=1)
#         loss = loss_fn(outputs, targets)
#         correct_predictions += torch.sum(preds == targets)
#         losses.append(loss.item())
#
#         # Backward prop
#         loss.backward()
#
#         # Gradient Descent
#         nn.utils.clip_grad_norm_(model.parameters(), max_norm=1.0)
#         optimizer.step()
#         scheduler.step()
#         optimizer.zero_grad()
#
#     return correct_predictions.double() / n_examples, np.mean(losses)

使用混合精度训练 pytorch1.6以上才能支持使用fp16显卡 Tensorcore核心


from torch.cuda.amp import GradScaler, autocast

scaler = GradScaler()


def train_epoch(model, data_loader, loss_fn, optimizer, device, scheduler, n_examples):
    model = model.train()
    losses = []
    correct_predictions = 0

    for d in data_loader:
        input_ids = d["input_ids"].to(device)
        attention_mask = d["attention_mask"].to(device)
        targets = d["targets"].to(device)

        outputs = model(
            input_ids=input_ids,
            attention_mask=attention_mask
        )
        # Gradient Descent
        nn.utils.clip_grad_norm_(model.parameters(), max_norm=1.0)

        optimizer.zero_grad()

        with autocast():
            _, preds = torch.max(outputs, dim=1)
            loss = loss_fn(outputs, targets)
            correct_predictions += torch.sum(preds == targets)
            losses.append(loss.item())

        # Scales loss.  Calls backward() on scaled loss to create scaled gradients.
        # Backward passes under autocast are not recommended.
        # Backward ops run in the same dtype autocast chose for corresponding forward ops.
        scaler.scale(loss).backward()

        # scaler.step() first unscales the gradients of the optimizer's assigned params.
        # If these gradients do not contain infs or NaNs, optimizer.step() is then called,
        # otherwise, optimizer.step() is skipped.
        scaler.step(optimizer)

        # Updates the scale for next iteration.
        scaler.update()

    return correct_predictions.double() / n_examples, np.mean(losses)

def eval_model(model, data_loader, loss_fn, device, n_examples):
    model = model.eval()

    losses = []
    correct_predictions = 0

    with torch.no_grad():
        for d in data_loader:
            input_ids = d["input_ids"].to(device)
            attention_mask = d["attention_mask"].to(device)
            targets = d["targets"].to(device)

            # Get model ouptuts
            outputs = model(
                input_ids=input_ids,
                attention_mask=attention_mask
            )

            _, preds = torch.max(outputs, dim=1)
            loss = loss_fn(outputs, targets)

            correct_predictions += torch.sum(preds == targets)
            losses.append(loss.item())

    return correct_predictions.double() / n_examples, np.mean(losses)



history = defaultdict(list)
best_accuracy = 0

for epoch in range(EPOCHS):

    # Show details
    print(f"Epoch {epoch + 1}/{EPOCHS}")
    print("-" * 10)

    train_acc, train_loss = train_epoch(
        model,
        train_data_loader,
        loss_fn,
        optimizer,
        device,
        scheduler,
        len(df_train)
    )

    print(f"Train loss {train_loss} accuracy {train_acc}")

    # Get model performance (accuracy and loss)
    val_acc, val_loss = eval_model(
        model,
        val_data_loader,
        loss_fn,
        device,
        len(df_val)
    )

    print(f"Val   loss {val_loss} accuracy {val_acc}")
    print()

    history['train_acc'].append(train_acc)
    history['train_loss'].append(train_loss)
    history['val_acc'].append(val_acc)
    history['val_loss'].append(val_loss)

    # If we beat prev performance
    if val_acc > best_accuracy:
        torch.save(model.state_dict(), 'best_model_state.bin')
        best_accuracy = val_acc
# 获取过段时间后的时间
endtime = datetime.datetime.now()

print((endtime - starttime).seconds)

Epoch 1/10
----------
Train loss 0.6848097632106501 accuracy 0.7208883553421369
Val   loss 0.5925843127071857 accuracy 0.7574059247397917

Epoch 2/10
----------
Train loss 0.48870645539638713 accuracy 0.8079231692677071
Val   loss 0.6062256038188935 accuracy 0.7493995196156925

Epoch 3/10
----------
Train loss 0.34348880144925165 accuracy 0.8686474589835935
Val   loss 0.6998532168567181 accuracy 0.743795036028823

Epoch 4/10
----------
Train loss 0.2768642743127034 accuracy 0.8961584633853542
Val   loss 0.7555158618837595 accuracy 0.7445956765412329

Epoch 5/10
----------
Train loss 0.19659621281602893 accuracy 0.9308723489395759
Val   loss 0.8499629437923432 accuracy 0.7141713370696556

Epoch 6/10
----------
Train loss 0.13516816481674154 accuracy 0.9560824329731893
Val   loss 1.0227949187159537 accuracy 0.7101681345076061

Epoch 7/10
----------
Train loss 0.10121473114336499 accuracy 0.9680872348939576
Val   loss 1.142523455619812 accuracy 0.7341873498799039

Epoch 8/10
----------
Train loss 0.08803147610244207 accuracy 0.9714885954381753
Val   loss 1.170446154475212 accuracy 0.7421937550040032

Epoch 9/10
----------
Train loss 0.07703767744705271 accuracy 0.9755902360944378
Val   loss 1.1894072636961937 accuracy 0.7389911929543634

Epoch 10/10
----------
Train loss 0.0677171527135064 accuracy 0.9771908763505402
Val   loss 1.3171145111322402 accuracy 0.7453963170536428

716


# 读取训练好的模型
model.load_state_dict(torch.load('best_model_state.bin'))

test_acc, _ = eval_model(
    model,
    test_data_loader,
    loss_fn,
    device,
    len(df_test)
)

test_acc.item()


def get_predictions(model, data_loader):
    model = model.eval()

    review_texts = []
    predictions = []
    prediction_probs = []
    real_values = []

    with torch.no_grad():
        for d in data_loader:
            texts = d["review_text"]
            input_ids = d["input_ids"].to(device)
            attention_mask = d["attention_mask"].to(device)
            targets = d["targets"].to(device)

            # Get outouts
            outputs = model(
                input_ids=input_ids,
                attention_mask=attention_mask
            )
            _, preds = torch.max(outputs, dim=1)

            review_texts.extend(texts)
            predictions.extend(preds)
            prediction_probs.extend(outputs)
            real_values.extend(targets)

    predictions = torch.stack(predictions).cpu()
    prediction_probs = torch.stack(prediction_probs).cpu()
    real_values = torch.stack(real_values).cpu()

    return review_texts, predictions, prediction_probs, real_values


y_review_texts, y_pred, y_pred_probs, y_test = get_predictions(
    model,
    test_data_loader
)

print(classification_report(y_test, y_pred, target_names=class_names))


def show_confusion_matrix(confusion_matrix):
    hmap = sns.heatmap(confusion_matrix, annot=True, fmt="d", cmap="Blues")
    hmap.yaxis.set_ticklabels(hmap.yaxis.get_ticklabels(), rotation=0, ha='right')
    hmap.xaxis.set_ticklabels(hmap.xaxis.get_ticklabels(), rotation=30, ha='right')
    plt.ylabel('True sentiment')
    plt.xlabel('Predicted sentiment');


cm = confusion_matrix(y_test, y_pred)
df_cm = pd.DataFrame(cm, index=class_names, columns=class_names)
show_confusion_matrix(df_cm)

review_text = "I love completing my todos! Best app ever!!!"
encoded_review = tokenizer.encode_plus(
    review_text,
    max_length=MAX_LEN,
    add_special_tokens=True,
    return_token_type_ids=False,
    pad_to_max_length=True,
    return_attention_mask=True,
    return_tensors='pt',
)

input_ids = encoded_review['input_ids'].to(device)
attention_mask = encoded_review['attention_mask'].to(device)

output = model(input_ids, attention_mask)
_, prediction = torch.max(output, dim=1)

print(f'Review text: {review_text}')
print(f'Sentiment  : {class_names[prediction]}')

              precision    recall  f1-score   support

    negative       0.72      0.90      0.80       480
     neutral       0.70      0.03      0.06       216
    positive       0.78      0.90      0.84       554

    accuracy                           0.75      1250
   macro avg       0.73      0.61      0.57      1250
weighted avg       0.74      0.75      0.69      1250

Review text: I love completing my todos! Best app ever!!!
Sentiment  : positive

你可能感兴趣的:(pytorch,bert,python)

Html、Markdown的信息提取 DreamBoy_W.W.Y 知识图谱 python
目录一、前言二、核心代码1、解析提取html文档2、提取Markdown文档信息一、前言【python】mistune转换md为HTML，BeautifulSoup解析读取。【python】Html文档，使用BeautifulSoup解析读取。二、核心代码1、解析提取html文档defextract_all_content(soup):content={'text':[]
Python实战：解析labelme标注数据——如何将数据转换为COCO格式程序员杨弋 Python全栈工程师学习指南 python 开发语言
在计算机视觉中，标注数据是非常重要的，而Labelme是一个简单易用的自由标注工具，被广泛应用于图像语义分割、目标检测、实例分割等领域，然而标注数据并不总是以我们需要的格式存在，因此需要进行适当的转换，本文将详细介绍如何将Labelme标注数据转换为COCO格式。首先需要安装相关的Python库，包括labelme、numpy、matplotlib、pillow等，在安装完成后设置数据路径，并读取
python运行路径和脚本文件所在路径 Wiseehw Python
我在sublimeText2编辑python脚本程序，用ipython导入脚本模块，打开文件时总是报错，原来是路径问题deffile2matrix(filename):fp=open(filename,'r')datalines=fp.readlines()lenlines=len(datalines)dataSet=np.zeros((lenlines,3))labels=[]index=0fo
揭秘！100 个 Python 常用易错知识点的避坑指南 tekin Python python Python 易错点 Python 编程避坑 Python 知识总结 Python 基础与进阶 Python 代码优化 Python 常见错误解析
目录简介1.类方法命名中的下划线2.函数形参中的*和**3.函数实参中的*4.变量作用域5.浅拷贝和深拷贝6.默认参数的陷阱7.迭代器和生成器相关迭代器使用后耗尽生成器表达式和列表推导式混淆8.异常处理相关捕获异常范围过大异常处理中的finally子句9.多线程和多进程相关全局解释器锁（GIL）误解多线程性能提升多进程中的资源共享问题10.字符串编码问题编码和解码错误11.模块导入相关循环导入问题
Deepseek与doubao|tongyi|wenxin三个大模型对比编写数据处理脚本 AI技术老狗（QA） Deepseek 大模型 AI编写脚本
‌DeepSeek在编写脚本方面的能力非常强大，尤其在编程、推理和数学计算方面展现出了超越普通AI的能力‌。DeepSeek的核心优势在于其编程能力的显著提高，能够轻松应对前端脚本和后端逻辑的编写，大大降低了程序员编写代码的难度。今天我们就对比下deepseek、豆包、通义千问、文心一言这四个进行一下对比，对比的题目为：《帮我写一个处理excel数据的python脚本，要求：100万条数据，去除重
python工作目录与文件目录我家大宝最可爱 python 开发语言
总结open函数中的相对路径是以工作目录为基准的import导入package时，相对路径是以当前执行import的文件路径为基准的由于python规定顶层模块不能作为package，因此import只能导入当前文件所在的目录以及子路下的package，无法导入上层目录的pakcage，例如import..xxx是不行的，只能是importx或者importx.y想要导入上层目录的package，
logging 日志同时输出到控制台（踩坑：python2 vs python3使用差异) freesonWANG 入门 python logging
一段python3生效的代码：importloggingforhandlerinlogging.root.handlers[:]:logging.root.removeHandler(handler)logging.basicConfig(level=logging.DEBUG,format=
字节跳动实习生和校招生内推飞300 python javascript php 业界资讯算法
机器学习算法实习生-平台治理1、2026届硕士及以上学位在读，计算机等相关专业优先；2、有扎实的代码能力，熟悉深度学习/图神经网络/机器学习框架，如Pytorch、Tensorflow、DGL、Pyg、Sklearn等；3、熟悉机器学习/图学习/序列学习算法中的一项或者多项，如图建模、时序信号建模、节点/子图分类、社区挖掘、表征学习、自监督/半监督学习等，有一定深度和广度；4、熟悉相关算法在数据挖
踩坑记录: Python的工作路径(working dircetory) neowell 个人笔记 python 开发语言
本部分不涉及模块搜索方式的具体解释,有兴趣可以看看我之前的笔记:Python中令人困惑的模块导入.问题描述项目简介首先给出一个简单的项目结构:root└──random_dir├──random_file.py└──text_file.txtroot是项目的根目录,旗下只有一个名为random_dir的文件夹,在这个文件夹内,有一个python的脚本文件random_file.py,以及一份空的文
Github 2024-06-20 开源项目日报 Top10 老孙正经胡说 github 开源 Github趋势分析开源项目 Python Golang
根据GithubTrendings的统计，今日(2024-06-20统计)共有10个项目上榜。根据开发语言中项目的数量，汇总情况如下：开发语言项目数量Python项目4TypeScript项目4Rust项目2JavaScript项目1Dart项目1Java项目1Go项目1RustDesk:用Rust编写的开源远程桌面软件创建周期：1218天开发语言：Rust,Dart协议类型：GNUAfferoG
不安装python怎么运行py_如何不用安装python就可以运行.py文件？ weixin_39632471 不安装python怎么运行py
解决这个问题的便携版的python不能直接运行py文件。解决这个问题的便携版的python不能直接运行py文件。协会:直接运行.py文件没有windows下的一个前缀。协会:python的便携版的问题不能直接运行python脚本。直接运行py文件\u2026为了方便部署,您需要编译Python源代码到一个可执行文件,和编译后的可执行文件可以运行的Python环境。你好!这是你第一次使用欢迎页面显示
python引用其他文件提示找不到模块_命令行执行python模块时提示ImportError: No module named xxx... weixin_39644146
在pycharm中运行python文件没有问题，切换到cmd中是提示：ImportError:Nomodulenamedxxx原因：pycharm在运行时会把当前工程的所有文件夹路径都作为包的搜索路径，而命令行默认只是搜索当前路径。’解决方法：在出错的模块中加上importsysimportoscurPath=os.path.abspath(os.path.dirname(__file__))ro
MNIST Examples for GGML - Fully connected network Yongqiang Cheng ggml -llama.cpp -whisper.cpp GGML MNIST Examples Fully connected
MNISTExamplesforGGML-Fullyconnectednetwork1.Build2.MNISTExamplesforGGML2.1.Obtainingthedata2.2.Fullyconnectednetwork2.2.1.TotrainafullyconnectedmodelinPyTorchandsaveitasaGGUFfile2.2.2.Toevaluatethemod
python3 + selenium webdriver自动化测试启动不同浏览器 cs_mengxi selenium Web自动化 selenium 测试工具
seleniumwebdriver自动化测试启动不同浏览器seleniumwebdriver介绍SeleniumWebDriver进行自动化测试的一般流程浏览器驱动下载浏览器驱动的安装chrome、edge、Firefox、Opera、Safari、phantomjs应用HeadlessChrome、HeadlessFirefox应用seleniumwebdriver介绍SeleniumWebDr
通过命令行运行py文件与通过ide运行py文件，对文件中模块的引用方式的影响 yaoshengting python python
通过命令行运行Python文件和通过IDE运行Python文件时，模块的引用方式会受到一些影响，主要体现在工作目录和模块导入路径（sys.path）的设置上。下面详细介绍这两种方式的区别和它们如何影响模块引用。1.通过命令行运行Python文件当你通过命令行运行Python文件时，Python会根据你在命令行中指定的路径来查找模块。通常情况下，当前工作目录（即你运行Python命令的目录）会被添加
Python删除文件与文件夹：remove()、rmdir() 大数据张老师 Python程序设计 python 开发语言运维
Python删除文件与文件夹：remove()、rmdir()在文件和目录管理中，删除操作是非常重要的一部分。Python提供了os模块中的remove()和rmdir()方法来删除文件和文件夹。本节将详细讲解这两个方法的用法、注意事项以及它们的适用场景，帮助读者准确掌握删除文件与文件夹的操作。1.删除文件：os.remove()os.remove()方法用于删除指定路径的文件。当路径对应的目标为
python中datetime模块时间的使用幸运的星竹 Python python 开发语言
python中，有两个模块用来表示时间，一个是time模块，一个是datetime模块。之前我们讲述过time模块怎么使用时间，这篇我们看下datetime模块怎么表达时间。datetime类型是一个比较综合的类型，它下面有子包datetime.date专门用来表示日期，datetime.time专门用来表示时间，而datetime.datetime就表示日期时间，其中还有datetime.dat
python中os的常用方法神即道道法自然如来 python python
os.path常用方法：os.getcwd()获取当前工作目录，即当前python脚本工作的目录路径os.chdir("dirname")改变当前脚本工作目录；相当于shell下cdos.curdir返回当前目录:('.')os.pardir获取当前目录的父目录字符串名：('..')os.makedirs('dirname1/dirname2')可生成多层递归目录os.removedirs('di
python字符串怎么转换成字典_用python将字符串转换成字典 weixin_39777018
Iknowthatthisquestionsoundaduplicate,butit'snot,atleastlookedforawhileandIcouldn'tfinenothingformyspecificproblem.Ihavethefollowingstring:"{first:{name:'test',value:100},second:{name:'test2',value:50}
Python里的OS模块常用函数说明 weixin_34050389 python 操作系统 shell
Python的标准库中的os模块包含普遍的操作系统功能。如果你希望你的程序能够与平台无关的话，这个模块是尤为重要的。即它允许一个程序在编写后不需要任何改动，也不会发生任何问题，就可以在Linux和Windows下运行。下面列出了一些在os模块中比较有用的部分。它们中的大多数都简单明了。os.sep可以取代操作系统特定的路径分隔符。windows下为“\\”os.name字符串指示你正在使用的平台。
python-将字符串转换为字典 weixin_30505751 python json
json越来越流行，通过python获取到json格式的字符串后，可以通过eval函数转换成dict格式：>>>a='{"name":"yct","age":10}'>>>eval(a){'age':10,'name':'yct'}转载于:https://www.cnblogs.com/gy-ph/p/8087372.html
Python调用C语言动态库（DLL）结构体/指针/变量的方法 ENOCH_Q PYTHON python c语言开发语言
文章目录前言一、如何生成C语言动态库DLL第一步：安装编译工具第二步：设计C代码第三步：编译成C语言动态库DLL二、如何使用C语言动态库第一步：python/pytorch调入DLL接口第二步：Python调用DLL函数第三步：Python测试函数三、完整程序与测试结果总结前言在使用python等进行数据处理时，有时需要使用C语言生成的动态库进行数据处理，比如有些算法已经用C语言实现，或有些函数处
使用Python引用DLL文件的方法 NoABug python microsoft 开发语言 Python
Python是一种功能强大的编程语言，可以与其他编程语言和库进行交互。在某些情况下，您可能需要使用Python引用动态链接库（DLL）文件，以便在Python代码中调用DLL文件中的函数和方法。本文将介绍如何使用Python引用DLL文件并调用其中的函数。步骤1：准备DLL文件首先，您需要准备一个DLL文件，该文件包含您要在Python中调用的函数和方法。您可以通过自己编写DLL文件，或者使用第三
在Python 中字符串转换为字典 Yuº Python python
在Python中把字符串转换为字典例如，将字符串user_info=‘{“name”:“john”,“gender”:“male”,“age”:28}’转换为字典user_dict={“name”:“john”,“gender”:“male”,“age”:28}有以下几种方法1.通过json来转换importjsonuser_info='{"name":"john","gender":"male"
Python实现AWS Fargate自动化部署系统 ivwdcwso 运维开发 python aws 自动化 ecs 开发 Fargate 运维
一、背景介绍在现代云原生应用开发中,自动化部署是提高开发效率和保证部署质量的关键。AWSFargate作为一项无服务器计算引擎,可以让我们专注于应用程序开发而无需管理底层基础设施。本文将详细介绍如何使用Python实现AWSFargate的完整自动化部署流程。©ivwdcwso(ID:u012172506)二、技术栈选择Python3.8+:作为主要开发语言boto3:AWS官方PythonSDK
小程序二：利用Python编写一个简单的计算器（实现加减乘除）嘵奇 Python小程序 python
![在这里插入图片描述](https://img-blog.csdnimg.cn/20210515164416507.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3FxNDc5ODUwNTgx,size_16,color_FFFFFF,t
【愚公系列】《Python网络爬虫从入门到精通》012-字符串处理愚公搬代码愚公系列-书籍专栏 python 爬虫开发语言
标题详情作者简介愚公搬代码头衔华为云特约编辑，华为云云享专家，华为开发者专家，华为产品云测专家，CSDN博客专家，CSDN商业化专家，阿里云专家博主，阿里云签约作者，腾讯云优秀博主，腾讯云内容共创官，掘金优秀博主，亚马逊技领云博主，51CTO博客专家等。近期荣誉2022年度博客之星TOP2，2023年度博客之星TOP2，2022年华为云十佳博主，2023年华为云十佳博主，2024年华为云十佳博主等
Python 实现 2025 专属烟花效果粒子 Clevermea python 开发语言 pygame 算法逻辑回归推荐算法
引言“爆竹声中一岁除”，听到这句话，想必大家都不陌生吧。在城市中看到那拖着彩星的烟花飞向天空，在空中绽放出来，那一刻是多么美好。那么，话说回来，你是否想过用代码的力量，在虚拟世界中重现这绚烂的烟花场景呢？借助Python强大的绘图和动画库，我们完全可以实现这一有趣的创意，为2025年增添一份独特的科技感与浪漫氛围。准备工作在开始编码之前，我们需要安装一些必要的Python库。这里我们会用到pyga
AWS上基于高德API验证Amazon Redshift里国内地址数据正确性的设计方案 weixin_30777913 python 数据仓库云计算 aws
该方案通过无服务架构实现高可扩展性，结合分页查询和批量更新确保高效处理海量数据，同时通过密钥托管和错误重试机制保障安全性及可靠性。一、技术栈组件技术选型说明计算层AWSLambda无服务器执行，适合事件驱动、按需处理，成本低数据存储AmazonRedshift存储原始地址数据及验证结果API调用高德地理编码API提供地址标准化及验证能力开发语言Python3.9+使用requests处理HTTP请
deepseek+python,离线api，持续对话守着黎明看日出 python
功能：通过start开启新对话，stop结束对话，exit退出程序，并且可持续对话代码fromtransformersimportAutoModelForCausalLM,AutoTokenizer,BitsAndBytesConfigimporttorch#导入torch模块#配置4-bit量化quantization_config=BitsAndBytesConfig(load_in_4bit
书其实只有三类西蜀石兰类
一个人一辈子其实只读三种书，知识类、技能类、修心类。知识类的书可以让我们活得更明白。类似十万个为什么这种书籍，我一直不太乐意去读，因为单纯的知识是没法做事的，就像知道地球转速是多少一样（我肯定不知道），这种所谓的知识，除非用到，普通人掌握了完全是一种负担，维基百科能找到的东西，为什么去记忆？知识类的书，每个方面都涉及些，让自己显得不那么没文化，仅此而已。社会认为的学识渊博，肯定不是站在
《TCP/IP 详解，卷1：协议》学习笔记、吐槽及其他 bylijinnan tcp
《TCP/IP 详解，卷1：协议》是经典，但不适合初学者。它更像是一本字典，适合学过网络的人温习和查阅一些记不清的概念。这本书，我看的版本是机械工业出版社、范建华等译的。这本书在我看来，翻译得一般，甚至有明显的错误。如果英文熟练，看原版更好： http://pcvr.nl/tcpip/ 下面是我的一些笔记，包括我看书时有疑问的地方，也有对该书的吐槽，有不对的地方请指正： 1.
Linux—— 静态IP跟动态IP设置 eksliang linux IP
一.在终端输入 vi /etc/sysconfig/network-scripts/ifcfg-eth0 静态ip模板如下： DEVICE="eth0" #网卡名称 BOOTPROTO="static" #静态IP（必须） HWADDR="00:0C:29:B5:65:CA" #网卡mac地址 IPV6INIT=&q
Informatica update strategy transformation 18289753290
更新策略组件：标记你的数据进入target里面做什么操作，一般会和lookup配合使用，有时候用0,1,1代表 forward rejected rows被选中，rejected row是输出在错误文件里，不想看到reject输出，将错误输出到文件，因为有时候数据库原因导致某些column不能update，reject就会output到错误文件里面供查看，在workflow的
使用Scrapy时出现虽然队列里有很多Request但是却不下载，造成假死状态酷的飞上天空 request
现象就是：程序运行一段时间，可能是几十分钟或者几个小时，然后后台日志里面就不出现下载页面的信息，一直显示上一分钟抓取了0个网页的信息。刚开始已经猜到是某些下载线程没有正常执行回调方法引起程序一直以为线程还未下载完成，但是水平有限研究源码未果。经过不停的google终于发现一个有价值的信息，是给twisted提出的一个bugfix 连接地址如下http://twistedmatrix.
利用预测分析技术来进行辅助医疗蓝儿唯美医疗
2014年，克利夫兰诊所（Cleveland Clinic）想要更有效地控制其手术中心做膝关节置换手术的费用。整个系统每年大约进行2600例此类手术，所以，即使降低很少一部分成本，都可以为诊所和病人节约大量的资金。为了找到适合的解决方案，供应商将视野投向了预测分析技术和工具，但其分析团队还必须花时间向医生解释基于数据的治疗方案意味着什么。克利夫兰诊所负责企业信息管理和分析的医疗
java 线程(一)：基础篇 DavidIsOK java 多线程线程
&nbs
Tomcat服务器框架之Servlet开发分析 aijuans servlet
最近使用Tomcat做web服务器，使用Servlet技术做开发时，对Tomcat的框架的简易分析：疑问：为什么我们在继承HttpServlet类之后，覆盖doGet(HttpServletRequest req, HttpServetResponse rep)方法后，该方法会自动被Tomcat服务器调用，doGet方法的参数有谁传递过来？怎样传递？分析之我见： doGet方法的
揭秘玖富的粉丝营销之谜与小米粉丝社区类似 aoyouzi 揭秘玖富的粉丝营销之谜
玖富旗下悟空理财凭借着一个微信公众号上线当天成交量即破百万，第七天成交量单日破了1000万;第23天时，累计成交量超1个亿……至今成立不到10个月，粉丝已经超过500万，月交易额突破10亿，而玖富平台目前的总用户数也已经超过了1800万，位居P2P平台第一位。很多互联网金融创业者慕名前来学习效仿，但是却鲜有成功者，玖富的粉丝营销对外至今仍然是个谜。　　近日，一直坚持微信粉丝营销
Java web的会话跟踪技术百合不是茶 url会话 Cookie会话 Seession会话 Java Web 隐藏域会话
会话跟踪主要是用在用户页面点击不同的页面时,需要用到的技术点会话:多次请求与响应的过程 1,url地址传递参数,实现页面跟踪技术格式:传一个参数的 url?名=值传两个参数的 url?名=值 &名=值关键代码
web.xml之Servlet配置 bijian1013 java web.xml Servlet配置
定义： <servlet> <servlet-name>myservlet</servlet-name> <servlet-class>com.myapp.controller.MyFirstServlet</servlet-class> <init-param> <param-name>
利用svnsync实现SVN同步备份 sunjing SVN 同步 E000022 svnsync 镜像
1. 在备份SVN服务器上建立版本库 svnadmin create test 2. 创建pre-revprop-change文件 cd test/hooks/ cp pre-revprop-change.tmpl pre-revprop-change 3. 修改pre-revprop-
【分布式数据一致性三】MongoDB读写一致性 bit1129 mongodb
本系列文章结合MongoDB，探讨分布式数据库的数据一致性，这个系列文章包括：数据一致性概述与CAP 最终一致性(Eventually Consistency) 网络分裂(Network Partition)问题多数据中心(Multi Data Center) 多个写者(Multi Writer)最终一致性一致性图表(Consistency Chart) 数据
Anychart图表组件-Flash图转IMG普通图的方法白糖_ Flash
问题背景：项目使用的是Anychart图表组件，渲染出来的图是Flash的，往往一个页面有时候会有多个flash图，而需求是让我们做一个打印预览和打印功能，让多个Flash图在一个页面上打印出来。那么我们打印预览的思路是获取页面的body元素，然后在打印预览界面通过$("body").append(html)的形式显示预览效果，结果让人大跌眼镜：Flash是
Window 80端口被占用 WHY? bozch 端口占用 window
平时在启动一些可能使用80端口软件的时候，会提示80端口已经被其他软件占用，那一般又会有那些软件占用这些端口呢？下面坐下总结： 1、web服务器是最经常见的占用80端口的，例如：tomcat , apache , IIS , Php等等； 2
编程之美-数组的最大值和最小值-分治法（两种形式） bylijinnan 编程之美
import java.util.Arrays; public class MinMaxInArray { /** * 编程之美数组的最大值和最小值分治法 * 两种形式 */ public static void main(String[] args) { int[] t={11,23,34,4,6,7,8,1,2,23}; int[]
Perl正则表达式 chenbowen00 正则表达式 perl
首先我们应该知道 Perl 程序中，正则表达式有三种存在形式，他们分别是：匹配：m/<regexp>;/ （还可以简写为 /<regexp>;/ ，略去 m）替换：s/<pattern>;/<replacement>;/ 转化：tr/<pattern>;/<replacemnt>;
[宇宙与天文]行星议会是否具有本行星大气层以外的权力呢? comsci
举个例子: 地球,地球上由200多个国家选举出一个代表地球联合体的议会,那么现在地球联合体遇到一个问题,地球这颗星球上面的矿产资源快要采掘完了....那么地球议会全体投票,一致通过一项带有法律性质的议案,既批准地球上的国家用各种技术手段在地球以外开采矿产资源和其它资源........ &
Oracle Profile 使用详解 daizj oracle profile 资源限制
Oracle Profile 使用详解转一、目的： Oracle系统中的profile可以用来对用户所能使用的数据库资源进行限制，使用Create Profile命令创建一个Profile，用它来实现对数据库资源的限制使用，如果把该profile分配给用户，则该用户所能使用的数据库资源都在该profile的限制之内。二、条件：创建profile必须要有CREATE PROFIL
How HipChat Stores And Indexes Billions Of Messages Using ElasticSearch & Redis dengkane elasticsearch Lucene
This article is from an interview with Zuhaib Siddique, a production engineer at HipChat, makers of group chat and IM for teams. HipChat started in an unusual space, one you might not
循环小示例，菲波拉契序列，循环解一元二次方程以及switch示例程序 dcj3sjt126com c 算法
# include <stdio.h> int main(void) { int n; int i; int f1, f2, f3; f1 = 1; f2 = 1; printf("请输入您需要求的想的序列："); scanf("%d", &n); for (i=3; i<n; i
macbook的lamp环境 dcj3sjt126com lamp
sudo vim /etc/apache2/httpd.conf /Library/WebServer/Documents 是默认的网站根目录重启Mac上的Apache服务这个命令很早以前就查过了，但是每次使用的时候还是要在网上查：停止服务：sudo /usr/sbin/apachectl stop 开启服务：s
java ArrayList源码下 shuizhaosi888 ArrayList源码
版本 jdk-7u71-windows-x64 JavaSE7 ArrayList源码上：http://flyouwith.iteye.com/blog/2166890 /** * 从这个列表中移除所有c中包含元素 */ public boolean removeAll(Collection<?> c) {
Spring Security（08）——intercept-url配置 234390216 Spring Security intercept-url 访问权限访问协议请求方法
intercept-url配置目录 1.1 指定拦截的url 1.2 指定访问权限 1.3 指定访问协议 1.4 指定请求方法 1.1 &n
Linux环境下的oracle安装 jayung oracle
linux系统下的oracle安装本文档是Linux(redhat6.x、centos6.x、redhat7.x) 64位操作系统安装Oracle 11g(Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production)，本文基于各种网络资料精心整理而成，共享给有需要的朋友。如有问题可联系：QQ：52-7
hotspot虚拟机 leichenlei java HotSpot jvm 虚拟机文档
JVM参数 http://docs.oracle.com/javase/6/docs/technotes/guides/vm/index.html JVM工具 http://docs.oracle.com/javase/6/docs/technotes/tools/index.html JVM垃圾回收 http://www.oracle.com
读《Node.js项目实践：构建可扩展的Web应用》 ——引编程慢慢变成系统化的“砌砖活” noaighost Web node.js
读《Node.js项目实践：构建可扩展的Web应用》 ——引编程慢慢变成系统化的“砌砖活” 眼里的Node.JS 初初接触node是一年前的事，那时候年少不更事。还在纠结什么语言可以编写出牛逼的程序，想必每个码农都会经历这个月经性的问题：微信用什么语言写的？facebook为什么推荐系统这么智能，用什么语言写的？dota2的外挂这么牛逼，用什么语言写的？……用什么语言写这句话，困扰人也是阻碍
快速开发Android应用 rensanning android
Android应用开发过程中，经常会遇到很多常见的类似问题，解决这些问题需要花时间，其实很多问题已经有了成熟的解决方案，比如很多第三方的开源lib，参考 Android Libraries 和 Android UI/UX Libraries。编码越少，Bug越少，效率自然会高。但可能由于根本没听说过、听说过但没用过、特殊原因不能用、自己已经有了解决方案等等原因，这些成熟的解决
理解Java中的弱引用 tomcat_oracle java 工作面试
　不久之前，我面试了一些求职Java高级开发工程师的应聘者。我常常会面试他们说，“你能给我介绍一些Java中得弱引用吗？”，如果面试者这样说，“嗯，是不是垃圾回收有关的？”，我就会基本满意了，我并不期待回答是一篇诘究本末的论文描述。　　然而事与愿违，我很吃惊的发现，在将近20多个有着平均5年开发经验和高学历背景的应聘者中，居然只有两个人知道弱引用的存在，但是在这两个人之中只有一个人真正了
标签输出html标签" target="_blank">关于标签输出html标签 xshdch jsp
http://back-888888.iteye.com/blog/1181202 关于<c:out value=""/>标签的使用，其中有一个属性是escapeXml默认是true(将html标签当做转移字符，直接显示不在浏览器上面进行解析)，当设置escapeXml属性值为false的时候就是不过滤xml，这样就能在浏览器上解析html标签， &nb