weixin_38616018

deeplabv3 + mobilenetv2 做语义分割并封装成c++部署到移动端，linux，windows等平台(史上最详细)

训练

Deeplab项目安装以及测试

首先为了确保版本支持，先得确认你的tensorflow的版本是1.10以上的。我的linux系统上装的是1.14的tensorflow，因为我一直用的这个版本。

克隆deeplab项目

git clone https://gitee.com/yujiahao123/models2

这是我为了速度fork的deeplab，因为tensorflow的github clone实在太慢了，老是中断，特别难受，当开始我还不知道有码云这种神器，让同事Roland兄弟帮我从加拿大下载然后发邮件给我才拿到，哈哈。

添加项目依赖路径

sudo gedit ~/.bashr

在这个文件最后加上一句

export
PYTHONPATH=/home/william/models/research/slim:/home/william/models/research:$PYTHONPATH

这边把路径改成你自己的
然后激活一下环境

source ~/.bashrc

测试deeplab

cd /home/william/models/research/deeplab

执行

python model_test.py

如果最后输出了OK就说明安装成功了

数据集处理

我的数据是同事Roland兄弟给我的(顺便一提加拿大到中国的文件传输实在太恶心了,下载几个小时百分之八十的时候下载失败伤不起啊)，但是要做处理才能使用，于是我写了一些代码来处理数据，来保证和官方提供的pascal voc格式一致

将数据转换成TFRecord

创建 tfrecord

mkdir tfrecord

将上述制作的数据集打包成TFRecord，使用的是build_voc2012_data.py 在目录/home/bai/models/research/deeplab/datasets下执行

python build_voc2012_data.py 
--image_folder="/home/william/dataset/yoho/Images" --semantic_segmentation_folder="/home/william/dataset/yoho/masks" --list_folder="/home/william/dataset/yoho/index" 
--image_format="jpg" 
--output_dir="/home/william/dataset/yoho/tfrecord"

image_folder ：数据集中原输入数据的文件目录地址
semantic_segmentation_folder：数据集中标签的文件目录地址
list_folder : 将数据集分类成训练集、验证集等的指引目录文件目录
image_format : 输入图片数据的格式，CamVid的是png格式
output_dir：制作的TFRecord存放的目录地址(自己创建)

网络训练

在datasets/data_generator.py文件中，添加camvid数据集描述：

_YOHO_INFORMATION = DatasetDescriptor(
    splits_to_sizes={
     
        'train': 256,  # num of samples in images/training
        'val': 49,  # num of samples in images/validation
    },
    num_classes=3,
    ignore_label=255,
)

因为yoho共有3个classes

注册数据集

同时在datasets/data_generator.py文件，添加对应数据集的名称：

_DATASETS_INFORMATION = {
         
    'cityscapes': _CITYSCAPES_INFORMATION,    
    'pascal_voc_seg': _PASCAL_VOC_SEG_INFORMATION,    
    'ade20k': _ADE20K_INFORMATION,
    'yoho':_YOHO_INFORMATION, #自己的数据集 
    }

修改代码

因为是在DeepLab的基础上ﬁne-tune我们自己的数据集，所以需要修改一些代码

修改train.py

其中有一些选项：

使用预训练的所有权重，设置initialize_last_layer=True
只使用网络的backbone，设置initialize_last_layer=False和 last_layers_contain_logits_only=False
使用所有的预训练权重，除了logits以外。因为如果是自己的数据集，对应的classes不同(这个我们前面已经设置不加载logits), 可设置initialize_last_layer=False和 last_layers_contain_logits_only=True

修改train_utils.py

对应的utils/train_utils.py中，将关于 exclude_list 的设置修改，作用是在使用预训练权重时候，不加载该 logit 层：

exclude_list = ['global_step','logits'] 
if not initialize_last_layer:
 exclude_list.extend(last_layers)

下载预训练权重

因为我们的数据比较少，除了数据增强之外，使用别人训练好的模型做fineturn是一个好的选择
在model_zoo上下载预训练模型：
这里因为我打算将模型部署到移动端，所以我选择了专门给移动端设计的mobilenetv2
下载地址：https://github.com/tensorflow/models/blob/master/research/deeplab/g3doc/model_zoo.md
我用的是deeplabv3_mnv2_cityscapes_train这个预训练模型

训练

我们现在基本需要的都准备好了，那就可以开始炼丹啦！
训练我们需要运行deeplab下的train.py这个脚本

python train.py
--logtostderr
--train_logdir=/home/william/model/models-master/models-master/research/deeplab/exp/yoho/train3
--dataset_dir=/home/william/dataset/yoho/tfrecord
--training_number_of_steps=100
--train_split="train"
--model_variant="mobilenet_v2"
--output_stride=16
--base_learning_rate=3e-5
--train_crop_size=513,513
--train_batch_size=2
--dataset="yoho"
--tf_initial_checkpoint=/home/william/model/models-master/models-master/research/deeplab/exp/yoho/train2/model.ckpt-520

测试结果可视化

训练我们需要运行deeplab下的vis.py这个脚本

python vis.py
--vis_split="val"
--model_variant="mobilenet_v2"
--vis_crop_size=3000,2000
--output_stride=16
--checkpoint_dir="/home/william/model/models-master/models-master/research/deeplab/exp/yoho/train3"
--dataset="yoho"
--colormap_type="pascal"
--vis_logdir="/home/william/model/models-master/models-master/research/deeplab/exp/yoho/vis"
--dataset_dir="/home/william/dataset/yoho/tfrecord"

跑完了之后我们再/home/william/model/models-master/models-master/research/deeplab/exp/yoho/vis这个目录下就能看一下我们的模型的效果

大致看一下效果还可以

性能评估

评估我们需要运行deeplab下的vis.py这个脚本

python vis.py
--eval_split="val"
--model_variant="mobilenet_v2"
--eval_crop_size=3000,2000
--output_stride=16
--dataset="yoho"
--checkpoint_dir="/home/william/model/models-master/models-master/research/deeplab/exp/yoho/train3"
--eval_logdir="/home/william/model/models-master/models-master/research/deeplab/exp/yoho/eval"
--dataset_dir="/home/william/dataset/yoho/tfrecord"
--max_number_of_evaluations=1

最后会输出一个mloU值，这个值越高表示效果越好，我跑了一下mloU值能达到85左右还行

模型输出

模型输出我们需要运行deeplab下的export_model.py这个脚本

python export_model.py
--logtostderr
--checkpoint_path="/home/william/model/models-master/models-master/research/deeplab/exp/yoho/train3/model.ckpt-100"
--export_path="/home/william/model/models-master/models-master/research/deeplab/exp/yoho/save/frozen_inference_graph.pb"
--model_variant="mobilenet_v2"
--num_classes=3

这个脚本会把tensor的变量转成常量，生成传说中的冻结图，就是frozen_inference_graph.pb这玩意

部署

我们有了frozen_inference_graph.pb模型文件之后我们就可以将我们的语义分割网络部署在各个平台了，接下来我会从安卓，linux，windos等各个平台来说怎么把一个论文提出的东西变成一个实实在在的产品

移动端

模型转换

移动端tensorflow的官网提供了适合手机的轻量封装库tensorflow lite，要使用这个tensorflow lite必须先将模型从.pb转换为.tfrecord,对于模型转换，tensorflow官网提供了一个工具。

模型转换的史诗级大坑

转换模型的时候有一个大坑，让我踩了快两礼拜，伤不起啊。
刚开始我模型是这样直接转换的

tflite_convert \
  --output_file=test.lite \
  --graph_def_file=frozen_inference_graph.pb \
  --input_arrays=ImageTensor \
  --output_arrays=SemanticPredictions \
  --input_shapes=1,3000,2000,3 \
  --inference_input_type=QUANTIZED_UINT8 \
  --inference_type=FLOAT \
  --mean_values=128 \
  --std_dev_values=128

先不说转换模型能不能转换成功，就算转换成功了，真正部署到各种平台的时候会报错
比如安卓端

这个问题卡了我半天，问同事，问主管，问tensorflow作者都没有解决我的问题
这个是我提的issue
https://github.com/tensorflow/tensorflow/issues/42622
找了很长时间都没有相关的资料
这边卡了很久
靠人不如靠自己
最后解决方法无意间看到了这个issue才解决了我的问题
https://github.com/tensorflow/tensorflow/issues/23747
主要原因是因为tensorflow的前操作和后处理tensorflow lite并不支持
所以如果直接转换模型的话会出现奇奇怪怪的错误
这个issue上是这么转换，sub_2是第二层而ResizeBilinear_2是倒数第二层，跳过了前处理和后处理
然后自己实现了前处理uint转float32和后处理argmax函数，问题解决

tflite_convert \
    --output_file=./deeplabv3_513.tflite \
    --graph_def_file=frozen_inference_graph.pb \
    --input_arrays=sub_2 \
    --output_arrays=ResizeBilinear_2 \
    --input_shapes=1,3000,2000,3 \
    --inference_type=FLOAT

最终转换方式

最后写一下我的转换方式,我是先给输入输出添加了一些signature，然后存成了SaveModel格式，其实和这个冻结图是一样的，不过带上了一些输入输出的信息

def export_saved_model(sess, input, output):
    output_path = 'seg/'
    print('Exporting trained model to ', output_path)
    builder = tf.saved_model.builder.SavedModelBuilder(output_path)
    input_tensor_info = tf.saved_model.utils.build_tensor_info(input)
    tensor_info_input = {
     'images': input_tensor_info}
    output_tensor_info = tf.saved_model.utils.build_tensor_info(output)
    tensor_info_outputs = {
     'output': output_tensor_info}

    preditction_signature = (
        tf.saved_model.signature_def_utils.build_signature_def(
            inputs=tensor_info_input,
            outputs=tensor_info_outputs,
            method_name=tf.saved_model.signature_constants.PREDICT_METHOD_NAME
        ))
    legcy_init_op = tf.group(tf.tables_initializer(), name='legacy_init_op')

    builder.add_meta_graph_and_variables(
        sess, [tf.saved_model.tag_constants.SERVING],
        signature_def_map={
     
            'predict_images': preditction_signature
        })
    builder.save()
    print('Successfully export model to %s' % output_path)

然后再转成tensroflow lite可以使用的tfcord模型

tflite_convert \
    --output_file=./deeplabv3_513.tflite \
    --saved_model_dir==seg \
    --input_arrays=sub_2 \
    --output_arrays=ResizeBilinear_2 \
    --input_shapes=1,3000,2000,3 \
    --inference_type=FLOAT

量化(可选)

转成SaveModel之后转成tfcord模型之前可以做量化，可以使你的模型更小，跑的速度更快
量化代码

import tensorflow as tf

converter = tf.lite.TFLiteConverter.from_saved_model(saved_model_dir)
converter.optimizations = [tf.lite.Optimize.OPTIMIZE_FOR_SIZE]
tflite_quant_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_quantized_model)

编译tensorflow lite

首先我们得先把tensorflow的项目下载下来，为了速度我们还是先把项目fork到码云上

git clone https://gitee.com/yujiahao123/tensorflow

这样就能较快的拿到tensorflow的源码
然后先下载tensorflow lite的依赖
这里有一些依赖是国外的资源所以可能得挂VPN有可能下载不了
运行tensorflow/tensorflow/lite/tools/make路径下的download_dependencies.sh脚本，这个脚本会把lite所需要的依赖下载到make下的download文件夹里
然后我们再下载bazel，这是一个类似于CMake一样的编译工具，我们需要这个东西来编译我们的tensorflow lite的库
我们运行下面这行脚本可以得到tensorflow lite的.so动态库
32bit armeabi-v7a:

bazel build -c opt --config=android_arm //tensorflow/lite:libtensorflowlite.so

64bit arm64-v8a:

bazel build -c opt --config=android_arm64 //tensorflow/lite:libtensorflowlite.so

有了这个动态库我们就可以快乐的使用他的接口啦！
我们需要将他的头文件添加进来，下面是官方的说明
Currently, there is no straightforward way to extract all header files needed,
so you must include all header files in tensorflow/lite/ from the TensorFlow
repository. Additionally, you will need header files from
FlatBuffers and
Abseil.
由此可见，我们需要tensorflow\lite还有FlatBuffers和Abseil的头文件
也就是我们之前download的目录下的downloads\flatbuffers\include和downloads\absl这两个目录

安卓端c++代码

#include 
#include 
#include
#include 
#include "tensorflow/lite/model.h"
#include "tensorflow/lite/kernels/register.h"
#include "opencv2/opencv.hpp"
using namespace std;
using namespace cv;

string jstringTostring(JNIEnv* env, jstring jstr) {
     
    char *rtn = NULL;
    jclass clsstring = env->FindClass("java/lang/String");
    jstring strencode = env->NewStringUTF("GB2312");
    jmethodID mid = env->GetMethodID(clsstring, "getBytes", "(Ljava/lang/String;)[B");
    jbyteArray barr = (jbyteArray) env->CallObjectMethod(jstr, mid, strencode);
    jsize alen = env->GetArrayLength(barr);
    jbyte *ba = env->GetByteArrayElements(barr, JNI_FALSE);
    if (alen > 0) {
     
        rtn = (char *) malloc(alen + 1);
        memcpy(rtn, ba, alen);
        rtn[alen] = 0;
    }
    env->ReleaseByteArrayElements(barr, ba, 0);
    string stemp(rtn);
    free(rtn);
    return stemp;
}

//后处理找到softmax之后的最大概率值
static int Argmax(float* array, int size) {
     
    float max_value = -10000;
    int max_index = 0;
    for (int32_t i = 0; i < size; i++) {
     
        if (array[i] > max_value) {
     
            max_value = array[i];
            max_index = i;
        }
    }
    return max_index;
}

Mat RunInference(string picname){
     
    unique_ptr<tflite::FlatBufferModel> model;
    unique_ptr<tflite::Interpreter> interpreter;
    model = tflite::FlatBufferModel::BuildFromFile("/sdcard/model/deeplabv3_513.tflite");//我们转换好的模型，我这边直接拷贝到手机里面了，你们可以放到asset文件夹下面
    model->error_reporter();

    tflite::ops::builtin::BuiltinOpResolver resolver;
    tflite::InterpreterBuilder(*model,resolver)(& interpreter);

    interpreter->AllocateTensors();

    __android_log_print(ANDROID_LOG_INFO, "mydebug", "Success\n");


    int input = interpreter->inputs()[0];
    TfLiteIntArray* dims = interpreter->tensor(input)->dims;

    int height = dims->data[1];
    int width = dims->data[2];
    int channels = dims->data[3];

    Mat img = imread(picname);

    //__android_log_print(ANDROID_LOG_INFO, "mydebug", "height %d",img.rows);
    auto img_inputs = interpreter->typed_tensor<float>(input);
    //前处理，加赋值给tensor
    for(int i = 0;i<img.cols*img.rows*3;i++){
     
        img_inputs[i] = (img.data[i]- 128.0)/128.0;
    }
    interpreter->Invoke();
    int output = interpreter->outputs()[0];
    TfLiteIntArray* output_dims = interpreter->tensor(output)->dims;

    for(int i = 0;i<4;i++){
     
        cout<<output_dims->data[i]<<endl;
    }
    float* outputsoftmax = interpreter->typed_output_tensor<float>(0);
    int* outputlabel = new int[3000*2000];
    for(int i = 0;i<height*width;i++){
     
        outputlabel[i] = Argmax(outputsoftmax+3*i,3);
    }

    Mat mat = cv::Mat(3000, 2000, CV_8UC1);
    for (int i = 0; i < mat.rows; i++)
    {
     
        for (int j = 0; j < mat.cols; j++)
        {
     
            mat.at<uchar>(i, j) = (uchar)1.0*outputlabel[b] * 100;
        }
    }
    delete[] outputlabel;
    Mat im_color;
    applyColorMap(mat, im_color, cv::COLORMAP_JET); //采用colormap
    resize(im_color,im_color,cv::Size(320,480));
    return im_color;
}

extern "C" JNIEXPORT jintArray JNICALL
Java_com_example_myapplication_MainActivity_decodeFile(
        JNIEnv* env,
        jobject /* this */,jstring picname) {
     
    string picnam = jstringTostring(env,picname);
    Mat im_color = RunInference(picnam);
    int size = im_color.cols * im_color.rows *4;

    jbyte * outImage = new jbyte[size];

    __android_log_print(ANDROID_LOG_INFO, "mydebug", "h: %d",im_color.rows);
    cvtColor(im_color,im_color,CV_RGB2BGRA);
    jintArray result  = env->NewIntArray(im_color.cols * im_color.rows);
    env->SetIntArrayRegion(result,0,im_color.cols * im_color.rows,(jint *)im_color.data);
    return result;
}

安卓java代码

protected void onCreate(Bundle savedInstanceState) {
     
        super.onCreate(savedInstanceState);
        verifyStoragePermissions(this);
        // Example of a call to a native method
        setContentView(R.layout.activity_main);
        int[] test = decodeFile("/sdcard/pic/000014_image.png");//我们上面写好的的c++函数，通过JNI调用接口
        Bitmap result = Bitmap.createBitmap(320,480, Bitmap.Config.RGB_565);
        result.setPixels(test, 0, 320, 0, 0,320, 480);
        ImageView imageView = (ImageView) findViewById(R.id.imt_photo);
        imageView.setImageBitmap(result);//显示返回结果
    }

好了我们可以看一下效果
这是原图

这个是安卓端的效果图

部署到IOS端

需要MAC环境，在装虚拟机中，未完待续

桌面/PC

linux和windows通用的python

直接贴代码

import tflite_runtime.interpreter as tflite
import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt # plt 用于显示图片
import cv2 as cv
interpreter = tf.lite.Interpreter(model_path='/home/william/CLionProjects/DEMO/model/deeplabv3_513.tflite')
interpreter.allocate_tensors()
# Get input and output tensors.
input_details = interpreter.get_input_details()
output_details = interpreter.get_output_details()
testpic = cv.imread("000014_image.png")
testpic = (1.0*testpic-128)/128
testpic = testpic.reshape(1,3000,2000,3)
testpic = testpic.astype(np.float32)
interpreter.set_tensor(input_details[0]['index'], testpic)
interpreter.invoke()
# The function `get_tensor()` returns a copy of the tensor data.
# Use `tensor()` in order to get a pointer to the tensor.
output_data = interpreter.get_tensor(output_details[0]['index'])
test=output_data.argmax(axis=3)
test = test.reshape(3000,2000)
plt.imshow(test) # 显示图片
plt.axis('off') # 不显示坐标轴
plt.show()

效果

python可以用来实验模型效果，但是实际在工程中使用的时候为了效率还是得用c/c++

部署到linux端

部署到linux比部署到移动端更简单，运行了download_dependencies.sh这个脚本之后，再运行make目录下的build_lib.sh就可以生成再linux上可以运行的.so动态库了
linux端代码和安卓端代码基本上一模一样，安卓端多了一层JNI，linux端不用JNI是纯c++的，代码我就不放出来了，有需要的私聊
这是linux端的效果

部署到windows端

使用Tensorflow lite windows端c++接口是很麻烦的，因为我找了好几天没有找到相关的资料
部署到windows端我并没有使用tensorflow lite，因为刚开始的时候没有尝试移动端和linux端的编译，而是使用的windows作为实验，但是tensorflow官网里面也没有任何tensorflow lite在windows端源码如何编译的资料
虽然windows端的python是可以跑通的，但是如何用c++来调用python又是另一个问题了，而且tensorflow lite的代码本来就是c/c++的，用c++调用python再调用c++这样也有问题，无奈之下，我开始尝试直接使用tensorflow库，这边直接贴代码
在tensorflow/cc的文件夹下面新建一个myinteface的文件夹，然后封装代码和外层接口代码都放在里面
这是tensorflow封装层接口

//这个是tensorflow的推理封装库
#include "direct.h"
#include "tf_inference_lib.h"

TfInferenceLib::TfInferenceLib(model_params_t* model_params,
    tensor_array_t* input_tensors,
    tensor_array_t* output_tensors)
    : m_input_tensors(input_tensors)
    , m_output_tensors(output_tensors)
{
     
    if (NULL == model_params) {
     
        MY_ERROR("model params is null \n");
    }
    m_model_params = new model_params_t;
    if (NULL == m_model_params) {
     
        MY_ERROR("Alloc local model params error!!!\n");
    }
    memcpy(m_model_params, model_params, sizeof(model_params_t));
}

TfInferenceLib::~TfInferenceLib()
{
     
    delete m_model_params;
}

std::string TfInferenceLib::getCurrentModelDir()
{
     
    char buffer[256];
    getcwd(buffer, 256);
    std::string strDir = buffer;
    std::cout << "Currrent dir is " << strDir << std::endl;
    return strDir;
}

void TfInferenceLib::TensorDebugInfo(std::string tensor_name, tensorflow::Tensor& tensor)
{
     
    std::cout << "tensor name"
              << ":" << tensor_name << std::endl;
    std::cout << "shape:[";
    int rank = tensor.dims();
    for (int i = 0; i < rank; i++) {
     
        std::cout << tensor.dim_size(i) << ",";
    }
    std::cout << "]" << std::endl;
    tensorflow::DataType eTensorType = tensor.dtype();
    std::cout << "Tensor type: " << eTensorType << std::endl;
    std::cout << "Tensor data size: " << tensor.tensor_data().size() << std::endl;
    std::cout << tensor.SummarizeValue(tensor.NumElements(), true);
    std::cout << std::endl
              << std::endl;
}

result_t TfInferenceLib::parseOutputTensors(std::vector<tensorflow::Tensor>& tVecOutputs,
    tensor_array_t* output_tensor_array)
{
     
    assert(tVecOutputs.size() == output_tensor_array->nArraySize);
    for (int i = 0; i < output_tensor_array->nArraySize; i++) {
     
        tensorflow::Tensor cur_tf_tensor = tVecOutputs[i];
        tensor_t* cur_tensor = &(output_tensor_array->pTensorArray[i]);
        tensor_params_t* cur_tensor_info = cur_tensor->pTensorInfo;
        cur_tensor_info->type = (tensor_types_t)cur_tf_tensor.dtype();
        cur_tensor_info->nElementSize = cur_tf_tensor.NumElements();
        cur_tensor_info->nDims = cur_tf_tensor.dims();
        for (int j = 0; j < cur_tensor_info->nDims; j++) {
     
            cur_tensor_info->pShape[j] = cur_tf_tensor.dim_size(j);
        }
        assert(cur_tensor_info->nLength == cur_tf_tensor.tensor_data().size());
        memcpy(cur_tensor->pValue, cur_tf_tensor.tensor_data().data(), cur_tensor_info->nLength);
    }
    return SUCCESS;
}

result_t TfInferenceLib::tfLoadSavedModel()
{
     
    tensorflow::Status load_status;
    std::string strModelAbsolutePath = getCurrentModelDir() + "/" + m_model_params->model_path;
    if (tensorflow::MaybeSavedModelDirectory(strModelAbsolutePath)) {
     
        std::cout << "Saved model is exists, path is " << strModelAbsolutePath << std::endl;
    } else {
     
        std::cout << "Saved model is not exists! path is " << strModelAbsolutePath << std::endl;
        return FAILED;
    }
    //设定显存的参数
    if (m_model_params->gpu_memory_faction < 0.0001 || m_model_params->gpu_memory_faction > 1.0) {
     
        m_model_params->gpu_memory_faction = 0.99;
    }
    m_session_options.config.mutable_gpu_options()->set_per_process_gpu_memory_fraction(
        m_model_params->gpu_memory_faction);
    m_session_options.config.mutable_gpu_options()->set_allow_growth(true);
    //Load model
    MY_DEBUG("Begin to load mode , path is  %s\n", strModelAbsolutePath.c_str());
    //load_status = tensorflow::LoadEncSavedModel(
    //    m_session_options, m_run_options, strModelAbsolutePath,
    //    { m_model_params->paModelTagSet }, &m_bundle,
    //    m_model_params->bIsCipher,
    //    m_model_params->gpu_id);
    load_status = tensorflow::LoadSavedModel(
        m_session_options, m_run_options, strModelAbsolutePath,
        {
      m_model_params->paModelTagSet }, &m_bundle);
    if (load_status.ok()) {
     
        std::cout << "Load model succeed!" << std::endl;
        return SUCCESS;
    } else {
     
        std::cout << "Error load model :" << load_status << std::endl;
        return MODEL_LOAD_FAILED;
    }
    return SUCCESS;
}

result_t TfInferenceLib::tfInferenceTensors()
{
     
    tensorflow::Status run_status;
    //拿到模型的函数签名
    const auto signature_def_map = m_bundle.meta_graph_def.signature_def();
    const auto signature_def = signature_def_map.at(m_input_tensors->pcSignatureDef);
    //准备输入Tensor矢量
    std::vector<tensorflow::Tensor> input_tensor_vec;
    std::vector<std::string> vecInputTensorNames;
    for (int j = 0; j < m_input_tensors->nArraySize; ++j) {
     
        tensor_t* cur_tensor = &(m_input_tensors->pTensorArray[j]);
        tensor_params_t* cur_tensor_info = cur_tensor->pTensorInfo;
        tensorflow::TensorShape cur_tensor_shape;
        for (int k = 0; k < cur_tensor_info->nDims; ++k) {
     
            cur_tensor_shape.AddDim(cur_tensor_info->pShape[k]);
        }
        if (cur_tensor_info->type == DT_UINT8) {
     
            tensorflow::Tensor input_tensor(tensorflow::DT_UINT8, cur_tensor_shape);
            memcpy(input_tensor.flat<u8>().data(), cur_tensor->pValue, cur_tensor_info->nLength);
            input_tensor_vec.push_back(input_tensor);
        } else if (cur_tensor_info->type == DT_INT32) {
     
            tensorflow::Tensor input_tensor(tensorflow::DT_INT32, cur_tensor_shape);
            memcpy(input_tensor.flat<int>().data(), cur_tensor->pValue, cur_tensor_info->nLength);
            input_tensor_vec.push_back(input_tensor);
        } else if (cur_tensor_info->type == DT_FLOAT) {
     
            tensorflow::Tensor input_tensor(tensorflow::DT_FLOAT, cur_tensor_shape);
            memcpy(input_tensor.flat<float>().data(), cur_tensor->pValue, cur_tensor_info->nLength);
            input_tensor_vec.push_back(input_tensor);
        } else if (cur_tensor_info->type == DT_STRING) {
     
            tensorflow::Tensor input_tensor;
            std::unique_ptr<tensorflow::TensorProto> upProto(new tensorflow::TensorProto());
            upProto->set_dtype(tensorflow::DataType::DT_STRING);
            upProto->add_string_val(cur_tensor->pValue, cur_tensor_info->nLength);
            upProto->mutable_tensor_shape()->add_dim()->set_size(1);
            if (!input_tensor.FromProto(*(std::move(upProto)))) {
     
                printf("Tensor[%d]::FromProto failed\n", j);
                return FAILED;
            }
            input_tensor_vec.push_back(input_tensor);
        }
        vecInputTensorNames.push_back(signature_def.inputs().at(cur_tensor_info->aTensorName).name());
    }
    //构造输入pair的矢量
    std::vector<std::pair<std::string, tensorflow::Tensor>> inputs;
    for (int i = 0; i < vecInputTensorNames.size(); i++) {
     
        inputs.push_back(std::make_pair(vecInputTensorNames[i], input_tensor_vec[i]));
    }
    //拿到输出Tensor真实的名字
    std::vector<std::string> vecWantOutTensorNames;
    std::vector<tensorflow::Tensor> tVecOutputs;
    for (int i = 0; i < m_output_tensors->nArraySize; i++) {
     
        std::string outputTensorName = m_output_tensors->pTensorArray[i].pTensorInfo->aTensorName;
        std::string outputTensorInternalName = signature_def.outputs().at(outputTensorName).name();
        vecWantOutTensorNames.push_back(outputTensorInternalName);
    }
    //进行推理
    MY_DEBUG("Begin to run session!\n");
    run_status = m_bundle.session->Run(inputs, vecWantOutTensorNames, {
     }, &tVecOutputs);
    if (run_status.ok()) {
     
        std::cout << "Session run succeed!!!\n";
    } else {
     
        std::cout << "Session run error: " << run_status << std::endl;
        return FAILED;
    }
    //解析输出
    result_t post_status = parseOutputTensors(tVecOutputs, m_output_tensors);
    return post_status;
}

这是外层接口

#define DLL_IMPLEMENT
#include 
#include "my_interface.h"
#include "tf_inference_lib.h"
result_t my_alloc_tensors(tensor_params_array_t* tensors_params,
    tensor_array_t** tensors)
{
     
    MY_CHECK_NULL(tensors_params, PARAM_NULL);
    tensor_array_t* ptTensorArray = new tensor_array_t;
    MY_CHECK_NULL(ptTensorArray, MEMORY_MALLOC_FAILED);
    ptTensorArray->nArraySize = tensors_params->nArraySize;
    ptTensorArray->pTensorArray = new tensor_t[tensors_params->nArraySize];
    MY_CHECK_NULL(ptTensorArray->pTensorArray, MEMORY_MALLOC_FAILED);
    for (int i = 0; i < ptTensorArray->nArraySize; i++) {
     
        int nDataSize = 0;
        tensor_params_t* curTensorParam = &(tensors_params->pTensorParamArray[i]);
        tensor_t* curTensor = &(ptTensorArray->pTensorArray[i]);
        switch (curTensorParam->type) {
     
        case DT_FLOAT:
            nDataSize = sizeof(float);
            break;
        case DT_DOUBLE:
            nDataSize = sizeof(double);
            break;
        case DT_INT32:
            nDataSize = sizeof(int);
            break;
        case DT_UINT8:
            nDataSize = sizeof(u8);
            break;
        case DT_STRING:
            nDataSize = sizeof(s8);
            break;
        case DT_BOOL:
            nDataSize = sizeof(bool);
            break;
        default:
            nDataSize = 1;
            break;
        }
        curTensorParam->nElementSize = 1;
        for (int j = 0; j < curTensorParam->nDims; j++) {
     
            curTensorParam->nElementSize *= curTensorParam->pShape[j];
        }
        curTensorParam->nLength = curTensorParam->nElementSize * nDataSize;
        curTensor->pValue = new u8[curTensorParam->nLength];
        MY_CHECK_NULL(curTensor->pValue, MEMORY_MALLOC_FAILED);
        curTensor->pTensorInfo = new tensor_params_t;
        memcpy(curTensor->pTensorInfo, curTensorParam, sizeof(tensor_params_t));
    }
    strcpy(ptTensorArray->pcSignatureDef, tensors_params->pcSignatureDef);
    *tensors = ptTensorArray;
    return SUCCESS;
}
result_t release_tensors(tensor_array_t* tensors)
{
     
    MY_CHECK_NULL(tensors, PARAM_NULL);
    for (int i = 0; i < tensors->nArraySize; i++) {
     
        tensor_t* curTensor = &(tensors->pTensorArray[i]);
        delete[]((u8*)(curTensor->pValue));
        delete (curTensor->pTensorInfo);
    }
    delete[] tensors->pTensorArray;
    delete tensors;
    return SUCCESS;
}
/**
 * 功能： 申请输入/输出tensor array的内存
 * 参数：
 *     input_tensors_params（in） ： 输入的tensor参数结构体；
 *     output_tensors_params(in) : 输出的tensor参数结构体；
 *     input_tensors（out) :        申请的输入tensor数组；
 *     output_tensors（out) :       申请的输出tensor数组；
 **/
DLL_API result_t init_tensors(tensor_params_array_t* input_tensors_params,
    tensor_params_array_t* output_tensors_params,
    tensor_array_t** input_tensors,
    tensor_array_t** output_tensors)
{
     
    result_t result = SUCCESS;
    //分配输入Tensor数组内存
    result = my_alloc_tensors(input_tensors_params, input_tensors);
    if (result != SUCCESS) {
     
        MY_ERROR("Alloc input tensors error !!!\n");
    }
    //分配输出Tensor数组内存
    result = my_alloc_tensors(output_tensors_params, output_tensors);
    if (result != SUCCESS) {
     
        MY_ERROR("Alloc input tensors error !!!\n");
    }
    return result;
}
/**
 * 功能： 释放申请的tensor array的内存
 * 参数
 *     input_tensors（in) :        申请的输入tensor数组指针；
 *     output_tensors（in) :       申请的输出tensor数组指针；
 **/
DLL_API result_t deinit_tensors(tensor_array_t* input_tensors,
    tensor_array_t* output_tensors)
{
     
    result_t res = SUCCESS;
    res = release_tensors(input_tensors);
    if (res != SUCCESS) {
     
        MY_ERROR("Release input tensors error!!!\n");
    }
    res = release_tensors(output_tensors);
    if (res != SUCCESS) {
     
        MY_ERROR("Release output tensors error!!!\n");
    }
    return SUCCESS;
}
result_t gpu_card_visible(char* visible_card_id_list)
{
     
    char env_name[] = "CUDA_VISIBLE_DEVICES";
    const char* new_cuda_value = visible_card_id_list;
    char* old_cuda_value = getenv(env_name);
    if (NULL == old_cuda_value) {
     
        old_cuda_value = "";
    }
    std::cout << "old cuda value is :" << old_cuda_value << std::endl;
#ifdef _WIN32
    char tmp_env_exp[256] = {
      0 };
    sprintf(tmp_env_exp, "%s=%s", env_name, new_cuda_value);
    putenv(tmp_env_exp);
#else
    setenv(env_name, new_cuda_value, 1);
#endif
    char* new_seted_cuda_value = getenv(env_name);
    std::cout << "New setted cuda value is :" << new_seted_cuda_value;
    return SUCCESS;
}
/**
 * 功能： 根据模型参数装载tensorflow模型
 * 参数：
 *     model_param（in) : 模型的输入参数
 *     input_tensors（in) :        输入tensor数组指针；
 *     output_tensors（in) :       输出tensor数组指针；
 *     model_handle（out) :装载好的模型句柄 
 **/
DLL_API result_t load_model(model_params_t* load_model_param,
    tensor_array_t* input_tensors,
    tensor_array_t* output_tensors,
    model_handle_t* load_model_handle)
{
     
    TfInferenceLib* tfInferenceLib = new TfInferenceLib(load_model_param, input_tensors, output_tensors);
    gpu_card_visible(load_model_param->visibleCard);
    tfInferenceLib->tfLoadSavedModel();
    load_model_handle->model_handle = tfInferenceLib;
    return SUCCESS;
}
/**
 * 功能： 释放模型的内存
 * 参数：
 *      model_handle（in):要释放的模型句柄
**/
DLL_API result_t release_model(model_handle_t* load_model_handle)
{
     
    TfInferenceLib* tfInferenceLib = (TfInferenceLib*)load_model_handle->model_handle;
    delete tfInferenceLib;
    return SUCCESS;
}
/**
 *  功能：进行推理，推理后的结果放到output_tensors中
 *  参数：
 *       model_handle(in) 模型句柄
 **/
DLL_API result_t inference_tensors(model_handle_t* load_model_handle)
{
     
    TfInferenceLib* tfInferenceLib = (TfInferenceLib*)load_model_handle->model_handle;
    result_t res = tfInferenceLib->tfInferenceTensors();

    return res;
}

然后修改tensorflow/cc文件夹下面的BUILD文件
在最后加上

c_library(
    name = "my_tensorflow",
    srcs = [
        "my_inference/common.h",
        "my_inference/my_interface.cc",
        "my_inference/my_interface.h",
        "my_inference/tf_inference_lib.cc",
        "my_inference/tf_inference_lib.h",
    ],
    #linkshared = 1,
    deps = [
        ":cc_ops",
        ":client_session",
        ":coordinator",
        ":queue_runner",
        ":scope",
        "//tensorflow/cc/saved_model:constants",
        "//tensorflow/cc/saved_model:loader",
        "//tensorflow/cc/saved_model:signature_constants",
        "//tensorflow/cc/saved_model:tag_constants",
        "//tensorflow/core:core_cpu",
        "//tensorflow/core:framework",
        "//tensorflow/core:lib",
        "//tensorflow/core:lib_internal",
        "//tensorflow/core:protos_all_cc",
        "//tensorflow/core:tensorflow",
    ],
)

然后再修改tensorflow的BUILD文件

tf_cc_shared_object(
    name = "tensorflow_cc",
    linkopts = select({
        "//tensorflow:macos": [
            "-Wl,-exported_symbols_list,$(location //tensorflow:tf_exported_symbols.lds)",
        ],
        "//tensorflow:windows": [],
        "//conditions:default": [
            "-z defs",
            "-Wl,--version-script,$(location //tensorflow:tf_version_script.lds)",
        ],
    }),
    per_os_targets = True,
    soversion = VERSION,
    visibility = ["//visibility:public"],
    # add win_def_file for tensorflow_cc
    win_def_file = select({
        # We need this DEF file to properly export symbols on Windows
        "//tensorflow:windows": ":tensorflow_filtered_def_file",
        "//conditions:default": None,
    }),
    deps = [
        "//tensorflow:tf_exported_symbols.lds",
        "//tensorflow:tf_version_script.lds",
        "//tensorflow/c:c_api",
        "//tensorflow/c/eager:c_api",
        "//tensorflow/cc:cc_ops",
        "//tensorflow/cc:client_session",
        "//tensorflow/cc:scope",
        "//tensorflow/cc/profiler",
        "//tensorflow/cc:my_tensorflow",             #这是我们添加的接口
        "//tensorflow/core:tensorflow",
    ] + if_ngraph(["@ngraph_tf//:ngraph_tf"]),
)

用bazel编译tensorflow的tensorflow_cc，顺利的话我们就可以编译出来tensorflow_cc.dll和tensorflow_cc.lib

然后我们在我们自己的工程里就可以引入动态库和静态库调用我们的接口了
最外一层还得加一个总的decodeFile的接口我还没时间写

调用接口实现，下面是我的一个例子

#include 
#include 
#include 
#include "opencv2/opencv.hpp"
#include "DCS_DeepLearningRegionDetection.h"

#define BATH_SIZE 1

int main(int argc, char **argv)
{
     
	//申请输入输出内存
	tensor_params_array_t in_tensor_params_ar = {
      0 };
	tensor_params_array_t out_tensor_params_ar = {
      0 };
	tensor_array_t *input_tensor_array = NULL;
	tensor_array_t *output_tensor_array = NULL;

	//输入Tensor数组参数设置
	in_tensor_params_ar.nArraySize = 1;
	strcpy(in_tensor_params_ar.pcSignatureDef, "predict_images");
	in_tensor_params_ar.pTensorParamArray = (tensor_params_t *)malloc(
		in_tensor_params_ar.nArraySize * sizeof(tensor_params_t));

	tensor_params_t *cur_in_tensor_params = &(in_tensor_params_ar.pTensorParamArray[0]);
	cur_in_tensor_params->nDims = 4;
	cur_in_tensor_params->type = DT_UINT8;
	cur_in_tensor_params->pShape[0] = BATH_SIZE;
	cur_in_tensor_params->pShape[1] = 3000;  //H
	cur_in_tensor_params->pShape[2] = 2000; //W
	cur_in_tensor_params->pShape[3] = 3;    //channel
	strcpy(cur_in_tensor_params->aTensorName, "ImageTensor");

	//输出Tensor数组参数设置
	out_tensor_params_ar.nArraySize = 1;
	strcpy(out_tensor_params_ar.pcSignatureDef, "predict_images");
	out_tensor_params_ar.pTensorParamArray = (tensor_params_t *)malloc(
		out_tensor_params_ar.nArraySize * sizeof(tensor_params_t));
	tensor_params_t *cur_out_tensor_params0 = &(out_tensor_params_ar.pTensorParamArray[0]);
	cur_out_tensor_params0->type = DT_INT32;
	cur_out_tensor_params0->nDims = 3;
	cur_out_tensor_params0->pShape[0] = BATH_SIZE;
	cur_out_tensor_params0->pShape[1] = 3000;
	cur_out_tensor_params0->pShape[2] = 2000;
	strcpy(cur_out_tensor_params0->aTensorName, "SemanticPredictions");


	//调用API申请Tensor数组内存
	if (SUCCESS != init_tensors(&in_tensor_params_ar, &out_tensor_params_ar,
		&input_tensor_array, &output_tensor_array))
	{
     
		printf("Open tensor memory error\n");
	}

	//设置模型加载参数
	model_params_t tModelParams = {
      0 };
	model_handle_t tModelHandel = {
      0 };
	tModelParams.cpu_or_gpu = 0;

	strcpy(tModelParams.visibleCard, "0");
	//strcpy(tModelParam.visibleCard, "0,1");
	tModelParams.gpu_id = 0;

	tModelParams.gpu_memory_faction = 0.9;

	//tModelParams.bIsCipher = true;
	//strcpy(tModelParams.model_path, "models/object_detection_enc/1");

	tModelParams.bIsCipher = false;
	strcpy(tModelParams.model_path, "models/test/1");
	strcpy(tModelParams.paModelTagSet, "serve");

	//调用API装载模型
	if (load_model(&tModelParams, input_tensor_array, output_tensor_array, &tModelHandel) != SUCCESS)
	{
     
		printf("Load Model error!!!\n");
	}


	cv::Mat bgrImage, rgbImage;
	bgrImage = cv::imread("test_data/000014_image.png");
	cv::cvtColor(bgrImage, rgbImage, cv::COLOR_BGR2RGB);
	int img_size = rgbImage.rows * rgbImage.cols * rgbImage.channels();

	tensor_t *cur_input_tensor = &(input_tensor_array->pTensorArray[0]);
	tensor_params_t *cur_input_tensor_info = cur_input_tensor->pTensorInfo;
	std::cout << "Cur tensor value length: " << cur_input_tensor_info->nLength << std::endl;
	assert(img_size == cur_input_tensor_info->nLength);
	memcpy(cur_input_tensor->pValue, rgbImage.ptr<unsigned char>(0), img_size);

	printf("Call api to inferencing.....\n");
	inference_tensors(&tModelHandel);
	printf("End inference!!!\n");

	//打印推理结果
	tensor_t * cur_output_tensor_class = &(output_tensor_array->pTensorArray[0]);

	int *seg = (int*)cur_output_tensor_class->pValue;

	cv::Mat mat = cv::Mat(3000, 2000, CV_8UC1);
	int b = 0;
	for (int i = 0; i < mat.rows; i++)
	{
     
		for (int j = 0; j < mat.cols; j++)
		{
     
			mat.at<uchar>(i, j) = (uchar)1.0*seg[b] * 100;
			b++;
		}
	}
	cv::Mat im_color;
	cv::applyColorMap(mat, im_color, cv::COLORMAP_JET);
	//释放申请的Tensor数组内存
	deinit_tensors(input_tensor_array, output_tensor_array);

	release_model(&tModelHandel);

	free(in_tensor_params_ar.pTensorParamArray);
	free(out_tensor_params_ar.pTensorParamArray);

	system("pause");
}

这是window端Debug下用imagewatch看的效果

window端 tflite

tensorflow官方没有提供tflite的windows版本，在GitHub上找到了别人实现的window版本
https://github.com/qintao97/tensorflow_lite

未完待续，最后还剩IOS了

你可能感兴趣的:(tensorflow,lite,语义分割,图像,tensorflow,深度学习)

微软 LIDA 库：基于大模型的自动化数据分析与可视化窝窝和牛牛 microsoft 数据分析
微软LIDA库：基于大模型的自动化数据分析与可视化一、核心架构与LLM交互流程调用LLM生成数据摘要基于LLM推理分析目标LLM生成可视化代码结合图像生成模型优化原始数据Summarizer模块结构化摘要GoalExplorer模块可视化目标列表VizGenerator模块可执行图表代码Infographer模块风格化信息图表二、LLM交互核心功能1.多模型支持架构兼容主流LLM服务商：通过统一接
OpenGL ES 如何渲染 16bit 图像？字节流动 OpenGL ES 3.0 OpenGLES 音视频图形渲染 Android c++
未经作者（微信ID：Byte-Flow）允许，禁止转载文章首发于公众号：字节流动最近有不少读者私信问OpenGLES如何处理16bit图像（P010）？然后我直接贴给他们一段在OpenGL环境下验证过的上传16bit图像数据的代码glTexImage2D(GL_TEXTURE_2D,0,GL_R16UI,width,height,0,GL_RED_INTEGER,GL_UNSIGNED_SHORT
给普通人看的深度学习说明书：用快递系统理解AI如何思考嵌入式Jerry Python AI 人工智能深度学习
第一章：理解AI的思维方式（快递版）1.1快递分拣站的故事假设你管理一个快递分拣站：传统方法：手动制定规则（比如根据邮编分拣）机器学习：观察老员工的分拣记录，总结规律深度学习：搭建自动分拣流水线，自主发现隐藏规则1.2神经网络就像智能分拣机传送带（输入层）：接收包裹信息（图片像素/文字等）#就像扫描快递单input_data=[0.2,0.7,0.1]#归一化后的特征数据分拣工人（隐藏层）：每个工
解析大模型归一化：提升训练稳定性和性能的关键技术秋声studio 口语化解析深度学习人工智能大模型归一化
引言在深度学习领域，特别是在处理大型神经网络模型时，归一化（Normalization）是一项至关重要的技术。它可以提高模型的训练稳定性和性能，在加速收敛方面发挥了重要作用。本文将深入探讨大模型归一化的原理、常见方法及其应用场景，并结合实际案例和代码示例进行说明。一、归一化的作用与理论基础归一化的主要目的是为了提高模型的训练稳定性和性能。具体来说，归一化有以下几个关键作用：提高训练稳定性：在神经网
PyTorch数据归一化处理：transforms 2401_87555420 pytorch 人工智能 python
##1.数据归一化处理：transforms.Normalize###1.1理解torchvision*torchvision.transforms：常用的图像预处理方法*torchvision.datasets：常用的数据集Dataset实现*torchvision.models：常用的CV（预训练）模型实现torchvision.transforms:常用的数据预处理方法，提升泛化能力，包括：
Python 向量检索库Faiss使用懒大王爱吃狼 python python 开发语言自动化 Python基础 python教程
Faiss（FacebookAISimilaritySearch）是一个由FacebookAIResearch开发的库，它专门用于高效地搜索和聚类大量向量。Faiss能够在几毫秒内搜索数亿个向量，这使得它非常适合于实现近似最近邻（ANN）搜索，这在许多应用中都非常有用，比如图像检索、推荐系统和自然语言处理。以下是如何使用Faiss的基本步骤和示例：1.安装Faiss首先，你需要安装Faiss。你可
深入解析深度学习中的过拟合与欠拟合诊断、解决与工程实践古月居GYH 深度学习人工智能
一、引言：模型泛化能力的核心挑战在深度学习模型开发中，欠拟合与过拟合是影响泛化能力的两个核心矛盾。据GoogleBrain研究统计，工业级深度学习项目中有63%的失败案例与这两个问题直接相关。本文将从基础概念到工程实践，系统解析其本质特征、诊断方法及解决方案，并辅以可复现的代码案例。二、核心概念与通熟易懂解释简单而言，欠拟合是指模型不能在训练集上获得足够低的误差。换句换说，就是模型复杂度低，模型在
CBNet--一种新的目标检测的复合骨干网体系结构 weixin_45963617 深度学习系列
一、Introduction一般来说，在一个典型的基于CNN的目标检测器中，使用主干网络来提取检测对象的基本特征，该网络通常是为图像分类任务而设计的，并在ImageNet上预训练。毫无疑问，更强大的主干网可以带来更好的检测性能。尽管最先进的基于深度的大骨干网络的探测器取得了很好的结果，但仍有很大改进空间。此外，通过设计一个新的更强大的主干网络并在ImageNet上预训练来获取好的检测性能是十分昂贵
初始OpenCV 指尖下的技术 OpenCV opencv 人工智能计算机视觉
OpenCV是一个功能强大、应用广泛的计算机视觉库，它为开发人员提供了丰富的工具和算法，可以帮助他们快速构建各种视觉应用。随着计算机视觉技术的不断发展，OpenCV也将会继续发挥重要的作用。OpenCV提供了大量的计算机视觉算法和图像处理工具，广泛应用于图像和视频的处理、分析以及机器学习领域。所以学习人计算机视觉或者图像处理方面的知识，OpenCV是一个要重点学习的工具库。首先介绍一下OpenCV
【2017-2025】Adobe Photoshop【PS】软件下载安装 adkjcbqvblq adobe photoshop ui
获取安装包https://pan.baidu.com/s/1NLUthiAyC2chlSEwbf1LRQ?pwd=4ppq1.起源与发展1.1初试啼声AdobePhotoshop的历史可以追溯到1987年，当时由托马斯·诺尔（ThomasKnoll）和他的兄弟约翰·诺尔（JohnKnoll）共同开发。托马斯在父亲的帮助下，开始了图像处理的编程尝试。他们的初始产品是一个用于Mac系统的程序，最初名为
Umi-OCR 实践教程：离线、免费、高效的图像文字识别工具几道之旅人工智能智能体及数字员工 ocr 人工智能
一、工具简介Umi-OCR是一款开源、免费且支持离线运行的OCR（光学字符识别）工具，适用于Windows和Linux系统。它基于深度学习技术，能够高效提取图像中的文字，支持多语言识别、批量处理、截屏识别等功能，尤其适合对隐私敏感或网络受限的场景。核心亮点：离线运行：无需联网，保护隐私。多引擎支持：提供Paddle（高性能）和Rapid（低配兼容）两种引擎。批量处理：支持图片、PDF、电子书等多格
基于ChatGPT、GIS与Python机器学习的地质灾害风险评估、易发性分析、信息化建库及灾后重建高级实践 weixin_贾防洪评价风险评估滑坡泥石流地质灾害
第一章、ChatGPT、DeepSeek大语言模型提示词与地质灾害基础及平台介绍【基础实践篇】1、什么是大模型？大模型（LargeLanguageModel,LLM）是一种基于深度学习技术的大规模自然语言处理模型。代表性大模型：GPT-4、BERT、T5、ChatGPT等。特点：多任务能力：可以完成文本生成、分类、翻译、问答等任务。上下文理解：能理解复杂的上下文信息。广泛适配性：适合科研、教育、行
anythingLLM 使用教程惟贤箬溪穷玩Ai AIGC 人工智能
一、anythingLLM简介anythingLLM是一款灵活且功能强大的语言模型，它基于先进的深度学习架构构建，旨在为用户提供多样化的自然语言处理服务。其设计理念注重通用性和可扩展性，能够适应多种领域和任务，无论是文本生成、智能问答，还是翻译、摘要提取等，都能展现出出色的性能。与同类模型相比，anythingLLM具有训练数据丰富、模型优化程度高的优势，能够生成更符合逻辑、更具实用性的文本内容。
深度解析大模型推理框架：原理、应用与实践百度_开发者中心人工智能大模型自然语言处理
在当今数据驱动的时代，大模型推理框架已经成为人工智能领域的重要支柱。本文将通过简明扼要、清晰易懂的方式，带领读者深入了解大模型推理框架的原理、应用领域和实践经验，帮助读者更好地掌握这一技术，并在实际工作中发挥其价值。一、大模型推理框架简介大模型推理框架是指一种基于深度学习技术的推理框架，主要用于解决大规模数据集下的复杂问题。该框架通过对海量数据进行高效的训练和推理，能够快速地对各种复杂场景进行分析
大模型推理框架：从理论到实践的全面解析百度_开发者中心人工智能大模型自然语言处理
在数据驱动的时代，深度学习技术已经渗透到各个行业，从图像识别到自然语言处理，从推荐系统到智能客服，其应用无处不在。然而，深度学习模型的训练和推理过程往往涉及大量数据和复杂计算，传统的计算框架难以满足需求。因此，大模型推理框架应运而生，成为解决这一问题的关键。一、大模型推理框架基本概念大模型推理框架是一种基于深度学习技术的推理框架，它通过对海量数据进行高效的训练和推理，能够快速地对各种复杂场景进行分
GStreamer —— 3.2、Qt+GStreamer+OpenCV制作图像处理播放器(对每帧图像处理)，支持本地mp4文件、rtsp流、usb摄像头等（可跨平台，附源码）信必诺 GStreamer Qt GStreamer Qt
运行效果介绍本项目是一个结合了Qt、GStreamer和OpenCV的跨平台图像处理播放器项目。该
人脸识别的一些代码饿了就干饭 CV相关人脸识别
1、cv2入门函数imread及其相关操作2、（详解）opencv里的cv2.resize改变图片大小Python3、机器学习之人脸识别face_recognition使用4、使用face_recognition进行人脸校准5、简单的人脸识别通用流程示意图（这个看着写的挺好的）6、face_recognition和图像处理中left、top、right、bottom解释7、使用pillow库对图片
【人工智能】大模型的幻觉问题：DeepSeek 的解决策略与实践蒙娜丽宁 Python杂谈人工智能人工智能
《PythonOpenCV从菜鸟到高手》带你进入图像处理与计算机视觉的大门！解锁Python编程的无限可能：《奇妙的Python》带你漫游代码世界大语言模型（LLM）的“幻觉”问题，即模型生成与事实不符或脱离上下文的内容，是限制其广泛应用的关键挑战之一。本文深入探讨了幻觉问题的成因，包括训练数据的偏差、推理过程中的过度泛化以及缺乏外部验证机制。以DeepSeek系列模型为研究对象，我们分析了其在解
Yolo系列之Yolo的基本理解是十一月末 YOLO python 开发语言 yolo
YOLO的基本理解目录YOLO的基本理解1YOLO1.1概念1.2算法2单、多阶段对比2.1FLOPs和FPS2.2one-stage单阶段2.3two-stage两阶段1YOLO1.1概念YOLO(YouOnlyLookOnce)是一种基于深度学习的目标检测算法，由JosephRedmon等人于2016年提出。它的核心思想是将目标检测问题转化为一个回归问题，通过一个神经网络直接预测目标的类别和位
PyTorch基础知识讲解（一）完整训练流程示例苏雨流丰机器学习 pytorch 人工智能 python 机器学习深度学习
文章目录Tutorial1.数据处理2.网络模型定义3.损失函数、模型优化、模型训练、模型评价4.模型保存、模型加载、模型推理Tutorial大多数机器学习工作流程涉及处理数据、创建模型、优化模型参数和保存训练好的模型。本教程向你介绍一个用PyTorch实现的完整的ML工作流程，并提供链接来了解这些概念中的每一个。我们将使用FashionMNIST数据集来训练一个神经网络，预测输入图像是否属于以下
入门 Canvas：Web 绘图的强大工具 Hopebearer_ 前端 es6 javascript canva可画
文章目录入门Canvas：Web绘图的强大工具一、Canvas简介二、Canvas的基本用法（一）绘制基本图形（二）绘制文本三、Canvas的应用场景（一）数据可视化（二）游戏开发（三）图像编辑四、Canvas的动画效果五、Canvas的优势与局限性（一）优势（二）局限性六、总结入门Canvas：Web绘图的强大工具在Web开发的广阔天地中，为了满足用户对丰富、交互性强的体验的不断追求，前端技术持
探索HTML5 Canvas：创造动态与交互性网页内容的强大工具 A-Kamen html5 前端 html
探索HTML5Canvas：创造动态与交互性网页内容的强大工具引言在HTML5的众多新特性中，Canvas无疑是最引人注目的元素之一。它为网页设计师和开发者提供了一个通过JavaScript和HTML直接在网页上绘制图形、图像以及进行动画处理的画布。Canvas的灵活性和强大功能，使得它成为创造动态、交互性网页内容的首选工具。本文将深入探讨HTML5Canvas的基本用法、应用场景以及如何利用它来
TensorFlow和Pytorch在功能上的区别以及优势 Honeysea_70 #算法 tensorflow pytorch 人工智能
功能上的区别1.计算图TensorFlow：使用静态计算图（StaticGraph）。在运行模型之前，需要先构建完整的计算图，然后通过会话（Session）运行图。优点是性能优化更高效，适合大规模分布式训练和生产环境部署。缺点是调试相对复杂，因为计算图的构建和运行是分离的。PyTorch：使用动态计算图（DynamicGraph）。计算图是动态构建和执行的，每次迭代都会重新构建图。优点是调试方便，
AI时代个人财富增长实战指南：从零基础到精通变现的完整路径 A达峰绮人工智能
（本文基于人工智能技术发展规律，结合互联网经济底层逻辑，为普通从业者构建系统性AI应用框架）一、建立AI认知基础：技术理解与工具掌握技术分类认知人工智能工具分为四大功能模块：自然语言处理（文本生成、对话交互）、计算机视觉（图像视频处理）、数据分析（预测建模）、自动化控制（流程优化）。建议新手首先掌握语言类工具的基础操作，逐步扩展到其他领域。工具操作逻辑通用AI工具通常包含三大核心功能模块：输入界面
大语言模型学习路线：从入门到实战大模型官方资料语言模型学习人工智能产品经理自然语言处理搜索引擎
大语言模型学习路线：从入门到实战在人工智能领域，大语言模型（LargeLanguageModels,LLMs）正迅速成为一个热点话题。本学习路线旨在为有基本Python编程和深度学习基础的学习者提供一个清晰、系统的大模型学习指南，帮助你在这一领域快速成长。本学习路线更新至2024年02月，后期部分内容或工具可能需要更新。适应人群已掌握Python基础具备基本的深度学习知识学习步骤本路线将通过四个核
深度学习与目标检测系列(六) 本文约(4.5万字) | 全面解读复现ResNet | Pytorch | 小酒馆燃着灯深度学习目标检测 pytorch 人工智能 ResNet 残差连接残差网络
文章目录解读Abstract—摘要翻译精读主要内容Introduction—介绍翻译精读背景RelatedWork—相关工作ResidualRepresentations—残差表达翻译精读主要内容ShortcutConnections—短路连接翻译精读主要内容DeepResidualLearning—深度残差学习ResidualLearning—残差学习翻译精读ResNet目的以前方法本文改进本质
深度学习与目标检测系列(三) 本文约(4万字) | 全面解读复现AlexNet | Pytorch | 小酒馆燃着灯深度学习目标检测 pytorch AlexNet 人工智能
文章目录解读Abstract-摘要翻译精读主要内容1.Introduction—前言翻译精读主要内容：本文主要贡献：2.TheDataset-数据集翻译精读主要内容：ImageNet简介：图像处理方法：3.TheArchitecture—网络结构3.1ReLUNonlinearity—非线性激活函数ReLU翻译精读传统方法及不足本文改进方法本文的改进结果3.2TrainingonMultipleG
python科学绘图-matplotlib绘制三维函数图像，并且在函数底部绘制等值线 zhan114514 python科学绘图 python matplotlib 开发语言
python使用matplotlib库绘制三维函数图像，并且在底部绘制等值线。三维图像函数surface=ax.plot_surface(X,Y,zss,camp=色带)等值线函数contour=ax.contour(xs,ys,zss,zdir=在哪个轴绘制,offset=在该轴什么位置绘制,camp=色带,zorder=图层位置)颜色条函数plt.colorbar(surface,shrink
外星人入侵-Python-二 Java版蜡笔小新 Python python pygame 开发语言
武装飞船开发一个名为《外星人入侵》的游戏吧！为此将使用Pygame，这是一组功能强大而有趣的模块，可用于管理图形、动画乃至声音，让你能够更轻松地开发复杂的游戏。通过使用Pygame来处理在屏幕上绘制图像等任务，可将重点放在程序的高级逻辑上。你将安装Pygame，再创建一艘能够根据用户输入左右移动和射击的飞船。在接下来的两章，你将创建一群作为射杀目标的外星人，并改进该游戏：限制可供玩家使用的飞船数，
不搞花里胡哨！CMU最新开源：极简风格的LiDAR全景分割+跟踪！ 3Ｄ视觉工坊 3D视觉从入门到精通 3D视觉
来源：3D视觉工坊在公众号「3D视觉工坊」后台，回复「原论文」可获取论文pdf、代码链接添加微信：dddvisiona，备注：三维点云，拉你入群。文末附行业细分群1.笔者个人体会激光雷达全景分割（LPS）一般遵循自下而上的以分割为中心的范式，利用聚类获得对象实例来建立语义分割网络。但是最近CMU&Meta等大佬们重新思考了这种方法，并提出了一个简单而有效的检测中心网络，用于LPS和跟踪。这项工作也
微信开发者验证接口开发 362217990 微信开发者 token 验证
微信开发者接口验证。 Token，自己随便定义，与微信填写一致就可以了。根据微信接入指南描述 http://mp.weixin.qq.com/wiki/17/2d4265491f12608cd170a95559800f2d.html 第一步：填写服务器配置第二步：验证服务器地址的有效性第三步：依据接口文档实现业务逻辑这里主要讲第二步验证服务器有效性。建一个
一个小编程题-类似约瑟夫环问题 BrokenDreams 编程
今天群友出了一题：一个数列,把第一个元素删除,然后把第二个元素放到数列的最后,依次操作下去,直到把数列中所有的数都删除,要求依次打印出这个过程中删除的数。 &
linux复习笔记之bash shell (5) 关于减号-的作用 eksliang linux关于减号“-”的含义 linux关于减号“-”的用途 linux关于“-”的含义 linux关于减号的含义
转载请出自出处： http://eksliang.iteye.com/blog/2105677 管道命令在bash的连续处理程序中是相当重要的，尤其在使用到前一个命令的studout（标准输出）作为这次的stdin（标准输入）时，就显得太重要了，某些命令需要用到文件名，例如上篇文档的的切割命令（split）、还有
Unix(3) 18289753290 unix ksh
1)若该变量需要在其他子进程执行，则可用"$变量名称"或${变量}累加内容什么是子进程？在我目前这个shell情况下，去打开一个新的shell，新的那个shell就是子进程。一般状态下，父进程的自定义变量是无法在子进程内使用的，但通过export将变量变成环境变量后就能够在子进程里面应用了。 2)条件判断： &&代表and ||代表or&nbs
关于ListView中性能优化中图片加载问题酷的飞上天空 ListView
ListView的性能优化网上很多信息，但是涉及到异步加载图片问题就会出现问题。具体参看上篇文章http://314858770.iteye.com/admin/blogs/1217594 如果每次都重新inflate一个新的View出来肯定会造成性能损失严重，可能会出现listview滚动是很卡的情况，还会出现内存溢出。现在想出一个方法就是每次都添加一个标识，然后设置图
德国总理默多克：给国人的一堂“震撼教育”课永夜-极光教育
http://bbs.voc.com.cn/topic-2443617-1-1.html德国总理默多克：给国人的一堂“震撼教育”课　安吉拉—默克尔，一位经历过社会主义的东德人，她利用自己的博客，发表一番来华前的谈话，该说的话，都在上面说了，全世界想看想传播——去看看默克尔总理的博客吧！　　德国总理默克尔以她的低调、朴素、谦和、平易近人等品格给国人留下了深刻印象。她以实际行动为中国人上了一堂
关于Java继承的一个小问题。。。随便小屋 java
今天看Java 编程思想的时候遇见一个问题，运行的结果和自己想想的完全不一样。先把代码贴出来！ //CanFight接口 interface Canfight { void fight(); } //ActionCharacter类 class ActionCharacter { public void fight() { System.out.pr
23种基本的设计模式 aijuans 设计模式
Abstract Factory：提供一个创建一系列相关或相互依赖对象的接口，而无需指定它们具体的类。　　Adapter：将一个类的接口转换成客户希望的另外一个接口。A d a p t e r模式使得原本由于接口不兼容而不能一起工作的那些类可以一起工作。　　Bridge：将抽象部分与它的实现部分分离，使它们都可以独立地变化。　　Builder：将一个复杂对象的构建与它的表示分离，使得同
《周鸿祎自述：我的互联网方法论》读书笔记 aoyouzi 读书笔记
从用户的角度来看,能解决问题的产品才是好产品,能方便/快速地解决问题的产品,就是一流产品. 商业模式不是赚钱模式一款产品免费获得海量用户后,它的边际成本趋于0,然后再通过广告或者增值服务的方式赚钱,实际上就是创造了新的价值链. 商业模式的基础是用户,木有用户,任何商业模式都是浮云.商业模式的核心是产品,本质是通过产品为用户创造价值. 商业模式还包括寻找需求
JavaScript动态改变样式访问技术百合不是茶 JavaScript style属性 ClassName属性
一:style属性格式: HTML元素.style.样式属性="值"; 创建菜单:在html标签中创建或者在head标签中用数组创建 <html> <head> <title>style改变样式</title> </head> &l
jQuery的deferred对象详解 bijian1013 jquery deferred对象
jQuery的开发速度很快，几乎每半年一个大版本，每两个月一个小版本。每个版本都会引入一些新功能，从jQuery 1.5.0版本开始引入的一个新功能----deferred对象。 &nb
淘宝开放平台TOP Bill_chen C++c 物流 C#
淘宝网开放平台首页：http://open.taobao.com/ 淘宝开放平台是淘宝TOP团队的产品，TOP即TaoBao Open Platform，是淘宝合作伙伴开发、发布、交易其服务的平台。支撑TOP的三条主线为： 1.开放数据和业务流程 * 以API数据形式开放商品、交易、物流等业务； &
【大型网站架构一】大型网站架构概述 bit1129 网站架构
大型互联网特点面对海量用户、海量数据大型互联网架构的关键指标高并发高性能高可用高可扩展性线性伸缩性安全性大型互联网技术要点前端优化 CDN缓存反向代理 KV缓存消息系统分布式存储 NoSQL数据库搜索监控安全想到的问题： 1.对于订单系统这种事务型系统，如
eclipse插件hibernate tools安装白糖_ Hibernate
eclipse helios(3.6)版 1.启动eclipse 2.选择 Help > Install New Software...> 3.添加如下地址： http://download.jboss.org/jbosstools/updates/stable/helios/ 4.选择性安装：hibernate tools在All Jboss tool
Jquery easyui Form表单提交注意事项 bozch jquery easyui
jquery easyui对表单的提交进行了封装，提交的方式采用的是ajax的方式，在开发的时候应该注意的事项如下： 1、在定义form标签的时候，要将method属性设置成post或者get，特别是进行大字段的文本信息提交的时候，要将method设置成post方式提交，否则页面会抛出跨域访问等异常。所以这个要
Trie tree(字典树)的Java实现及其应用-统计以某字符串为前缀的单词的数量 bylijinnan java实现
import java.util.LinkedList; public class CaseInsensitiveTrie { /** 字典树的Java实现。实现了插入、查询以及深度优先遍历。 Trie tree's java implementation.(Insert,Search,DFS) Problem Description Igna
html css 鼠标形状样式汇总 chenbowen00 html css
css鼠标手型cursor中hand与pointer Example：CSS鼠标手型效果 <a href="#" style="cursor:hand">CSS鼠标手型效果</a><br/> Example：CSS鼠标手型效果 <a href="#" style=&qu
[IT与投资]IT投资的几个原则 comsci it
无论是想在电商,软件,硬件还是互联网领域投资,都需要大量资金,虽然各个国家政府在媒体上都给予大家承诺,既要让市场的流动性宽松,又要保持经济的高速增长....但是,事实上,整个市场和社会对于真正的资金投入是非常渴望的,也就是说,表面上看起来,市场很活跃,但是投入的资金并不是很充足的......
oracle with语句详解 daizj oracle with with as
oracle with语句详解转在oracle中，select 查询语句，可以使用with,就是一个子查询，oracle 会把子查询的结果放到临时表中，可以反复使用例子:注意，这是sql语句，不是pl/sql语句，可以直接放到jdbc执行的 ----------------------------------------------------------------
hbase的简单操作 deng520159 数据库 hbase
近期公司用hbase来存储日志,然后再来分析 ,把hbase开发经常要用的命令找了出来. 用ssh登陆安装hbase那台linux后用hbase shell进行hbase命令控制台! 表的管理 1）查看有哪些表 hbase(main)> list 2）创建表 # 语法：create <table>, {NAME => <family&g
C语言scanf继续学习、算术运算符学习和逻辑运算符 dcj3sjt126com c
/* 2013年3月11日20:37:32 地点：北京潘家园功能：完成用户格式化输入多个值目的：学习scanf函数的使用 */ # include <stdio.h> int main(void) { int i, j, k; printf("please input three number:\n"); //提示用
2015越来越好 dcj3sjt126com 歌曲
越来越好房子大了电话小了感觉越来越好假期多了收入高了工作越来越好商品精了价格活了心情越来越好天更蓝了水更清了环境越来越好活得有奔头人会步步高想做到你要努力去做到幸福的笑容天天挂眉梢越来越好婆媳和了家庭暖了生活越来越好孩子高了懂事多了学习越来越好朋友多了心相通了大家越来越好道路宽了心气顺了日子越来越好活的有精神人就不显
java.sql.SQLException: Value '0000-00-00' can not be represented as java.sql.Tim feiteyizu mysql
数据表中有记录的time字段（属性为timestamp）其值为：“0000-00-00 00:00:00” 程序使用select 语句从中取数据时出现以下异常： java.sql.SQLException:Value '0000-00-00' can not be represented as java.sql.Date java.sql.SQLException: Valu
Ehcache（07）——Ehcache对并发的支持 234390216 并发 ehcache 锁 ReadLock WriteLock
Ehcache对并发的支持在高并发的情况下，使用Ehcache缓存时，由于并发的读与写，我们读的数据有可能是错误的，我们写的数据也有可能意外的被覆盖。所幸的是Ehcache为我们提供了针对于缓存元素Key的Read（读）、Write（写）锁。当一个线程获取了某一Key的Read锁之后，其它线程获取针对于同
mysql中blob,text字段的合成索引 jackyrong mysql
在mysql中，原来有一个叫合成索引的，可以提高blob,text字段的效率性能，但只能用在精确查询，核心是增加一个列，然后可以用md5进行散列，用散列值查找则速度快比如： create table abc(id varchar(10),context blog,hash_value varchar(40)); insert into abc(1,rep
逻辑运算与移位运算 latty 位运算逻辑运算
源码：正数的补码与原码相同例+7 源码：00000111 补码：00000111 （用8位二进制表示一个数）负数的补码：符号位为1，其余位为该数绝对值的原码按位取反；然后整个数加1。 -7 源码： 10000111 ，其绝对值为00000111 取反加一：11111001 为-7补码已知一个数的补码，求原码的操作分两种情况：
利用XSD 验证XML文件 newerdragon java xml xsd
XSD文件（XML Schema 语言也称作 XML Schema 定义（XML Schema Definition，XSD）。具体使用方法和定义请参看： http://www.w3school.com.cn/schema/index.asp java自jdk1.5以上新增了SchemaFactory类可以实现对XSD验证的支持，使用起来也很方便。以下代码可用在J
搭建 CentOS 6 服务器(12) - Samba rensanning centos
（1）安装 # yum -y install samba Installed: samba.i686 0:3.6.9-169.el6_5 # pdbedit -a rensn new password:123456 retype new password:123456 …… （2）Home文件夹 # mkdir /etc
Learn Nodejs 01 toknowme nodejs
（1）下载nodejs https://nodejs.org/download/ 选择相应的版本进行下载（2）安装nodejs 安装的方式比较多，请baidu下我这边下载的是“node-v0.12.7-linux-x64.tar.gz”这个版本（1）上传服务器（2）解压 tar -zxvf node-v0.12.
jquery控制自动刷新的代码举例 xp9802 jquery
1、html内容部分复制代码代码示例: <div id='log_reload'> <select name="id_s" size="1"> <option value='2'>-2s-</option> <option value='3'>-3s-</option