江太翁

mediapipe流水线分析一

object detection Graph

以目标检测为例分析mediapip流水线处理机制

一流水线上游输入处理

1 Calculator

算子它是在MediaPipe框架中用于创建插件/算子机制的基础

在MediaPipe中，插件是一种可扩展的计算模块，可以用于实现各种不同的计算功能。calculator_base.h 文件定义了一个基类，所有插件都需要继承这个基类，并实现其中的函数或方法。通过使用这个基类，MediaPipe可以统一管理插件的接口和功能，使得在创建复杂的多媒体处理程序时更加灵活和可扩展。插件可以像拼积木一样组合和排列，以实现不同的功能和效果。

calculator_base.h 文件通常会包含一些基本的函数和属性，例如插件的初始化、更新、清理等操作，以及插件之间的通信和数据交换等。这个基类为插件的实现提供了一个统一的框架和规范，使得开发者可以根据自己的需求和创意来创建自定义的插件，并将其集成到MediaPipe的多媒体处理程序中。

它计算图里的每个node都是calculator,是计算图的逻辑计算的载体，一个calculator可以接受0或多个stream或side packet, 输出0或多个stream或side packet. Calculator需要继承相同的基类并实现所需要的接口，并且要在framework中进行注册，以便可以通过配置文件进行构建。

1.1 CalculatorBase

calculator_base.h 头文件，它定义了MediaPipe框架中用于创建插件/算子机制的基础类。

// Copyright 2019 The MediaPipe Authors.
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
//      http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

#ifndef MEDIAPIPE_FRAMEWORK_COLLECTION_H_
#define MEDIAPIPE_FRAMEWORK_COLLECTION_H_

#include 
#include 
#include 
#include 
#include 
#include 
#include 

#include "absl/base/macros.h"
#include "absl/memory/memory.h"
#include "absl/strings/str_cat.h"
#include "absl/strings/string_view.h"
#include "mediapipe/framework/collection_item_id.h"
#include "mediapipe/framework/port/logging.h"
#include "mediapipe/framework/tool/tag_map.h"
#include "mediapipe/framework/tool/tag_map_helper.h"
#include "mediapipe/framework/tool/validate_name.h"
#include "mediapipe/framework/type_map.h"

namespace mediapipe {
namespace internal {

// A class to handle errors that occur in Collection.  For most
// collections, these errors should be fatal.  However, for a collection
// more like PacketTypeSet, the errors should be deferred and handled
// later.
//
// This class is thread compatible.
template 
struct CollectionErrorHandlerFatal {
  // An error occurred during object lookup for the provided tag and
  // index.  The returned object reference will be provided instead.
  //
  // Since there isn't any state and we're not returning anything, we
  // get away with only one version of this function (which is const
  // but returns a non-const reference).
  T& GetFallback(const absl::string_view tag, int index) const {
    LOG(FATAL) << "Failed to get tag \"" << tag << "\" index " << index;
    std::abort();
  }
};

enum class CollectionStorage { kStoreValue = 0, kStorePointer };

// A collection of objects of type T.
//
// If storage == kStorePointer then T* will be stored instead of T, but
// the accessor functions will still return T types.  The T objects must
// be owned elsewhere and remain alive as long as the collection is used.
// To set the pointers use the GetPtr() function.
//
// The ErrorHandler object allows errors to be deferred to a later time.
//
// This class is thread compatible as long as the ErrorHandler object is also
// thread compatible.
template >
class Collection {
 private:
  template 
  class DoubleDerefIterator;

 public:
  using value_type = T;

  // The iterator is over value_type, requiring a double dereference if
  // storage == kStorePointer.
  using iterator =
      typename std::conditional,
                                value_type*>::type;
  using const_iterator =
      typename std::conditional,
                                const value_type*>::type;
  using difference_type = ptrdiff_t;
  using size_type = size_t;
  using pointer = value_type*;
  using reference = value_type&;

  // The type that is stored by data_;
  using stored_type =
      typename std::conditional::type;

  // Collection must be initialized on construction.
  Collection() = delete;
  Collection(const Collection&) = delete;
  Collection& operator=(const Collection&) = delete;
  // Makes a Collection using the given TagMap (which should be shared
  // between collections).
  // Refer to mediapipe::tool::CreateTagMap for examples of how to construct a
  // collection from a vector of "TAG::name" strings, or from an integer
  // number of indexes, etc.
  explicit Collection(std::shared_ptr tag_map);
  // Makes a Collection using the information in the TagAndNameInfo.
  ABSL_DEPRECATED("Use Collection(tool::TagMap)")
  explicit Collection(const tool::TagAndNameInfo& info);
  // Convenience constructor which initializes a collection to use
  // indexes and have num_entries inputs.
  ABSL_DEPRECATED("Use Collection(tool::TagMap)")
  explicit Collection(int num_entries);
  // Convenience constructor which initializes a collection to use tags
  // with the given names.
  // Note: initializer_list constructor should not be marked explicit.
  ABSL_DEPRECATED("Use Collection(tool::TagMap)")
  Collection(const std::initializer_list& tag_names);

  // Access the data at a given CollectionItemId.  This is the most efficient
  // way to access data within the collection.
  //
  // Do not assume that Index(2) == Get(collection.TagMap()->BeginId() + 2).
  value_type& Get(CollectionItemId id);
  const value_type& Get(CollectionItemId id) const;

  // Convenience functions.
  value_type& Get(absl::string_view tag, int index);
  const value_type& Get(absl::string_view tag, int index) const;

  // Equivalent to Get("", index);
  value_type& Index(int index);
  const value_type& Index(int index) const;

  // Equivalent to Get(tag, 0);
  value_type& Tag(absl::string_view tag);
  const value_type& Tag(absl::string_view tag) const;

  // These functions only exist for collections with storage ==
  // kStorePointer.  GetPtr returns the stored ptr value rather than
  // the value_type.  The non-const version returns a reference so that
  // the pointer can be set.
  value_type*& GetPtr(CollectionItemId id);
  // Const version returns a pointer to a const value (a const-ref to
  // a pointer wouldn't be useful in this context).
  const value_type* GetPtr(CollectionItemId id) const;

  // Returns true if the collection has a tag other than "".
  // TODO Deprecate and remove this function.
  bool UsesTags() const;

  // Returns a description of the collection.
  std::string DebugString() const;

  // Return the tag_map.
  const std::shared_ptr& TagMap() const;

  // Iteration functions for use of the collection in a range based
  // for loop.  The items are provided in sorted tag order with indexes
  // sequential within tags.
  iterator begin();
  iterator end();
  const_iterator begin() const;
  const_iterator end() const;

  // Returns the error handler object.
  const ErrorHandler& GetErrorHandler() const { return error_handler_; }

  
  // The remaining public functions directly call their equivalent
  // in tool::TagMap.  They are guaranteed to be equivalent for any
  // Collection initialized using an equivalent tool::TagMap.
  

  // Returns true if the provided tag is available (not necessarily set yet).
  bool HasTag(const absl::string_view tag) const {
    return tag_map_->HasTag(tag);
  }

  // Returns the number of entries in this collection.
  int NumEntries() const { return tag_map_->NumEntries(); }

  // Returns the number of entries with the provided tag.
  int NumEntries(const absl::string_view tag) const {
    return tag_map_->NumEntries(tag);
  }

  // Get the id for the tag and index.  This id is guaranteed valid for
  // any Collection which was initialized with an equivalent tool::TagMap.
  // If the tag or index are invalid then an invalid CollectionItemId
  // is returned (with id.IsValid() == false).
  //
  // The id for indexes within the same tag are guaranteed to
  // be sequential.  Meaning, if tag "BLAH" has 3 indexes, then
  // ++GetId("BLAH", 1) == GetId("BLAH", 2)
  // However, be careful in using this fact, as it circumvents the
  // validity checks in GetId() (i.e. ++GetId("BLAH", 2) looks like it
  // is valid, while GetId("BLAH", 3) is not valid).
  CollectionItemId GetId(const absl::string_view tag, int index) const {
    return tag_map_->GetId(tag, index);
  }

  // Returns the names of the tags in this collection.
  std::set GetTags() const { return tag_map_->GetTags(); }

  // Get a tag and index for the specified id.  If the id is not valid,
  // then {"", -1} will be returned.
  std::pair TagAndIndexFromId(CollectionItemId id) const {
    return tag_map_->TagAndIndexFromId(id);
  }

  // The CollectionItemId corresponding to the first element in the collection.
  // Looping over all elements can be done as follows.
  //   for (CollectionItemId id = collection.BeginId();
  //        id < collection.EndId(); ++id) {
  //   }
  // However, if only one collection is involved, prefer using a range
  // based for loop.
  //   for (Packet packet : Inputs()) {
  //   }
  CollectionItemId BeginId() const { return tag_map_->BeginId(); }
  // The CollectionItemId corresponding to an element immediately after
  // the last element of the collection.
  CollectionItemId EndId() const { return tag_map_->EndId(); }

  // Same as BeginId()/EndId() but for only one tag.  If the tag doesn't
  // exist then an invalid CollectionItemId is returned.  It is guaranteed
  // that a loop constructed in this way will successfully not be entered
  // for invalid tags.
  //   for (CollectionItemId id = collection.BeginId(tag);
  //        id < collection.EndId(tag); ++id) {
  //   }
  CollectionItemId BeginId(const absl::string_view tag) const {
    return tag_map_->BeginId(tag);
  }
  CollectionItemId EndId(const absl::string_view tag) const {
    return tag_map_->EndId(tag);
  }

  // Equal Collections contain equal mappings and equal elements.
  bool operator==(const Collection& other) const {
    if (tag_map_->Mapping() != other.TagMap()->Mapping()) {
      return false;
    }
    for (CollectionItemId id = BeginId(); id < EndId(); ++id) {
      if (Get(id) != other.Get(id)) {
        return false;
      }
    }
    return true;
  }
  bool operator!=(const Collection& other) const {
    return !(*this == other);
  }

 private:
  // An iterator which is identical to ItType** except that the
  // dereference operator (operator*) does a double dereference and
  // returns an ItType.
  //
  // This class is thread compatible.
  template 
  class DoubleDerefIterator {
   public:
    using iterator_category = std::random_access_iterator_tag;
    using value_type = ItType;
    using difference_type = std::ptrdiff_t;
    using pointer = ItType*;
    using reference = ItType&;

    DoubleDerefIterator() : ptr_(nullptr) {}

    reference operator*() { return **ptr_; }

    pointer operator->() { return *ptr_; }

    reference operator[](difference_type d) { return **(ptr_ + d); }

    // Member operators.
    DoubleDerefIterator& operator++() {
      ++ptr_;
      return *this;
    }
    DoubleDerefIterator operator++(int) {
      DoubleDerefIterator output(ptr_);
      ++ptr_;
      return output;
    }
    DoubleDerefIterator& operator--() {
      --ptr_;
      return *this;
    }
    DoubleDerefIterator operator--(int) {
      DoubleDerefIterator output(ptr_);
      --ptr_;
      return output;
    }
    DoubleDerefIterator& operator+=(difference_type d) {
      ptr_ += d;
      return *this;
    }
    DoubleDerefIterator& operator-=(difference_type d) {
      ptr_ -= d;
      return *this;
    }

    // Non-member binary operators.
    friend bool operator==(DoubleDerefIterator lhs, DoubleDerefIterator rhs) {
      return lhs.ptr_ == rhs.ptr_;
    }
    friend bool operator!=(DoubleDerefIterator lhs, DoubleDerefIterator rhs) {
      return lhs.ptr_ != rhs.ptr_;
    }
    friend bool operator<(DoubleDerefIterator lhs, DoubleDerefIterator rhs) {
      return lhs.ptr_ < rhs.ptr_;
    }
    friend bool operator<=(DoubleDerefIterator lhs, DoubleDerefIterator rhs) {
      return lhs.ptr_ <= rhs.ptr_;
    }
    friend bool operator>(DoubleDerefIterator lhs, DoubleDerefIterator rhs) {
      return lhs.ptr_ > rhs.ptr_;
    }
    friend bool operator>=(DoubleDerefIterator lhs, DoubleDerefIterator rhs) {
      return lhs.ptr_ >= rhs.ptr_;
    }

    friend DoubleDerefIterator operator+(DoubleDerefIterator lhs,
                                         difference_type d) {
      return lhs.ptr_ + d;
    }
    friend DoubleDerefIterator operator+(difference_type d,
                                         DoubleDerefIterator rhs) {
      return rhs.ptr_ + d;
    }
    friend DoubleDerefIterator& operator-(DoubleDerefIterator lhs,
                                          difference_type d) {
      return lhs.ptr_ - d;
    }
    friend difference_type operator-(DoubleDerefIterator lhs,
                                     DoubleDerefIterator rhs) {
      return lhs.ptr_ - rhs.ptr_;
    }

   private:
    explicit DoubleDerefIterator(ItType* const* data) : ptr_(data) {}

    ItType* const* ptr_;

    friend class Collection;
  };

  // TagMap for the collection.
  std::shared_ptr tag_map_;

  // Indexed by Id.  Use an array directly so that the type does not
  // have to be copy constructable.  The array has tag_map_->NumEntries()
  // elements.
  std::unique_ptr data_;

  // A class which allows errors to be reported flexibly.  The default
  // instantiation performs a LOG(FATAL) and does not have any member
  // variables (zero size).
  ErrorHandler error_handler_;
};

// Definitions of templated functions for Collection.

template 
Collection::Collection(
    std::shared_ptr tag_map)
    : tag_map_(std::move(tag_map)) {
  if (tag_map_->NumEntries() != 0) {
    data_ = absl::make_unique(tag_map_->NumEntries());
  }
}

template 
Collection::Collection(
    const tool::TagAndNameInfo& info)
    : Collection(tool::TagMap::Create(info).value()) {}

template 
Collection::Collection(const int num_entries)
    : Collection(tool::CreateTagMap(num_entries).value()) {}

template 
Collection::Collection(
    const std::initializer_list& tag_names)
    : Collection(tool::CreateTagMapFromTags(tag_names).value()) {}

template 
bool Collection::UsesTags() const {
  auto& mapping = tag_map_->Mapping();
  if (mapping.size() > 1) {
    // At least one tag is not "".
    return true;
  }
  if (mapping.empty()) {
    // The mapping is empty, it doesn't use tags.
    return false;
  }
  // If the one tag present is non-empty then we are using tags.
  return !mapping.begin()->first.empty();
}

template 
typename Collection::value_type&
Collection::Get(CollectionItemId id) {
  CHECK_LE(BeginId(), id);
  CHECK_LT(id, EndId());
  return begin()[id.value()];
}

template 
const typename Collection::value_type&
Collection::Get(CollectionItemId id) const {
  CHECK_LE(BeginId(), id);
  CHECK_LT(id, EndId());
  return begin()[id.value()];
}

template 
typename Collection::value_type*&
Collection::GetPtr(CollectionItemId id) {
  static_assert(storage == CollectionStorage::kStorePointer,
                "mediapipe::internal::Collection::GetPtr() is only "
                "available for collections that were defined with template "
                "argument storage == CollectionStorage::kStorePointer.");
  CHECK_LE(BeginId(), id);
  CHECK_LT(id, EndId());
  return data_[id.value()];
}

template 
const typename Collection::value_type*
Collection::GetPtr(CollectionItemId id) const {
  static_assert(storage == CollectionStorage::kStorePointer,
                "mediapipe::internal::Collection::GetPtr() is only "
                "available for collections that were defined with template "
                "argument storage == CollectionStorage::kStorePointer.");
  CHECK_LE(BeginId(), id);
  CHECK_LT(id, EndId());
  return data_[id.value()];
}

template 
typename Collection::value_type&
Collection::Get(const absl::string_view tag,
                                          int index) {
  CollectionItemId id = GetId(tag, index);
  if (!id.IsValid()) {
    return error_handler_.GetFallback(tag, index);
  }
  return begin()[id.value()];
}

template 
const typename Collection::value_type&
Collection::Get(const absl::string_view tag,
                                          int index) const {
  CollectionItemId id = GetId(tag, index);
  if (!id.IsValid()) {
    return error_handler_.GetFallback(tag, index);
  }
  return begin()[id.value()];
}

template 
typename Collection::value_type&
Collection::Index(int index) {
  return Get("", index);
}

template 
const typename Collection::value_type&
Collection::Index(int index) const {
  return Get("", index);
}

template 
typename Collection::value_type&
Collection::Tag(const absl::string_view tag) {
  return Get(tag, 0);
}

template 
const typename Collection::value_type&
Collection::Tag(const absl::string_view tag) const {
  return Get(tag, 0);
}

template 
std::string Collection::DebugString() const {
  std::string output =
      absl::StrCat("Collection of \"", MediaPipeTypeStringOrDemangled(),
                   "\" with\n", tag_map_->DebugString());
  return output;
}

template 
const std::shared_ptr&
Collection::TagMap() const {
  return tag_map_;
}

template 
typename Collection::iterator
Collection::begin() {
  return iterator(data_.get());
}

template 
typename Collection::iterator
Collection::end() {
  return iterator(data_.get() + tag_map_->NumEntries());
}

template 
typename Collection::const_iterator
Collection::begin() const {
  return const_iterator(data_.get());
}

template 
typename Collection::const_iterator
Collection::end() const {
  return const_iterator(data_.get() + tag_map_->NumEntries());
}

}  // namespace internal

// Returns c.HasTag(tag) && !Tag(tag)->IsEmpty() (just for convenience).
// This version is used with Calculator.
template 
bool HasTagValue(const internal::Collection& c,
                 const absl::string_view tag) {
  return c.HasTag(tag) && !c.Tag(tag)->IsEmpty();
}

// Returns c.HasTag(tag) && !Tag(tag).IsEmpty() (just for convenience).
// This version is used with CalculatorBase.
template 
bool HasTagValue(const internal::Collection& c,
                 const absl::string_view tag) {
  return c.HasTag(tag) && !c.Tag(tag).IsEmpty();
}

// Returns c.HasTag(tag) && !Tag(tag).IsEmpty() (just for convenience).
// This version is used with Calculator or CalculatorBase.
template 
bool HasTagValue(const C& c, const absl::string_view tag) {
  return HasTagValue(c->Inputs(), tag);
}

}  // namespace mediapipe

#endif  // MEDIAPIPE_FRAMEWORK_COLLECTION_H_

2 mediapipe 流水线

下面将根据Graph数据流走向深入源码部分需要一定mediapie基础知识储备

mediapipe源码中大量使用boost absel库等apii及智能指针提升性能或者增强鲁棒性

2.1 GpuBufferToImageFrameCalculator

2.1.1 GPU

根据宏定义 MEDIAPIPE_GPU_BUFFER_USE_CV_PIXEL_BUFFER 决定调用 opecv cpu处理还是Gpu处理
GPU 渲染管线运行在opengl上下文GLContext openglThread中

absl::Status GpuBufferToImageFrameCalculator::Process(CalculatorContext* cc) { if (cc->Inputs().Index(0).Value().ValidateAsType().ok()) { cc->Outputs().Index(0).AddPacket(cc->Inputs().Index(0).Value()); return absl::OkStatus(); } #ifdef HAVE_GPU_BUFFER if (cc->Inputs().Index(0).Value().ValidateAsType().ok()) { const auto& input = cc->Inputs().Index(0).Get(); #if MEDIAPIPE_GPU_BUFFER_USE_CV_PIXEL_BUFFER std::unique_ptr frame = CreateImageFrameForCVPixelBuffer(GetCVPixelBufferRef(input)); cc->Outputs().Index(0).Add(frame.release(), cc->InputTimestamp()); #else helper_.RunInGlContext([this, &input, &cc]() { auto src = helper_.CreateSourceTexture(input); std::unique_ptr frame = absl::make_unique( ImageFormatForGpuBufferFormat(input.format()), src.width(), src.height(), ImageFrame::kGlDefaultAlignmentBoundary); helper_.BindFramebuffer(src); const auto info = GlTextureInfoForGpuBufferFormat(input.format(), 0, helper_.GetGlVersion()); glReadPixels(0, 0, src.width(), src.height(), info.gl_format, info.gl_type, frame->MutablePixelData()); glFlush(); cc->Outputs().Index(0).Add(frame.release(), cc->InputTimestamp()); src.Release(); }); #endif // MEDIAPIPE_GPU_BUFFER_USE_CV_PIXEL_BUFFER return absl::OkStatus(); } #endif // defined(HAVE_GPU_BUFFER) return absl::Status(absl::StatusCode::kInvalidArgument, "Input packets must be ImageFrame or GpuBuffer."); }

2.1.2 CPU

…

2.2 FlowLimiterCalculator

节流图像流向下游流量控制。它穿过第一个传入的图像不变，并等待 tflitetensorstodetectioncalculator 下游图完成在它通过另一个之前生成相应的检测形象。所有在等待期间进入的图像都将被删除，从而限制了图像的数量在这个计算器和。之间的飞行图像的数目 tflitetensorstodetectioncalculator到1。这防止了中间的节点从传入的图像和数据排队过多，这导致增加延迟和内存使用，在实时移动应用程序中不需要。它还消除不必要的计算，例如，ImageTransformationCalculator可能会被拖到下游，如果后续的tfliteeconvertercalculator或tfliteinterencecalculator仍在忙处理之前的输入。

2.2.1 FlowLimiterCalculator

flow_limiter_calculator.cc

FlowLimiterCalculator类是MediaPipe框架中用于控制数据流的一个组件。它主要负责限制输入数据的流速，以避免模型过载或运算资源不足的情况。

在源代码中，FlowLimiterCalculator类通常会定义一个数据结构来存储和管理输入数据的流速限制信息。它包含以下内容：
数据缓冲区：用于存储输入数据，以便在下一帧图像可用之前进行运算。
计时器：用于计算输入数据到达的时间间隔，并根据设定的流速限制来决定是否将下一帧图像送入模型。
流速限制参数：这些参数可以设定输入数据的最大流速，例如每秒处理的帧数。
状态变量：用于记录当前处理的帧数和已处理的帧数，以便在达到流速限制时停止处理新的帧。
FlowLimiterCalculator类的核心功能如下：

接收输入数据：每当有新的帧图像可用时，FlowLimiterCalculator会接收并存储在数据缓冲区中。
计算时间间隔：计时器会记录当前帧与上一帧之间的时间间隔。
判断是否达到流速限制：根据计时器记录的时间间隔和设定的流速限制参数，FlowLimiterCalculator会判断是否达到最大流速。如果达到流速限制，将停止处理新的帧，直到当前处理的帧数达到已处理的帧数为止。
处理帧：如果当前帧没有被丢弃（即未达到流速限制），FlowLimiterCalculator会将该帧送入模型进行运算处理。
更新状态变量：每次处理完一帧后，FlowLimiterCalculator会更新已处理的帧数状态变量，以便在达到流速限制时正确停止处理新的帧。

// Releases input packets allowed by the max_in_flight constraint. absl::Status Process(CalculatorContext* cc) final { options_ = tool::RetrieveOptions(options_, cc->Inputs()); // Process the FINISHED input stream. Packet finished_packet = cc->Inputs().Tag(kFinishedTag).Value(); if (finished_packet.Timestamp() == cc->InputTimestamp()) { while (!frames_in_flight_.empty() && frames_in_flight_.front() <= finished_packet.Timestamp()) { frames_in_flight_.pop_front(); } } // Process the frame input streams. for (int i = 0; i < cc->Inputs().NumEntries(""); ++i) { Packet packet = cc->Inputs().Get("", i).Value(); if (!packet.IsEmpty()) { input_queues_[i].push_back(packet); } } // Abandon expired frames in flight. Note that old frames are abandoned // when much newer frame timestamps arrive regardless of elapsed time. TimestampDiff timeout = options_.in_flight_timeout(); Timestamp latest_ts = cc->Inputs().Get("", 0).Value().Timestamp(); if (timeout > 0 && latest_ts == cc->InputTimestamp() && latest_ts < Timestamp::Max()) { while (!frames_in_flight_.empty() && (latest_ts - frames_in_flight_.front()) > timeout) { frames_in_flight_.pop_front(); } } // Release allowed frames from the main input queue. auto& input_queue = input_queues_[0]; while (ProcessingAllowed() && !input_queue.empty()) { Packet packet = input_queue.front(); input_queue.pop_front(); cc->Outputs().Get("", 0).AddPacket(packet); SendAllow(true, packet.Timestamp(), cc); frames_in_flight_.push_back(packet.Timestamp()); } // Limit the number of queued frames. // Note that frames can be dropped after frames are released because // frame-packets and FINISH-packets never arrive in the same Process call. while (input_queue.size() > options_.max_in_queue()) { Packet packet = input_queue.front(); input_queue.pop_front(); SendAllow(false, packet.Timestamp(), cc); } // Propagate the input timestamp bound. if (!input_queue.empty()) { Timestamp bound = input_queue.front().Timestamp(); SetNextTimestampBound(bound, &cc->Outputs().Get("", 0)); } else { Timestamp bound = cc->Inputs().Get("", 0).Value().Timestamp().NextAllowedInStream(); SetNextTimestampBound(bound, &cc->Outputs().Get("", 0)); if (cc->Outputs().HasTag(kAllowTag)) { SetNextTimestampBound(bound, &cc->Outputs().Tag(kAllowTag)); } } ProcessAuxiliaryInputs(cc); return absl::OkStatus(); }

code fragment

just flag tag

// Outputs a packet indicating whether a frame was sent or dropped. void SendAllow(bool allow, Timestamp ts, CalculatorContext* cc) { if (cc->Outputs().HasTag(kAllowTag)) { cc->Outputs().Tag(kAllowTag).AddPacket(MakePacket(allow).At(ts)); } }

调度器如何处理呢

3 输入变换

image_transformation_calculator.cc

ImageTransformationCalculator 类是一个用于图像处理的类，它主要负责应用各种图像变换。这些变换可以包括旋转、缩放、剪切、扭曲等。此类通常用于图像增强、图像恢复以及计算机视觉任务。

3.1 ImageTransformationCalculator

图像操作功能：ImageTransformationCalculator 类应该具有能够读取、写入和处理图像的功能。这可能包括对图像进行解码和编码，处理图像文件，以及在内存中操作图像数据。
变换计算功能：此类应该具有能够计算和应用各种图像变换的功能。这可能包括旋转、缩放、剪切、扭曲、平移等变换。这些变换的计算可能需要使用一些数学和计算机视觉库，如OpenCV。
可配置性：此类可能具有一些配置选项，允许用户指定变换的类型、参数以及其他选项。这使得用户可以根据自己的需求定制变换的计算和应用方式。
线程安全性：此类可能需要支持多线程操作。这可能涉及到在多个线程之间共享图像数据和变换状态，以及同步访问共享资源的问题。
错误处理和异常处理：此类可能需要具有一些错误处理和异常处理的机制，以处理例如无法读取图像文件、无法应用某些变换等情况。
性能优化：由于图像处理可能是一个计算密集型的任务，因此此类可能需要使用一些性能优化技术来提高计算效率，例如使用并行计算、缓存等技术。

3.2 process

absl::Status ImageTransformationCalculator::Process(CalculatorContext* cc) { // First update the video header if it is given, based on the rotation and // dimensions specified as side packets or options. This will only be done // once, so streaming transformation changes will not be reflected in // the header. if (cc->Inputs().HasTag(kVideoPrestreamTag) && !cc->Inputs().Tag(kVideoPrestreamTag).IsEmpty() && cc->Outputs().HasTag(kVideoPrestreamTag)) { mediapipe::VideoHeader header = cc->Inputs().Tag(kVideoPrestreamTag).Get(); // Update the header's width and height if needed. ComputeOutputDimensions(header.width, header.height, &header.width, &header.height); cc->Outputs() .Tag(kVideoPrestreamTag) .AddPacket(mediapipe::MakePacket(header).At( mediapipe::Timestamp::PreStream())); } // Override values if specified so. if (cc->Inputs().HasTag("ROTATION_DEGREES") && !cc->Inputs().Tag("ROTATION_DEGREES").IsEmpty()) { rotation_ = DegreesToRotationMode(cc->Inputs().Tag("ROTATION_DEGREES").Get()); } if (cc->Inputs().HasTag("FLIP_HORIZONTALLY") && !cc->Inputs().Tag("FLIP_HORIZONTALLY").IsEmpty()) { flip_horizontally_ = cc->Inputs().Tag("FLIP_HORIZONTALLY").Get(); } if (cc->Inputs().HasTag("FLIP_VERTICALLY") && !cc->Inputs().Tag("FLIP_VERTICALLY").IsEmpty()) { flip_vertically_ = cc->Inputs().Tag("FLIP_VERTICALLY").Get(); } if (cc->Inputs().HasTag("OUTPUT_DIMENSIONS")) { if (cc->Inputs().Tag("OUTPUT_DIMENSIONS").IsEmpty()) { return absl::OkStatus(); } else { const auto& image_size = cc->Inputs().Tag("OUTPUT_DIMENSIONS").Get>(); output_width_ = image_size.first; output_height_ = image_size.second; } } if (use_gpu_) { #if !MEDIAPIPE_DISABLE_GPU if (cc->Inputs().Tag(kGpuBufferTag).IsEmpty()) { return absl::OkStatus(); } return gpu_helper_.RunInGlContext( [this, cc]() -> absl::Status { return RenderGpu(cc); }); #endif // !MEDIAPIPE_DISABLE_GPU } else { if (cc->Inputs().Tag(kImageFrameTag).IsEmpty()) { return absl::OkStatus(); } return RenderCpu(cc); } return absl::OkStatus(); }

这段代码定义了一个名为RunInGlContext的模板函数，它接受一个函数作为参数，并在一个lambda表达式中执行该函数，然后返回一个absl::OkStatus()，即使函数本身没有返回任何结果。

这段代码的主要目的是方便那些需要在OpenGL上下文中执行函数的情况，尤其是当这些函数没有返回结果（即返回类型为void）时。由于std::function不能正确处理返回void的函数，因此在这里使用了模板以避免歧义。

3.2.1 Cpu opencv process

cpu porocess opencv api

absl::Status ImageTransformationCalculator::RenderCpu(CalculatorContext* cc) { cv::Mat input_mat; mediapipe::ImageFormat::Format format; const auto& input = cc->Inputs().Tag(kImageFrameTag).Get(); input_mat = formats::MatView(&input); format = input.Format(); const int input_width = input_mat.cols; const int input_height = input_mat.rows; int output_width; int output_height; ComputeOutputDimensions(input_width, input_height, &output_width, &output_height); if (output_width_ > 0 && output_height_ > 0) { cv::Mat scaled_mat; if (scale_mode_ == mediapipe::ScaleMode_Mode_STRETCH) { int scale_flag = input_mat.cols > output_width_ && input_mat.rows > output_height_ ? cv::INTER_AREA : cv::INTER_LINEAR; cv::resize(input_mat, scaled_mat, cv::Size(output_width_, output_height_), 0, 0, scale_flag); } else { const float scale = std::min(static_cast(output_width_) / input_width, static_cast(output_height_) / input_height); const int target_width = std::round(input_width * scale); const int target_height = std::round(input_height * scale); int scale_flag = scale < 1.0f ? cv::INTER_AREA : cv::INTER_LINEAR; if (scale_mode_ == mediapipe::ScaleMode_Mode_FIT) { cv::Mat intermediate_mat; cv::resize(input_mat, intermediate_mat, cv::Size(target_width, target_height), 0, 0, scale_flag); const int top = (output_height_ - target_height) / 2; const int bottom = output_height_ - target_height - top; const int left = (output_width_ - target_width) / 2; const int right = output_width_ - target_width - left; cv::copyMakeBorder(intermediate_mat, scaled_mat, top, bottom, left, right, options_.constant_padding() ? cv::BORDER_CONSTANT : cv::BORDER_REPLICATE); } else { cv::resize(input_mat, scaled_mat, cv::Size(target_width, target_height), 0, 0, scale_flag); output_width = target_width; output_height = target_height; } } input_mat = scaled_mat; }

3.2.2 Gpu

该部分区分了android ios opengl 跨平台库，平台的opengl上下文已在开头 gpu_server内初始化完成，该部分构建render类通过gpu处理二维数据且以FBO方式，然后将GpuBuffer 设置到输出packet 送往下一节点

TEXTURE_EXTERNAL_OES

TEXTURE_EXTERNAL_OES 和普通纹理的主要区别在于它们的定义和用途。

普通纹理完全由 OpenGL ES 定义、分配和管理。它们是在 OpenGL ES 上下文中创建和使用的纹理。
TEXTURE_EXTERNAL_OES 是一种特殊类型的纹理，它在别处定义和分配，并以某种实现定义的方式导入 OpenGL ES。这种纹理主要用于导入 YUV 视频数据。系统中的一些外部实体定义了格式——它对应用程序不可见，颜色空间转换由驱动程序堆栈神奇地处理。具体支持哪些格式是实现定义的。这种纹理的主要优势是它们能够直接从 BufferQueue 数据进行渲染。例如，在 Android 平台上，BufferQueue 是连接图形数据生产方和消费方的队列，也就表示 OES 纹理能直接拿到某些生产方产生的图形数据进行渲染。

absl::Status ImageTransformationCalculator::RenderGpu(CalculatorContext* cc) { #if !MEDIAPIPE_DISABLE_GPU const auto& input = cc->Inputs().Tag(kGpuBufferTag).Get(); const int input_width = input.width(); const int input_height = input.height(); int output_width; int output_height; ComputeOutputDimensions(input_width, input_height, &output_width, &output_height); if (scale_mode_ == mediapipe::ScaleMode_Mode_FILL_AND_CROP) { const float scale = std::min(static_cast(output_width_) / input_width, static_cast(output_height_) / input_height); output_width = std::round(input_width * scale); output_height = std::round(input_height * scale); } if (cc->Outputs().HasTag("LETTERBOX_PADDING")) { auto padding = absl::make_unique>(); ComputeOutputLetterboxPadding(input_width, input_height, output_width, output_height, padding.get()); cc->Outputs() .Tag("LETTERBOX_PADDING") .Add(padding.release(), cc->InputTimestamp()); } QuadRenderer* renderer = nullptr; GlTexture src1; #if defined(MEDIAPIPE_IOS) if (input.format() == GpuBufferFormat::kBiPlanar420YpCbCr8VideoRange || input.format() == GpuBufferFormat::kBiPlanar420YpCbCr8FullRange) { if (!yuv_renderer_) { yuv_renderer_ = absl::make_unique(); MP_RETURN_IF_ERROR( yuv_renderer_->GlSetup(::mediapipe::kYUV2TexToRGBFragmentShader, {"video_frame_y", "video_frame_uv"})); } renderer = yuv_renderer_.get(); src1 = gpu_helper_.CreateSourceTexture(input, 0); } else // NOLINT(readability/braces) #endif // iOS { src1 = gpu_helper_.CreateSourceTexture(input); #if defined(TEXTURE_EXTERNAL_OES) if (src1.target() == GL_TEXTURE_EXTERNAL_OES) { if (!ext_rgb_renderer_) { ext_rgb_renderer_ = absl::make_unique(); MP_RETURN_IF_ERROR(ext_rgb_renderer_->GlSetup( ::mediapipe::kBasicTexturedFragmentShaderOES, {"video_frame"})); } renderer = ext_rgb_renderer_.get(); } else // NOLINT(readability/braces) #endif // TEXTURE_EXTERNAL_OES { if (!rgb_renderer_) { rgb_renderer_ = absl::make_unique(); MP_RETURN_IF_ERROR(rgb_renderer_->GlSetup()); } renderer = rgb_renderer_.get(); } } RET_CHECK(renderer) << "Unsupported input texture type"; mediapipe::FrameScaleMode scale_mode = mediapipe::FrameScaleModeFromProto( scale_mode_, mediapipe::FrameScaleMode::kStretch); mediapipe::FrameRotation rotation = mediapipe::FrameRotationFromDegrees(RotationModeToDegrees(rotation_)); auto dst = gpu_helper_.CreateDestinationTexture(output_width, output_height, input.format()); gpu_helper_.BindFramebuffer(dst); glActiveTexture(GL_TEXTURE1); glBindTexture(src1.target(), src1.name()); MP_RETURN_IF_ERROR(renderer->GlRender( src1.width(), src1.height(), dst.width(), dst.height(), scale_mode, rotation, flip_horizontally_, flip_vertically_, /*flip_texture=*/false)); glActiveTexture(GL_TEXTURE1); glBindTexture(src1.target(), 0); // Execute GL commands, before getting result. glFlush(); auto output = dst.template GetFrame(); cc->Outputs().Tag(kGpuBufferTag).Add(output.release(), cc->InputTimestamp()); #endif // !MEDIAPIPE_DISABLE_GPU return absl::OkStatus(); }

鸢尾花分类项目 GUI 编织幻境的妖分类数据挖掘人工智能
1.机器学习的定义机器学习是一门人工智能的分支，专注于开发算法和统计模型，使计算机能够在没有明确编程的情况下从数据中自动学习和改进。通过识别数据中的模式和规律，机器学习系统可以做出预测或决策。常见的应用包括图像识别、语音识别、推荐系统等。2.为什么使用鸢尾花数据集（Irisdataset）鸢尾花数据集是一个经典的多类分类问题数据集，由英国统计学家和遗传学家RonaldFisher在1936年引入。
《神经网络与深度学习》(邱锡鹏) 内容概要【不含数学推导】 code_stream #机器学习神经网络
第1章绪论基本概念：介绍了人工智能的发展历程及不同阶段的特点，如符号主义、连接主义、行为主义等。还阐述了深度学习在人工智能领域的重要地位和发展现状，以及其在图像、语音、自然语言处理等多个领域的成功应用。术语解释人工智能：旨在让机器模拟人类智能的技术和科学。深度学习：一种基于对数据进行表征学习的方法，通过构建具有很多层的神经网络模型，自动从大量数据中学习复杂的模式和特征。第2章机器学习概述基本概念：
图像识别与应用狂踹瘸子那条好脚 python
图像识别作为人工智能领域的重要分支，近年来取得了显著进展，其中卷积神经网络（CNN）功不可没。CNN凭借其强大的特征提取能力，在图像分类、目标检测、人脸识别等任务中表现出色，成为图像识别领域的核心技术。一、卷积神经网络：图像识别的利器CNN是一种专门处理网格状数据的深度学习模型，其结构设计灵感来源于生物视觉系统。与全连接神经网络不同，CNN通过卷积层、池化层等结构，能够有效提取图像的局部特征，并逐
知识图谱构建概念、工具、实例调研熟悉的黑曼巴知识图谱人工智能
一、知识图谱的概念知识图谱（Knowledgegraph）知识图谱是一种用图模型来描述知识和建模世界万物之间的关联关系的技术方法。知识图谱由节点和边组成。节点可以是实体，如一个人、一本书等，或是抽象的概念，如人工智能、知识图谱等。边可以是实体的属性，如姓名、书名或是实体之间的关系，如朋友、配偶。知识图谱的早期理念来自SemanticWeb（语义网络），其最初理想是把基于文本链接的万维网落转化为基于
【deepseek与chatGPT辩论】辩论题： “人工智能是否应当具备自主决策能力？” 海宁不掉头发软件工程人工智能人工智能 chatgpt deepseek
探讨辩论题这个提案涉及创建一个精确的辩论题目，旨在测试deepseek的应答能力。创建辩论题目提议设计一个辩论题目以测试deepseek的应答能力。希望这个题目具有挑战性并能够测量其回应质量。好的，来一道适合深度学习的辩论题：辩论题：“人工智能是否应当具备自主决策能力？”这个话题涉及到人工智能的发展、伦理以及未来应用，可以从以下几个方面展开辩论：支持方：认为人工智能的自主决策能力能够加速科技进步，
GenAI 平台，3 分钟即可构建基于 Claude、DeepSeek 的 AI Agent DO_Community 人工智能
DigitalOcean云服务在前不久发布了GenAI平台——一个让任何团队都能在几分钟内构建和部署AI代理的平台。DigitalOcean的GenAI平台持续扩展，让人工智能驱动的开发变得更加易用、灵活且强大。近日，Digitalocean宣布将Anthropic的Claude模型和DeepSeekR1引入Digitalocean的生态系统，为你提供更多构建和部署AI应用的选择。通过Anthro
智享AI直播三代系统，马斯克旗下AI人工智能直播工具,媲美DeepSeek！ V__17671155793 人工智能
智享AI直播三代系统，马斯克旗下AI人工智能直播工具,媲美DeepSeek！在科技飞速发展的当下，人工智能正以前所未有的态势重塑着各个行业的格局。直播领域，作为信息传播与商业交互的前沿阵地，也在AI技术的赋能下迎来了颠覆性的变革。其中，马斯克旗下的智享AI直播三代系统宛如一颗璀璨的新星，横空出世，以其卓越的性能和创新的理念，迅速在竞争激烈的直播市场中崭露头角，甚至被业界誉为可媲美DeepSeek的
DeepSeek与ChatGPT：会取代搜索引擎和人工客服的人工智能革命云边有个稻草人热门文章 chatgpt 搜索引擎人工智能 DeepSeek
云边有个稻草人-CSDN博客在众多创新技术中，DeepSeek和ChatGPT无疑是最为引人注目的。它们通过强大的搜索和对话生成能力，能够改变我们与计算机交互的方式，帮助我们高效地获取信息，增强智能服务。本文将深入探讨这两项技术如何结合使用，为用户提供更精准、更流畅的对话和搜索体验。目录一、介绍1.1什么是DeepSeek？1.2什么是ChatGPT？1.3DeepSeek与ChatGPT的结合：
LLM与知识图谱融合:智能运维知识库构建 AI天才研究院 DeepSeek R1 &大数据AI人工智能大模型 AI大模型企业级应用开发实战 AI实战计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
1.背景介绍随着信息技术的飞速发展，IT运维管理面临着越来越大的挑战。海量的设备、复杂的网络环境、日益增长的数据量，使得传统的运维方式难以满足需求。为了提高运维效率和质量，智能运维应运而生。智能运维的核心是将人工智能技术应用于运维领域，通过机器学习、深度学习等算法，实现自动化、智能化的运维管理。其中，大语言模型（LLM）和知识图谱是两个重要的技术方向。LLM能够理解和生成自然语言，可以用于构建智能
Python中LLM的知识图谱构建：动态更新与推理二进制独立开发 GenAI与Python 非纯粹GenAI python 知识图谱开发语言自然语言处理人工智能分布式机器学习
文章目录引言1.知识图谱的基本概念1.1知识图谱的定义1.2知识图谱的构建流程2.利用LLM进行知识抽取2.1实体识别2.2关系抽取2.3属性抽取3.知识融合3.1实体对齐3.2冲突消解4.知识存储5.知识推理5.1规则推理5.2基于LLM的推理6.动态更新6.1增量更新6.2实时更新7.结论引言随着人工智能技术的飞速发展，知识图谱（KnowledgeGraph,KG）作为一种结构化的知识表示方法
无需配置！深脑云一键启用DeepSeek全系AI模型小深ai硬件分享人工智能深度学习服务器
解锁无限算力潜能，开启DeepSeek镜像云算力新征程！在人工智能风起云涌的时代，算力就是驱动创新的引擎，而优质的模型镜像则是引领变革的密钥。我们向您介绍一下我们的深脑云算力平台，这里汇聚了DeepSeek的各大版本镜像，为您的科研、开发与创新之路注入强大动力！强大的DeepSeek模型家族DeepSeek，作为AI领域的璀璨明星，以其卓越的性能和先进的技术架构闻名遐迩。我们的平台精心整合了Dee
AI服务器散热黑科技：让芯片“冷静”提速小深ai硬件分享人工智能深度学习服务器
AI服务器为何需要散热黑科技在人工智能飞速发展的当下，AI服务器作为核心支撑，作用重大。从互联网智能推荐，到医疗疾病诊断辅助，从金融风险预测，到教育个性化学习，AI服务器广泛应用，为各类复杂人工智能应用提供强大算力。然而，AI服务器在运行时面临着严峻的散热挑战。随着人工智能技术的不断发展，对AI服务器的计算能力要求越来越高，这使得服务器的功率密度急剧增加。以GPT-4的训练为例，它需要大量的GPU
深度应用场景：DeepSeek —— 探索AI赋能的智慧未来人工智能专属驿站人工智能
深度应用场景：DeepSeek——探索AI赋能的智慧未来随着人工智能的迅猛发展，数据的价值已不再局限于简单的存储与处理，它们正变得更加智能与高效。DeepSeek，这一创新的AI技术平台，正以其独特的深度学习能力，开启了各行各业的智能化变革。让我们走进一个由DeepSeek打造的深度应用场景，探索它如何推动未来的发展。1.智能医疗：精准诊断，拯救生命想象一下，医生们不再是唯一的诊断专家，而是与AI
在 DeepSeek 驱动的编程变革中抓住机遇并脱颖而出智想天开 AI技术人工智能 deep learning
公众号地址:在DeepSeek驱动的编程变革中抓住机遇并脱颖而出更多内容请关注公众号：智想天开前言在DeepSeek引领的新一轮AI技术革新中，程序员们正面临着前所未有的挑战。随着DeepSeek等人工智能工具的迅猛发展，编程领域正在发生深刻变革。这些先进的工具不仅能够自动化完成繁重的代码生成和调试任务，还能够根据大量数据提供优化建议，改变了传统编程的工作流程。虽然这些技术为提高工作效率和解放开发
项目管理新趋势！2024年，Jira与Codes你更倾向谁？ Codes_AndyLiu jira teambition redmine 项目管理软件项目管理工具项目管理 jira 国产平替
一、项目管理软件新趋势概述2024年，项目管理软件呈现出诸多新趋势，这些趋势对于项目管理的重要性日益凸显。在数字化转型方面，项目管理软件成为企业实现数字化转型的关键工具。让老板感知数据，让中层管理者感受先进，让基层员工感到舒心.人工智能与自动化在项目管理软件中的应用也越来越广泛。项目管理软件正朝着智能化、自动化的方向迈进，利用AI技术提供个性化和场景化解决方案。例如，工作周报AI化，自动化测试，代
【人工智能】提升编程效率的6种GPT实用应用技巧！保姆级讲解！ ChatGPT-千鑫人工智能 AI领域人工智能 gpt AI编程
文章目录实用教程：六大AI编程技巧解锁效率提升技巧1：快速实现需求demo操作步骤技巧2：代码审查——AI帮你提升代码质量操作步骤技巧3：错误排查——AI助你快速定位问题操作步骤技巧4：代码注释——AI帮你理解复杂逻辑操作步骤技巧5：数据整理——AI帮你高效准备测试数据操作步骤技巧6：学习未知代码库——AI助你快速掌握新工具操作步骤使用教程：全面掌握CodeMoss的高效编程工具（1）VSCode
利用人工智能增强可读性：自动为文本添加标点符号姚家湾 AI 标点符号
在数字通信时代，文本的清晰度和可读性至关重要。无论是转录口语、处理原始文本数据还是改进用户生成的内容，标点符号在传达预期信息方面都起着至关重要的作用。但是，手动编辑文本以添加标点符号可能非常耗时且容易出错。这就是人工智能(AI)发挥作用的地方，它提供了一种强大的解决方案，可以自动将标点符号插入句子中。目前，利用大模型的能力，完全可以胜任添加标点符号的工作，不需要其它特别的处理程序。参考代码from
《从编程小白到人工智能大神：大学新生Python入门攻略》千帆过尽. python 人工智能
前言在如今这个技术飞速发展的时代，编程已经成为许多大学生不可或缺的技能，尤其是对于人工智能方向的学生来说，编程更是必不可少的一部分。作为一名大三学生，并且专注于Python和人工智能方向，我深知刚开始学习编程时的挑战与迷茫。希望本文能帮助作为大学新生的你们在编程入门的过程中少走弯路，提供一条清晰有效的学习路径。一、编程语言选择作为编程新手，选择一门适合自己的编程语言至关重要。对于希望进入人工智能领
华为的云端训练算力与迭代效率 AI大模型应用之禅 DeepSeek R1 &AI大模型与大数据计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
华为云、云端训练、算力、迭代效率、人工智能、深度学习、模型训练、分布式训练、优化算法1.背景介绍人工智能（AI）技术近年来发展迅速，深度学习作为其核心驱动力，在图像识别、自然语言处理、语音识别等领域取得了突破性进展。然而，深度学习模型的训练需要海量数据和强大的计算资源，这成为AI技术发展面临的瓶颈之一。云计算作为一种新型的计算模式，为深度学习提供了强大的算力支持。华为云作为国内领先的云计算平台，在
【第四届网络安全、人工智能与数字经济国际学术会议（CSAIDE 2025】网络安全，人工智能，数字经济的研究禁默学术会议话题探讨 web安全人工智能安全数字经济学术论文
重要信息会议官网：www.csaide.net会议时间：2025年3月7-9日会议地点：马来西亚-马来西亚理工大学新山校区（线上+线下混合）简介过去几年，数字经济蓬勃发展，已成为全球经济增长的驱动力。然而，网络安全成为其最大的挑战。为了确保数字经济的可持续发展，人工智能被认为是至关重要的技术手段。第四届网络安全、人工智能与数字经济（CSAIDE2025）将于2025年3月7日至9日在马来西亚举行。
Python从0到100（四）：Python中的运算符介绍(补充) 是Dream呀 python java 数据库
前言：零基础学Python：Python从0到100最新最全教程。想做这件事情很久了，这次我更新了自己所写过的所有博客，汇集成了Python从0到100，共一百节课，帮助大家一个月时间里从零基础到学习Python基础语法、Python爬虫、Web开发、计算机视觉、机器学习、神经网络以及人工智能相关知识，成为学习学习和学业的先行者！欢迎大家订阅专栏：零基础学Python：Python从0到100最新
Python从0到100（三十五）：beautifulsoup的学习是Dream呀 Dream的茶话会 python beautifulsoup 学习
前言：零基础学Python：Python从0到100最新最全教程。想做这件事情很久了，这次我更新了自己所写过的所有博客，汇集成了Python从0到100，共一百节课，帮助大家一个月时间里从零基础到学习Python基础语法、Python爬虫、Web开发、计算机视觉、机器学习、神经网络以及人工智能相关知识，成为学习学习和学业的先行者！欢迎大家订阅专栏：零基础学Python：Python从0到100最新
《深入浅出AI》前言知识：深度学习基础总结 GoAI 深入浅出AI 人工智能深度学习机器学习 cnn rnn 生成对抗网络神经网络
个人主页:GoAI|公众号:GoAI的学习小屋|交流群:704932595|个人简介：掘金签约作者、百度飞桨PPDE、领航团团长、开源特训营导师、CSDN、阿里云社区人工智能领域博客专家、新星计划计算机视觉方向导师等，专注大数据与人工智能知识分享。AI学习星球推荐：GoAI的学习社区知识星球是一个致力于提供《机器学习|深度学习|CV|NLP|大模型|多模态|AIGC》各个最新AI方向综述、论文等成
人工智能与机器学习入门：决策树应用决策树机器学习入门
在人工智能与机器学习入门：使用Kaggle完成Titanic推断学习一文中，给出了使用Kaggle进行机器学习入门的方法，本文基于上文的需求。尝试使用决策树模型来训练数据，并进行test数据集的测试。什么是决策树决策树，简单来讲可以认为是一个大的ifelse判断树，有了决策树后，测试集中的数据便可以使用该决策树进行判断了。比如根据Titanic的训练数据构造了上次决策树后，便可以根据测试数据的性别
怎么使用DeepSeek？DeepSeek使用教程轻创思维网络
1.简介DeepSeek是一款基于人工智能技术的智能搜索引擎和信息检索工具。它能够通过自然语言处理技术理解用户的查询需求，并提供精准、全面的搜索结果。无论您是想查找信息、解答问题还是进行创意写作，DeepSeek都能为您提供高效的支持。2.主要功能智能搜索：支持自然语言输入，快速获取精准结果。多语言支持：支持中文、英文及其他多种语言的输入和输出。知识库覆盖：整合海量互联网信息，覆盖百科、新闻、学术
DeepSeek的实用方法DeepSeek+kimi生成PPT C_V_Better AI人工智能人工智能 ppt ai
在人工智能领域，DeepSeek和KimiAI作为强大的语言模型，为开发者和普通用户提供了丰富的功能。本文将详细介绍DeepSeek的实用方法，以及如何结合KimiAI生成PPT，帮助您快速上手并发挥其强大能力。一、DeepSeek的使用方法（一）注册与登录访问官网：打开浏览器，输入DeepSeek官网。注册账号：点击“注册”按钮，填写邮箱地址、设置密码，并完成邮箱验证。登录：注册成功后，使用注册
从零到入门：人工智能学习路径全解析这题有点难度人工智能学习
一、打破迷雾：重新认识人工智能人工智能（AI）早已不再是科幻电影中的专属概念，而是渗透到我们生活的方方面面。从手机里的语音助手到电商平台的推荐系统，从自动驾驶到医疗影像分析，AI技术正在重塑人类社会的运行方式。对于初学者而言，建立正确的认知框架至关重要：1.技术图谱解析：机器学习（ML）：AI的核心驱动力，使计算机具备从数据中学习的能力深度学习（DL）：基于神经网络的进阶技术，擅长处理图像、语音等
常用的高性能计算工具有哪些这题有点难度人工智能学习
在当今数字化时代，高性能计算（HPC）已成为推动科学、工程、技术以及商业创新的核心力量。无论是模拟宇宙的起源、设计新型航空器，还是训练复杂的人工智能模型，HPC都扮演着不可或缺的角色。本文将深入探讨高性能计算的定义、其背后的强大工具，以及它们如何助力各领域的突破性发展。一、高性能计算：定义与意义高性能计算（HPC）是一种利用超级计算机或大规模集群来处理复杂计算任务的技术。它通过并行计算和优化算法，
合作伙伴中心Partner Center中添加了Copilot预览版 xueyunshengling 微软合作伙伴计划合作伙伴中心 copilot Copilot预览版
目录一、引言二、Copilot功能概述2.1Copilot简介2.2Copilot的核心功能2.3Copilot的访问和使用三、Copilot的使用方法3.1Copilot功能区域3.2Copilot使用示例3.2.1编写有效提示3.2.2使用反馈循环四、负责任的人工智能4.1Copilot结果的可靠性4.2意外或冒犯性内容的处理4.3Copilot数据收集五、总结一、引言合作伙伴中心（预览版）中
《DeepSeek模型压缩：在高效与性能间寻平衡》人工智能深度学习
在人工智能飞速发展的当下，大语言模型不断迭代升级，规模与性能同步攀升。DeepSeek作为其中的佼佼者，在模型压缩技术上不断探索，力求在减小模型体积的同时，最大程度保留模型性能，为更广泛的应用场景提供支持。量化：用低精度表达，换存储空间与计算效率量化技术是DeepSeek模型压缩的关键手段之一，它将模型中的高精度浮点数参数转换为低比特数的整数或定点数，从而实现存储空间的大幅缩减与计算速度的提升。从
jvm调优总结（从基本概念到深度优化） oloz java jvm jdk 虚拟机应用服务器
JVM参数详解：http://www.cnblogs.com/redcreen/archive/2011/05/04/2037057.html Java虚拟机中，数据类型可以分为两类：基本类型和引用类型。基本类型的变量保存原始值，即：他代表的值就是数值本身；而引用类型的变量保存引用值。“引用值”代表了某个对象的引用，而不是对象本身，对象本身存放在这个引用值所表示的地址的位置。
【Scala十六】Scala核心十：柯里化函数 bit1129 scala
本篇文章重点说明什么是函数柯里化，这个语法现象的背后动机是什么，有什么样的应用场景，以及与部分应用函数(Partial Applied Function)之间的联系 1. 什么是柯里化函数 A way to write functions with multiple parameter lists. For instance def f(x: Int)(y: Int) is a
HashMap dalan_123 java
HashMap在java中对很多人来说都是熟的；基于hash表的map接口的非同步实现。允许使用null和null键；同时不能保证元素的顺序；也就是从来都不保证其中的元素的顺序恒久不变。 1、数据结构在java中，最基本的数据结构无外乎：数组和引用（指针），所有的数据结构都可以用这两个来构造，HashMap也不例外，归根到底HashMap就是一个链表散列的数据
Java Swing如何实时刷新JTextArea，以显示刚才加append的内容周凡杨 java 更新 swing JTextArea
在代码中执行完textArea.append("message")后，如果你想让这个更新立刻显示在界面上而不是等swing的主线程返回后刷新，我们一般会在该语句后调用textArea.invalidate()和textArea.repaint()。问题是这个方法并不能有任何效果，textArea的内容没有任何变化，这或许是swing的一个bug，有一个笨拙的办法可以实现
servlet或struts的Action处理ajax请求 g21121 servlet
其实处理ajax的请求非常简单，直接看代码就行了： //如果用的是struts //HttpServletResponse response = ServletActionContext.getResponse(); // 设置输出为文字流 response.setContentType("text/plain"); // 设置字符集 res
FineReport的公式编辑框的语法简介老A不折腾 finereport 公式总结
FINEREPORT用到公式的地方非常多，单元格（以=开头的便被解析为公式），条件显示，数据字典，报表填报属性值定义，图表标题，轴定义，页眉页脚，甚至单元格的其他属性中的鼠标悬浮提示内容都可以写公式。简单的说下自己感觉的公式要注意的几个地方： 1.if语句语法刚接触感觉比较奇怪，if(条件式子,值1,值2)，if可以嵌套，if(条件式子1，值1，if(条件式子2，值2，值3)
linux mysql 数据库乱码的解决办法墙头上一根草 linux mysql 数据库乱码
linux 上mysql数据库区分大小写的配置 lower_case_table_names=1 1-不区分大小写 0-区分大小写修改/etc/my.cnf 具体的修改内容如下: [client] default-character-set=utf8 [mysqld] datadir=/var/lib/mysql socket=/va
我的spring学习笔记6-ApplicationContext实例化的参数兼容思想 aijuans Spring 3
ApplicationContext能读取多个Bean定义文件，方法是： ApplicationContext appContext = new ClassPathXmlApplicationContext（ new String[]｛“bean-config1.xml”，“bean-config2.xml”，“bean-config3.xml”，“bean-config4.xml
mysql 基准测试之sysbench annan211 基准测试 mysql基准测试 MySQL测试 sysbench
1 执行如下命令，安装sysbench-0.5： tar xzvf sysbench-0.5.tar.gz cd sysbench-0.5 chmod +x autogen.sh ./autogen.sh ./configure --with-mysql --with-mysql-includes=/usr/local/mysql
sql的复杂查询使用案列与技巧百合不是茶 oracle sql 函数数据分页合并查询
本片博客使用的数据库表是oracle中的scott用户表; ------------------- 自然连接查询查询 smith 的上司(两种方法) &
深入学习Thread类 bijian1013 java thread 多线程 java多线程
一．线程的名字下面来看一下Thread类的name属性，它的类型是String。它其实就是线程的名字。在Thread类中，有String getName()和void setName(String)两个方法用来设置和获取这个属性的值。同时，Thr
JSON串转换成Map以及如何转换到对应的数据类型 bijian1013 java fastjson net.sf.json
在实际开发中，难免会碰到JSON串转换成Map的情况，下面来看看这方面的实例。另外，由于fastjson只支持JDK1.5及以上版本，因此在JDK1.4的项目中可以采用net.sf.json来处理。一.fastjson实例 JsonUtil.java package com.study; impor
【RPC框架HttpInvoker一】HttpInvoker：Spring自带RPC框架 bit1129 spring
HttpInvoker是Spring原生的RPC调用框架，HttpInvoker同Burlap和Hessian一样，提供了一致的服务Exporter以及客户端的服务代理工厂Bean，这篇文章主要是复制粘贴了Hessian与Spring集成一文，【RPC框架Hessian四】Hessian与Spring集成在【RPC框架Hessian二】Hessian 对象序列化和反序列化一文中
【Mahout二】基于Mahout CBayes算法的20newsgroup的脚本分析 bit1129 Mahout
#!/bin/bash # # Licensed to the Apache Software Foundation (ASF) under one or more # contributor license agreements. See the NOTICE file distributed with # this work for additional information re
nginx三种获取用户真实ip的方法 ronin47
随着nginx的迅速崛起，越来越多公司将apache更换成nginx. 同时也越来越多人使用nginx作为负载均衡, 并且代理前面可能还加上了CDN加速，但是随之也遇到一个问题：nginx如何获取用户的真实IP地址,如果后端是apache,请跳转到<apache获取用户真实IP地址>，如果是后端真实服务器是nginx，那么继续往下看。实例环境：用户IP 120.22.11.11
java-判断二叉树是不是平衡 bylijinnan java
参考了 http://zhedahht.blog.163.com/blog/static/25411174201142733927831/ 但是用java来实现有一个问题。由于Java无法像C那样“传递参数的地址，函数返回时能得到参数的值”，唯有新建一个辅助类：AuxClass import ljn.help.*; public class BalancedBTree {
BeanUtils.copyProperties VS PropertyUtils.copyProperties 诸葛不亮 PropertyUtils BeanUtils
BeanUtils.copyProperties VS PropertyUtils.copyProperties 作为两个bean属性copy的工具类，他们被广泛使用，同时也很容易误用，给人造成困然；比如：昨天发现同事在使用BeanUtils.copyProperties copy有integer类型属性的bean时，没有考虑到会将null转换为0，而后面的业
[金融与信息安全]最简单的数据结构最安全 comsci 数据结构
现在最流行的数据库的数据存储文件都具有复杂的文件头格式，用操作系统的记事本软件是无法正常浏览的，这样的情况会有什么问题呢？从信息安全的角度来看，如果我们数据库系统仅仅把这种格式的数据文件做异地备份，如果相同版本的所有数据库管理系统都同时被攻击，那么
vi区段删除 Cwind linux vi 区段删除
区段删除是编辑和分析一些冗长的配置文件或日志文件时比较常用的操作。简记下vi区段删除要点备忘。 vi概述引文中并未将末行模式单独列为一种模式。单不单列并不重要，能区分命令模式与末行模式即可。 vi区段删除步骤： 1. 在末行模式下使用:set nu显示行号非必须，随光标移动vi右下角也会显示行号，能够正确找到并记录删除开始行
清除tomcat缓存的方法总结 dashuaifu tomcat 缓存
用tomcat容器，大家可能会发现这样的问题，修改jsp文件后，但用IE打开依然是以前的Jsp的页面。出现这种现象的原因主要是tomcat缓存的原因。解决办法如下: 在jsp文件头加上 <meta http-equiv="Expires" content="0"> <meta http-equiv="kiben&qu
不要盲目的在项目中使用LESS CSS dcj3sjt126com Web less
　如果你还不知道LESS CSS是什么东西，可以看一下这篇文章，是我一朋友写给新人看的《CSS——LESS》　　不可否认，LESS CSS是个强大的工具，它弥补了css没有变量、无法运算等一些“先天缺陷”，但它似乎给我一种错觉，就是为了功能而实现功能。　　比如它的引用功能 ? .rounded_corners{
[入门]更上一层楼 dcj3sjt126com PHP yii2
更上一层楼通篇阅读完整个“入门”部分，你就完成了一个完整 Yii 应用的创建。在此过程中你学到了如何实现一些常用功能，例如通过 HTML 表单从用户那获取数据，从数据库中获取数据并以分页形式显示。你还学到了如何通过 Gii 去自动生成代码。使用 Gii 生成代码把 Web 开发中多数繁杂的过程转化为仅仅填写几个表单就行。本章将介绍一些有助于更好使用 Yii 的资源：
Apache HttpClient使用详解 eksliang httpclient http协议
Http协议的重要性相信不用我多说了，HttpClient相比传统JDK自带的URLConnection，增加了易用性和灵活性（具体区别，日后我们再讨论），它不仅是客户端发送Http请求变得容易，而且也方便了开发人员测试接口（基于Http协议的），即提高了开发的效率，也方便提高代码的健壮性。因此熟练掌握HttpClient是很重要的必修内容，掌握HttpClient后，相信对于Http协议的了解会
zxing二维码扫描功能 gundumw100 android zxing
经常要用到二维码扫描功能现给出示例代码 import com.google.zxing.WriterException; import com.zxing.activity.CaptureActivity; import com.zxing.encoding.EncodingHandler; import android.app.Activity; import an
纯HTML+CSS带说明的黄色导航菜单 ini html Web html5 css hovertree
HoverTree带说明的CSS菜单:纯HTML+CSS结构链接带说明的黄色导航在线体验效果：http://hovertree.com/texiao/css/1.htm代码如下,保存到HTML文件可以看到效果： <!DOCTYPE html > <html > <head> <title>HoverTree
fastjson初始化对性能的影响 kane_xie fastjson 序列化
之前在项目中序列化是用thrift，性能一般，而且需要用编译器生成新的类，在序列化和反序列化的时候感觉很繁琐，因此想转到json阵营。对比了jackson，gson等框架之后，决定用fastjson，为什么呢，因为看名字感觉很快。。。网上的说法： fastjson 是一个性能很好的 Java 语言实现的 JSON 解析器和生成器，来自阿里巴巴的工程师开发。
基于Mybatis封装的增删改查实现通用自动化sql mengqingyu DAO
1.基于map或javaBean的增删改查可实现不写dao接口和实现类以及xml，有效的提高开发速度。 2.支持自定义注解包括主键生成、列重复验证、列名、表名等 3.支持批量插入、批量更新、批量删除 <bean id="dynamicSqlSessionTemplate" class="com.mqy.mybatis.support.Dynamic
js控制input输入框的方法封装(数字，中文，字母，浮点数等) qifeifei javascript js
在项目开发的时候，经常有一些输入框，控制输入的格式，而不是等输入好了再去检查格式，格式错了就报错，体验不好。 /** 数字，中文，字母,浮点数(+/-/.) 类型输入限制，只要在input标签上加上 jInput="number,chinese,alphabet,floating" 备注：floating属性只能单独用*/ funct
java 计时器应用 tangqi609567707 java timer
mport java.util.TimerTask; import java.util.Calendar; public class MyTask extends TimerTask { private static final int
erlang输出调用栈信息 wudixiaotie erlang
在erlang otp的开发中，如果调用第三方的应用，会有有些错误会不打印栈信息，因为有可能第三方应用会catch然后输出自己的错误信息，所以对排查bug有很大的阻碍，这样就要求我们自己打印调用的栈信息。用这个函数：erlang:process_display (self (), backtrace).需要注意这个函数只会输出到标准错误输出。也可以用这个函数：erlang:get_s

mediapipe流水线分析一

object detection Graph

一流水线上游输入处理

1 Calculator

1.1 CalculatorBase

2 mediapipe 流水线

2.1 GpuBufferToImageFrameCalculator

2.1.1 GPU

2.1.2 CPU

2.2 FlowLimiterCalculator

2.2.1 FlowLimiterCalculator

3 输入变换

3.1 ImageTransformationCalculator

3.2 process

3.2.1 Cpu opencv process

3.2.2 Gpu

你可能感兴趣的:(人工智能,mediapipe)

mediapipe流水线分析 一

object detection Graph

一 流水线上游输入处理

1 Calculator

1.1 CalculatorBase

2 mediapipe 流水线

2.1 GpuBufferToImageFrameCalculator

2.1.1 GPU

2.1.2 CPU

2.2 FlowLimiterCalculator

2.2.1 FlowLimiterCalculator

3 输入变换

3.1 ImageTransformationCalculator

3.2 process

3.2.1 Cpu opencv process

3.2.2 Gpu

你可能感兴趣的:(人工智能,mediapipe)

mediapipe流水线分析一

一流水线上游输入处理