知行合一2018

C++无锁编程——无锁栈(lock-free stack)

贺志国
2023.6.28

无锁数据结构意味着线程可以并发地访问数据结构而不出错。例如，一个无锁栈能同时允许一个线程压入数据，另一个线程弹出数据。不仅如此，当调度器中途挂起其中一个访问线程时，其他线程必须能够继续完成自己的工作，而无需等待挂起线程。
无锁栈一个很大的问题在于，如何在不加锁的前提下，正确地分配和释放节点的内存，同时不引起逻辑错误和程序崩溃。

一、使用智能指针`std::shared_ptr`实现

一个最朴素的想法是，使用智能指针管理节点。事实上，如果平台支持std::atomic_is_lock_free(&some_shared_ptr)实现返回true，那么所有内存回收问题就都迎刃而解了（我在X86和Arm平台测试，均返回false）。示例代码（文件命名为 lock_free_stack.h）如下：

#pragma once

#include 
#include 

template <typename T>
class LockFreeStack {
 public:
  LockFreeStack(): head_(nullptr) {}
  ~LockFreeStack() {
    while (Pop()) {
      // Do nothing and wait for all elements are poped.
    }
  }
  LockFreeStack(const LockFreeStack& other) = delete;
  LockFreeStack& operator=(const LockFreeStack& other) = delete;
  
  bool IsEmpty() const { return std::atomic_load(&head_) == nullptr; }

  void Push(const T& data) {
    const auto new_node = std::make_shared<Node>(data);
    new_node->next = std::atomic_load(&head_);
    // If new_node->next is the same as head_, update head_ to new_node and
    // return true.
    // If new_node->next and head_ are not equal, update new_node->next to head_
    // and return false.
    while (
        !std::atomic_compare_exchange_weak(&head_, &new_node->next, new_node)) {
      // Do nothing and wait for the head_ is updated to new_node.
    }
  }

  std::shared_ptr<T> Pop() {
    std::shared_ptr<Node> old_head = std::atomic_load(&head_);
    // If old_head is not a null pointer and it is the same as head_, update
    // head_ to old_head->next and return true.
    // If old_head is not a null pointer and it is not equal to head_, update
    // old_head to head_ and return false.
    while (old_head != nullptr &&
           !std::atomic_compare_exchange_weak(
               &head_, &old_head, std::atomic_load(&old_head->next))) {
      // Do nothing and wait for the head_ is updated to old_head->next.
    }

    if (old_head != nullptr) {
      std::atomic_store(&old_head->next, std::shared_ptr<Node>());
      return old_head->data;
    }

    return std::shared_ptr<T>();
  }
  
 private:
  struct Node {
    // std::make_shared does not throw an exception.
    Node(const T& input_data)
        : data(std::make_shared<T>(input_data)), next(nullptr) {}

    std::shared_ptr<T> data;
    std::shared_ptr<Node> next;
  };

  std::shared_ptr<Node> head_;
};

上述代码中，希望借助std::shared_ptr<>来完成节点内存的动态分配和回收，因为其有内置的引用计数机制。不幸地是，虽然std::shared_ptr<>虽然可以用于原子操作，但在大多数平台上不是无锁的，需要通过C++标准库添加内部锁来实现原子操作，这样会带来极大的性能开销，无法满足高并发访问的需求。

如果编译器支持C++20标准，std::atomic>允许用户原子地操纵 std::shared_ptr，即在确保原子操作的同时，还能正确地处理引用计数。与其他原子类型一样，其实现也不确定是否无锁。使用std::atomic>实现无锁栈（表面上看肯定无锁，实际上是否无锁取决于std::atomic>的is_lock_free函数返回值是否为true）的示例代码（文件命名为 lock_free_stack.h）如下：

#pragma once

#include 
#include 

template <typename T>
class LockFreeStack {
 public:
  LockFreeStack() : head_(nullptr) {}
  ~LockFreeStack() {
    while (Pop()) {
      // Do nothing and wait for all elements are poped.
    }
  }
  LockFreeStack(const LockFreeStack& other) = delete;
  LockFreeStack& operator=(const LockFreeStack& other) = delete;

  bool IsEmpty() const { return std::atomic_load(&head_) == nullptr; }

  void Push(const T& data) {
    const auto new_node = std::make_shared<Node>(data);
    std::shared_ptr<Node> old_head = head_.load();
    new_node->next = old_head;
    // If old_head is the same as head_, update head_ to new_node and return
    // true. If old_head and head_ are not equal, update old_head to head_ and
    // return false.
    while (!head_.compare_exchange_weak(old_head, new_node)) {
      new_node->next = old_head;
    }
  }

  std::shared_ptr<T> Pop() {
    std::shared_ptr<Node> old_head = head_.load();
    // If old_head is not a null pointer and it is the same as head_, update
    // head_ to old_head->next and return true.
    // If old_head is not a null pointer and it is not equal to head_, update
    // old_head to head_ and return false.
    while (old_head != nullptr &&
           !head_.compare_exchange_weak(old_head, old_head->next.load())) {
      // Do nothing and wait for the head_ is updated to old_head->next.
    }

    if (old_head != nullptr) {
      old_head->next = std::shared_ptr<Node>();
      return old_head->data;
    }

    return std::shared_ptr<T>();
  }

 private:
  struct Node {
    // std::make_shared does not throw an exception.
    Node(const T& input_data)
        : data(std::make_shared<T>(input_data)), next(nullptr) {}

    std::shared_ptr<T> data;
    std::atomic<std::shared_ptr<Node>> next;
  };

  // Compilation error: /usr/include/c++/9/atomic:191:21: error: static
  // assertion failed: std::atomic requires a trivially copyable type
  // static_assert(__is_trivially_copyable(_Tp),
  std::atomic<std::shared_ptr<Node>> head_;
};

我的编译器目前只支持C++17标准，上述代码会出现如下编译错误：

In file included from /home/zhiguohe/code/excercise/lock_freee/lock_free_stack_with_shared_ptr_cpp/lock_free_stack_with_shared_ptr.h:3,
                 from /home/zhiguohe/code/excercise/lock_freee/lock_free_stack_with_shared_ptr_cpp/lock_free_stack_with_shared_ptr.cpp:1:
/usr/include/c++/9/atomic: In instantiation of ‘struct std::atomic<std::shared_ptr<LockFreeStack<int>::Node> >’:
/home/zhiguohe/code/excercise/lock_freee/lock_free_stack_with_shared_ptr_cpp/lock_free_stack_with_shared_ptr.h:61:38:   required from ‘class LockFreeStack<int>’
/home/zhiguohe/code/excercise/lock_freee/lock_free_stack_with_shared_ptr_cpp/lock_free_stack_with_shared_ptr.cpp:16:22:   required from here
/usr/include/c++/9/atomic:191:21: error: static assertion failed: std::atomic requires a trivially copyable type
  191 |       static_assert(__is_trivially_copyable(_Tp),
      |                     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
make[2]: *** [CMakeFiles/lock_free_stack_with_shared_ptr_cpp.dir/build.make:63: CMakeFiles/lock_free_stack_with_shared_ptr_cpp.dir/lock_free_stack_with_shared_ptr.cpp.o] Error 1
make[1]: *** [CMakeFiles/Makefile2:644: CMakeFiles/lock_free_stack_with_shared_ptr_cpp.dir/all] Error 2
make: *** [Makefile:117: all] Error 2

二、手动管理内存——使用简单的计数器判断是否存在线程调用`Pop`函数

2.1 不考虑放宽内存顺序

如果编译器不支持C++20标准，我们需要手动管理节点的内存分配和回收。一种简单的思路是，判断当前有无线程访问Pop函数，如果不存在，则删除所有弹出的节点，否则将弹出的节点存储到待删除列表to_be_deleted_中，等到最终无线程访问Pop函数时再释放to_be_deleted_。下面展示该思路的实现代码（文件命名为 lock_free_stack.h，示例来源于C++ Concurrency In Action, 2ed 2019，修复了其中的bug）：

#pragma once

#include 
#include 

template <typename T>
class LockFreeStack {
 public:
   LockFreeStack()
      : head_(nullptr), to_be_deleted_(nullptr), threads_in_pop_(0) {}
  ~LockFreeStack() {
    while (Pop()) {
      // Do nothing and wait for all elements are poped.
    }
  }
  LockFreeStack(const LockFreeStack& other) = delete;
  LockFreeStack& operator=(const LockFreeStack& other) = delete;
  
  bool IsEmpty() const { return head_.load() == nullptr; }

  void Push(const T& data) {
    Node* new_node = new Node(data);
    new_node->next = head_.load();
    // If new_node->next is the same as head_, update head_ to new_node and
    // return true.
    // If new_node->next and head_ are not equal, update new_node->next to head_
    // and return false.
    while (!head_.compare_exchange_weak(new_node->next, new_node)) {
      // Do nothing and wait for the head_ is updated to new_node.
    }
  }

  std::shared_ptr<T> Pop() {
    Node* old_head = head_.load();
    // If old_head is not a null pointer and it is the same as head_, update
    // head_ to old_head->next and return true.
    // If old_head is not a null pointer and it is not equal to head_, update
    // old_head to head_ and return false.
    while (old_head != nullptr &&
           !head_.compare_exchange_weak(old_head, old_head->next)) {
      // Do nothing and wait for the head_ is updated to old_head->next.
    }

    // return old_head != nullptr ? old_head->data : std::shared_ptr();

    std::shared_ptr<T> res;
    if (old_head != nullptr) {
      ++threads_in_pop_;
      res.swap(old_head->data);
      // Reclaim deleted nodes.
      TryReclaim(old_head);
    }

    return res;
  }

  ~LockFreeStack() {
    while (Pop()) {
      // Do nothing and wait for all elements are poped.
    }
  }

 private:
  // If the struct definition of Node is placed in the private data member
  // field where 'head_' is defined, the following compilation error will occur:
  //
  // error: 'Node' has not been declared ...
  //
  // It should be a bug of the compiler. The struct definition of Node is put in
  // front of the private member function `DeleteNodes` to eliminate this error.
  struct Node {
    // std::make_shared does not throw an exception.
    Node(const T& input_data)
        : data(std::make_shared<T>(input_data)), next(nullptr) {}

    std::shared_ptr<T> data;
    Node* next;
  };

 private:
  static void DeleteNodes(Node* nodes) {
    while (nodes != nullptr) {
      Node* next = nodes->next;
      delete nodes;
      nodes = next;
    }
  }

  void ChainPendingNodes(Node* first, Node* last) {
    last->next = to_be_deleted_;
    // If last->next is the same as to_be_deleted_, update head_ to first and
    // return true.
    // If last->next and to_be_deleted_ are not equal, update last->next to
    // to_be_deleted_ and return false.
    while (!to_be_deleted_.compare_exchange_weak(last->next, first)) {
      // Do nothing and wait for the to_be_deleted_ is updated to first.
    }
  }

  void ChainPendingNodes(Node* nodes) {
    Node* last = nodes;
    while (Node* next = last->next) {
      last = next;
    }

    ChainPendingNodes(nodes, last);
  }

  void ChainPendingNode(Node* n) { ChainPendingNodes(n, n); }

  void TryReclaim(Node* old_head) {
    if (old_head == nullptr) {
      return;
    }

    if (threads_in_pop_ == 1) {
      Node* nodes_to_delete = to_be_deleted_.exchange(nullptr);

      if (!--threads_in_pop_) {
        DeleteNodes(nodes_to_delete);
      } else if (nodes_to_delete) {
        ChainPendingNodes(nodes_to_delete);
      }

      delete old_head;
    } else {
      ChainPendingNode(old_head);
      --threads_in_pop_;
    }
  }

 private:
  std::atomic<Node*> head_;
  std::atomic<Node*> to_be_deleted_;
  std::atomic<unsigned> threads_in_pop_;
};

上述代码通过计数的方式来回收节点的内存。当栈处于低负荷状态时，这种方式没有问题。然而，删除节点是一项非常耗时的工作，并且希望其他线程对链表做的修改越少越好。从第一次发现threads_in_pop_是1，到尝试删除节点，会用耗费很长的时间，就会让线程有机会调用Pop()，让threads_in_pop_不为0，阻止节点的删除操作。栈处于高负荷状态时，因为其他线程在初始化后都能使用Pop()，所以待删除节点的链表to_be_deleted_将会无限增加，会再次泄露。另一种方式是，确定无线程访问给定节点，这样给定节点就能回收，这种最简单的替换机制就是使用风险指针(hazard pointer)和引用计数。我们将在后续示例中讲解。

2.2 放宽内存顺序

上述实现代码的所有原子操作函数没给出内存顺序，默认使用的都是std::memory_order_seq_cst（顺序一致序）。std::memory_order_seq_cst比起其他内存序要简单得多，在顺序一致序下，所有操作（包括原子与非原子的操作）都与代码顺序一致，符合人类正常的思维逻辑，但消耗的系统资源相对会更高。任何一个无锁数据的实现，内存顺序都应当从std::memory_order_seq_cst开始。只有当基本操作正常工作的时候，才可考虑放宽内存顺序的选择。通常，放松后的内存顺序很难保证在所有平台工作正常。除非性能真正成了瓶颈，否则不必考虑放宽内存顺序。如果追求极致性能，需要部分放宽内存顺序。实际上，内存顺序仅对ARM嵌入式平台的性能产生较大的影响，在X86平台上几乎无影响，X86编译器实现的内存顺序似乎都是std::memory_order_seq_cst（顺序一致序）。放宽内存顺序的基本原则为：如原子操作不需要和其他操作同步，可使用std::memory_order_relaxed（自由序，或称松弛序）；写入操作一般使用std::memory_order_release（释放序），读入操作一般使用std::memory_order_acquire（获取序）。放宽内存顺序需要严格测试，尤其是在嵌入式平台上。如无把握，并且性能瓶颈不严重，建议一律不填写内存顺序参数，也就是使用默认的std::memory_order_seq_cst（顺序一致序）。下面给出参考的放宽内存顺序代码（文件命名为 lock_free_stack.h），不能保证所有平台都会正确：

#pragma once

#include 
#include 

template <typename T>
class LockFreeStack {
 public:
  LockFreeStack()
      : head_(nullptr), to_be_deleted_(nullptr), threads_in_pop_(0) {}
  ~LockFreeStack() {
    while (Pop()) {
      // Do nothing and wait for all elements are poped.
    }
  }
  LockFreeStack(const LockFreeStack& other) = delete;
  LockFreeStack& operator=(const LockFreeStack& other) = delete;
  
  bool IsEmpty() const {
    return head_.load(std::memory_order_relaxed) == nullptr;
  }

  void Push(const T& data) {
    Node* new_node = new Node(data);
    new_node->next = head_.load(std::memory_order_relaxed);
    // If new_node->next is the same as head_, update head_ to new_node and
    // return true.
    // If new_node->next and head_ are not equal, update new_node->next to head_
    // and return false.
    while (!head_.compare_exchange_weak(new_node->next, new_node,
                                        std::memory_order_release,
                                        std::memory_order_relaxed)) {
      // Do nothing and wait for the head_ is updated to new_node.
    }
  }

  std::shared_ptr<T> Pop() {
    Node* old_head = head_.load(std::memory_order_relaxed);
    // If old_head is not a null pointer and it is the same as head_, update
    // head_ to old_head->next and return true.
    // If old_head is not a null pointer and it is not equal to head_, update
    // old_head to head_ and return false.
    while (old_head != nullptr &&
           !head_.compare_exchange_weak(old_head, old_head->next,
                                        std::memory_order_acquire,
                                        std::memory_order_relaxed)) {
      // Do nothing and wait for the head_ is updated to old_head->next.
    }

    // return old_head != nullptr ? old_head->data : std::shared_ptr();

    std::shared_ptr<T> res;
    if (old_head != nullptr) {
      threads_in_pop_.fetch_add(1, std::memory_order_relaxed);
      res.swap(old_head->data);
      // Reclaim deleted nodes.
      TryReclaim(old_head);
    }

    return res;
  }

 private:
  // If the struct definition of Node is placed in the private data member
  // field where 'head_' is defined, the following compilation error will occur:
  //
  // error: 'Node' has not been declared ...
  //
  // It should be a bug of the compiler. The struct definition of Node is put in
  // front of the private member function `DeleteNodes` to eliminate this error.
  struct Node {
    // std::make_shared does not throw an exception.
    Node(const T& input_data)
        : data(std::make_shared<T>(input_data)), next(nullptr) {}

    std::shared_ptr<T> data;
    Node* next;
  };

 private:
  static void DeleteNodes(Node* nodes) {
    while (nodes != nullptr) {
      Node* next = nodes->next;
      delete nodes;
      nodes = next;
    }
  }

  void ChainPendingNodes(Node* first, Node* last) {
    last->next = to_be_deleted_.load(std::memory_order_relaxed);
    // If last->next is the same as to_be_deleted_, update head_ to first and
    // return true.
    // If last->next and to_be_deleted_ are not equal, update last->next to
    // to_be_deleted_ and return false.
    while (!to_be_deleted_.compare_exchange_weak(last->next, first,
                                                 std::memory_order_release,
                                                 std::memory_order_relaxed)) {
      // Do nothing and wait for the to_be_deleted_ is updated to first.
    }
  }

  void ChainPendingNodes(Node* nodes) {
    Node* last = nodes;
    while (Node* next = last->next) {
      last = next;
    }

    ChainPendingNodes(nodes, last);
  }

  void ChainPendingNode(Node* n) { ChainPendingNodes(n, n); }

  void TryReclaim(Node* old_head) {
    if (old_head == nullptr) {
      return;
    }

    if (threads_in_pop_ == 1) {
      Node* nodes_to_delete =
          to_be_deleted_.exchange(nullptr, std::memory_order_relaxed);

      if (!--threads_in_pop_) {
        DeleteNodes(nodes_to_delete);
      } else if (nodes_to_delete) {
        ChainPendingNodes(nodes_to_delete);
      }

      delete old_head;
    } else {
      ChainPendingNode(old_head);
      threads_in_pop_.fetch_sub(1, std::memory_order_relaxed);
    }
  }

 private:
  std::atomic<Node*> head_;
  std::atomic<Node*> to_be_deleted_;
  std::atomic<unsigned> threads_in_pop_;
};

三、手动管理内存——使用风险指针（hazard pointer）标识正在访问的对象

风险指针（hazard pointer）之所以称为是风险的，是因为删除一个节点可能会让其他引用线程处于危险状态。其他线程持有已删除节点的指针对其进行解引用操作时，会出现未定义行为。其基本思想就是，当有线程去访问(其他线程)删除的对象时，会先对这个对象设置风险指针，而后通知其他线程——使用这个指针是危险的行为。当这个对象不再需要，就可以清除风险指针。当线程想要删除一个对象，就必须检查系统中其他线程是否持有风险指针。当没有风险指针时，就可以安全删除对象。否则，就必须等待风险指针消失。这样，线程就需要周期性的检查要删除的对象是否能安全删除。下面展示该思路的实现代码（文件命名为 lock_free_stack.h，示例来源于C++ Concurrency In Action, 2ed 2019，修复了其中的bug）：

#pragma once

#include 
#include 
#include 
#include 

template <typename T>
class LockFreeStack {
 public:
  LockFreeStack() : head_(nullptr), nodes_to_reclaim_(nullptr) {}
  ~LockFreeStack() {
    while (Pop()) {
      // Do nothing and wait for all elements are poped.
    }
  }
  LockFreeStack(const LockFreeStack& other) = delete;
  LockFreeStack& operator=(const LockFreeStack& other) = delete;
  
  bool IsEmpty() const { return head_.load() == nullptr; }

  void Push(const T& data) {
    Node* new_node = new Node(data);
    new_node->next = head_.load();
    // If new_node->next is the same as head_, update head_ to new_node and
    // return true.
    // If new_node->next and head_ are not equal, update new_node->next to head_
    // and return false.
    while (!head_.compare_exchange_weak(new_node->next, new_node)) {
      // Do nothing and wait for the head_ is updated to new_node.
    }
  }

  std::shared_ptr<T> Pop() {
    std::atomic<void*>& hp = GetHazardPointerForCurrentThread();
    Node* old_head = head_.load();

    do {
      Node* temp = nullptr;
      do {
        temp = old_head;
        hp.store(old_head);
        old_head = head_.load();

      } while (old_head != temp);
    } while (old_head != nullptr &&
             !head_.compare_exchange_strong(old_head, old_head->next));

    hp.store(nullptr);

    std::shared_ptr<T> res;

    if (old_head != nullptr) {
      res.swap(old_head->data);

      if (IsOutstandingHazardPointerForNode(old_head)) {
        ReClaimLater(old_head);
      } else {
        delete old_head;
      }

      DeleteNodesWithNoHazards();
    }

    return res;
  }

 private:
  // If the struct definition of Node is placed in the private data member
  // field where 'head_' is defined, the following compilation error will occur:
  //
  // error: 'Node' has not been declared ...
  //
  // It should be a bug of the compiler. The struct definition of Node is put in
  // front of the private member function `DeleteNodes` to eliminate this error.
  struct Node {
    // std::make_shared does not throw an exception.
    Node(const T& input_data)
        : data(std::make_shared<T>(input_data)), next(nullptr) {}

    std::shared_ptr<T> data;
    Node* next;
  };

  struct HazardPointer {
    std::atomic<std::thread::id> id;
    std::atomic<void*> pointer;
  };

  class HazardPointerOwner {
   public:
    HazardPointerOwner(const HazardPointerOwner& other) = delete;
    HazardPointerOwner operator=(const HazardPointerOwner& other) = delete;
    HazardPointerOwner() : hp_(nullptr) {
      for (unsigned i = 0; i < kMaxHazardPointerNum; ++i) {
        std::thread::id old_id;
        if (hazard_pointers_[i].id.compare_exchange_strong(
                old_id, std::this_thread::get_id())) {
          hp_ = &hazard_pointers_[i];
          break;
        }
      }

      if (hp_ == nullptr) {
        throw std::runtime_error("No hazard pointers available.");
      }
    }

    ~HazardPointerOwner() {
      hp_->pointer.store(nullptr);
      hp_->id.store(std::thread::id());
    }

    std::atomic<void*>& GetPointer() { return hp_->pointer; }

   private:
    HazardPointer* hp_;
  };

  template <typename DT>
  struct DataToReclaim {
    DataToReclaim(DT* p) : data(p), next(nullptr) {}
    ~DataToReclaim() { delete data; }

    DT* data;
    DataToReclaim* next;
  };

 private:
  static std::atomic<void*>& GetHazardPointerForCurrentThread() {
    static thread_local HazardPointerOwner hazard_owner;
    return hazard_owner.GetPointer();
  }
  template <typename DT>
  void ReClaimLater(DT* data) {
    AddToReclaimList(new DataToReclaim<DT>(data));
  }

  bool IsOutstandingHazardPointerForNode(void* p) {
    for (unsigned i = 0; i < kMaxHazardPointerNum; ++i) {
      if (hazard_pointers_[i].pointer.load() == p) {
        return true;
      }
    }

    return false;
  }

  void AddToReclaimList(DataToReclaim<Node>* node) {
    if (node == nullptr) {
      return;
    }

    node->next = nodes_to_reclaim_.load();
    while (!nodes_to_reclaim_.compare_exchange_weak(node->next, node)) {
      // Do nothing.
    }
  }

  void DeleteNodesWithNoHazards() {
    DataToReclaim<Node>* current = nodes_to_reclaim_.exchange(nullptr);
    while (current) {
      DataToReclaim<Node>* next = current->next;
      if (!IsOutstandingHazardPointerForNode(current->data)) {
        delete current;
      } else {
        AddToReclaimList(current);
      }
      current = next;
    }
  }

 private:
  static constexpr unsigned kMaxHazardPointerNum = 200;
  static HazardPointer hazard_pointers_[kMaxHazardPointerNum];

  std::atomic<DataToReclaim<Node>*> nodes_to_reclaim_;
  std::atomic<Node*> head_;
};

// Static member array initialization. The syntax is ugly.
template <typename T>
typename LockFreeStack<T>::HazardPointer
    LockFreeStack<T>::hazard_pointers_[kMaxHazardPointerNum];

注意我们之前的比较交换操作用的是compare_exchange_weak函数，而在以下代码中使用的是compare_exchange_strong函数：

    do {
      Node* temp = nullptr;
      do {
        temp = old_head;
        hp.store(old_head);
        old_head = head_.load();

      } while (old_head != temp);
    } while (old_head != nullptr &&
             !head_.compare_exchange_strong(old_head, old_head->next));

compare_exchange_weak函数的优点是比较交换动作消耗的资源较少，但经常存在操作系统调度引起的虚假失败；compare_exchange_strong函数的优点是不存在操作系统调度引起的虚假失败，但比较交换动作消耗的资源较多。选择二者依据是：如果while循环体中没有任何操作或者while循环体中的操作消耗资源非常少，则使用compare_exchange_weak函数；反之，如果while循环体中的操作消耗资源比较多，则使用compare_exchange_strong函数。对应到上述代码，大while循环中嵌套了一个小while循环，小while循环中包含三条语句，消耗资源较多，compare_exchange_weak函数虚假失败带来的资源消耗会超过compare_exchange_strong函数比较操作消耗的资源，因此选用compare_exchange_strong函数。如果觉得仍然难以把握，建议反复大数据量测试两种方式的实际资源消耗，最终选出合适的函数版本。

以下代码：

template <typename T>
typename LockFreeStack<T>::HazardPointer
    LockFreeStack<T>::hazard_pointers_[kMaxHazardPointerNum];

是定义静态成员数组hazard_pointers_[kMaxHazardPointerNum]，也就是我们通常所说的静态成员数组初始化。语法相当丑陋，但是只能这么写。使用风险指针的方法实现内存回收虽然很简单，也的确安全地回收了删除的节点，不过增加了很多开销。遍历风险指针数组需要检查kMaxHazardPointerNum个原子变量，并且每次Pop()调用时，都需要再检查一遍。原子操作很耗时，所以Pop()成为了性能瓶颈，不仅需要遍历节点的风险指针链表，还要遍历等待链表上的每一个节点。有kMaxHazardPointerNum在链表中时，就需要检查kMaxHazardPointerNum个已存储的风险指针。

四、手动管理内存——使用引用计数判断节点是否未被访问

判断弹出的节点是否能被删除的另一种思路是，当前被删除的节点是否存在线程访问，如果不存在就删除，否则就等待。该思路与智能指针的引用计数思路一致。具体做法为：对每个节点使用两个引用计数：内部计数和外部计数。两个值的总和就是对这个节点的引用数。外部记数与节点指针绑定在一起，节点指针每次被线程读到时，外部计数加1。当线程结束对节点的访问时，内部计数减1。当节点（内部包含节点指针和绑定在一起的外部计数）不被外部线程访问时，将内部计数与外部计数-2相加并将结果重新赋值给内部计数，同时丢弃外部计数。一旦内部计数等于0，表明当前节点没有被外部线程访问，可安全地将节点删除。实现代码如下（文件命名为 lock_free_stack.h）：

4.1 不考虑内存顺序

#pragma once

#include 
#include 

template <typename T>
class LockFreeStack {
 public:
  LockFreeStack() : head_(CountedNodePtr()) {}
  ~LockFreeStack() {
    while (Pop()) {
      // Do nothing and wait for all elements are poped.
    }
  }

  // Copy constructs and copy assignments are prohibited
  LockFreeStack(const LockFreeStack& other) = delete;
  LockFreeStack& operator=(const LockFreeStack& other) = delete;

  bool IsEmpty() const {
    return head_.load().ptr == 0;
  }

  bool IsLockFree() const {
    return std::atomic<CountedNodePtr>::is_always_lock_free;
  }

  void Push(const T& data) {
    // Construct a `CountedNodePtr` object that refers to a freshly allocated
    // node with associated data and set the next value of the node to the
    // current value of head_.
    CountedNodePtr new_node;
    new_node.ptr = reinterpret_cast<uint64_t>(new Node(data));
    new_node.external_count = 1;
    reinterpret_cast<Node*>(new_node.ptr)->next = head_.load();

    // Use compare_exchange_weak() to set the value of `head_` to the
    // `new_node`. The counts are set up so the `internal_count` is zero, and
    // the `external_count` is one. Because this is a new node, there’s
    // currently only one external reference to the node (the `head_` pointer
    // itself).

    // If new_node.ptr->next is the same as head_, update head_ to
    // new_node and return true. If new_node.ptr->next and head_ are not equal,
    // update new_node.ptr->next to head_ and return false.
    while (!head_.compare_exchange_weak(
        reinterpret_cast<Node*>(new_node.ptr)->next, new_node)) {
      // Do nothing and wait for the head_ is updated to new_node.
    }
  }

  std::shared_ptr<T> Pop() {
    CountedNodePtr old_head = head_.load();

    while (true) {
      // Once we've loaded the value of `head_`, we must first increase the
      // count of external references to the `head_` node to indicate that we’re
      // referencing it and to ensure that it’s safe to dereference it. If we
      // dereference the pointer before increasing the reference count, another
      // thread could free the node before we access it, leaving we with a
      // dangling pointer. This is the primary reason for using the split
      // reference count: by incrementing the external reference count, we
      // ensure that the pointer remains valid for the duration of our access.
      IncreaseHeadCount(&old_head);

      // Once the count has been increased, we can safely dereference the `ptr`
      // field of the value loaded from `head_` in order to access the
      // pointed-to node
      Node* ptr = reinterpret_cast<Node*>(old_head.ptr);

      // If the pointer is a null pointer, we’re at the end of the list: no more
      // entries.
      if (ptr == nullptr) {
        return std::shared_ptr<T>();
      }

      // If the pointer isn’t a null pointer, we can try to remove the node by a
      // compare_exchange_strong() call on the `head_`.
      if (head_.compare_exchange_strong(old_head, ptr->next)) {
        // If the compare_exchange_strong() succeeds, we've taken ownership of
        // the node and can swap out the data in preparation for returning it.
        // This ensures that the data isn’t kept alive just because other
        // threads accessing the stack happen to still have pointers to its
        // node.
        std::shared_ptr<T> res;
        res.swap(ptr->data);

        // It’s important to note that the value we add is two less than the
        // external count; we've removed the node from the list, so we drop
        // one off the count for that, and we’re no longer accessing the node
        // from this thread, so we drop another off the count for that.
        const int increased_count = old_head.external_count - 2;

        // Add the external count to the internal count on the node with an
        // atomic `fetch_add`. If the reference count is now zero, the previous
        // value (which is what fetch_add returns) was the negative of what we
        // added, in which case we can delete the node.
        if (ptr->internal_count.fetch_add(increased_count) ==
            -increased_count) {
          delete ptr;
        }

        // Whether or not we deleted the node, we've finished, so we can return
        // the data.
        return res;

        // If the compare/exchange fails, another thread removed our node
        // before we did, or another thread added a new node to the stack.
        // Either way, we need to start again with the fresh value of head
        // returned by the compare/exchange call. But first we must decrease the
        // reference count on the node we were trying to remove. This thread
        // won’t access it anymore. If we’re the last thread to hold a
        // reference (because another thread removed it from the stack), the
        // internal reference count will be 1, so subtracting 1 will set the
        // count to zero. In this case, we can delete the node here before we
        // loop.
      } else if (ptr->internal_count.fetch_add(-1) == 1) {        
        delete ptr;
      }
    }
  }

 private:
  // Forward class declaration
  struct Node;

  struct CountedNodePtr {
    CountedNodePtr() : external_count(0), ptr(0) {}

    // We know that the platform has spare bits in a pointer (for example,
    // because the address space is only 48 bits but a pointer is 64 bits), we
    // can store the count inside the spare bits of the pointer to fit it all
    // back in a single machine word.
    uint16_t external_count : 16;
    uint64_t ptr : 48;
  };

  struct Node {
    // std::make_shared does not throw an exception.
    explicit Node(const T& input_data)
        : data(std::make_shared<T>(input_data)), internal_count(0) {}

    std::shared_ptr<T> data;
    std::atomic<int> internal_count;
    CountedNodePtr next;
  };

 private:
  void IncreaseHeadCount(CountedNodePtr* old_counter) {
    CountedNodePtr new_counter;
    // The increment is done with a compare_exchange_strong() loop, which
    // compares and sets the whole structure to ensure that the pointer hasn’t
    // been changed by another thread in the meantime.
    do {
      new_counter = *old_counter;
      ++new_counter.external_count;
    } while (!head_.compare_exchange_strong(*old_counter, new_counter));

    old_counter->external_count = new_counter.external_count;
  }

 private:
  std::atomic<CountedNodePtr> head_;
};

上述代码中，值得特别指出的是，带引用计数的节点指针结构体CountedNodePtr使用了位域的概念：

  struct CountedNodePtr {
    CountedNodePtr() : external_count(0), ptr(0) {}

    // We know that the platform has spare bits in a pointer (for example,
    // because the address space is only 48 bits but a pointer is 64 bits), we
    // can store the count inside the spare bits of the pointer to fit it all
    // back in a single machine word.
    uint16_t external_count : 16;
    uint64_t ptr : 48;
  };

ptr的真实类型是Node*，但这里给出的却是占据48位内存空间的无符整型uint64_t 。为什么要这么做？现在主流的操作系统和编译器只支持最多8字节数据类型的无锁操作，即std::atomic的成员函数is_lock_free只有在sizeof(CountedNodePtr) <= 8时才会返回true。因此，必须将CountedNodePtr的字节数控制8以内，于是我们想到了位域。在主流的操作系统中，指针占用的空间不会超过48位（如果超过该尺寸则必须重新设计位域大小，请查阅操作系统使用手册确认），为此将external_count分配16位（最大支持65535），ptr分配48位，合计64位（8字节）。此时，std::atomic的成员函数is_lock_free在主流操作系统中都会返回true，是真正的无锁原子变量。为了适应上述更改，必须使用reinterpret_cast(new_node.ptr)完成ptr从uint64_t到Node*类型的转换，使用reinterpret_cast(new Node(data)完成指针变量从ptr从Node*到uint64_t类型的转换，从而正常地存储于ptr中。注意：external_count的计数只自增，不自减。当没有线程访问节点时，直接丢弃external_count。注意：internal_count的计数只自减，不自增，另外还与外部计数external_count - 2相加合并。

4.2 考虑内存顺序

修改内存顺序之前，需要检查一下操作间的依赖关系，再去确定适合这种关系的最佳内存序。为了保证这种方式能够工作，需要从线程的视角进行观察。其中最简单的视角就是向栈中推入一个数据项，之后让其他线程从栈中弹出这个数据项。这里需要三个重要数据参与。

CountedNodePtr转移的数据head_。
head_引用的Node。
节点所指向的数据项。

执行Push()的线程，会先构造数据项，并设置head_。执行Pop()的线程，会先加载head_，再做“比较/交换”操作，并增加引用计数，读取对应的Node节点，获取next的指向值。next的值是非原子对象，所以为了保证读取安全，必须确定存储(推送线程)和加载(弹出线程)的先行(happens-before)关系。因为原子操作就是Push()函数中的compare_exchange_weak()，所以需要获取两个线程间的先行(happens-before)关系。compare_exchange_weak()必须是std::memory_order_release或更严格的内存序。不过，compare_exchange_weak()调用失败时，什么都不会改变，并且可以持续循环下去，所以使用std::memory_order_relaxed就足够了。

Pop()的实现呢？为了确定先行(happens-before)关系，必须在访问next值之前使用std::memory_order_acquire或更严格的内存序操作。因为，IncreaseHeadCount()中使用compare_exchange_strong()，会获取next指针指向的旧值，所以要其获取成功就需要std::memory_order_acquire。如同调用Push()那样，当交换失败，循环会继续，所以在失败时可使用std::memory_order_relaxed。

compare_exchange_strong()调用成功时，ptr中的值就被存到old_counter中。存储操作是Push()中的一个释放操作，compare_exchange_strong()操作是一个获取操作，现在存储同步于加载，并且能够获取先行(happens-before)关系。因此，Push()中存储ptr的值要先行于在Pop()中对ptr->next的访问，目前的操作完全安全。

内存序对head_.load()的初始化并不妨碍分析，现在就可以使用std::memory_order_relaxed。

接下来compare_exchange_strong()将old_head.ptr->next设置为head_。是否需要做什么来保证操作线程中的数据完整性呢？交换成功就能访问ptr->data，所以需要保证在Push()线程中对ptr->data进行存储(在加载之前)。increase_head_count()中的获取操作，保证与Push()线程中的存储和“比较/交换”操作同步。在Push()线程中存储数据，先行于存储head_指针；调用increase_head_count()先行于对ptr->data的加载。即使，Pop()中的“比较/交换”操作使用std::memory_order_relaxed，这些操作还是能正常运行。唯一不同的地方就是，调用swap()让ptr->data有所变化，且没有其他线程可以对同一节点进行操作(这就是“比较/交换”操作的作用)。

compare_exchange_strong()失败时，新值不会更新old_head，并继续循环。因为确定了std::memory_order_acquire内存序在IncreaseHeadCount()中使用的可行性，所以使用std::memory_order_relaxed也可以。

其他线程呢？是否需要设置一些更为严格的内存序来保证其他线程的安全呢？回答是“不用”。因为，head_只会因“比较/交换”操作有所改变，对于“读-改-写”操作来说，Push()中的“比较/交换”操作是构成释放序列的一部分。因此，即使有很多线程在同一时间对head_进行修改，Push()中的compare_exchange_weak()与IncreaseHeadCount()(读取已存储的值)中的compare_exchange_strong()也是同步的。

剩余的就可以用来处理fetch_add()操作(用来改变引用计数的操作)，因为已知其他线程不可能对该节点的数据进行修改，所以从节点中返回数据的线程可以继续执行。不过，当线程获取修改后的值时，就代表操作失败(swap()是用来提取数据项的引用)。为了避免数据竞争，要保证swap()先行于delete操作。一种简单的解决办法：在“成功返回”分支中对fetch_add()使用std::memory_order_release内存序，在“再次循环”分支中对fetch_add()使用std::memory_order_acquire内存序。不过，这有点矫枉过正：只有一个线程做delete操作(将引用计数设置为0的线程)，所以只有这个线程需要获取操作。因为fetch_add()是一个“读-改-写”操作，是释放序列的一部分，所以可以使用一个额外的load()做获取。当“再次循环”分支将引用计数减为0时，fetch_add()可以重载引用计数，使用std::memory_order_acquire为了保持需求的同步关系。并且，fetch_add()本身可以使用std::memory_order_relaxed。

完整的放宽内存顺序的代码如下（文件命名为 lock_free_stack.h），不能保证所有平台都会正确：

#pragma once

#include 
#include 

template <typename T>
class LockFreeStack {
 public:
  LockFreeStack() : head_(CountedNodePtr()) {}
  ~LockFreeStack() {
    while (Pop()) {
      // Do nothing and wait for all elements are poped.
    }
  }

  // Copy constructs and copy assignments are prohibited
  LockFreeStack(const LockFreeStack& other) = delete;
  LockFreeStack& operator=(const LockFreeStack& other) = delete;

  bool IsEmpty() const {
    return head_.load(std::memory_order_relaxed).ptr == 0;
  }

  bool IsLockFree() const {
    return std::atomic<CountedNodePtr>::is_always_lock_free;
  }

  void Push(const T& data) {
    // Construct a `CountedNodePtr` object that refers to a freshly allocated
    // node with associated data and set the next value of the node to the
    // current value of head_.
    CountedNodePtr new_node;
    new_node.ptr = reinterpret_cast<uint64_t>(new Node(data));
    new_node.external_count = 1;
    reinterpret_cast<Node*>(new_node.ptr)->next =
        head_.load(std::memory_order_relaxed);

    // Use compare_exchange_weak() to set the value of `head_` to the
    // `new_node`. The counts are set up so the `internal_count` is zero, and
    // the `external_count` is one. Because this is a new node, there’s
    // currently only one external reference to the node (the `head_` pointer
    // itself).

    // If new_node.ptr->next is the same as head_, update head_ to
    // new_node and return true. If new_node.ptr->next and head_ are not equal,
    // update new_node.ptr->next to head_ and return false.
    while (!head_.compare_exchange_weak(
        reinterpret_cast<Node*>(new_node.ptr)->next, new_node,
        std::memory_order_release, std::memory_order_relaxed)) {
      // Do nothing and wait for the head_ is updated to new_node.
    }
  }

  std::shared_ptr<T> Pop() {
    CountedNodePtr old_head = head_.load(std::memory_order_relaxed);

    while (true) {
      // Once we've loaded the value of `head_`, we must first increase the
      // count of external references to the `head_` node to indicate that we’re
      // referencing it and to ensure that it’s safe to dereference it. If we
      // dereference the pointer before increasing the reference count, another
      // thread could free the node before we access it, leaving we with a
      // dangling pointer. This is the primary reason for using the split
      // reference count: by incrementing the external reference count, we
      // ensure that the pointer remains valid for the duration of our access.
      IncreaseHeadCount(&old_head);

      // Once the count has been increased, we can safely dereference the `ptr`
      // field of the value loaded from `head_` in order to access the
      // pointed-to node
      Node* ptr = reinterpret_cast<Node*>(old_head.ptr);

      // If the pointer is a null pointer, we’re at the end of the list: no more
      // entries.
      if (ptr == nullptr) {
        return std::shared_ptr<T>();
      }

      // If the pointer isn’t a null pointer, we can try to remove the node by a
      // compare_exchange_strong() call on the `head_`.
      if (head_.compare_exchange_strong(old_head, ptr->next,
                                        std::memory_order_relaxed,
                                        std::memory_order_relaxed)) {
        // If the compare_exchange_strong() succeeds, we've taken ownership of
        // the node and can swap out the data in preparation for returning it.
        // This ensures that the data isn’t kept alive just because other
        // threads accessing the stack happen to still have pointers to its
        // node.
        std::shared_ptr<T> res;
        res.swap(ptr->data);

        // It’s important to note that the value we add is two less than the
        // external count; we've removed the node from the list, so we drop
        // one off the count for that, and we’re no longer accessing the node
        // from this thread, so we drop another off the count for that.
        const int increased_count = old_head.external_count - 2;

        // Add the external count to the internal count on the node with an
        // atomic `fetch_add`. If the reference count is now zero, the previous
        // value (which is what fetch_add returns) was the negative of what we
        // added, in which case we can delete the node.
        if (ptr->internal_count.fetch_add(increased_count,
                                          std::memory_order_release) ==
            -increased_count) {
          delete ptr;
        }

        // Whether or not we deleted the node, we've finished, so we can return
        // the data.
        return res;

        // If the compare/exchange fails, another thread removed our node
        // before we did, or another thread added a new node to the stack.
        // Either way, we need to start again with the fresh value of head
        // returned by the compare/exchange call. But first we must decrease the
        // reference count on the node we were trying to remove. This thread
        // won’t access it anymore. If we’re the last thread to hold a
        // reference (because another thread removed it from the stack), the
        // internal reference count will be 1, so subtracting 1 will set the
        // count to zero. In this case, we can delete the node here before we
        // loop.
      } else if (ptr->internal_count.fetch_add(-1, std::memory_order_relaxed) ==
                 1) {
        ptr->internal_count.load(std::memory_order_acquire);
        delete ptr;
      }
    }
  }

 private:
  // Forward class declaration
  struct Node;

  struct CountedNodePtr {
    CountedNodePtr() : external_count(0), ptr(0) {}

    // We know that the platform has spare bits in a pointer (for example,
    // because the address space is only 48 bits but a pointer is 64 bits), we
    // can store the count inside the spare bits of the pointer to fit it all
    // back in a single machine word.
    uint16_t external_count : 16;
    uint64_t ptr : 48;
  };

  struct Node {
    // std::make_shared does not throw an exception.
    explicit Node(const T& input_data)
        : data(std::make_shared<T>(input_data)), internal_count(0) {}

    std::shared_ptr<T> data;
    std::atomic<int> internal_count;
    CountedNodePtr next;
  };

 private:
  void IncreaseHeadCount(CountedNodePtr* old_counter) {
    CountedNodePtr new_counter;
    // The increment is done with a compare_exchange_strong() loop, which
    // compares and sets the whole structure to ensure that the pointer hasn’t
    // been changed by another thread in the meantime.
    do {
      new_counter = *old_counter;
      ++new_counter.external_count;
    } while (!head_.compare_exchange_strong(*old_counter, new_counter,
                                            std::memory_order_acquire,
                                            std::memory_order_relaxed));

    old_counter->external_count = new_counter.external_count;
  }

 private:
  std::atomic<CountedNodePtr> head_;
};

五、测试代码

下面给出测试无锁栈工作是否正常的简单测试代码（文件命名为：lock_free_stack.cpp）：

#include "lock_free_stack.h"

#include 
#include 
#include 
#include 
#include 

namespace {
constexpr size_t kElementNum = 10;
constexpr size_t kThreadNum = 200;
constexpr size_t kLargeThreadNum = 2000;
}  // namespace

int main() {
  LockFreeStack<int> stack;

  // Case 1: Single thread test
  for (size_t i = 0; i < kElementNum; ++i) {
    std::cout << "The data " << i << " is pushed in the stack.\n";
    stack.Push(i);
  }
  std::cout << "stack.IsEmpty() == " << std::boolalpha << stack.IsEmpty()
            << std::endl;
  while (auto data = stack.Pop()) {
    std::cout << "Current data is : " << *data << '\n';
  }

  // Case 2: multi-thread test. Producers and consumers are evenly distributed
  std::vector<std::thread> producers1;
  std::vector<std::thread> producers2;
  std::vector<std::thread> consumers1;
  std::vector<std::thread> consumers2;
  for (size_t i = 0; i < kThreadNum; ++i) {
    producers1.emplace_back(&LockFreeStack<int>::Push, &stack, i * 10);
    producers2.emplace_back(&LockFreeStack<int>::Push, &stack, i * 20);
    consumers1.emplace_back(&LockFreeStack<int>::Pop, &stack);
    consumers2.emplace_back(&LockFreeStack<int>::Pop, &stack);
  }
  for (size_t i = 0; i < kThreadNum; ++i) {
    producers1[i].join();
    consumers1[i].join();
    producers2[i].join();
    consumers2[i].join();
  }
  producers1.clear();
  producers1.shrink_to_fit();
  producers2.clear();
  producers2.shrink_to_fit();
  consumers1.clear();
  consumers1.shrink_to_fit();
  consumers2.clear();
  consumers2.shrink_to_fit();

  // Case 3: multi-thread test. Producers and consumers are randomly distributed
  std::vector<std::thread> producers3;
  std::vector<std::thread> consumers3;
  for (size_t i = 0; i < kLargeThreadNum; ++i) {
    producers3.emplace_back(&LockFreeStack<int>::Push, &stack, i * 30);
    consumers3.emplace_back(&LockFreeStack<int>::Pop, &stack);
  }
  std::vector<int> random_numbers(kLargeThreadNum);
  std::mt19937 gen(std::random_device{}());
  std::uniform_int_distribution<int> dis(0, 100000);
  auto rand_num_generator = [&gen, &dis]() mutable { return dis(gen); };
  std::generate(random_numbers.begin(), random_numbers.end(),
                rand_num_generator);
  for (size_t i = 0; i < kLargeThreadNum; ++i) {
    if (random_numbers[i] % 2) {
      producers3[i].join();
      consumers3[i].join();
    } else {
      consumers3[i].join();
      producers3[i].join();
    }
  }
  consumers3.clear();
  consumers3.shrink_to_fit();
  consumers3.clear();
  consumers3.shrink_to_fit();

  return 0;
}

CMake的编译配置文件CMakeLists.txt：

cmake_minimum_required(VERSION 3.0.0)
project(lock_free_stack VERSION 0.1.0)
set(CMAKE_CXX_STANDARD 17)

# If the debug option is not given, the program will not have debugging information.
SET(CMAKE_BUILD_TYPE "Debug")

add_executable(${PROJECT_NAME} ${PROJECT_NAME}.cpp)

find_package(Threads REQUIRED)
# libatomic should be linked to the program.
# Otherwise, the following link errors occured:
# /usr/include/c++/9/atomic:254: undefined reference to `__atomic_load_16'
# /usr/include/c++/9/atomic:292: undefined reference to `__atomic_compare_exchange_16'
# target_link_libraries(${PROJECT_NAME} ${CMAKE_THREAD_LIBS_INIT} atomic)
target_link_libraries(${PROJECT_NAME} ${CMAKE_THREAD_LIBS_INIT})

include(CTest)
enable_testing()
set(CPACK_PROJECT_NAME ${PROJECT_NAME})
set(CPACK_PROJECT_VERSION ${PROJECT_VERSION})
include(CPack)

上述配置中添加了对原子库atomic的链接。因为引用计数的结构体CountedNodePtr包含两个数据成员（注：最初实现的版本未使用位域，需要添加对原子库atomic的链接。新版本使用位域，不再需要添加）：int external_count; Node* ptr;，这两个变量占用16字节，而16字节的数据结构需要额外链接原子库atomic，否则会出现链接错误：

/usr/include/c++/9/atomic:254: undefined reference to `__atomic_load_16'
/usr/include/c++/9/atomic:292: undefined reference to `__atomic_compare_exchange_16'

VSCode调试启动配置文件.vscode/launch.json：

{
    "version": "0.2.0",
    "configurations": [
        {
            "name": "cpp_gdb_launch",
            "type": "cppdbg",
            "request": "launch",
            "program": "${workspaceFolder}/build/${workspaceFolderBasename}",
            "args": [],
            "stopAtEntry": false,
            "cwd": "${fileDirname}",
            "environment": [],
            "externalConsole": false,
            "MIMode": "gdb",
            "setupCommands": [
                {
                    "description": "Enable neat printing for gdb",
                    "text": "-enable-pretty-printing",
                    "ignoreFailures": true
                }
            ],
            // "preLaunchTask": "cpp_build_task",
            "miDebuggerPath": "/usr/bin/gdb"
        }
    ]
}

使用CMake的编译命令：

cd lock_free_stack
# 只执行一次
mkdir build
cd build
cmake .. && make

运行结果如下：

./lock_free_stack 
The data 0 is pushed in the stack.
The data 1 is pushed in the stack.
The data 2 is pushed in the stack.
The data 3 is pushed in the stack.
The data 4 is pushed in the stack.
The data 5 is pushed in the stack.
The data 6 is pushed in the stack.
The data 7 is pushed in the stack.
The data 8 is pushed in the stack.
The data 9 is pushed in the stack.
stack.IsEmpty() == false
Current data is : 9
Current data is : 8
Current data is : 7
Current data is : 6
Current data is : 5
Current data is : 4
Current data is : 3
Current data is : 2
Current data is : 1
Current data is : 0

VSCode调试界面如下：

你可能感兴趣的:(数据结构,c++,多线程,无锁编程)

c++ 的iostream 和 c++的stdio的区别和联系黄卷青灯77 c++算法开发语言 iostream stdio
在C++中，iostream和C语言的stdio.h都是用于处理输入输出的库，但它们在设计、用法和功能上有许多不同。以下是两者的区别和联系：区别1.编程风格iostream（C++风格）：C++标准库中的输入输出流类库，支持面向对象的输入输出操作。典型用法是cin（输入）和cout（输出），使用>操作符来处理数据。更加类型安全，支持用户自定义类型的输入输出。#includeintmain(){in
数组去重好奇的猫猫猫
整理自js中基础数据结构数组去重问题思考？如何去除数组中重复的项例如数组：[1,3,4,3,5]我们在做去重的时候，一开始想到的肯定是，逐个比较，外面一层循环，内层后一个与前一个一比较，如果是久不将当前这一项放进新的数组，挨个比较完之后返回一个新的去过重复的数组不好的实践方式上述方法效率极低，代码量还多，思考？有没有更好的方法这时候不禁一想当然有了！！！hashtable啊，通过对象的hash办法
【JS】执行时长(100分) |思路参考+代码解析（C++） l939035548 JS 算法数据结构 c++
题目为了充分发挥GPU算力，需要尽可能多的将任务交给GPU执行，现在有一个任务数组，数组元素表示在这1秒内新增的任务个数且每秒都有新增任务。假设GPU最多一次执行n个任务，一次执行耗时1秒，在保证GPU不空闲情况下，最少需要多长时间执行完成。题目输入第一个参数为GPU一次最多执行的任务个数，取值范围[1,10000]第二个参数为任务数组长度，取值范围[1,10000]第三个参数为任务数组，数字范围
回溯算法-重新安排行程 chirou_ 算法数据结构图论 c++图搜索
leetcode332.重新安排行程这题我还没自己ac过，只能现在凭着刚学完的热乎劲把我对题解的理解记下来。本题我认为对数据结构的考察比较多，用什么数据结构去存数据，去读取数据，都是很重要的。classSolution{private:unordered_map>targets;boolbacktracking(intticketNum,vector&result){//1.确定参数和返回值//2
Redis系列：Geo 类型赋能亿级地图位置计算 Ly768768 redis bootstrap 数据库
1前言我们在篇深刻理解高性能Redis的本质的时候就介绍过Redis的几种基本数据结构，它是基于不同业务场景而设计的：动态字符串(REDIS_STRING)：整数(REDIS_ENCODING_INT)、字符串(REDIS_ENCODING_RAW)双端列表(REDIS_ENCODING_LINKEDLIST)压缩列表(REDIS_ENCODING_ZIPLIST)跳跃表(REDIS_ENCODI
基于CODESYS的多轴运动控制程序框架：逻辑与运动控制分离，快速开发灵活操作 GPJnCrbBdl python 开发语言
基于codesys开发的多轴运动控制程序框架，将逻辑与运动控制分离，将单轴控制封装成功能块，对该功能块的操作包含了所有的单轴控制（归零、点动、相对定位、绝对定位、设置当前位置、伺服模式切换等等）。程序框架由主程序按照状态调用分归零模式、手动模式、自动模式、故障模式，程序状态的跳转都已完成，只需要根据不同的工艺要求完成所需的动作即可。变量的声明、地址的规划都严格按照C++的标准定义，能帮助开发者快速
C++ | Leetcode C++题解之第409题最长回文串 Ddddddd_158 经验分享 C++Leetcode 题解
题目：题解：classSolution{public:intlongestPalindrome(strings){unordered_mapcount;intans=0;for(charc:s)++count[c];for(autop:count){intv=p.second;ans+=v/2*2;if(v%2==1andans%2==0)++ans;}returnans;}};
C++菜鸟教程 - 从入门到精通第二节 DreamByte c++
一.上节课的补充(数据类型)1.前言继上节课,我们主要讲解了输入,输出和运算符,我们现在来补充一下数据类型的知识上节课遗漏了这个知识点,非常的抱歉顺便说一下,博主要上高中了,更新会慢,2-4周更新一次对了,正好赶上中秋节,小编跟大家说一句:中秋节快乐!2.int类型上节课,我们其实只用了int类型int类型,是整数类型,它们存贮的是整数,不能存小数(浮点数)定义变量的方式很简单inta;//定义一
Faiss：高效相似性搜索与聚类的利器网络·魚大数据 faiss
Faiss是一个针对大规模向量集合的相似性搜索库，由FacebookAIResearch开发。它提供了一系列高效的算法和数据结构，用于加速向量之间的相似性搜索，特别是在大规模数据集上。本文将介绍Faiss的原理、核心功能以及如何在实际项目中使用它。Faiss原理：近似最近邻搜索：Faiss的核心功能之一是近似最近邻搜索，它能够高效地在大规模数据集中找到与给定查询向量最相似的向量。这种搜索是近似的，
数据结构之哈希表 X同学的开始数据结构数据结构散列表
哈希表(散列表)出现的原因在顺序表中查找时，需要从表头开始，依次遍历比较a[i]与key的值是否相等，直到相等才返回索引i；在有序表中查找时，我们经常使用的是二分查找，通过比较key与a[i]的大小来折半查找，直到相等时才返回索引i。最终通过索引找到我们要找的元素。但是，这两种方法的效率都依赖于查找中比较的次数。我们有一种想法，能不能不经过比较，而是直接通过关键字key一次得到所要的结果呢？这时，
Python开发常用的三方模块如下：换个网名有点难 python 开发语言
Python是一门功能强大的编程语言，拥有丰富的第三方库，这些库为开发者提供了极大的便利。以下是100个常用的Python库，涵盖了多个领域：1、NumPy，用于科学计算的基础库。2、Pandas，提供数据结构和数据分析工具。3、Matplotlib，一个绘图库。4、Scikit-learn，机器学习库。5、SciPy，用于数学、科学和工程的库。6、TensorFlow，由Google开发的开源机
多线程之——ExecutorCompletionService 阿福德
在我们开发中，经常会遇到这种情况，我们起多个线程来执行，等所有的线程都执行完成后，我们需要得到个线程的执行结果来进行聚合处理。我在内部代码评审时，发现了不少这种情况。看很多同学都使用正确，但比较啰嗦，效率也不高。本文介绍一个简单处理这种情况的方法：直接上代码：publicclassExecutorCompletionServiceTest{@TestpublicvoidtestExecutorCo
Java面试题精选：消息队列(二) 芒果不是芒 Java面试题精选 java kafka
一、Kafka的特性1.消息持久化：消息存储在磁盘，所以消息不会丢失2.高吞吐量：可以轻松实现单机百万级别的并发3.扩展性：扩展性强，还是动态扩展4.多客户端支持：支持多种语言（Java、C、C++、GO、）5.KafkaStreams（一个天生的流处理）:在双十一或者销售大屏就会用到这种流处理。使用KafkaStreams可以快速的把销售额统计出来6.安全机制：Kafka进行生产或者消费的时候会
数据结构 | 栈和队列 TT-Kun 数据结构与算法数据结构栈队列 C语言
文章目录栈和队列1.栈：后进先出（LIFO）的数据结构1.1概念与结构1.2栈的实现2.队列：先进先出（FIFO）的数据结构2.1概念与结构2.2队列的实现3.栈和队列算法题3.1有效的括号3.2用队列实现栈3.3用栈实现队列3.4设计循环队列结论栈和队列在计算机科学中，栈和队列是两种基本且重要的数据结构，它们在处理数据存储和访问顺序方面有着独特的规则和应用。本文将详细介绍栈和队列的概念、结构、实
python多线程程序设计之一 IT_Beijing_BIT #Python 程序设计语言 python
python多线程程序设计之一全局解释器锁线程APIsthreading.active_count()threading.current_thread()threading.excepthook(args,/)threading.get_native_id()threading.main_thread()threading.stack_size([size])线程对象成员函数构造器start/ru
[Python] 数据结构详解及代码 AIAdvocate 算法 python 数据结构链表
今日内容大纲介绍数据结构介绍列表链表1.数据结构和算法简介程序大白话翻译,程序=数据结构+算法数据结构指的是存储,组织数据的方式.算法指的是为了解决实际业务问题而思考思路和方法,就叫:算法.2.算法的5大特性介绍算法具有独立性算法是解决问题的思路和方式,最重要的是思维,而不是语言,其(算法)可以通过多种语言进行演绎.5大特性有输入,需要传入1或者多个参数有输出,需要返回1个或者多个结果有穷性,执行
4.C_数据结构_队列荣世蓥数据结构数据结构
概述什么是队列：队列是限定在两端进行插入操作和删除操作的线性表。具有先入先出(FIFO)的特点相关名词：队尾：写入数据的一段队头：读取数据的一段空队：队列中没有数据，队头指针=队尾指针满队：队列中存满了数据，队尾指针+1=队头指针循环队列1、基本内容循环队列是以数组形式构成的队列数据结构。循环队列的结构体如下：typedefintdata_t;//队列数据类型#defineN64//队列容量typ
C++ lambda闭包消除类成员变量 barbyQAQ c++c++java 算法
原文链接：https://blog.csdn.net/qq_51470638/article/details/142151502一、背景在面向对象编程时，常常要添加类成员变量。然而类成员一旦多了之后，也会带来干扰。拿到一个类，一看成员变量好几十个，就问你怕不怕？二、解决思路可以借助函数式编程思想，来消除一些不必要的类成员变量。三、实例举个例子：classClassA{public:...intfu
2021 CCF 非专业级别软件能力认证第一轮（CSP-J1）入门级C++语言试题（第三大题：完善程序代码） mmz1207 c++csp
最近有一段时间没更新了，在准备CSP考试，请大家见谅。（1）有n个人围成一个圈，依次标号0到n-1。从0号开始，依次0，1，0，1...交替报数，报到一的人离开，直至圈中剩最后一个人。求最后剩下的人的编号。#includeusingnamespacestd;intf[1000010];intmain(){intn;cin>>n;inti=0,cnt=0,p=0;while(cnt#includeu
《 C++ 修炼全景指南：九》打破编程瓶颈！掌握二叉搜索树的高效实现与技巧 Lenyiin C++修炼全景指南技术指南 c++算法 stl
摘要本文详细探讨了二叉搜索树（BinarySearchTree,BST）的核心概念和技术细节，包括插入、查找、删除、遍历等基本操作，并结合实际代码演示了如何实现这些功能。文章深入分析了二叉搜索树的性能优势及其时间复杂度，同时介绍了前驱、后继的查找方法等高级功能。通过自定义实现的二叉搜索树类，读者能够掌握其实际应用，此外，文章还建议进一步扩展为平衡树（如AVL树、红黑树）以优化极端情况下的性能退化。
Python多线程实现大规模数据集高效转移 sand&wich 网络 python 服务器
背景在处理大规模数据集时，通常需要在不同存储设备、不同服务器或文件夹之间高效地传输数据。如果采用单线程传输方式，当数据量非常大时，整个过程会非常耗时。因此，通过多线程并行处理可以大幅提升数据传输效率。本文将分享一个基于Python多线程实现的高效数据传输工具，通过遍历源文件夹中的所有文件，将它们移动到目标文件夹。工具和库这个数据集转移工具主要依赖于以下Python标准库：os：用于文件系统操作，如
Python实现下载当前年份的谷歌影像 sand&wich python 开发语言
在GIS项目和地图应用中，获取最新的地理影像数据是非常重要的。本文将介绍如何使用Python代码从Google地图自动下载当前年份的影像数据，并将其保存为高分辨率的TIFF格式文件。这个过程涉及地理坐标转换、多线程下载和图像处理。关键功能该脚本的核心功能包括：坐标转换：支持WGS-84与WebMercator投影之间转换，以及处理中国GCJ-02偏移。自动化下载：多线程下载地图瓦片，提高效率。图像
20个新手学习c++必会的程序输出*三角形、杨辉三角等（附代码） X_StarX c++学习算法大学生开发语言数据结构
示例1:HelloWorld#includeusingnamespacestd;intmain(){coutusingnamespacestd;intmain(){inta=5;intb=10;intsum=a+b;coutusingnamespacestd;intfactorial(intn){if(nusingnamespacestd;voidprintFibonacci(intn){intt
C++八股 Petrichorzncu 八股总结 c++开发语言
这里写目录标题C++内存管理C++的构造函数，复制构造函数，和析构函数深复制与浅复制：构造函数和析构函数哪个能写成虚函数，为什么？C++数据结构内存排列结构体和类占用的内存：==虚函数和虚表的原理==虚函数虚表（Vtable）虚函数和虚表的实现细节==内存泄漏==指针的工作原理函数的传值和传址new和delete与malloc和freeC++内存区域划分C++11新特性C++常见新特性==智能指针
WebMagic：强大的Java爬虫框架解析与实战 Aaron_945 Java java 爬虫开发语言
文章目录引言官网链接WebMagic原理概述基础使用1.添加依赖2.编写PageProcessor高级使用1.自定义Pipeline2.分布式抓取优点结论引言在大数据时代，网络爬虫作为数据收集的重要工具，扮演着不可或缺的角色。Java作为一门广泛使用的编程语言，在爬虫开发领域也有其独特的优势。WebMagic是一个开源的Java爬虫框架，它提供了简单灵活的API，支持多线程、分布式抓取，以及丰富的
【2022 CCF 非专业级别软件能力认证第一轮（CSP-J1）入门级 C++语言试题及解析】汉子萌萌哒 CCF noi 算法数据结构 c++
一、单项选择题(共15题，每题2分，共计30分；每题有且仅有一个正确选项)1.以下哪种功能没有涉及C++语言的面向对象特性支持：()。A.C++中调用printf函数B.C++中调用用户定义的类成员函数C.C++中构造一个class或structD.C++中构造来源于同一基类的多个派生类题目解析【解析】正确答案:AC++基础知识，面向对象和类有关，类又涉及父类、子类、继承、派生等关系，printf
《 C++ 修炼全景指南：十》自平衡的艺术：深入了解 AVL 树的核心原理与实现 Lenyiin C++修炼全景指南技术指南 c++数据结构 stl
摘要本文深入探讨了AVL树（自平衡二叉搜索树）的概念、特点以及实现细节。我们首先介绍了AVL树的基本原理，并详细分析了其四种旋转操作，包括左旋、右旋、左右双旋和右左双旋，阐述了它们在保持树平衡中的重要作用。接着，本文从头到尾详细描述了AVL树的插入、删除和查找操作，配合完整的代码实现和详尽的注释，使读者能够全面理解这些操作的执行过程。此外，我们还提供了AVL树的遍历方法，包括中序、前序和后序遍历，
【树一线性代数】005入门 Owlet_woodBird 算法
Index本文稍后补全，推荐阅读：https://blog.csdn.net/weixin_60702024/article/details/141874376分析实现总结本文稍后补全，推荐阅读：https://blog.csdn.net/weixin_60702024/article/details/141874376已知非空二叉树T的结点值均为正整数，采用顺序存储方式保存，数据结构定义如下:t
python获取子进程返回值_Python对进程Multiprocessing子进程返回值 weixin_39752157 python获取子进程返回值
在实际使用多进程的时候，可能需要获取到子进程运行的返回值。如果只是用来存储，则可以将返回值保存到一个数据结构中；如果需要判断此返回值，从而决定是否继续执行所有子进程，则会相对比较复杂。另外在Multiprocessing中，可以利用Process与Pool创建子进程，这两种用法在获取子进程返回值上的写法上也不相同。这篇中，我们直接上代码，分析多进程中获取子进程返回值的不同用法，以及优缺点。初级用法
JAVA学习笔记之23种设计模式学习 victorfreedom Java技术设计模式 android java 常用设计模式
博主最近买了《设计模式》这本书来学习，无奈这本书是以C++语言为基础进行说明，整个学习流程下来效率不是很高，虽然有的设计模式通俗易懂，但感觉还是没有充分的掌握了所有的设计模式。于是博主百度了一番，发现有大神写过了这方面的问题，于是博主迅速拿来学习。一、设计模式的分类总体来说设计模式分为三大类：创建型模式，共五种：工厂方法模式、抽象工厂模式、单例模式、建造者模式、原型模式。结构型模式，共七种：适配器
JAVA基础灵静志远位运算加载 Date 字符串池覆盖
一、类的初始化顺序 1 （静态变量，静态代码块）-->（变量，初始化块）--> 构造器同一括号里的，根据它们在程序中的顺序来决定。上面所述是同一类中。如果是继承的情况，那就在父类到子类交替初始化。二、String 1 String a = "abc"; JAVA虚拟机首先在字符串池中查找是否已经存在了值为"abc"的对象，根
keepalived实现redis主从高可用 bylijinnan redis
方案说明两台机器（称为A和B），以统一的VIP对外提供服务 1.正常情况下，A和B都启动，B会把A的数据同步过来（B is slave of A） 2.当A挂了后，VIP漂移到B；B的keepalived 通知redis 执行：slaveof no one，由B提供服务 3.当A起来后，VIP不切换，仍在B上面；而A的keepalived 通知redis 执行slaveof B，开始
java文件操作大全 0624chenhong java
最近在博客园看到一篇比较全面的文件操作文章，转过来留着。 http://www.cnblogs.com/zhuocheng/archive/2011/12/12/2285290.html 转自http://blog.sina.com.cn/s/blog_4a9f789a0100ik3p.html 一.获得控制台用户输入的信息 &nbs
android学习任务不懂事的小屁孩工作
任务完成情况搞清楚带箭头的pupupwindows和不带的使用已完成熟练使用pupupwindows和alertdialog，并搞清楚两者的区别已完成熟练使用android的线程handler,并敲示例代码进行中了解游戏2048的流程，并完成其代码工作进行中-差几个actionbar 研究一下android的动画效果，写一个实例已完成复习fragem
zoom.js 换个号韩国红果果 oom
它的基于bootstrap 的 https://raw.github.com/twbs/bootstrap/master/js/transition.js transition.js模块引用顺序 <link rel="stylesheet" href="style/zoom.css"> <script src=&q
详解Oracle云操作系统Solaris 11.2 蓝儿唯美 Solaris
当Oracle发布Solaris 11时，它将自己的操作系统称为第一个面向云的操作系统。Oracle在发布Solaris 11.2时继续它以云为中心的基调。但是，这些说法没有告诉我们为什么Solaris是配得上云的。幸好，我们不需要等太久。Solaris11.2有4个重要的技术可以在一个有效的云实现中发挥重要作用：OpenStack、内核域、统一存档（UA）和弹性虚拟交换（EVS）。
spring学习——springmvc（一） a-john springMVC
Spring MVC基于模型-视图-控制器（Model-View-Controller，MVC）实现，能够帮助我们构建像Spring框架那样灵活和松耦合的Web应用程序。 1，跟踪Spring MVC的请求请求的第一站是Spring的DispatcherServlet。与大多数基于Java的Web框架一样，Spring MVC所有的请求都会通过一个前端控制器Servlet。前
hdu4342 History repeat itself-------多校联合五 aijuans 数论
水题就不多说什么了。 #include<iostream>#include<cstdlib>#include<stdio.h>#define ll __int64using namespace std;int main(){ int t; ll n; scanf("%d",&t); while(t--)
EJB和javabean的区别 asia007 bean ejb
EJB不是一般的JavaBean,EJB是企业级JavaBean,EJB一共分为3种,实体Bean,消息Bean,会话Bean,书写EJB是需要遵循一定的规范的,具体规范你可以参考相关的资料.另外,要运行EJB,你需要相应的EJB容器,比如Weblogic,Jboss等,而JavaBean不需要,只需要安装Tomcat就可以了 1.EJB用于服务端应用开发, 而JavaBeans
Struts的action和Result总结百合不是茶 struts Action配置 Result配置
一:Action的配置详解: 下面是一个Struts中一个空的Struts.xml的配置文件 <?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE struts PUBLIC &quo
如何带好自已的团队 bijian1013 项目管理团队管理团队
在网上看到博客" 怎么才能让团队成员好好干活"的评论，觉得写的比较好。原文如下：我做团队管理有几年了吧，我和你分享一下我认为带好团队的几点： 1.诚信对团队内成员，无论是技术研究、交流、问题探讨，要尽可能的保持一种诚信的态度，用心去做好，你的团队会感觉得到。 2.努力提
Java代码混淆工具 sunjing ProGuard
Open Source Obfuscators ProGuard http://java-source.net/open-source/obfuscators/proguardProGuard is a free Java class file shrinker and obfuscator. It can detect and remove unused classes, fields, m
【Redis三】基于Redis sentinel的自动failover主从复制 bit1129 redis
在第二篇中使用2.8.17搭建了主从复制，但是它存在Master单点问题，为了解决这个问题，Redis从2.6开始引入sentinel，用于监控和管理Redis的主从复制环境，进行自动failover，即Master挂了后，sentinel自动从从服务器选出一个Master使主从复制集群仍然可以工作，如果Master醒来再次加入集群，只能以从服务器的形式工作。什么是Sentine
使用代理实现Hibernate Dao层自动事务白糖_ DAO spring AOP 框架 Hibernate
都说spring利用AOP实现自动事务处理机制非常好，但在只有hibernate这个框架情况下，我们开启session、管理事务就往往很麻烦。 public void save(Object obj){ Session session = this.getSession(); Transaction tran = session.beginTransaction(); try
maven3实战读书笔记 braveCS maven3
Maven简介是什么？ Is a software project management and comprehension tool.项目管理工具是基于POM概念(工程对象模型) [设计重复、编码重复、文档重复、构建重复，maven最大化消除了构建的重复] [与XP：简单、交流与反馈；测试驱动开发、十分钟构建、持续集成、富有信息的工作区] 功能：
编程之美-子数组的最大乘积 bylijinnan 编程之美
public class MaxProduct { /** * 编程之美子数组的最大乘积 * 题目: 给定一个长度为N的整数数组，只允许使用乘法，不能用除法，计算任意N-1个数的组合中乘积中最大的一组，并写出算法的时间复杂度。 * 以下程序对应书上两种方法，求得“乘积中最大的一组”的乘积——都是有溢出的可能的。 * 但按题目的意思，是要求得这个子数组，而不
读书笔记-2 chengxuyuancsdn 读书笔记
1、反射 2、oracle年-月-日时-分-秒 3、oracle创建有参、无参函数 4、oracle行转列 5、Struts2拦截器 6、Filter过滤器(web.xml) 1、反射 (1)检查类的结构在java.lang.reflect包里有3个类Field,Method,Constructor分别用于描述类的域、方法和构造器。 2、oracle年月日时分秒 s
[求学与房地产]慎重选择IT培训学校 comsci it
关于培训学校的教学和教师的问题,我们就不讨论了,我主要关心的是这个问题培训学校的教学楼和宿舍的环境和稳定性问题我们大家都知道，房子是一个比较昂贵的东西，特别是那种能够当教室的房子... &nb
RMAN配置中通道(CHANNEL)相关参数 PARALLELISM 、FILESPERSET的关系 daizj oracle rman filesperset PARALLELISM
RMAN配置中通道(CHANNEL)相关参数 PARALLELISM 、FILESPERSET的关系转 PARALLELISM --- 我们还可以通过parallelism参数来指定同时"自动"创建多少个通道： RMAN > configure device type disk parallelism 3 ; 表示启动三个通道，可以加快备份恢复的速度。
简单排序:冒泡排序 dieslrae 冒泡排序
public void bubbleSort(int[] array){ for(int i=1;i<array.length;i++){ for(int k=0;k<array.length-i;k++){ if(array[k] > array[k+1]){
初二上学期难记单词三 dcj3sjt126com sciet
concert 音乐会 tonight 今晚 famous 有名的；著名的 song 歌曲 thousand 千 accident 事故；灾难 careless 粗心的，大意的 break 折断；断裂；破碎 heart 心（脏） happen 偶尔发生，碰巧 tourist 旅游者；观光者 science （自然）科学 marry 结婚 subject 题目；
I.安装Memcahce 1. 安装依赖包libevent Memcache需要安装libevent,所以安装前可能需要执行 Shell代码收藏代码 dcj3sjt126com redis
wget http://download.redis.io/redis-stable.tar.gz tar xvzf redis-stable.tar.gz cd redis-stable make 前面3步应该没有问题，主要的问题是执行make的时候，出现了异常。异常一： make[2]: cc: Command not found 异常原因：没有安装g
并发容器 shuizhaosi888 并发容器
通过并发容器来改善同步容器的性能，同步容器将所有对容器状态的访问都串行化，来实现线程安全，这种方式严重降低并发性，当多个线程访问时，吞吐量严重降低。并发容器ConcurrentHashMap 替代同步基于散列的Map，通过Lock控制。 &nb
Spring Security（12）——Remember-Me功能 234390216 Spring Security Remember Me 记住我
Remember-Me功能目录 1.1 概述 1.2 基于简单加密token的方法 1.3 基于持久化token的方法 1.4 Remember-Me相关接口和实现
位运算焦志广位运算
一、位运算符Ｃ语言提供了六种位运算符： & 按位与 | 按位或 ^ 按位异或 ~ 取反 << 左移 >> 右移 1. 按位与运算按位与运算符"&"是双目运算符。其功能是参与运算的两数各对应的二进位相与。只有对应的两个二进位均为1时，结果位才为1 ，否则为0。参与运算的数以补码方式出现。例如：9&am
nodejs 数据库连接 mongodb mysql liguangsong mongodb mysql node 数据库连接
1.mysql 连接 package.json中dependencies加入 "mysql":"~2.7.0" 执行 npm install 在config 下创建文件 database.js
java动态编译 olive6615 java HotSpot jvm 动态编译
在HotSpot虚拟机中，有两个技术是至关重要的，即动态编译(Dynamic compilation)和Profiling。 HotSpot是如何动态编译Javad的bytecode呢？Java bytecode是以解释方式被load到虚拟机的。HotSpot里有一个运行监视器，即Profile Monitor,专门监视
Storm0.9.5的集群部署配置优化 roadrunners 优化 storm.yaml
nimbus结点配置（storm.yaml）信息： # Licensed to the Apache Software Foundation (ASF) under one # or more contributor license agreements. See the NOTICE file # distributed with this work for additional inf
101个MySQL 的调节和优化的提示 tomcat_oracle mysql
　1. 拥有足够的物理内存来把整个InnoDB文件加载到内存中——在内存中访问文件时的速度要比在硬盘中访问时快的多。　　2. 不惜一切代价避免使用Swap交换分区 – 交换时是从硬盘读取的，它的速度很慢。　　3. 使用电池供电的RAM（注：RAM即随机存储器）。　　4. 使用高级的RAID（注：Redundant Arrays of Inexpensive Disks，即磁盘阵列
zoj 3829 Known Notation(贪心) 阿尔萨斯 ZOJ
题目链接：zoj 3829 Known Notation 题目大意：给定一个不完整的后缀表达式，要求有2种不同操作，用尽量少的操作使得表达式完整。解题思路：贪心，数字的个数要要保证比∗的个数多1，不够的话优先补在开头是最优的。然后遍历一遍字符串，碰到数字+1，碰到∗-1,保证数字的个数大于等1，如果不够减的话，可以和最后面的一个数字交换位置（用栈维护十分方便），因为添加和交换代价都是1

C++无锁编程——无锁栈(lock-free stack)