Multithreading C++ Out of Core Sotring for Massive Data|多线程C++的大规模数据外部排序

先说一下，这个其实是我为实现PantaRay或者是类似Dreamworks的Out of Core点云GI的技术储备，为大规模点云光线跟踪所准备的第一步。在实际的应用中，int类型会被64bit的uint64_t所代替，代表空间中的一个hash键。所有的代码全部使用STL+boost实现了足够高层次的抽象，读者完全可以根据自己的需要改写。

This is the first step to implement the PantaRay or the GI solution from Dreamworks about Out-Core point cloud sorting. Actually the int type in the code would be replaced by he uint64_t which indices a hash key in space. All fragments code are using the STL+Boost, user can modify the code by yourself.

我们先来准备测试数据。这个测试数据有尺寸大小的限制，就是在现在x86_64环境下malloc/new分配的单个数组有1G尺寸的限制，这样就意味着内排序一次操作的数据不可能大于1G，造成了测试上的限制，所以我只生成了一个尺寸大约962M的文件测试，包含了246324610个int。

First of all, let’s prepare the test data. But as we know, there is the 1G array size limitation in x86_64, so that we can only apply qsort or std::stable_sort to a < 1G array. For this test I generate a 962M file which contains the 246324610 integers.

如下程序生成测试数据，均匀分布的Mersenne Twister 19937序列。

The following program generates the test data, using the MT19937 uniform distribution.

Multithreading C++ Out of Core Sotring for Massive Data|多线程C++的大规模数据外部排序

#include <iostream>



#include <boost/random/mersenne_twister.hpp>

#include <boost/random/uniform_int_distribution.hpp>



int main(int argc, char *argv[])

{

    -- argc, ++ argv;

    if (argc != 2)

    {

        return 1;

    }

    char * szPath = argv[0];

    int iCount = atoi(argv[1]);

    std::cout << szPath << " " << iCount << std::endl;



    boost::random::mt19937 cGen;

    boost::random::uniform_int_distribution<> cDist(0, 99999999);



    FILE * pFile = fopen(szPath, "wb");

    if (pFile)

    {

        for (int i = 0; i < iCount; ++ i)

        {

            int iRandom = cDist(cGen);

            fwrite(& iRandom, sizeof(int), 1, pFile);

        }



        fclose(pFile);

    }

    return 0;

}

View Code

然后生成内排序的结果，储存为外部独立文件为了比较。

Generate the internal sorted result to verify the data.

int main(int argc, char * argv[])

{

    PlaySTL();



    -- argc, ++ argv;

    if (argc != 2)

    {

        return EXIT_FAILURE;

    }



    FILE * pOriginalFile = fopen(argv[0], "rb");

    fseek(pOriginalFile, 0, SEEK_END);

    long lSize = ftell(pOriginalFile);

    fseek(pOriginalFile, 0, SEEK_SET);



    int iNumItems = lSize / 4;

    int * pData = new int[iNumItems];

    fread(pData, sizeof(int), iNumItems, pOriginalFile);

    fclose(pOriginalFile);

    std::stable_sort(pData, pData + iNumItems, std::less<int>());

    

    FILE * pSortedFile = fopen(argv[1], "wb");

    fwrite(pData, sizeof(int), iNumItems, pSortedFile);

    fclose(pSortedFile);



    delete [] pData;



    return EXIT_SUCCESS;

}

View Code

从设计的思路上，由于操作系统在磁盘IO上都是单线程的，每次只允许一个线程读写，所以把读取的工作部分都放在主线程中，排序线程为了让磁盘写入的时间占据总共处理的时间尽可能地小，所以尽可能的让一个工作线程处理更多的数据。

Because the disk access is synchronized at low-level IO, so that we will read the data in the main thread, the working thread process as much as data as possible to reduce the percent of time on disk writing.

先让我们定义一个名字叫做Job的类，顾名思义，代表一个计算任务，每个计算任务都有一个自己的索引，以及一堆乱序的整数int数据。

Let’s define a Job class, each Job has a index and unsorted data.

class Job

{

public:



    Job()

    :

    m_iIndex(0),

    m_iNumItems(0)

    {

    }



    Job(int iIndex, int iNumItems, const boost::shared_array<int> & aData)

    :

    m_iIndex(iIndex),

    m_iNumItems(iNumItems),

    m_aData(aData)

    {

    }



    Job(const Job & cCopy)

    :

    m_iIndex(cCopy.m_iIndex),

    m_iNumItems(cCopy.m_iNumItems),

    m_aData(cCopy.m_aData)

    {

    }



public:



    int m_iIndex;

    int m_iNumItems;

    boost::shared_array<int> m_aData;

};

View Code

然后再来一个Context，负责存储用于计算的共享数据，比如工作队列，以及Mutex等为了同步所需要的对象。

Later the Context class, to keep the queue and mutex objects.

class Context

{

public:



    Context(int iNumSortingThread)

    :

    m_iNumSortingThread(iNumSortingThread),

    m_bHasMoreData(true)

    {

    }



public:



    int m_iNumSortingThread;



    bool m_bHasMoreData;



    boost::mutex m_cMutex;

    boost::condition_variable m_cEvent;



    std::list<Job > m_lJobQueue;

};

View Code

这里是工作线程，其中有工作代码的实现。当访问Context中的队列时必须要加锁，抓一个工作包出来，当作局部数据，接下来再排序和写出为Cache，末了尽可能贪婪的告诉主线程我们需要更多的数据，如果真的是没有任何数据了则直接退出。

Here is the working thread, it will get a Job from the queue, sort the data, and write out, at the end, tell the main thread it needs more data to process, if there is no more data it will return.

class SortingThread : public boost::thread

{

public:



    SortingThread(const boost::shared_ptr<Context> & pContext)

    :

    m_pContext(pContext),

    boost::thread(boost::bind(& SortingThread::Sort, this))

    {

    }



    void Sort()

    {

        while (1)

        {

            if (! m_pContext->m_bHasMoreData)

            {

                if (! m_pContext->m_lJobQueue.size())

                {

                    break;

                }

            }



            Job cJob;

            {

                boost::unique_lock<boost::mutex> cLock(m_pContext->m_cMutex);

                if (m_pContext->m_lJobQueue.size())

                {

                    // Get a job.

                    //

                    cJob = m_pContext->m_lJobQueue.front();

                    m_pContext->m_lJobQueue.pop_front();

                }

            }



            if (cJob.m_iNumItems)

            {

                std::stable_sort(cJob.m_aData.get(), cJob.m_aData.get() + cJob.m_iNumItems, std::less<int>());

                

                // Write out the sorted data.

                //

                char aBuffer[256];

                sprintf(aBuffer, "%.06d.tmp", cJob.m_iIndex);

                std::ofstream cOutput(aBuffer, std::ios_base::binary);

                cOutput.write(reinterpret_cast<const char *>(cJob.m_aData.get()), cJob.m_iNumItems * sizeof(int));

            }



            // Tell the main thread we need more data here.

            //

            m_pContext->m_cEvent.notify_one();

        }

    }



private:



    boost::shared_ptr<Context> m_pContext;

};

View Code

把所有的线程都放入线程池，这样就可以一股脑的执行了。

The simple thread pool.

class SortingThreadGroup : public boost::thread_group

{

public:



    SortingThreadGroup(const boost::shared_ptr<Context> & pContext)

    :

    m_pContext(pContext)

    {

        for (int i = 0; i < m_pContext->m_iNumSortingThread; ++ i)

        {

            SortingThread * pSortingThread = new SortingThread(pContext);

            add_thread(pSortingThread);

        }

    }



private:



    boost::shared_ptr<Context> m_pContext;

};

View Code

主线程从外部文件读取数据填充Job对象，尽可能的把整个队列的数据控制在一定得范围内，这样内存的占用可以小一些，否则就失去了外排序的意义。

Main thread reads data from file, fills the Job, and keep the memory usage minimal.

bool Sort(const char * szPath, int iNumSortingThreads, int iNumLocalItems)

{

    try

    {

        // Calculate real size.

        //

        std::ifstream cUnSortedFile(szPath, std::ios_base::binary);

        boost::uintmax_t ullSize = boost::filesystem::file_size(szPath);

        boost::uintmax_t ullNumItems = ullSize / 4;



        int iNumBatches = ullNumItems / iNumLocalItems;

        std::vector<int> vNumItemsPerBatch(iNumBatches, iNumLocalItems);

        int iNumRestItems = ullNumItems % iNumLocalItems;

        if (iNumRestItems)

        {

            vNumItemsPerBatch.push_back(iNumRestItems);

        }

        std::cout << "Number of Items   : " << ullNumItems << std::endl

                  << "Number of Batches : " << vNumItemsPerBatch.size() << std::endl;



        boost::shared_ptr<Context> pContext(new Context(iNumSortingThreads));

        boost::scoped_ptr<SortingThreadGroup> pSortingThreadGroup(new SortingThreadGroup(pContext));



        boost::timer::auto_cpu_timer cTimer;

        for (int i = 0; i < vNumItemsPerBatch.size(); ++ i)

        {

            boost::shared_array<int> aData(new int[vNumItemsPerBatch[i]]);

            cUnSortedFile.read(reinterpret_cast<char *>(aData.get()), vNumItemsPerBatch[i] * sizeof(int));



            Job cJob(i, vNumItemsPerBatch[i], aData);



            //

            boost::unique_lock<boost::mutex> cLock(pContext->m_cMutex);

            if (pContext->m_lJobQueue.size() > iNumSortingThreads * 2)

            {

                pContext->m_cEvent.wait(cLock);

            }

            pContext->m_lJobQueue.push_back(cJob);

        }

        std::cout << std::endl;

        pContext->m_bHasMoreData = false;



        pSortingThreadGroup->join_all();



        return true;

    }

    catch(const std::exception & cE)

    {

        std::cerr << cE.what() << std::endl;

    }

    catch(...)

    {

        std::cerr << __LINE__ << std::endl;

    }



    return false;

}

View Code

第二遍就是k Way Merge Sorting了。这里的思路很简单，直接读取外部的一坨文件，以及维护一个队列，每次从活的最小数字的那一列输出候选者，然后读出下一个放入队列。如果文件读完了，则说明那一路文件流可以丢弃了，队列也相应的变小了。这里当然是单线程的。

The second pass is the single-threaded classical k-Way Merging Sorting.

bool Merge(const char * szPath, int iNumBatches)

{

    try

    {

        //TODO : There is the limitation about the max number of opened file in process.

        //

        std::vector<boost::shared_ptr<std::ifstream> > vTempFiles;

        for (int i = 0; i < iNumBatches; ++ i)

        {

            char aBuffer[256];

            sprintf(aBuffer, "%.06d.tmp", i);

            boost::shared_ptr<std::ifstream> pTempFile(new std::ifstream(aBuffer, std::ios_base::binary));

            assert(pTempFile->is_open());

            vTempFiles.push_back(pTempFile);

        }



        std::ofstream cSortedFile(szPath, std::ios_base::binary);

        if (! cSortedFile)

        {

            std::cerr << "Can't open " << szPath << " to write. " << std::endl;

            return false;

        }



        //

        boost::timer::auto_cpu_timer cTimer;



        std::vector<int> vCache;

        vCache.reserve(10 * 1024 * 1024);



        std::vector<int> vQueue;

        std::vector<boost::shared_ptr<std::ifstream> >::iterator iFile = vTempFiles.begin();

        for (; iFile != vTempFiles.end(); ++ iFile)

        {

            int iNumber = - 1;

            if ((* iFile)->read(reinterpret_cast<char *>(& iNumber), sizeof(int)))

            {

                vQueue.push_back(iNumber);

            }

        }

        do

        {

            std::vector<int>::iterator iMinPos = std::min_element(vQueue.begin(), vQueue.end());

            vCache.push_back(* iMinPos);

            if (vCache.size() == vCache.capacity())

            {

                cSortedFile.write(reinterpret_cast<const char *>(& vCache[0]), vCache.size() * sizeof(int));

                vCache.clear();

            }



            iFile = vTempFiles.begin() + (iMinPos - vQueue.begin());

            int iNumber = - 1;

            if ((* iFile)->read(reinterpret_cast<char *>(& iNumber), sizeof(int)))

            {

                (* iMinPos) = iNumber;

            }

            else

            {

                vTempFiles.erase(iFile);

                vQueue.erase(iMinPos);

            }



        } while (vQueue.size());

        cSortedFile.write(reinterpret_cast<const char *>(& vCache[0]), vCache.size() * sizeof(int));



        return true;

    }

    catch(const std::exception & cE)

    {

        std::cerr << cE.what() << std::endl;

    }

    catch(...)

    {

        std::cerr << __LINE__ << std::endl;

    }



    return false;

}

View Code

测试的环境为Xeon [email protected]，4个硬件线程，测试设置的Job中的数据长度为80M，每次工作线程需要排序20M个int。西部数据的蓝盘，非SSD，也不是混合硬盘，纯机械硬盘。

Tested by single Xeon E5-2603 CPU at 1.8G with 4 hardware threads, each thread process 20M integers. Using WD blue disk, not SSD,.

第一个Sort遍的时间为19.111468s wall, 52.369536s user + 4.243227s system = 56.612763s CPU (296.2%)，CPU效率为296.2%/300% = 98.7%，几乎所有时间都在STL中的std::stable_sort里。

Sorting pass used total 19.11 seconds with 98.7% CPU usage.

第二个Merge遍的时间为33.082600s wall, 29.874191s user + 3.010819s system = 32.885011s CPU (99.4%)，主要还是都在磁盘写入和排序。当然这里可以为每个文件流构造一个Cache，也可以显著地提高性能，不过这里有一个问题，一旦牵涉到了Cache，则必然又有内存的占用提升，如果占用过大则又失去了Merge的意义。

这里读者可能有个问题，关于主线程中的不停new，其实从Vista开始Windows的内存分配其实已经是池化的，而且这里根本不是性能瓶颈，只有磁盘IO才是，所以这里可以不需要优化。至于架构上的提升其实也不大，因为这里不是传统的多读取者+单写入着（Multiple Reader+Single Writer）而是多读取者写入者+单写入者（Multiple Reader and Writer + Single Writer），所以在结构上和传统的消费者/生产者的多线程工作方式还是有些不同。未来会尝试Lock-Free的工作方式而不用Mutex，这个是以后的内容了。

The memory allocation in the main thread is not a bottleneck compared with the disk IO and sorting, and the memory allocation is based on pool since Vista, so here we might discard the optimization. Later the Lock-Free architecture might be implemented.

这里有全套代码。

Here is the full code.

  1 /**

  2  * Multithreading C++ Out of Core Sotring for Massive Data

  3  *

  4  * Copyright (c) 2013 Bo Zhou<[email protected]>

  5  * All rights reserved.

  6  * Redistribution and use in source and binary forms, with or without

  7  * modification, are permitted provided that the following conditions are met:

  8  *

  9  *     * Redistributions of source code must retain the above copyright

 10  *       notice, this list of conditions and the following disclaimer.

 11  *     * Redistributions in binary form must reproduce the above copyright

 12  *       notice, this list of conditions and the following disclaimer in the

 13  *       documentation and/or other materials provided with the distribution.

 14  *     * Neither the name of the University of California, Berkeley nor the

 15  *       names of its contributors may be used to endorse or promote products

 16  *       derived from this software without specific prior written permission.

 17  */

 18 

 19 #include <fstream>

 20 #include <list>

 21 #include <iostream>

 22 #include <queue>

 23 

 24 #include <boost/filesystem.hpp>

 25 #include <boost/smart_ptr.hpp>

 26 #include <boost/thread.hpp>

 27 #include <boost/timer/timer.hpp>

 28 

 29 class Job

 30 {

 31 public:

 32 

 33     Job()

 34     :

 35     m_iIndex(0),

 36     m_iNumItems(0)

 37     {

 38     }

 39 

 40     Job(int iIndex, int iNumItems, const boost::shared_array<int> & aData)

 41     :

 42     m_iIndex(iIndex),

 43     m_iNumItems(iNumItems),

 44     m_aData(aData)

 45     {

 46     }

 47 

 48     Job(const Job & cCopy)

 49     :

 50     m_iIndex(cCopy.m_iIndex),

 51     m_iNumItems(cCopy.m_iNumItems),

 52     m_aData(cCopy.m_aData)

 53     {

 54     }

 55 

 56 public:

 57 

 58     int m_iIndex;

 59     int m_iNumItems;

 60     boost::shared_array<int> m_aData;

 61 };

 62 

 63 class Context

 64 {

 65 public:

 66 

 67     Context(int iNumSortingThread)

 68     :

 69     m_iNumSortingThread(iNumSortingThread),

 70     m_bHasMoreData(true)

 71     {

 72     }

 73 

 74 public:

 75 

 76     int m_iNumSortingThread;

 77 

 78     bool m_bHasMoreData;

 79 

 80     boost::mutex m_cMutex;

 81     boost::condition_variable m_cEvent;

 82 

 83     std::list<Job > m_lJobQueue;

 84 };

 85 

 86 class SortingThread : public boost::thread

 87 {

 88 public:

 89 

 90     SortingThread(const boost::shared_ptr<Context> & pContext)

 91     :

 92     m_pContext(pContext),

 93     boost::thread(boost::bind(& SortingThread::Sort, this))

 94     {

 95     }

 96 

 97     void Sort()

 98     {

 99         while (1)

100         {

101             if (! m_pContext->m_bHasMoreData)

102             {

103                 if (! m_pContext->m_lJobQueue.size())

104                 {

105                     break;

106                 }

107             }

108 

109             Job cJob;

110             {

111                 boost::unique_lock<boost::mutex> cLock(m_pContext->m_cMutex);

112                 if (m_pContext->m_lJobQueue.size())

113                 {

114                     // Get a job.

115                     //

116                     cJob = m_pContext->m_lJobQueue.front();

117                     m_pContext->m_lJobQueue.pop_front();

118                 }

119             }

120 

121             if (cJob.m_iNumItems)

122             {

123                 std::stable_sort(cJob.m_aData.get(), cJob.m_aData.get() + cJob.m_iNumItems, std::less<int>());

124                 

125                 // Write out the sorted data.

126                 //

127                 char aBuffer[256];

128                 sprintf(aBuffer, "%.06d.tmp", cJob.m_iIndex);

129                 std::ofstream cOutput(aBuffer, std::ios_base::binary);

130                 cOutput.write(reinterpret_cast<const char *>(cJob.m_aData.get()), cJob.m_iNumItems * sizeof(int));

131             }

132 

133             // Tell the main thread we need more data here.

134             //

135             m_pContext->m_cEvent.notify_one();

136         }

137     }

138 

139 private:

140 

141     boost::shared_ptr<Context> m_pContext;

142 };

143 

144 class SortingThreadGroup : public boost::thread_group

145 {

146 public:

147 

148     SortingThreadGroup(const boost::shared_ptr<Context> & pContext)

149     :

150     m_pContext(pContext)

151     {

152         for (int i = 0; i < m_pContext->m_iNumSortingThread; ++ i)

153         {

154             SortingThread * pSortingThread = new SortingThread(pContext);

155             add_thread(pSortingThread);

156         }

157     }

158 

159 private:

160 

161     boost::shared_ptr<Context> m_pContext;

162 };

163 

164 ///////////////////////////////////////////////////////////////////////////////////////////////////

165 

166 bool Sort(const char * szPath, int iNumSortingThreads, int iNumLocalItems)

167 {

168     try

169     {

170         // Calculate real size.

171         //

172         std::ifstream cUnSortedFile(szPath, std::ios_base::binary);

173         boost::uintmax_t ullSize = boost::filesystem::file_size(szPath);

174         boost::uintmax_t ullNumItems = ullSize / 4;

175 

176         int iNumBatches = ullNumItems / iNumLocalItems;

177         std::vector<int> vNumItemsPerBatch(iNumBatches, iNumLocalItems);

178         int iNumRestItems = ullNumItems % iNumLocalItems;

179         if (iNumRestItems)

180         {

181             vNumItemsPerBatch.push_back(iNumRestItems);

182         }

183         std::cout << "Number of Items   : " << ullNumItems << std::endl

184                   << "Number of Batches : " << vNumItemsPerBatch.size() << std::endl;

185 

186         boost::shared_ptr<Context> pContext(new Context(iNumSortingThreads));

187         boost::scoped_ptr<SortingThreadGroup> pSortingThreadGroup(new SortingThreadGroup(pContext));

188 

189         boost::timer::auto_cpu_timer cTimer;

190         for (int i = 0; i < vNumItemsPerBatch.size(); ++ i)

191         {

192             boost::shared_array<int> aData(new int[vNumItemsPerBatch[i]]);

193             cUnSortedFile.read(reinterpret_cast<char *>(aData.get()), vNumItemsPerBatch[i] * sizeof(int));

194 

195             Job cJob(i, vNumItemsPerBatch[i], aData);

196 

197             //

198             boost::unique_lock<boost::mutex> cLock(pContext->m_cMutex);

199             if (pContext->m_lJobQueue.size() > iNumSortingThreads * 2)

200             {

201                 pContext->m_cEvent.wait(cLock);

202             }

203             pContext->m_lJobQueue.push_back(cJob);

204         }

205         std::cout << std::endl;

206         pContext->m_bHasMoreData = false;

207 

208         pSortingThreadGroup->join_all();

209 

210         return true;

211     }

212     catch(const std::exception & cE)

213     {

214         std::cerr << cE.what() << std::endl;

215     }

216     catch(...)

217     {

218         std::cerr << __LINE__ << std::endl;

219     }

220 

221     return false;

222 }

223 

224 ///////////////////////////////////////////////////////////////////////////////////////////////////

225 

226 bool Merge(const char * szPath, int iNumBatches)

227 {

228     try

229     {

230         //TODO : There is the limitation about the max number of opened file in process.

231         //

232         std::vector<boost::shared_ptr<std::ifstream> > vTempFiles;

233         for (int i = 0; i < iNumBatches; ++ i)

234         {

235             char aBuffer[256];

236             sprintf(aBuffer, "%.06d.tmp", i);

237             boost::shared_ptr<std::ifstream> pTempFile(new std::ifstream(aBuffer, std::ios_base::binary));

238             assert(pTempFile->is_open());

239             vTempFiles.push_back(pTempFile);

240         }

241 

242         std::ofstream cSortedFile(szPath, std::ios_base::binary);

243         if (! cSortedFile)

244         {

245             std::cerr << "Can't open " << szPath << " to write. " << std::endl;

246             return false;

247         }

248 

249         //

250         boost::timer::auto_cpu_timer cTimer;

251 

252         std::vector<int> vCache;

253         vCache.reserve(10 * 1024 * 1024);

254 

255         std::vector<int> vQueue;

256         std::vector<boost::shared_ptr<std::ifstream> >::iterator iFile = vTempFiles.begin();

257         for (; iFile != vTempFiles.end(); ++ iFile)

258         {

259             int iNumber = - 1;

260             if ((* iFile)->read(reinterpret_cast<char *>(& iNumber), sizeof(int)))

261             {

262                 vQueue.push_back(iNumber);

263             }

264         }

265         do

266         {

267             std::vector<int>::iterator iMinPos = std::min_element(vQueue.begin(), vQueue.end());

268             vCache.push_back(* iMinPos);

269             if (vCache.size() == vCache.capacity())

270             {

271                 cSortedFile.write(reinterpret_cast<const char *>(& vCache[0]), vCache.size() * sizeof(int));

272                 vCache.clear();

273             }

274 

275             iFile = vTempFiles.begin() + (iMinPos - vQueue.begin());

276             int iNumber = - 1;

277             if ((* iFile)->read(reinterpret_cast<char *>(& iNumber), sizeof(int)))

278             {

279                 (* iMinPos) = iNumber;

280             }

281             else

282             {

283                 vTempFiles.erase(iFile);

284                 vQueue.erase(iMinPos);

285             }

286 

287         } while (vQueue.size());

288         cSortedFile.write(reinterpret_cast<const char *>(& vCache[0]), vCache.size() * sizeof(int));

289 

290         return true;

291     }

292     catch(const std::exception & cE)

293     {

294         std::cerr << cE.what() << std::endl;

295     }

296     catch(...)

297     {

298         std::cerr << __LINE__ << std::endl;

299     }

300 

301     return false;

302 }

303 

304 int main(int argc, char * argv[])

305 {

306     int iRet = EXIT_FAILURE;

307 

308     //

309     char * szPath = NULL;

310 

311     int iNumSortingThreads = 0;

312     int iNumLocalItems = 0;

313 

314     int iNumBatches = 0;

315 

316     //

317     -- argc, ++ argv;

318     if (argc == 3)

319     {

320         szPath = argv[0];

321         iNumSortingThreads = atoi(argv[1]);

322         iNumLocalItems = atoi(argv[2]) * 1024 * 1024;

323         if (Sort(szPath, iNumSortingThreads, iNumLocalItems))

324         {

325             iRet = EXIT_SUCCESS;

326         }

327     }

328     else if (argc == 2)

329     {

330         szPath = argv[0];

331         iNumBatches = atoi(argv[1]);

332         if (Merge(szPath, iNumBatches))

333         {

334             iRet = EXIT_SUCCESS;

335         }

336     }

337 

338     return iRet;

339 }

View Full Code

python多线程程序设计之一 IT_Beijing_BIT #Python 程序设计语言 python
python多线程程序设计之一全局解释器锁线程APIsthreading.active_count()threading.current_thread()threading.excepthook(args,/)threading.get_native_id()threading.main_thread()threading.stack_size([size])线程对象成员函数构造器start/ru
C# 自动化 TineAine C#代码片段自动化 c#自动化模拟操作
实现的方法可能很笨，但是确实很好用usingSystem;usingSystem.Collections.Generic;usingSystem.Linq;usingSystem.Runtime.InteropServices;usingSystem.Text;usingSystem.Threading;usingSystem.Threading.Tasks;/******************
python 多线程抓取xunlei磁力下载链接 weixin_53748624 python pycharm
importurllib.requestimportreimporttimeimportthreadingclassSpider(object):def__init__(self):#定义字典，用于保存影片信息self.films_dict={}self.i=1self.lock1=threading.Lock()defstart(self):#调用下载函数，获取下载连接forpageinrang
Python 课程8-多线程编程和多进程编程可愛小吉 Python教學 python 开发语言 threading multiprocessing
前言在现代编程中，处理并发任务是提高程序性能的关键之一。Python提供了多线程（threading）和多进程（multiprocessing）两种方式来实现并发编程。多线程适用于I/O密集型任务，而多进程则更适合CPU密集型任务。通过这两种技术，你可以高效地处理大规模数据、加速程序执行并优化资源利用。在本篇详细教程中，我们将讨论如何使用Python的threading模块实现多线程，以及如何使用
WPF实现简单的9宫格键盘移动方块 no longer WPF学习 wpf
实现用电脑键盘上下左右实现方块的移动demoxaml文件代码：后台代码usingSystem;usingSystem.Collections.Generic;usingSystem.Linq;usingSystem.Text;usingSystem.Threading.Tasks;usingSystem.Windows;usingSystem.Windows.Controls;usingSyste
C++新特性以及应用场景平凡而伟大(心之所向) 编程语言 c++开发语言
C++的新特性可以大致分为以下几类：模板（Templates）：提高代码复用性，包括模板函数和模板类。异常处理（ExceptionHandling）：提供了一套结构化的错误处理机制。异步编程（ConcurrencyandMultithreading）：提供了线程和原子操作等工具。智能指针（SmartPointers）：自动管理内存，如std::unique_ptr和std::shared_ptr。
放慢速度，思考含义，细化理解 chuck_study
作者|士心先生来源|程序员的读书故事（公众号：pg_reading)细化思考.jpg我对阅读很感兴趣，它是我唯一知道的学习方法。除了阅读，我不知道还能做什么。如果读了又没有记住，我就不知道该怎么办了。在读过有关学习的研究资料后，我知道了除了被动地接收信息，还必须主动地做一些事。当然重要的是想出一个办法来检索记忆汇总的信息，因为这是你在考试时被要求做的事。如果你在学习时做不到这一点，那么在考试时间同
c# 网口通讯图像处理进阶小白 C#
一、命令行客户端程序:usingSystem;usingSystem.Collections.Generic;usingSystem.Linq;usingSystem.Text;usingSystem.Threading.Tasks;usingSystem.Net;usingSystem.Net.Sockets;usingSystem.Threading; namespaceclient{ c
【Reading207】我是你爸爸树欲静96
——沟通是建立信任的前提。《我是你爸爸》的作者是王朔，发表于1991年。王朔，1958年出生于江苏南京，代表作：《玩的就是心跳》、《看上去很美》，《动物凶猛》，《无知者无畏》。这篇小说讲述了马林生离婚后和儿子马锐的二人生活，一定程度上显示了子女和父母的内心世界。鲁迅曾经在一篇文章里写到，一个人做儿子时应当记日记，如：今天我想去公园玩，爸爸没有允许我出去。当到他做父亲时，拿出来读读，这样当他的儿子提
Lt-8 Multithreading yanlingyun0210 java
IntendedLearningOutcomesTounderstandtheconceptofconcurrency.Tounderstandthedifferenceofaprocessandathread.TodefineathreadusingtheThreadclassandRunnableinterface.TocontrolthreadswithvariousThreadmethod
kubeadm升级k8s_remote version is much newer v1 2401_86367086 kubernetes 容器云原生
可以看到我们的版本可以升级到v1.24.4###显示版本差异kubeadmupgradediff1.24.4[upgrade/diff]Readingconfigurationfromthecluster…[upgrade/diff]FYI:Youcanlookatthisconfigfilewith‘kubectl-nkube-systemgetcmkubeadm-config-oyaml’—/
【Python中处理多线程的几种方法】小九不懂SAP 我的Python日记 python 开发语言多线程
一、使用threading模块*Python的标准库提供了一个`threading`模块，它允许你创建和管理线程。*你可以通过继承`threading.Thread`类并重写其`run`方法来定义线程的行为。*你也可以使用`threading.Thread`的构造函数直接传递一个目标函数和参数来启动线程。*线程之间的同步可以使用锁（如`threading.Lock`或`threading.RLoc
【Python】超详细实例讲解python多线程（threading模块）猫猫不吃Sakana #Python自动化 python 经验分享笔记 pycharm
什么是多线程?线程（thread）是操作系统中能够进行运算的最小单位，包含于进程之中，一个进程可以有多个线程，这意味着一个进程中可以并发多个线程，即为多线程。对于一个python程序，如果需要同时大量处理多个任务，有使用多进程和多线程两种方法。在python中，实现多线程主要通过threading模块，而多进程主要通过multiprocessing模块。这两个模块的主要区别是：threading模
.NetCore里使用定时任务 AitTech .netcore c#
在.NETCore中，实现定时任务可以通过多种方式，包括使用内置的System.Threading.Timer、System.Timers.Timer，或者更高级、更灵活的库，如Hangfire、Quartz.NET或.NETCore3.0及以上版本引入的IHostedService和BackgroundService。这里主要介绍IHostedService和BackgroundService的
Python实现多线程、多进程及协程闲人编程 python python 开发语言多线程多进程协程并发异步
目录Python实现多线程、多进程及协程引言1.多线程（Threading）1.1多线程的基本概念1.2多线程的优点和缺点1.3Python多线程的实现2.多进程（Multiprocessing）2.1多进程的基本概念2.2多进程的优点和缺点2.3Python多进程的实现3.协程（Coroutine）3.1协程的基本概念3.2协程的优点和缺点3.3Python协程的实现4.三种并发模型的对比与选择
【Reading132】茨威格短篇小说树欲静96
【Reading132】茨威格短篇小说茨威格小说精选》是2003年8月文化艺术出版社出版发行的图书，作者是[奥地利]斯蒂芬·茨威格。斯蒂芬·茨威格，奥地利著名小说家、传记作家，出身于富裕的犹太家庭。青年时代在维也纳和柏林攻读哲学和文学。后去世界各地游历，结识罗曼·罗兰和罗丹等人，并受到他们的影响。第一次世界大战时从事反战工作，成为著名的和平主义者。二十年代赴苏联，认识了高尔基。1934年遭纳粹驱逐
Detecting Memory Management and Threading Bugs with Valgrind Chia-Te Kuan 分析工具交叉編譯經驗談 elasticsearch 大数据搜索引擎 git
contentAboutValgrindInstallingValgrindFromSourceFromPre-compiledBinaryPrepareFWandstandardlibrarywithsymbolPrepareFWPreparesysrootonNFSSetLD_LIBRARY_PATHandcreatesymboliclinksPrepareself-implementlibr
重点来了-口译学习的正确打开方式静秋_d191
蒙特雷国际研究院会译口译专业暑期作业蒙特雷的老师每年都会重新编写summerwork。用心程度从细节之中可以看见。以下为原文。蒙特雷版权，仅为学习交流，请勿用于商业行为。READING1.Economics,popularscience,politics,societyPickatopicandreadaboutitinCandEparalleltexts.TheyneedNOTbetransla
python测试开发基础---threading 面包会有的，牛奶也会有的。 python 开发语言
1.核心概念线程（Thread）：线程是轻量级的进程，在同一进程内可以并行执行多个任务。线程共享进程的资源，如内存和文件描述符，但每个线程有自己的执行栈和局部变量。全局解释器锁（GIL）：Python中的GIL限制了同一进程中多个线程的真正并行执行。它确保同一时间只有一个线程可以执行Python字节码，这对计算密集型任务可能会影响性能，但对于I/O密集型任务效果仍然良好。2.threading模块
python压力测试_Python 压力测试脚本 weixin_39561673 python压力测试
目的是写个脚本，起多线程去call一个接口,来测试一个并发问题。实现方案是将接口做到了一个页面中，用python的httpget请求来访问查询。importurllibimportthreadingfromtimeimportctime,sleepdeft1(func):foriinrange(10):f=urllib.urlopen("http://www.mystation.com/myint
python复制单元格格式太多_线程和多处理模块之间有什么区别？ - python weixin_39782709 python复制单元格格式太多
我正在学习如何在Python中使用threading和multiprocessing模块来并行运行某些操作并加速我的代码。我发现很难理解(也许是因为我没有任何理论背景)要理解threading.Thread()对象和multiprocessing.Process()对象之间的区别。另外，对我来说，如何实例化一个作业队列并使其只有4个(例如)并行运行，而另一个则等待资源释放后再执行，对我来说也不是很
Python多线程—threading模块详解 whoamilzq Python Python编程多线程
threading模块threading模块是Python支持的多线程编程的重要模块，该模块是在底层模块_thread的基础上开发的更高层次的多线程编程接口，提供了大量的方法和类来支持多线程编程。threading模块常用方法如下：方法功能说明threading.active_count()返回当前处于active状态的Thread对象threading.current_thread()返回当前T
2021-09-13 微笑的旗子萝卜
GeorgiaReports946NewCasesofCoronavirus,48DeathsReadingTime:1minreadSourceofphoto:news10.com946newcasesofcoronavirushavebeenregisteredinGeorgia,48peoplehavedied,4351peoplehaverecovered.Thenewlyconfirme
[HFE] U4L3 Homework TimmySHENX
本次学习内容：U4L3灰色练习册：Workbook-P50,ActivityA.Readanddraw.Workbook-P50,ActivityB.Completethesentences网络练习：Online-Spelling:Listenandtypethewordstocompletethesentences.Online-Reading:Listenandsortthewordsbyho
2021-08-23 微笑的旗子萝卜
Coronavirus:GeorgiaReports2354NewCases,60DeathsReadingTime:1minreadImagesource:GettyImages2354newcasesofcoronavirushavebeenregisteredinGeorgia,60peoplehavedied,5200peoplehaverecovered.Atotalof517,098c
C#实现文件的上传幽兰的天空 c#开发语言
usingSystem;usingSystem.IO;usingSystem.Net.Http;usingSystem.Threading.Tasks;classProgram{staticasyncTaskMain(string[]args){stringapiUrl="http://example.com/upload";//替换为你的上传API地址stringfilePath=@"C:\pa
复盘‖意识三境界－走近RIA标签阅读法 MaxTZ
之前听过经过拆书帮也参加过线下两次的活动，对RIA拆书法有过初步的了解。所以进入阅读营对便签阅读的拆书法充满着期待与憧憬。首先认识一下拆书帮的阅读大法RIA——RIA首字母提取法的字母组合。分别代表三个英文单词Reading阅读Interpretation解释说明Appropriation应用融合生活换言之，将书上的阅读内容，用自己的话来复述，回忆自己过往，是否触碰痛点？激发灵感。从而引发自己进一
Cannot read properties of undefined (reading ‘_android’) 久违的小技巧 qrcodejs2
记录：问题Cannotreadpropertiesofundefined(reading‘_android’)vue3+ts使用qrcodejs2插件生成二维码报错Cannotreadpropertiesofundefined(reading‘_android’)替换qrcodejs2使用qrcodejs2-fix1.卸载npmuninstallqrcodejs22.安装npmiqrcodejs2
【Reading111】领导力树欲静96
【Reading111】领导力——一个古老的原理，给予、索取……再给予、再索取。我们生活的这个环境无疑是地球上节奏最快也最错综复杂的环境。为了适应这种环境，我们需要有捷径。因此，我们必须要经常使用我们从经验中得来的方法，按照事物的特征将其归类。然后当某一种触发特征出现时，我们就会不加思索地做出相应反应。人们时常会说，某个人有牵着别人鼻子走的能力或者说是某个人总是倾向于根据一件事情或者一个行为对别人
继承庵下桃花仙
classCar():"""一次模拟汽车的简单尝试"""def__init__(self,make,model,year):"""初始化描述汽车属性"""self.make=makeself.model=modelself.year=yearself.odometer_reading=0defget_descriptive_name(self):"""返回整洁的描述性信息"""long_name=
Spring4.1新特性——综述 jinnianshilongnian spring 4.1
目录 Spring4.1新特性——综述 Spring4.1新特性——Spring核心部分及其他 Spring4.1新特性——Spring缓存框架增强 Spring4.1新特性——异步调用和事件机制的异常处理 Spring4.1新特性——数据库集成测试脚本初始化 Spring4.1新特性——Spring MVC增强 Spring4.1新特性——页面自动化测试框架Spring MVC T
Schema与数据类型优化 annan211 数据结构 mysql
目前商城的数据库设计真是一塌糊涂，表堆叠让人不忍直视，无脑的架构师，说了也不听。在数据库设计之初，就应该仔细揣摩可能会有哪些查询，有没有更复杂的查询，而不是仅仅突出很表面的业务需求，这样做会让你的数据库性能成倍提高，当然，丑陋的架构师是不会这样去考虑问题的。选择优化的数据类型 1 更小的通常更好更小的数据类型通常更快，因为他们占用更少的磁盘、内存和cpu缓存，
第一节 HTML概要学习 chenke html Web css
第一节 HTML概要学习 1. 什么是HTML HTML是英文Hyper Text Mark-up Language(超文本标记语言)的缩写，它规定了自己的语法规则，用来表示比“文本”更丰富的意义，比如图片，表格，链接等。浏览器（IE,FireFox等）软件知道HTML语言的语法，可以用来查看HTML文档。目前互联网上的绝大部分网页都是使用HTML编写的。打开记事本输入一下内
MyEclipse里部分习惯的更改 Array_06 eclipse
继续补充中---------------------- 1.更改自己合适快捷键windows-->prefences-->java-->editor-->Content Assist--> Activation triggers for java的右侧“.”就可以改变常用的快捷键选中 Text
近一个月的面试总结 cugfy 面试
本文是在学习中的总结，欢迎转载但请注明出处：http://blog.csdn.net/pistolove/article/details/46753275 前言打算换个工作，近一个月面试了不少的公司，下面将一些面试经验和思考分享给大家。另外校招也快要开始了，为在校的学生提供一些经验供参考，希望都能找到满意的工作。
HTML5一个小迷宫游戏 357029540 html5
通过《HTML5游戏开发》摘抄了一个小迷宫游戏，感觉还不错，可以画画，写字，把摘抄的代码放上来分享下，喜欢的同学可以拿来玩玩！ <html> <head> <title>创建运行迷宫</title> <script type="text/javascript"
10步教你上传githib数据张亚雄 git
官方的教学还有其他博客里教的都是给懂的人说得，对已我们这样对我大菜鸟只能这么来锻炼，下面先不玩什么深奥的，先暂时用着10步干净利索。等玩顺溜了再用其他的方法。操作过程（查看本目录下有哪些文件NO.1）ls （跳转到子目录NO.2）cd+空格+目录（继续NO.3）ls （匹配到子目录NO.4）cd+ 目录首写字母+tab键+（首写字母“直到你所用文件根就不再按TAB键了”）（查看文件
MongoDB常用操作命令大全 adminjun mongodb 操作命令
成功启动MongoDB后，再打开一个命令行窗口输入mongo，就可以进行数据库的一些操作。输入help可以看到基本操作命令，只是MongoDB没有创建数据库的命令，但有类似的命令如：如果你想创建一个“myTest”的数据库，先运行use myTest命令，之后就做一些操作（如：db.createCollection('user')）,这样就可以创建一个名叫“myTest”的数据库。一
bat调用jar包并传入多个参数 aijuans
下面的主程序是通过eclipse写的： 1.在Main函数接收bat文件传递的参数（String[] args）如： String ip =args[0]; String user=args[1]; &nbs
Java中对类的主动引用和被动引用 ayaoxinchao java 主动引用对类的引用被动引用类初始化
在Java代码中，有些类看上去初始化了，但其实没有。例如定义一定长度某一类型的数组，看上去数组中所有的元素已经被初始化，实际上一个都没有。对于类的初始化，虚拟机规范严格规定了只有对该类进行主动引用时，才会触发。而除此之外的所有引用方式称之为对类的被动引用，不会触发类的初始化。虚拟机规范严格地规定了有且仅有四种情况是对类的主动引用，即必须立即对类进行初始化。四种情况如下：1.遇到ne
导出数据库提示 outfile disabled BigBird2012 mysql
在windows控制台下，登陆mysql，备份数据库： mysql>mysqldump -u root -p test test > D:\test.sql 使用命令 mysqldump 格式如下： mysqldump -u root -p *** DBNAME > E:\\test.sql。注意：执行该命令的时候不要进入mysql的控制台再使用，这样会报
Javascript 中的 && 和 || bijian1013 JavaScript &&||
准备两个对象用于下面的讨论 var alice = { name: "alice", toString: function () { return this.name; } } var smith = { name: "smith",
[Zookeeper学习笔记之四]Zookeeper Client Library会话重建 bit1129 zookeeper
为了说明问题，先来看个简单的示例代码： package com.tom.zookeeper.book; import com.tom.Host; import org.apache.zookeeper.WatchedEvent; import org.apache.zookeeper.ZooKeeper; import org.apache.zookeeper.Wat
【Scala十一】Scala核心五：case模式匹配 bit1129 scala
package spark.examples.scala.grammars.caseclasses object CaseClass_Test00 { def simpleMatch(arg: Any) = arg match { case v: Int => "This is an Int" case v: (Int, String)
运维的一些面试题 yuxianhua linux
1、Linux挂载Winodws共享文件夹 mount -t cifs //1.1.1.254/ok /var/tmp/share/ -o username=administrator,password=yourpass 或 mount -t cifs -o username=xxx,password=xxxx //1.1.1.1/a /win
Java lang包-Boolean BrokenDreams boolean
Boolean类是Java中基本类型boolean的包装类。这个类比较简单，直接看源代码吧。 public final class Boolean implements java.io.Serializable,
读《研磨设计模式》-代码笔记-命令模式-Command bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.ArrayList; import java.util.Collection; import java.util.List; /** * GOF 在《设计模式》一书中阐述命令模式的意图：“将一个请求封装
matlab下GPU编程笔记 cherishLC matlab
不多说，直接上代码 gpuDevice % 查看系统中的gpu,,其中的DeviceSupported会给出matlab支持的GPU个数。 g=gpuDevice(1); %会清空 GPU 1中的所有数据,,将GPU1 设为当前GPU reset(g) %也可以清空GPU中数据。 a=1; a=gpuArray(a); %将a从CPU移到GPU中 onGP
SVN安装过程 crabdave SVN
SVN安装过程 subversion-1.6.12 ./configure --prefix=/usr/local/subversion --with-apxs=/usr/local/apache2/bin/apxs --with-apr=/usr/local/apr --with-apr-util=/usr/local/apr --with-openssl=/
sql　行列转换 daizj sql 行列转换行转列列转行
行转列的思想是通过case when 来实现列转行的思想是通过union all 来实现下面具体例子：假设有张学生成绩表(tb)如下: Name Subject Result 张三语文　　74 张三数学　　83 张三物理　　93 李四语文　　74 李四数学　　84 李四物理　　94 */ /* 想变成姓名 &
MySQL--主从配置 dcj3sjt126com mysql
linux下的mysql主从配置：说明：由于MySQL不同版本之间的(二进制日志)binlog格式可能会不一样，因此最好的搭配组合是Master的MySQL版本和Slave的版本相同或者更低， Master的版本肯定不能高于Slave版本。（版本向下兼容） mysql1 : 192.168.100.1 //master mysq
关于yii 数据库添加新字段之后model类的修改 dcj3sjt126com Model
rules: array('新字段','safe','on'=>'search') 1、array('新字段', 'safe')//这个如果是要用户输入的话，要加一下， 2、array('新字段', 'numerical'),//如果是数字的话 3、array('新字段', 'length', 'max'=>100),//如果是文本 1、2、3适当的最少要加一条，新字段才会被
sublime text3 中文乱码解决 dyy_gusi Sublime Text
sublime text3中文乱码解决原因：缺少转换为UTF-8的插件目的：安装ConvertToUTF8插件包第一步：安装能自动安装插件的插件，百度“Codecs33”，然后按照步骤可以得到以下一段代码： import urllib.request,os,hashlib; h = 'eb2297e1a458f27d836c04bb0cbaf282' + 'd0e7a30980927
概念了解：CGI，FastCGI，PHP-CGI与PHP-FPM geeksun PHP
CGI CGI全称是“公共网关接口”(Common Gateway Interface)，HTTP服务器与你的或其它机器上的程序进行“交谈”的一种工具，其程序须运行在网络服务器上。 CGI可以用任何一种语言编写，只要这种语言具有标准输入、输出和环境变量。如php,perl,tcl等。 FastCGI FastCGI像是一个常驻(long-live)型的CGI，它可以一直执行着，只要激活后，不
Git push 报错 "error: failed to push some refs to " 解决 hongtoushizi git
Git push 报错 "error: failed to push some refs to " . 此问题出现的原因是：由于远程仓库中代码版本与本地不一致冲突导致的。由于我在第一次git pull --rebase 代码后，准备push的时候，有别人往线上又提交了代码。所以出现此问题。解决方案： 1： git pull 2：
第四章 Lua模块开发 jinnianshilongnian nginx lua
在实际开发中，不可能把所有代码写到一个大而全的lua文件中，需要进行分模块开发；而且模块化是高性能Lua应用的关键。使用require第一次导入模块后，所有Nginx 进程全局共享模块的数据和代码，每个Worker进程需要时会得到此模块的一个副本（Copy-On-Write），即模块可以认为是每Worker进程共享而不是每Nginx Server共享；另外注意之前我们使用init_by_lua中初
java.lang.reflect.Proxy liyonghui160com
1.简介 Proxy 提供用于创建动态代理类和实例的静态方法（1）动态代理类的属性代理类是公共的、最终的，而不是抽象的未指定代理类的非限定名称。但是，以字符串 "$Proxy" 开头的类名空间应该为代理类保留代理类扩展 java.lang.reflect.Proxy 代理类会按同一顺序准确地实现其创建时指定的接口
Java中getResourceAsStream的用法 pda158 java
1.Java中的getResourceAsStream有以下几种： 1. Class.getResourceAsStream(String path) ： path 不以’/'开头时默认是从此类所在的包下取资源，以’/'开头则是从ClassPath根下获取。其只是通过path构造一个绝对路径，最终还是由ClassLoader获取资源。　　2. Class.getClassLoader.get
spring 包官方下载地址（非maven） sinnk spring
SPRING官方网站改版后，建议都是通过 Maven和Gradle下载，对不使用Maven和Gradle开发项目的，下载就非常麻烦，下给出Spring Framework jar官方直接下载路径: http://repo.springsource.org/libs-release-local/org/springframework/spring/ s
Oracle学习笔记(7) 开发PLSQL子程序和包 vipbooks oracle sql 编程
哈哈，清明节放假回去了一下，真是太好了，回家的感觉真好啊！现在又开始出差之旅了，又好久没有来了，今天继续Oracle的学习！这是第七章的学习笔记，学习完第六章的动态SQL之后，开始要学习子程序和包的使用了……，希望大家能多给俺一些支持啊！编程时使用的工具是PLSQL

Multithreading C++ Out of Core Sotring for Massive Data|多线程C++的大规模数据外部排序

你可能感兴趣的:(reading)