ice_elephant

TFS淘宝分布式文件核心存储引擎源码架构剖析实现

这里写目录标题

相关背景介绍
相关设计思路介绍
项目基础
- 文件系统接口
- 扇区
- 文件结构
- 关于inode
- 为什么淘宝不用小文件存储
- 淘宝网为什么不用普通文件存储海量小数据？
设计思路
- 关键数据结构哈希表
代码日志
- mmp_file.h
- mmap_file.cpp
- file_op.h
- main_mmap_op_file.cpp
- index_handle.cpp
- blockwritetest.cpp
总结

项目基础

文件系统接口

文件系统是一种把数据组织成文件和目录方式，提供基于文件的存取接口，并通过权限控制。

扇区

磁盘读写的最小单位就是扇区，一般每个扇区是 512 字节(相当于0.5KB);

文件的基本单位块－文件存取的最小单位。"块"的大小，最常见的是4KB，即连续八个 sector组成一个 block。

在 Linux 系统中可以用 stat 查看文件相关信息

文件结构

目录项区:存放目录下文件的列表信息
文件数据: 存放文件数据
inode区:（inode table） - 存放inode所包含的信息

关于inode

inode - “索引节点”,储存文件的元信息，比如文件的创建者、文件的创建日期、文件的大小等等。每个inode都有一个号码，操作系统用inode号码来识别不同的文件。ls -i 查看inode 号
inode节点大小 - 一般是128字节或256字节。inode节点的总数，格式化时就给定，一般是每1KB或每2KB就设置一个inode。一块1GB的硬盘中，每1KB就设置一个inode，那么inode table的大小就会达到128MB，占整块硬盘的12.8%。

为什么淘宝不用小文件存储

大规模的小文件存取，磁头需要频繁的寻道和换道，因此在读取上容易带来
较长的延时。

淘宝网为什么不用普通文件存储海量小数据？

Inode 占用大量磁盘空间，降低了缓存的效果。

设计思路

以block文件的形式存放数据文件(一般64M一个block),以下简称为“块”，每个块都有唯一的一个整数编号，块在使用之前所用到的存储空间都会预先分配和初始化。

每一个块由一个索引文件、一个主块文件和若干个扩展块组成，“小文件”主要存放在主块中，扩展块主要用来存放溢出的数据。

每个索引文件存放对应的块信息和“小文件”索引信息，索引文件会在服务启动是映射（mmap）到内存，以便极大的提高文件检索速度。“小文件”索引信息采用在索引文件中的数据结构哈希链表来实现。

每个文件有对应的文件编号，文件编号从1开始编号，依次递增，同时作为哈希查找算法的Key 来定位“小文件”在主块和扩展块中的偏移量。文件编号+块编号按某种算法可得到“小文件”对应的文件名。

关键数据结构哈希表

代码日志

文件映射类

mmp_file.h

#ifndef MY_LARGE_FILE_H
#define MY_LARGE_FILE_H

#include "Common.h"

#define DEBUG  1
//代码组织有层次

namespace xiaozhu {

	namespace largefile {
		
		struct MMapOption
		{
	
			int32_t max_mmap_size_;	   //最大内存
			int32_t first_mmap_size_;  //第一次分配的内存
			int32_t per_mmap_size_;	   //每次每块分配的内存
		};

		class MMapfile {
		
		public:
		 
		  MMapfile();
		  explicit MMapfile(const int fd);	//必须显示构造
		  MMapfile(const MMapOption & mmap_option, const int fd);
		  ~MMapfile();

		 //同步文件，调用这个立即将内存同步到磁盘
		  bool sync_file();//同步
		  bool map_file(const bool write = false);//文件映射到内存同时设置访问权限
		  void* get_data() const;  //获取映射到内存的首地址
		  int32_t get_size()const; //映射内容
		  bool munmap_file();      //解除映射
		  bool remap_file();       //重新映射

		private:
			bool ensure_file_size(const int32_t size); // 扩容
			
		private:
			int32_t size_;
			int fd_;
			void* data_;
			struct MMapOption  mmap_file_option_;

		};
	
	}

}

#endif

mmap_file.cpp

#include "mmap_file.h"
#include 	
#include 
#include 
#include 

#include 
#include 
#include 


namespace xiaozhu {
	
	namespace largefile {
	
		MMapfile::MMapfile():size_(0),data_(nullptr),fd_(0){
		

		}

		MMapfile::MMapfile(const int fd) : fd_(fd),data_(nullptr),size_(0) {
		
		}

		MMapfile::MMapfile(const struct MMapOption& mmap_option, const int fd):fd_(fd),data_(nullptr),size_(0) {
			
			mmap_file_option_.first_mmap_size_ = mmap_option.first_mmap_size_;
			mmap_file_option_.max_mmap_size_ = mmap_option.max_mmap_size_;
			mmap_file_option_.per_mmap_size_ = mmap_option.per_mmap_size_;

		}

		MMapfile::~MMapfile() {
			
			if (data_) {
				
				if (DEBUG) printf("mmap file desruct,fd: %d,mmap_size:%d,data:%p\n", fd_, size_, data_);
				//同步
				msync(data_, size_, MS_SYNC);  //属性三设置为同步
				munmap(data_, size_);

				size_ = 0;
				data_ = nullptr;
				fd_ = -1;
				memset(&mmap_file_option_,'0',sizeof(mmap_file_option_));
			}
		}

		bool MMapfile::sync_file() {
			
			if (!data_ && size_ > 0) return msync(data_, size_, MS_ASYNC) == 0; // 使用异步

			//没有同步直接返回
			return true;
		}

		bool MMapfile::map_file(const bool write) {//执行映射
			int flags = PROT_READ;

			if (write) {
				flags |= PROT_WRITE;
			}

			if (fd_ < 0) {
				return false;
			}

			if (0 == mmap_file_option_.max_mmap_size_) {
				return false;
			}

			if (size_ < mmap_file_option_.max_mmap_size_) {
				
				size_ = mmap_file_option_.first_mmap_size_;

			}else {
			
				size_ = mmap_file_option_.max_mmap_size_;

			}

			if (!ensure_file_size(size_)) {
				
				fprintf(stderr, "ensure file size failed in mmap_file,size :%d\n", size_);
				return false;
			}

			data_ = mmap(0, size_, flags, MAP_SHARED, fd_, 0);

			if (data_ == MAP_FAILED) {

				fprintf(stderr, "mmap file failed :%s\n",strerror(errno));

				size_ = 0;
				fd_ = -1;
				data_ = nullptr;
				return false;
			}

			if (DEBUG) printf("mmap file successful,fd :%d mmaped size:%d data_:%p\n", fd_, size_, data_);

			return true;
		}

		void* MMapfile::get_data()const {

			return data_;
		}

		int32_t MMapfile::get_size() const{
			return size_;
		}

		bool MMapfile::munmap_file() {
		
			if (munmap(data_, size_) == 0) {
				return true;
			}
			else {
				return false;
			}
		}
		//文件扩容
		bool MMapfile::ensure_file_size(const int32_t size) {
			//扩容
			struct stat s;
			if (fstat(fd_, &s) < 0) {
			
				fprintf(stderr, "fstat error,error desc :%s\n", strerror(errno));
				return false;
			
			}

			if (s.st_size < size) { //小于 指定的大小
				if (ftruncate(fd_, size) < 0) {	
					
					fprintf(stderr, "fruncate error, size:%d,error desc:%s\n",size_,strerror(errno));
					return false;
				}
			
			}

			return true;

		
		}

		bool MMapfile::remap_file() {//重新映射

			//什么时候要重新映射 当改变这个文件装载的大小的时候肯定要重新映射一次 
			//增加的内存还是 
			if (fd_ == 0 || !data_) {
				fprintf(stderr, "mmremap not mapped yet\n");
				return false;
			}

			if (size_ == mmap_file_option_.max_mmap_size_) {
				fprintf(stderr, "hava been remap max_size :%d\n",size_);
				return false;
			}
			
			int32_t new_size = mmap_file_option_.per_mmap_size_ + size_;

			if (new_size > mmap_file_option_.max_mmap_size_) {
				
				fprintf(stderr,"new size is so length\n");
				return false;
			}



			if (!ensure_file_size(new_size)) {
				
				fprintf(stderr, "mremap failed becase ensure_file_size\n");
				return false;
			}

			if (DEBUG) printf("mremap start fd:%d ,now size_ :%d,new_size:%d data:%p\n", fd_, size_, new_size, data_);

			//重新映射
			void* m_remap = mremap(data_, size_, new_size, MREMAP_MAYMOVE);

			if(m_remap == MAP_FAILED) {
				
				fprintf(stderr, "mremap failed\n", strerror(errno));
				return false;
			}


			if (DEBUG) printf("mremap success fd:%d ,now size_ :%d,new_size:%d data:%p\n", fd_, size_, new_size, data_);

		//	mmap_file_option_.per_mmap_size_ = size_;

			data_ = m_remap;
			size_ = new_size;
			
			return true;

		}	

	}	

}

文件操作类

file_op.h

#ifndef FILE_OP_H
#define FILE_OP_H
#include "Common.h"

namespace xiaozhu {
		
	namespace largefile {
	
		class FileOperation {
		public:
			FileOperation(const std::string &file_Name,const int open_flags = O_RDWR |O_LARGEFILE);
			~FileOperation();

			int open_file();
			void close_file();

			int flush_file();//文件立即写入到磁盘  1行代码引起的血案
			
			
			//带精细化的读写
			int pread_file(char* buf, const int32_t nbytes,int64_t offset);
			int pwrite_file(char* buf, const int32_t nbytes, int64_t offset);

			int write_file(char* buf, const int32_t nbytes);
			//int read_file(char* buf, const int32_t nbytes);

			int64_t get_file_size();

			int unlink_file();//删除文件
			int ftruncate_file(const int64_t length);

			int seek_file(const int64_t offset);

			int get_fd() { return fd_; }

		protected:
			int fd_;
			char* filename_;
			int open_flags_;

		protected:
			static const mode_t OPEN_MODE = 0644;
			static const int MAX_DISK_TIMES = 5;//磁盘最大读取次数
		protected:
			int check_file();

		};

	}

}


#endif

#include "file_op.h"

namespace xiaozhu {
	namespace largefile {
		FileOperation::FileOperation(const std::string& file_Name, const int open_flags):fd_(-1), open_flags_(open_flags)
		{
			filename_ = strdup(file_Name.c_str());//字符串复制
		
		}


		FileOperation::~FileOperation()
		{
			if (fd_ > 0) {
			
				::close(fd_);
			}

			if (!filename_) free(filename_); filename_ = nullptr;
		
		}

		int FileOperation::open_file() {
			
			if (fd_ > 0) {
				close(fd_);
				fd_ = -1;
			}
			fd_ = ::open(filename_, open_flags_, OPEN_MODE);
		
			return fd_;
		}

		void FileOperation::close_file() {
			
			if (fd_ < 0) {
				return;
			}

			close(fd_);
			fd_ = -1;

		}
		int FileOperation::check_file()
		{
			if (fd_ < 0) {
				fd_ = open_file();
			}
			return fd_;
		}

		int64_t FileOperation::get_file_size() {
			
			int fd = check_file();

			struct stat statbuf;

			if (!fstat(fd,&statbuf) != 0) {
				return -1;
			}
			return statbuf.st_size();
			
			
		}

		int FileOperation::ftruncate_file(const int64_t length) {

			int fd = check_file();
			if (fd < 0) {
				return fd;
			}

			return ftruncate(fd, length);
		}

		int FileOperation::seek_file(const int64_t offset) {

			int fd = check_file();
			if (fd < 0) {
				return fd;
			}

			return lseek(fd, offset,SEEK_SET);
		}

		int FileOperation::flush_file() {
			
		
			if (open_flags_ & O_SYNC) {
			//如果是同步操作的话直接返回就不用主动映射了	
				return 0;
			}

			int fd = check_file();
			if (fd < 0) {
				return fd;
			}

			return fsync(fd);	//缓冲区写入磁盘
				
		}
		//读数据  
		int FileOperation::pread_file(char* buf, const int32_t nbytes, int64_t offset)
		{
			//从 offset 开始读写nbytes个字节 
			if (nbytes < 0) return 0;

			//int total_read = 0;
			int need_read = nbytes;
			int cur_offset = offset;
			char* tmp_buf = buf;

			int i = 0;
			while (need_read > 0) {
				
				if (i >= MAX_DISK_TIMES) {
					break;
				}

				if (check_file() < 0) {
					return -errno;
				}
					
				int readlen = pread64(fd_, tmp_buf, need_read, cur_offset);
				
				if (readlen < 0) {
					readlen = errno;

					if (-readlen == EINTR || -readlen == EAGAIN) {
						continue;
					}
					else if (EBADF == -readlen) {
						fd_ = -1;
						continue;
					}
					else {
						continue;
					}
				}
				else if (readlen == 0) {
					break;
				}
				else {

					need_read -= readlen;	 //还需要读这么多
					//total_read += readlen;   //总共读了这么多
					tmp_buf += readlen;
					cur_offset += readlen;   //当前读写的情况
				}			
											//还有什么情况呢 ?
			}

			if (need_read != 0 ) {
				return xiaozhu::largefile::EXIT_DISK_OPER_INCOMPLETE;
			}

			return xiaozhu::largefile::TFS_SUCCESS;
		}

		int FileOperation::pwrite_file(char *buf,const int32_t nbytes,int64_t offset) {
			//从 offset 开始读写nbytes个字节 
			if (nbytes < 0) return 0;

			//int total_read = 0;
			int need_write = nbytes;		//需要读这么多个字节 friends  ok is well none of us
			int cur_offset = offset;
			char* tmp_buf = buf;

			int i = 0;
			while (need_write > 0) {

				if (i >= MAX_DISK_TIMES) {
					break;
				}

				if (check_file() < 0) {
					return -errno;
				}

				int writelen = ::pwrite64(fd_, tmp_buf, need_write, cur_offset);

				if (writelen < 0) {
					writelen = errno;

					if (-writelen == EINTR || -writelen == EAGAIN) {
						continue;
					}
					else if (EBADF == -writelen) {
						fd_ = -1;
						continue;
					}
					else {
						continue;
					}
				}
				else if (writelen == 0) {
					break;
				}
				else {

					need_write -= writelen;	 //还需要读这么多
					tmp_buf += writelen;   //总共读了这么多
					cur_offset += writelen;   //当前读写的情况
				}
				//还有什么情况呢 ?
			}

			if (need_write != 0) {
				return xiaozhu::largefile::EXIT_DISK_OPER_INCOMPLETE;
			}

			return xiaozhu::largefile::TFS_SUCCESS;
		
		}


		//写文件
		int FileOperation::write_file(char* buf, const int32_t nbytes)
		{
			return 0;  //不指定偏移来写
			
			int needwrite = nbytes;
			char* tmp_buf = buf;

			int i = 0;
			while (needwrite > 0) {

				if (i >= MAX_DISK_TIMES) {
					break;
				}
				++i;
				if (check_file() < 0) {
					
					return -errno;
				}
				int write_len = ::write(fd_, tmp_buf, needwrite);

				if (write_len < 0) {
					
					write_len = -errno;

					if (-write_len == EINTR || -write_len == EAGAIN) {
					
						continue;
					}
					else if (EBADF == -write_len) {
						
						fd_ = -1;
						return write_len;
					}
					else {
						continue;
					}
					//快速实现  

				
				}




				needwrite -= write_len;
				tmp_buf += write_len;	//bug 指针的移动

			}


			if (needwrite != 0) {
				return xiaozhu::largefile::EXIT_DISK_OPER_INCOMPLETE;
			}

			return xiaozhu::largefile::TFS_SUCCESS;
		}

		int FileOperation::unlink_file() {
			
			close_file();
			return unlink(filename_);
		}
	
	}
}

单元测试

main_mmap_op_file.cpp

#include "mmap_file_op.h"

using namespace xiaozhu;
using namespace largefile;


largefile::MMapOption map_option = { 1024 * 1000,4096,4096 };

int main(void) {


	const char* file_Name = "./test.txt";
	char write_buffer[1024 + 1];
	char read_buffer[1024 + 1];

	MMapFileOperation* mpt = new MMapFileOperation(file_Name);
	
	int ret = mpt->mmap_file(map_option);

	int fd = mpt->open_file();

	if (fd < 0) {
		fprintf(stderr, "file is not open !\n");
		exit(-1);
	}
	
	write_buffer[1024] = '\0';


	
	
	if (ret == largefile::TFS_EEROR) {

		fprintf(stderr, "largefile::TFS_ERROR mmap_file failed\n");
		exit(-1);
	}
	memset(write_buffer, '4', 1024);
	//写进去
	 ret = mpt->pwrite_file(write_buffer, 1024, 0);
	if (ret == largefile::TFS_EEROR) {

		fprintf(stderr, "largefile::TFS_EEROR pwrite_file failed\n");
		exit(-1);

	}

	ret = mpt->pread_file(read_buffer, 1024, 0);
	
	if (ret == largefile::TFS_EEROR) {

		fprintf(stderr, "largefile::failed pread_file failed\n");
		exit(-1);
	}

	read_buffer[1024] = '\0';
	printf("read from buffer:%s\n", read_buffer);

	ret = mpt->flush_file();
	if (ret == largefile::TFS_EEROR) {
		fprintf(stderr, "largefile::TFS_ERROR flush_file failed\n");
		exit(-1);
	}

	ret = mpt->mumap_file();

	mpt->close_file();

	return 0;
}

测试结果:

第四次单元测试
##main_index_init_test.cpp

#include "indexHandle.h"
#include "Common.h"
#include "file_op.h"

#include 
#include 
#include 

static int debug = 1;

using namespace std;

using namespace xiaozhu;

const static largefile::MMapOption map_option = { 1024 * 1000,4096,4096 };//内存映射参数
const static int32_t bucket_size = 1000;
const static int32_t main_blocksize = 1024 * 1024 * 64;
static int32_t block_id = 1;

int main(int argc, char** argv) {

	std::string mainbock_path;
	std::string index_path;
	std::cout << "Please input block id:%d\n";

	cin >> block_id;
	if (block_id < 0) {
		cerr << "Invalid blockid. exit" << endl;
		exit(-1);
	}

	std::stringstream tmp_stream;
	tmp_stream << "." << largefile::MAINBLOCK_DIR_PREFIX << block_id;
	tmp_stream >> mainbock_path;

	largefile::FileOperation* mainblock = new largefile::FileOperation(mainbock_path, O_CREAT | O_RDWR | O_LARGEFILE);

	int ret = mainblock->ftruncate_file(main_blocksize);
	if (ret != 0) {
		fprintf(stderr, "create main_block failed. reason :%s\n", mainbock_path.c_str());
		exit(-2);
	}

	//创建索引文件;
	largefile::IndexHandle* index_handle = new largefile::IndexHandle(".", block_id);
	if (debug) printf("init index ...\n");
	//if(index_handle->)
	ret = index_handle->create(block_id, bucket_size, map_option);

	if (ret != largefile::TFS_SUCCESS) {

		fprintf(stderr, "create index %d failed\n", block_id);
		exit(-3);
	}

	//mainblock->flush_file();
	//index_handle->

	delete mainblock;
	delete index_handle;
	return 0;
}

添加删除、写模块后的头文件 :index_handle.h

在这里插入代码片

index_handle.cpp

写入块，int IndexHandle::write_segment_meta(const uint64_t key, Meltainfo& meta)

#ifndef HANDLE_INDEX_H
#define HANDLE_INDEX_H

#include "Common.h"
#include "mmap_file_op.h"

namespace xiaozhu {
	
	namespace largefile {
		
		struct IndexHeader {
		    public:

				IndexHeader()
				{
					memset(this, 0, sizeof(IndexHeader));
				}
			
			BlockInfo block_info_;
			int32_t bucket_size_;
			int32_t data_offset_;//指向主块的 也代表数据大小
			int32_t index_file_size_; //以空间换时间 index_header + all 
			int32_t free_head_offset_;
			
		};

		class IndexHandle {
			
		public :

			IndexHandle(const std::string& base_path, const uint32_t main_block_id);

			~IndexHandle();

			int create(const uint32_t logic_block_id,const int32_t bucket_size,const MMapOption map_option);//哈希桶的大小
			int load(const uint32_t logic_block_id, const int32_t bucket_size, const MMapOption map_option);
			
			//remove unlink
			int remove(const uint32_t logic_block_id);
			int flush();
			
			void commit_block_offset_data(const int file_size) const{
			
				reinterpret_cast<IndexHeader*>(file_op_->get_map_data())->data_offset_ += file_size;
				
			}
			
			int updata_block_info(const OperType  oper_type,const uint32_t modify_size);

			IndexHeader* index_header() {

			return reinterpret_cast< IndexHeader* >(file_op_->get_map_data());
			}

			BlockInfo* block_info() {

				return reinterpret_cast<BlockInfo*>(file_op_->get_map_data());
			}

			int32_t bucket_sizes()const{
				
				return  reinterpret_cast<IndexHeader*>(file_op_->get_map_data())->bucket_size_;	//等于bucket_size();
			}
			
			int32_t get_block_data_offset()const{

				return reinterpret_cast<IndexHeader*>(file_op_->get_map_data())->data_offset_;
			}

			int32_t free_head_offset() {
				return reinterpret_cast<IndexHeader*>(file_op_->get_map_data())->free_head_offset_;
			}

			int32_t* bucket_slot() {
				
			return reinterpret_cast<int32_t*>(reinterpret_cast<char*> (file_op_->get_map_data())+ sizeof(IndexHeader));
			
			}
			int write_segment_meta(const uint64_t key,Meltainfo &meta);
			int read_sengment_meta(const uint64_t key, Meltainfo& meta);
			int32_t delete_segment_meta(const uint64_t key);
			
			int hash_find(const uint64_t key, int32_t& current_offset, int32_t& previous_offset);
			int32_t hash_insert(const uint64_t key,int32_t previous,Meltainfo &meta);

		private:
			MMapFileOperation* file_op_;
			bool is_load_;
			bool hash_compare(int64_t left,int64_t right);
	
		};
			

	}
}

#endif

单元测试

blockwritetest.cpp

#include "indexHandle.h"
#include "Common.h"
#include "file_op.h"

#include 
#include 
#include 

static int debug = 1;

using namespace std;

using namespace xiaozhu;

const static largefile::MMapOption map_option = { 1024 * 1000,4096,4096 };//内存映射参数
const static int32_t bucket_size = 1000;
const static int32_t main_blocksize = 1024 * 1024 * 64;
static int32_t block_id = 1;

int mains(int argc, char** argv) {

	std::string mainbock_path;
	std::string index_path;
	std::cout << "Please input block id:%d\n";

	cin >> block_id;
	if (block_id < 0) {
		cerr << "Invalid blockid. exit" << endl;
		exit(-1);
	}
	int ret;
	//创建索引文件;
		//if(index_handle->)
	largefile::IndexHandle* index_handle = new largefile::IndexHandle(".", block_id);
	//if (debug) printf("create index...\n");
	//ret = index_handle->create(block_id, bucket_size, map_option);
	//if (ret != largefile::TFS_SUCCESS) {
	//	fprintf(stderr, "create index %d failed\n", block_id);
	//	exit(-3);
	//}


	if (debug) printf("load index ...\n");
	//if(index_handle->)
	ret = index_handle->load(block_id, bucket_size, map_option);

	if (ret != largefile::TFS_SUCCESS) {

		fprintf(stderr, "load index %d failed\n", block_id);
		exit(-2);
	}

	//把文件写入主块文件
	std::stringstream tmp_stream;
	tmp_stream << "." << largefile::MAINBLOCK_DIR_PREFIX << block_id;
	tmp_stream >> mainbock_path;

	//cout << "mainblock_path:" << mainbock_path << endl;
	largefile::FileOperation* mainblock = new largefile::FileOperation(mainbock_path, O_CREAT | O_RDWR | O_LARGEFILE);
	mainblock->ftruncate_file(main_blocksize);


	char buffer[4096];
	memset(buffer, '3', sizeof(buffer));
	buffer[4095] = '\0';

	int32_t data_offset = index_handle->get_block_data_offset();


	uint32_t file_no = index_handle->block_info()->seq_no_;

	ret = mainblock->pwrite_file(buffer, sizeof(buffer), data_offset);
	if (ret != largefile::TFS_SUCCESS) {
		fprintf(stderr, "wrtite to main blcok faield. reason:%s\n", strerror(errno));
		delete mainblock;
		delete index_handle;
		return ret;
	}
	//写入 metainfo
	largefile::Meltainfo meta;
	meta.set_filed(file_no);
	meta.set_offset(data_offset);
	meta.set_size(sizeof(buffer));
	//meta.set_key(block_id);
	ret = index_handle->write_segment_meta(meta.get_key(), meta);
	//index_handle->index_header()->data_offset_;
	if (ret == largefile::TFS_SUCCESS) {
		index_handle->commit_block_offset_data(sizeof(buffer));
		//跟新索引信息
		index_handle->updata_block_info(largefile::C_OPER_INSERT, sizeof(buffer));
		ret = index_handle->flush();
		if (ret != largefile::TFS_SUCCESS) {
			fprintf(stderr, "flush mainblock %d.file no :%u", block_id, file_no);
		}

	}
	else {
		fprintf(stderr, "write_segment_meta mainblock %d.file no :%u", block_id, file_no);
	}

	if (ret != largefile::TFS_SUCCESS)
	{//写失败了

		fprintf(stderr, "write to mainblock:%d fail.file no:%\n", block_id, file_no);
	}
	else {
		printf("write successfully.file no:%u block id:%d\n", file_no, block_id);

	}

	//index_handle->flush();
	mainblock->close_file();

	delete mainblock;
	delete index_handle;

	return 0;
}

添加后的 indexhandle.cpp

#include "indexHandle.h"

#include 

namespace xiaozhu {

	namespace largefile {

		IndexHandle::IndexHandle(const std::string& base_path, const uint32_t main_block_id) {

			//创建 file_op_
			std::stringstream tmp_stream;
			tmp_stream << base_path << INDEX_DIR_PREFIX << main_block_id;

			std::string index_path;
			tmp_stream >> index_path;

			file_op_ = new MMapFileOperation(index_path, O_CREAT | O_RDWR | O_LARGEFILE);
			is_load_ = false;
		}


		IndexHandle::~IndexHandle()
		{
			if (file_op_) {
				delete file_op_;
				file_op_ = nullptr;
			}
		}

		int IndexHandle::create(const uint32_t logic_block_id, const int32_t bucket_size, const MMapOption map_option)
		{

			int ret;
			if (DEBUG) {
				printf("logic_block_id:%u,bucket_size:%d,mmap_option.max_mmmp_size:%d ,mmap_option_first_size:%d mmap_option_per_size:%d", logic_block_id, bucket_size, map_option.max_mmap_size_, map_option.per_mmap_size_
					, map_option.per_mmap_size_);
			}

			if (is_load_) {
				return xiaozhu::largefile::EXIT_INDEX_ALREADY_LOAD;
			}
			//printf("43\n");

			int64_t file_size = file_op_->get_file_size();
			//printf("46\n");

			if (file_size < 0) {
				return TFS_EEROR;
			}
			else if (file_size == 0) {
				//索引头部
				IndexHeader i_header;
				i_header.block_info_.block_id_ = logic_block_id;
				i_header.block_info_.seq_no_ = 1;
				i_header.bucket_size_ = bucket_size;	//桶子的个数

				i_header.index_file_size_ = sizeof(IndexHeader) + bucket_size * sizeof(int32_t);

				char* init_data = new char[i_header.index_file_size_];
				memcpy(init_data, &i_header, sizeof(IndexHeader));
				memset(init_data + sizeof(IndexHeader), 0, i_header.index_file_size_ - sizeof(IndexHeader));


				ret = file_op_->pwrite_file(init_data, i_header.index_file_size_, 0);

				delete [] init_data;
				init_data = nullptr;

				if (ret != largefile::TFS_SUCCESS) {
					return ret;
				}

				ret = file_op_->flush_file();
				if (ret != largefile::TFS_SUCCESS) {
					return ret;
				}
			}
			else {

				return largefile::EXIT_META_UNEXPECT_FOUND_ERROR;

			}

			ret = file_op_->mmap_file(map_option);
		
			printf("87\n");

			printf("bucket_size():%u,index_headr bucket_size():%u\n",bucket_sizes(),index_header()->bucket_size_);

			printf("91\n");



			if (ret != largefile::TFS_SUCCESS) {

				return ret;
			}
			is_load_ = true;
			if (DEBUG) {
				printf("init block_id:%d index suceessful.date file size:%d,bucket_size:%d,free head offset:%d seqno:%d,size:%d,filecount:%d,del_size:%d,del_file_count:%d,version:%d\n",
					logic_block_id, index_header()->index_file_size_,
					index_header()->bucket_size_, index_header()->free_head_offset_, block_info()->seq_no_, block_info()->size_,
					block_info()->file_count_, block_info()->del_size_, block_info()->del_file_count_, block_info()->version_);

			}

			return ret;
		}

		int IndexHandle::load(const uint32_t logic_block_id, const int32_t bucket_size, const MMapOption map_option)
		{
			int ret = largefile::TFS_SUCCESS;
			if (is_load_) {
				printf("EXIT_INDEX_ALREADY_LOAD \n");
				return EXIT_INDEX_ALREADY_LOAD;
			}

			int64_t file_size = file_op_->get_file_size();

			if (file_size < 0)
			{
				return file_size;
			}
			else if (file_size == 0) {
				
				printf("file_size equal zero\n");
				
				return EXIT_INDEX_CORRUPT_EEROR;

			}

			MMapOption tmp_option = map_option;
			// if this conditional how to solve it ?
			if (tmp_option.first_mmap_size_ < file_op_->get_file_size() && file_op_->get_file_size() <= map_option.max_mmap_size_)
			{
				tmp_option.first_mmap_size_ = file_size;
			}
			ret = file_op_->mmap_file(tmp_option);
			if (ret != TFS_SUCCESS) {
				return ret;
			}
			//printf("bucket_size():%u,index_headr bucket_size():%u\n", bucket_sizes(), index_header()->bucket_size_);
			if (0 == block_info()->block_id_ || 0 == (bucket_sizes())) {

				fprintf(stderr, "index corrupt. blockid:%u,bucket_size:%d\n", block_info()->block_id_, index_header()->bucket_size_);
				return EXIT_INDEX_CORRUPT_EEROR;
			}

			int index_file_size = sizeof(IndexHeader) + bucket_sizes() * sizeof(int32_t);


			if (file_size < index_file_size) {

				fprintf(stderr, "index size is smaller than file_size_\n");
				return EXIT_INDEX_CORRUPT_EEROR;
			}

			if (logic_block_id != block_info()->block_id_) {

			//	if (logic_block_id != block_info()->block_id_) {
					fprintf(stderr, "block id confilit logic_block_id:%u  block_info()->block_id_:%d\n", logic_block_id, block_info()->block_id_);
			//	}
			}

			if (bucket_sizes() != bucket_size) {
			
				fprintf(stderr, "bucket_size is not equel bucket_sizes()\n", bucket_sizes(), bucket_size);
			
			}

			is_load_ = true;

			if (DEBUG) {
				printf("init block_id:%d index suceessful.date file size:%d,bucket_size:%d,free head offset:%d seqno:%d,size:%d,filecount:%d,del_size:%d,del_file_count:%d,version:%d\n",
					logic_block_id, index_header()->index_file_size_,
					index_header()->bucket_size_, index_header()->free_head_offset_, block_info()->seq_no_, block_info()->size_,
					block_info()->file_count_, block_info()->del_size_, block_info()->del_file_count_, block_info()->version_);

			}
			return TFS_SUCCESS;

		}

		int IndexHandle::remove(const uint32_t logic_block_id)
		{
			if (logic_block_id != block_info()->block_id_) {

				fprintf(stderr, "logic_block_id:%u is not equel file savaed block_id_:%u",
					logic_block_id, block_info()->block_id_);
			}
			//
			int ret = file_op_->mumap_file();

			if (ret != TFS_SUCCESS) {

				return ret;
			}

			ret = file_op_->unlink_file();

			return ret;

		}

		int IndexHandle::flush()
		{
			int ret = file_op_->flush_file();
			if (ret != largefile::TFS_SUCCESS) {

				fprintf(stderr, "index flush fail,ret :%d ,error desc:%s\n", ret, strerror(errno));
			}
			return ret;
		}

		int IndexHandle::updata_block_info(const OperType  oper_type, const uint32_t modify_size)
		{
			if (block_info()->block_id_ == 0) {

				return EXIT_BLOCK_ID_ZERO_ERROR;
			}
			else if (oper_type == OperType::C_OPER_INSERT) {

				++block_info()->file_count_;
				++block_info()->version_;
				++block_info()->seq_no_;
				block_info()->size_ += modify_size;

			}
			else if (oper_type == OperType::C_OPER_DELET) {
				--block_info()->file_count_;
				++block_info()->version_;
				block_info()->seq_no_;
				block_info()->size_ -= modify_size;
				++block_info()->del_file_count_;
				block_info()->del_size_ += modify_size;
			}

	
			if (DEBUG) {
				printf("update blockinfo()\n");
				printf("init block_id:%d index suceessful.data_offset_:%d,bucket_size:%d,free head offset:%d seqno:%d,size:%d,filecount:%d,del_size:%d,del_file_count:%d,version:%d oper_type:%d\n",
					block_info()->block_id_, index_header()->data_offset_, 
					index_header()->bucket_size_, index_header()->free_head_offset_, block_info()->seq_no_, block_info()->size_,
					block_info()->file_count_, block_info()->del_size_, block_info()->del_file_count_, block_info()->version_, oper_type);

			}
			return TFS_SUCCESS;
		}

		//怎么写 how to write friends ok yes me know
		int IndexHandle::write_segment_meta(const uint64_t key, Meltainfo& meta)
		{
			int32_t current_offset = 0, previous_offset = 0;

			int ret = hash_find(key, current_offset, previous_offset);

			//key 存在就不插入了

			if (ret == TFS_SUCCESS) {
				fprintf(stderr, "TFS_SUCCESS\n");
				return EXIT_META_UNEXPECT_FOUND_ERROR;
			}
			else if (ret != EXIT_META_INFO_IS_NOT_EXIT) {
				fprintf(stderr, "EXIT_META_INFO_IS_NOT_EXIT\n");
				return ret;

			}


			ret = hash_insert(key, previous_offset, meta);
			return ret;
		}


		int IndexHandle::read_sengment_meta(const uint64_t key, Meltainfo& meta)
		{
			int32_t current_offset, previous_offset;

			int ret = hash_find(key, current_offset, previous_offset);

			if (ret!= TFS_SUCCESS) {	
				fprintf(stderr,"key is not exit\n");
				return largefile::EXIT_META_INFO_IS_NOT_EXIT;
			}else {
				file_op_->pread_file(reinterpret_cast<char*>(&meta), sizeof(meta), current_offset);
				return ret;
			}
		}

		int32_t IndexHandle::delete_segment_meta(const uint64_t key)
		{
			int32_t current_offset, previous_offset = 0;

			int ret = hash_find(key, current_offset, previous_offset);

			if (ret != TFS_SUCCESS) {
				
			
				return ret;
			}

			Meltainfo meta_info;

			ret = file_op_->pread_file(reinterpret_cast<char*>(&meta_info), sizeof(meta_info), current_offset);
			if (ret != TFS_SUCCESS) {
				
				return ret;
			}

			int next_pos = meta_info.get_next_meta_info(); //拿到当前位置的下一个节点

			if (previous_offset == 0) {
				
				int32_t slot = static_cast<int32_t>(key) % bucket_sizes();
				bucket_slot()[slot] = next_pos;	//直接进行一波覆盖
			}
			else {
				Meltainfo pre_meta_info;
				ret = file_op_->pread_file(reinterpret_cast<char*>(&pre_meta_info), sizeof(pre_meta_info),previous_offset);

				if (TFS_SUCCESS != ret) {
					
					return ret;
				}

				pre_meta_info.set_next_meta_offset(next_pos);
				ret = file_op_->pwrite_file(reinterpret_cast<char*>(&pre_meta_info), sizeof(Meltainfo), previous_offset);
				if (TFS_SUCCESS != ret) {
					
					return ret;
				}
			}

			meta_info.set_next_meta_offset(free_head_offset());
			ret = file_op_->pwrite_file(reinterpret_cast<char*>(&meta_info), sizeof(Meltainfo),current_offset);
			
			index_header()->free_head_offset_ = current_offset;

			updata_block_info(C_OPER_DELET, meta_info.get_size());

			if (DEBUG) printf("delete_segment_meta-reuse metalnfo,current_offset:%d\n", current_offset);

			return TFS_SUCCESS;
		}

		int IndexHandle::hash_find(const uint64_t key, int32_t& current_offset, int32_t& previous_offset)
		{
			current_offset = 0;
			previous_offset = 0;
			Meltainfo meta;
			int ret = TFS_SUCCESS;
			//查找

			int32_t slot = key % bucket_sizes();
			int32_t pos = (int32_t)bucket_slot()[slot];	//得到

			//根据偏移量读取存储的 metainfo

			for (; pos != 0;) {

				ret = file_op_->pread_file(reinterpret_cast<char*>(&meta), sizeof(Meltainfo), pos);

				if (ret != TFS_SUCCESS) {
					return ret;
				}

				if (hash_compare(meta.get_key(), key)) {

					current_offset = pos;
					return TFS_SUCCESS;
				}

				previous_offset = pos;
				pos = meta.get_next_meta_info();
			}

			return EXIT_META_INFO_IS_NOT_EXIT;
		}

		int32_t IndexHandle::hash_insert(const uint64_t key, int32_t previous_offset, Meltainfo& meta)
		{
			int32_t slot = static_cast<uint32_t> (key) % bucket_sizes();//const 类型强转
			//printf("slot:%d\n", slot);
			int ret;
			int current_offset;
			Meltainfo tmp_meta;
			//确定 metainfo 存储在文件中的偏移量
			if (free_head_offset() != 0) {
				ret = file_op_->pread_file(reinterpret_cast<char*>(&tmp_meta), sizeof(Meltainfo), free_head_offset());
				if (ret != TFS_SUCCESS) {
					printf("free_head_offset failed\n");
					return ret;

				}
				current_offset = index_header()->free_head_offset_;
				if (DEBUG) printf("reuse metainfo,current_offset:%d \n", current_offset);
				index_header()->free_head_offset_ = tmp_meta.get_next_meta_info();		
			}
			else {
				current_offset = index_header()->index_file_size_;
				index_header()->index_file_size_ += sizeof(Meltainfo);
			}

	
			printf("------------------------hash_insert index_header()->index_file_size_:%d--------------------\n", index_header()->index_file_size_);
			//第三步将 matainfo 写入索引文件
			meta.set_next_meta_offset(0);
			
			ret = file_op_->pwrite_file(reinterpret_cast<char*> (&meta), sizeof(Meltainfo), current_offset);
			//拿到上一个mate
			
			if (ret != TFS_SUCCESS) {

				index_header()->index_file_size_ -= sizeof(Meltainfo);
				return ret;
			}

			//将 map 节点插入到哈希链表中
			if (0 != previous_offset) {
				ret = file_op_->pread_file(reinterpret_cast<char*>(&tmp_meta), sizeof(Meltainfo), previous_offset);

				if (ret != TFS_SUCCESS) {
					index_header()->index_file_size_ -= sizeof(Meltainfo);
					return ret;
				}


				meta.set_next_meta_offset(current_offset);

				file_op_->pwrite_file(reinterpret_cast<char*>(&tmp_meta), sizeof(meta), previous_offset);

				if (ret != TFS_SUCCESS) {

					index_header()->index_file_size_ -= sizeof(Meltainfo);
					return ret;

				}

			}
			else {
				printf(" index_headr()->index_file_size:%d\n", index_header()->index_file_size_);
				printf(",bucket_slot():%d  slot:%d\n",bucket_slot()[slot],slot);

				bucket_slot()[slot] = current_offset;
			}
			return TFS_SUCCESS;

		}

		bool IndexHandle::hash_compare(int64_t left, int64_t right)
		{

			return left == right ? true : false;
		}

	}
}

测试读、可重复利用节点的删除 mainblockwrite.cpp

总结

这个淘宝分布式文件系统核心存储引擎项目，从宏观层面理解:就是通过文件来管理文件。这么直接说有点抽象，刚开始我有疑问，为什么要用文件管理文件？操作系统直接来帮我们管理了不好吗？为什么还要自己写一个程序？这是我做这个项目之初的疑问。后来我了解到，因为淘宝的数据量非常的大，如果这些数据都存在磁盘中，cpu 直接访问磁盘的速度是非常慢的，大概是 cpu 访问内存的速度的万分之1 ，然后这么多数据并不能都放在内存中，因为内存的大小是十分有限的价格昂贵.而造成访问磁盘速度这么慢的原因是，系统在访问文件的时候需要移动这个 “磁头” 这个涉及到一些底层的物理知识，磁头的移动是十分耗时的，但是磁头得帮我们定位到文件，迫不得寻找消耗时间，阿里的大牛们，设计的这个淘宝分布式文件系统，就是不让系统来帮我们找磁盘，我们自己写一个 index 文件专门帮我们来管理文件岂不美哉 ? 这样就可以避免系统帮我们找文件磁盘移动. 这个思想的本质是，以空间来换时间，用价格相对不太昂贵的硬盘的储存空间，来换取文件的访问效率。淘宝的这种大文件的分布式文件系统在业界堪称是最牛的设计，它的设计十分精巧.

你可能感兴趣的:(c,c/c++,文件操作,tfs,分布式,架构)

系统架构设计师教程第二章计算机系统基础知识-2.9 系统性能 AncleLeen 软考-系统架构设计师-学习路线系统架构软考-系统架构师
系统架构设计师教程第二章计算机系统基础知识-2.9系统性能2.9.1.性能指标2.9.1.1计算机的性能指标2.9.1.2路由器的性能指标（了解即可）2.9.1.3交换机的性能指标（了解即可）2.9.1.4网络的性能指标2.9.1.5操作系统的性能指标2.9.1.6数据库管理系统的性能指标2.9.1.7Web服务器的性能指标2.9.2.性能计算2.9.3.性能设计2.9.3.1性能调整2.9.3.
后端开发：Spring Boot 的分布式缓存方案大厂资深架构师 Spring Boot 开发实战 spring boot 分布式缓存 ai
后端开发：SpringBoot的分布式缓存方案关键词：SpringBoot、分布式缓存、Redis、Caffeine、缓存策略、缓存失效摘要：本文深入探讨了在SpringBoot后端开发中分布式缓存方案的相关技术。首先介绍了分布式缓存在现代应用中的重要性及本文的研究范围，接着阐述了核心概念如分布式缓存的原理与架构，详细讲解了常用的核心算法原理及具体操作步骤，包括使用Python代码示例说明。通过数
我在黑马程序员学web前端新手来了@click 前端
1网页由三部分组成1.、html负责网页的结构2.css、负责网页的美化，控制网页元素的样式3、js，负责网页交互html常见的标签：1、form表单input输入框select下拉菜单option下拉列表2、table表格thead表头ｔｂｏｄｙ是表体tr行th表头加粗ｔｄ是列ｂｒ是换行2/CＳＳ常见的三种引入方式行内样式、内部样式、外部样式用ｌｉｎｋ关键字常用的元素选择器：标签选择器、id选择
commons-pool2对象池原理简析月落亦莫离
所谓对象池，即一个放对象的池子。目的是为了复用对象，以减少创建对象的开销，如连接池、线程池等。commons-pool2是apache下的一款对象池开源组件，在学习它的原理前，首先考虑下如果我们自实现对象池，会有哪些问题需要考虑？底层用什么数据结构来做对象池的容器？对象池要有什么属性，支持哪些方法？对象在对象池中的生命周期是什么样的？从对象池获取/归还的步骤？接下来我们带着这些问题去学习commo
Leetcode703. 数据流中的第K大元素 LonnieQ
题目设计一个找到数据流中第K大元素的类（class）。注意是排序后的第K大元素，不是第K个不同的元素。你的KthLargest类需要一个同时接收整数k和整数数组nums的构造器，它包含数据流中的初始元素。每次调用KthLargest.add，返回当前数据流中第K大的元素。示例:intk=3;int[]arr=[4,5,8,2];KthLargestkthLargest=newKthLargest(
Java-数构链表 2301_81674311 java 链表开发语言
1.链表1.1链表的概念和结构链表是一种物理存储结构上非连续存储结构，数据元素的逻辑顺序是通过链表中引用链接次序实现的。这里大多讨论无头单向非循环链表。这种结构，结构简单，一般与其他数据结构结合，作为其他数据结构的子数据。1.2链表的实现publicclassMysingleList{staticclassListNode{publicintval;//节点的值域publicListNodenex
拖拽放大镜　　购买查看照片不惧_f01e
这里是用三张图做成一套放大镜Document*{padding:0;margin:0;list-style:none;}.box{width:400px;height:500px;margin-left:100px;/*border:3pxsolid#00f;*/}.m{width:400px;height:400px;/*border:1pxsolid#000;*/position:relati
上位机知识篇---Prompt&PowerShell Prompt Atticus-Orion 上位机知识篇 prompt powershell
在Anaconda环境中，AnacondaPrompt和AnacondaPowerShellPrompt是两个常用的命令行工具，它们的核心功能都是为了方便管理Python环境和执行相关命令，但底层依赖的命令行解释器不同，因此在使用场景和语法上存在一些区别。下面详细介绍两者的差异：1.底层依赖的命令行解释器不同这是两者最根本的区别，决定了它们的语法规则和功能范围：AnacondaPrompt基于Wi
DeepSeek 助力 Vue3 开发：打造丝滑的日历(Calendar)，日历_睡眠记录日历示例（CalendarView01_30）宝码香车 #DeepSeek 前端 vue.js ecmascript javascript deepseek
前言：哈喽，大家好，今天给大家分享一篇文章！并提供具体代码帮助大家深入理解，彻底掌握！创作不易，如果能帮助到大家或者给大家一些灵感和启发，欢迎收藏+关注哦目录DeepSeek助力Vue3开发：打造丝滑的日历(Calendar)，日历_睡眠记录日历示例（CalendarView01_30）前言本文简介：本文页面效果组件代码代码测试测试代码正常跑通，附其他基本代码编写路由\src\router\ind
后端校招 | 高分简历 + 高频 C++ 面试题整理（附GitHub题库推荐）壹張先森 c++java 开发语言
一、为什么专门做一期C++面试题分享？我发现很多后端同学在面试准备时：Java岗位题资源非常多但C++后端面试内容分散、缺少整合所以我整理了GitHub上高频C++后端面试题+答案解析，今天精选5道送给你：二、精选高频C++面试题（附答题技巧）1.new和malloc的区别？特性newmalloc返回类型指定类型指针void*构造函数会调用构造函数不会调用释放方式deletefree重载支持支持重
力扣 hot100 Day50 qq_51397044 Hot100 leetcode 算法职场和发展
437.路径总和III给定一个二叉树的根节点root，和一个整数targetSum，求该二叉树里节点值之和等于targetSum的路径的数目。路径不需要从根节点开始，也不需要在叶子节点结束，但是路径方向必须是向下的（只能从父节点到子节点）。//抄的classSolution{public:intpathSum(TreeNode*root,inttargetSum){unordered_mappre
CSS样式中的布局、字体、响应式布局
目录一、使用内联块级元素布局二、使用float布局三、使用弹性盒子布局四、服务器字体五、响应式布局相关文章积累CSS样式属性：padding、margin、display:flex、font、position、cursor、:hover、:nth-child()、border-radius一、使用内联块级元素布局让想要横着的元素（left、mid、right）变成内联块级元素。示例leftmidr
浅谈EXT2文件系统----inode table 巭犇文件系统 linux 数据库运维
Inodetable概述在EXT2文件系统中，inode表（InodeTable）是一个非常重要的结构，用于存储文件和目录的元数据。每个文件和目录都由一个inode（索引节点）来表示，inode中包含了关于该文件或目录的关键信息，如文件的大小、权限、所属用户、时间戳以及指向数据块的指针等。EXT2文件系统将所有inode结构集中存储在inode表中。内核源码structext2_inode{__l
MongoDB创建集合命令db.createCollection详解 ywb201314 Mongodb
完整的命令如下：db.createCollection(name,{capped:,autoIndexId:,size:,max})name:集合的名字capped:是否启用集合限制，如果开启需要制定一个限制条件，默认为不启用，这个参数没有实际意义size:限制集合使用空间的大小，默认为没有限制max:集合中最大条数限制，默认为没有限制autoIndexId:是否使用_id作为索引，默认为使用(t
Windows server 2016 部署 PKI 和证书服务 LD_ee65
在Windowsserver2016操作系统中，想要安装证书服务需要满足的条件有：1.有固定的IP地址；2.域环境（不是必须，只是域环境安装完证书服务之后不需要自己手动添加证书服务）3.尽量使用两台服务器（dc1、dc2）。在Windowsserver2016操作系统中，证书服务不是Windows默认服务，需要在系统安装完成后手动添加证书服务，DC1具体操作步骤如下：1.打开“服务器管理器”单击“
中国电子学会(CIE)2021.6 c++一级考级真题
#数的输入和输出(a/b)*c的值大写字母的判断特殊求和硬币翻转一、数的输入和输出题目描述输入一个整数和双精度浮点数，先将浮点数保留2位小数输出，然后输出整数。输入格式一行两个数，分别为整数N（不超过整型范围），双精度浮点数F，以一个空格分开。输出格式一行两个数，分别为保留2位小数输出的F,以及整数N，以一个空格分开。输入输出样例输入#1100123.456789输出#1123.46100代码样例
数据结构排序算法总结（C语言实现） xienda 排序算法数据结构算法
以下是常见排序算法的总结及C语言实现，包含时间复杂度、空间复杂度和稳定性分析：1.冒泡排序(BubbleSort)思想：重复比较相邻元素，将较大元素向后移动。时间复杂度：O(n²)（最好O(n)，最坏O(n²))空间复杂度：O(1)稳定性：稳定voidbubbleSort(intarr[],intn){for(inti=0;iarr[j+1]){//交换相邻元素inttemp=arr[j];arr
分布式推客系统全栈开发指南：SpringCloud+Neo4j+Redis实战解析 wx_ywyy6798 oracle 数据库推客系统推客小程序推客系统开发推客小程序开发推客分销系统
一、推客系统概述与市场背景推客系统（或称"推荐客"系统）是一种基于社交关系和内容分发的推荐营销平台，近年来在电商、内容平台和社交媒体领域迅速崛起。根据最新统计数据，2023年全球社交电商市场规模已达1.2万亿美元，其中推客模式的贡献率超过35%。1.1推客系统的核心价值推客系统通过以下机制创造商业价值：社交裂变：利用用户社交网络实现指数级传播精准推荐：基于用户行为和关系链的个性化内容分发激励机制：
短剧系统全栈开发指南：从0到1构建高并发微服务架构 wx_ywyy6798 短剧系统短剧系统开发海外短剧系统海外短剧系统开发短剧分销短剧分销系统短剧分销系统开发
一、短剧系统概述短剧作为一种新兴的数字内容形式，近年来在移动互联网领域迅速崛起。短剧系统开发不仅涉及传统视频平台的技术栈，还需要针对短内容、高互动、快速消费等特点进行专门设计。1.1短剧行业现状与发展趋势2023年短剧市场规模已突破300亿元，用户日均使用时长达到58分钟。短剧以其"短平快"的特点，填补了用户碎片化时间的娱乐需求。未来发展趋势包括：垂直领域精细化运营AI辅助内容生产互动式剧情发展跨
二、ubuntu+django+nginx+uwsgi+vue:部署django+vue前后端分离项目
一、创建用户和文件夹#创建www文件夹，所有网站项目都放到这里$sudomkdir/www#创建用户组sudogroupaddwww-g666#创建用户$sudouseraddwww-u666-g666-M-s/sbin/nologin#查看$idwwwid#设置www文件夹的所属组和所属用户$sudochown-Rwww.www/www/#$sudochmod-R666某一目录,所有用户对一个目
uWSGI, Gunicorn负载服务器怎么选 weixin_43986924 django python
因为nginx等优秀的开源项目，有不少本来不是做服务器的同学也可以写很多服务器端的程序了。但是在聊天中会发现，大家虽然写了不少代码，但是对wsgi是什么，gunicorn是什么，反向代理又是什么并不了解，也就是说对基本概念并没有一个全局的了解。服务器到了服务器组你会发现原来有各种各样的服务器，那些叫法很多是有历史沉淀的，不需要太深究能对上号就行，因为本来也是乱七八糟的。HTTP服务器如果网站是HT
解析进程 /proc/pid/maps 和 /proc/pid/smaps
目录/proc//maps背景具体描述代码实现实践/proc/pid/smapssmaps各子项详解代码实现代码调用的路径如下：小结/proc//maps背景相对于/proc/meminfo和dumpsysmeminfo可以看到系统整体的内存信息，我们还需要能够具体到每一个进程内存占用统计的信息。在分析内存问题的时候，会经常依赖kernel的proc文件系统下各个进程的文件节点，从中获取当前进程的
mongodb创建集合命令db.createCollection详解 weixin_34209851 数据库 python shell
mongodb创建集合命令db.createCollection详解完整的命令如下：db.createCollection(name,{capped:,autoIndexId:,size:,max})name:集合的名字capped:是否启用集合限制，如果开启需要制定一个限制条件，默认为不启用size:限制集合使用空间的大小，默认为没有限制max:集合中最大条数限制，默认为没有限制autoInde
mysql创建集合collection_MongoDB创建集合命令db.createCollection详解 kokosK
MongoDB创建集合命令db.createCollection详解完整的命令如下：db.createCollection(name,{capped:,autoIndexId:,size:,max})name:集合的名字capped:是否启用集合限制，如果开启需要制定一个限制条件，默认为不启用，这个参数没有实际意义size:限制集合使用空间的大小，默认为没有限制max:集合中最大条数限制，默认为没
mysql.createPool(db)_nodejs解决mysql和连接池(pool)自动断开问题会咕咕咕的小夫爷
最近在做一个个人项目，数据库尝试使用了mongodb、sqlite和mysql。分享一下关于mysql的连接池用法。项目部署于appfog，项目中我使用连接池链接数据库，本地测试一切正常。上线以后，经过几次请求两个数据接口总是报503。一直不明就里，今天经过一番排查终于顺利解决了。1.mysql链接普通模式varmysql=require('mysql'),env={host:'localhost
前端学习路线推荐 oldfifteen
第一阶段：HTML+CSS:HTML进阶、CSS进阶、div+css布局、HTML+css整站开发、JavaScript基础：Js基础教程、js内置对象常用方法、常见DOM树操作大全、ECMAscript、DOM、BOM、定时器和焦点图。JS基本特效：常见特效、例如：tab、导航、整页滚动、轮播图、JS制作幻灯片、弹出层、手风琴菜单、瀑布流布局、滚动事件、滚差视图。JS高级特征：正则表达式、排序算
分治算法---归并
1、排序数组classSolution{vectortmp;public:vectorsortArray(vector&nums){tmp.resize(nums.size());mergeSort(nums,0,nums.size()-1);returnnums;}voidmergeSort(vector&nums,intleft,intright){if(left>=right)return;
linux proc/pid/信息说明 shenhuxi_yu LINUX
版权声明：本文为EnweiTech原创文章，未经博主允许不得转载。https://blog.csdn.net/English0523/article/details/53391567Proc是一个虚拟文件系统，在Linux系统中它被挂载于/proc目录之上。Proc有多个功能，这其中包括用户可以通过它访问内核信息或用于排错，这其中一个非常有用的功能，也是Linux变得更加特别的功能就是以文本流的形
前后端分离小程序（django）- 聚合推客（微信小程序分享） Y大壮 django python
https://juejin.cn/post/7124615000785682462/#heading-0
virtualenv 小小怪吃吃吃
virtualenv就是用来为一个应用创建一套“隔离”的Python运行环境。(1)用pip安装virtualenv:pip3installvirtualenv(2)创建开发项目目录:mkdirprojectcdproject/(3)创建一个独立的Python运行环境，命名为venv:virtualenv--no-site-packagesvenv命令virtualenv就可以创建一个独立的Pyt
log4j对象改变日志级别 3213213333332132 java log4j level log4j对象名称日志级别
log4j对象改变日志级别可批量的改变所有级别，或是根据条件改变日志级别。 log4j配置文件： log4j.rootLogger=ERROR,FILE,CONSOLE,EXECPTION #log4j.appender.FILE=org.apache.log4j.RollingFileAppender log4j.appender.FILE=org.apache.l
elk+redis 搭建nginx日志分析平台 ronin47 elasticsearch kibana logstash
elk+redis 搭建nginx日志分析平台 logstash,elasticsearch,kibana 怎么进行nginx的日志分析呢？首先，架构方面，nginx是有日志文件的，它的每个请求的状态等都有日志文件进行记录。其次，需要有个队列，redis的l
Yii2设置时区 dcj3sjt126com PHP timezone yii2
时区这东西，在开发的时候，你说重要吧，也还好，毕竟没它也能正常运行，你说不重要吧，那就纠结了。特别是linux系统，都TMD差上几小时，你能不痛苦吗？win还好一点。有一些常规方法，是大家目前都在采用的1、php.ini中的设置，这个就不谈了，2、程序中公用文件里设置，date_default_timezone_set一下时区3、或者。。。自己写时间处理函数，在遇到时间的时候，用这个函数处理（比较
js实现前台动态添加文本框，后台获取文本框内容 171815164 文本框
<%@ page language="java" import="java.util.*" pageEncoding="UTF-8"%> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://w
持续集成工具 g21121 持续集成
持续集成是什么？我们为什么需要持续集成？持续集成带来的好处是什么？什么样的项目需要持续集成？... 持续集成(Continuous integration ,简称CI)，所谓集成可以理解为将互相依赖的工程或模块合并成一个能单独运行
数据结构哈希表(hash)总结永夜-极光数据结构
1.什么是hash 来源于百度百科: Hash，一般翻译做“散列”，也有直接音译为“哈希”的，就是把任意长度的输入，通过散列算法，变换成固定长度的输出，该输出就是散列值。这种转换是一种压缩映射，也就是，散列值的空间通常远小于输入的空间，不同的输入可能会散列成相同的输出，所以不可能从散列值来唯一的确定输入值。简单的说就是一种将任意长度的消息压缩到某一固定长度的消息摘要的函数。
乱七八糟程序员是怎么炼成的
eclipse中的jvm字节码查看插件地址： http://andrei.gmxhome.de/eclipse/ 安装该地址的outline 插件后重启，打开window下的view下的bytecode视图 http://andrei.gmxhome.de/eclipse/ jvm博客： http://yunshen0909.iteye.com/blog/2
职场人伤害了“上司” 怎样弥补 aijuans 职场
由于工作中的失误，或者平时不注意自己的言行“伤害”、“得罪”了自己的上司，怎么办呢？　　在职业生涯中这种问题尽量不要发生。下面提供了一些解决问题的建议：　　一、利用一些轻松的场合表示对他的尊重　　即使是开明的上司也很注重自己的权威，都希望得到下属的尊重，所以当你与上司冲突后，最好让不愉快成为过去，你不妨在一些轻松的场合，比如会餐、联谊活动等，向上司问个好，敬下酒，表示你对对方的尊重，
深入浅出url编码 antonyup_2006 应用服务器浏览器 servlet weblogic IE
出处：http://blog.csdn.net/yzhz 杨争 http://blog.csdn.net/yzhz/archive/2007/07/03/1676796.aspx 一、问题：编码问题是JAVA初学者在web开发过程中经常会遇到问题，网上也有大量相关的
建表后创建表的约束关系和增加表的字段百合不是茶标的约束关系增加表的字段
下面所有的操作都是在表建立后操作的,主要目的就是熟悉sql的约束,约束语句的万能公式 1,增加字段(student表中增加姓名字段) alter table 增加字段的表名 add 增加的字段名增加字段的数据类型 alter table student add name varchar2(10); &nb
Uploadify 3.2 参数属性、事件、方法函数详解 bijian1013 JavaScript uploadify
一.属性属性名称默认值说明 auto true 设置为true当选择文件后就直接上传了，为false需要点击上传按钮才上传。 buttonClass ” 按钮样式 buttonCursor ‘hand’ 鼠标指针悬停在按钮上的样子 buttonImage null 浏览按钮的图片的路
精通Oracle10编程SQL(16)使用LOB对象 bijian1013 oracle 数据库 plsql
/* *使用LOB对象 */ --LOB(Large Object)是专门用于处理大对象的一种数据类型，其所存放的数据长度可以达到4G字节 --CLOB/NCLOB用于存储大批量字符数据，BLOB用于存储大批量二进制数据，而BFILE则存储着指向OS文件的指针 /* *综合实例 */ --建立表空间 --#指定区尺寸为128k,如不指定，区尺寸默认为64k CR
【Resin一】Resin服务器部署web应用 bit1129 resin
工作中，在Resin服务器上部署web应用，通常有如下三种方式：配置多个web-app 配置多个http id 为每个应用配置一个propeties、xml以及sh脚本文件配置多个web-app 在resin.xml中,可以为一个host配置多个web-app <cluster id="app&q
red5简介及基础知识白糖_ 基础
简介 Red5的主要功能和Macromedia公司的FMS类似，提供基于Flash的流媒体服务的一款基于Java的开源流媒体服务器。它由Java语言编写，使用RTMP作为流媒体传输协议，这与FMS完全兼容。它具有流化FLV、MP3文件，实时录制客户端流为FLV文件，共享对象，实时视频播放、Remoting等功能。用Red5替换FMS后,客户端不用更改可正
angular.fromJson boyitech AngularJS AngularJS 官方API AngularJS API
angular.fromJson 描述: 把Json字符串转为对象使用方法: angular.fromJson(json); 参数详解: Param Type Details json string JSON 字符串返回值: 对象, 数组, 字符串或者是一个数字示例: <!DOCTYPE HTML> <h
java-颠倒一个句子中的词的顺序。比如： I am a student颠倒后变成：student a am I bylijinnan java
public class ReverseWords { /** * 题目：颠倒一个句子中的词的顺序。比如： I am a student颠倒后变成：student a am I.词以空格分隔。 * 要求： * 1.实现速度最快,移动最少 * 2.不能使用String的方法如split,indexOf等等。 * 解答：两次翻转。 */ publ
web实时通讯 Chen.H Web 浏览器 socket 脚本
关于web实时通讯，做一些监控软件。由web服务器组件从消息服务器订阅实时数据，并建立消息服务器到所述web服务器之间的连接，web浏览器利用从所述web服务器下载到web页面的客户端代理与web服务器组件之间的socket连接，建立web浏览器与web服务器之间的持久连接；利用所述客户端代理与web浏览器页面之间的信息交互实现页面本地更新，建立一条从消息服务器到web浏览器页面之间的消息通路
[基因与生物]远古生物的基因可以嫁接到现代生物基因组中吗? comsci 生物
大家仅仅把我说的事情当作一个IT行业的笑话来听吧..没有其它更多的意思如果我们把大自然看成是一位伟大的程序员,专门为地球上的生态系统编制基因代码,并创造出各种不同的生物来,那么6500万年前的程序员开发的代码,是否兼容现代派的程序员的代码和架构呢?
oracle 外部表 daizj oracle 外部表 external tables
oracle外部表是只允许只读访问，不能进行DML操作，不能创建索引，可以对外部表进行的查询，连接，排序，创建视图和创建同义词操作。 you can select, join, or sort external table data. You can also create views and synonyms for external tables. Ho
aop相关的概念及配置 daysinsun AOP
切面(Aspect): 通常在目标方法执行前后需要执行的方法（如事务、日志、权限），这些方法我们封装到一个类里面，这个类就叫切面。连接点（joinpoint） spring里面的连接点指需要切入的方法，通常这个joinpoint可以作为一个参数传入到切面的方法里面（非常有用的一个东西）。通知（Advice）通知就是切面里面方法的具体实现，分为前置、后置、最终、异常环
初一上学期难记忆单词背诵第二课 dcj3sjt126com english word
middle 中间的，中级的 well 喔，那么；好吧 phone 电话，电话机 policeman 警察 ask 问 take 拿到；带到 address 地址 glad 高兴的，乐意的 why 为什么 China 中国 family 家庭 grandmother (外)祖母 grandfather (外)祖父 wife 妻子 husband 丈夫 da
Linux日志分析常用命令 dcj3sjt126com linux log
1.查看文件内容 cat -n 显示行号 2.分页显示 more Enter 显示下一行空格显示下一页 F 显示下一屏 B 显示上一屏 less /get 查询"get"字符串并高亮显示 3.显示文件尾 tail -f 不退出持续显示 -n 显示文件最后n行 4.显示头文件 head -n 显示文件开始n行 5.内容排序 sort -n 按照
JSONP 原理分析 fantasy2005 JavaScript jsonp jsonp 跨域
转自 http://www.nowamagic.net/librarys/veda/detail/224 JavaScript是一种在Web开发中经常使用的前端动态脚本技术。在JavaScript中，有一个很重要的安全性限制，被称为“Same-Origin Policy”（同源策略）。这一策略对于JavaScript代码能够访问的页面内容做了很重要的限制，即JavaScript只能访问与包含它的
使用connect by进行级联查询 234390216 oracle 查询父子 Connect by 级联
使用connect by进行级联查询 connect by可以用于级联查询，常用于对具有树状结构的记录查询某一节点的所有子孙节点或所有祖辈节点。来看一个示例，现假设我们拥有一个菜单表t_menu，其中只有三个字段：
一个不错的能将HTML表格导出为excel,pdf等的jquery插件 jackyrong jquery插件
发现一个老外写的不错的jquery插件，可以实现将HTML 表格导出为excel,pdf等格式，地址在： https://github.com/kayalshri/ 下面看个例子，实现导出表格到excel,pdf <html> <head> <title>Export html table to excel an
UI设计中我们为什么需要设计动效 lampcy UI UI设计
关于Unity3D中的Shader的知识首先先解释下Unity3D的Shader，Unity里面的Shaders是使用一种叫ShaderLab的语言编写的，它同微软的FX文件或者NVIDIA的CgFX有些类似。传统意义上的vertex shader和pixel shader还是使用标准的Cg/HLSL 编程语言编写的。因此Unity文档里面的Shader，都是指用ShaderLab编写的代码，
如何禁止页面缓存 nannan408 html jsp cache
禁止页面使用缓存~ ------------------------------------------------ jsp:页面no cache： response.setHeader("Pragma","No-cache"); response.setHeader("Cache-Control","no-cach
以代码的方式管理quartz定时任务的暂停、重启、删除、添加等 Everyday都不同定时任务管理 spring-quartz
【前言】在项目的管理功能中，对定时任务的管理有时会很常见。因为我们不能指望只在配置文件中配置好定时任务就行了，因为如果要控制定时任务的 “暂停” 呢？暂停之后又要在某个时间点 “重启” 该定时任务呢？或者说直接 “删除” 该定时任务呢？要改变某定时任务的触发时间呢？ “添加” 一个定时任务对于系统的使用者而言，是不太现实的，因为一个定时任务的处理逻辑他是不
EXT实例 tntxia ext
（1）增加一个按钮 JSP: <%@ page language="java" import="java.util.*" pageEncoding="UTF-8"%> <% String path = request.getContextPath(); Stri
数学学习在计算机研究领域的作用和重要性 xjnine Math
最近一直有师弟师妹和朋友问我数学和研究的关系，研一要去学什么数学课。毕竟在清华，衡量一个研究生最重要的指标之一就是paper,而没有数学，是肯定上不了世界顶级的期刊和会议的，这在计算机学界尤其重要！你会发现，不论哪个领域有价值的东西，都一定离不开数学！在这样一个信息时代，当google已经让世界没有秘密的时候，一种卓越的数学思维，绝对可以成为你的核心竞争力. 无奈本人实在见地

TFS淘宝分布式文件核心存储引擎源码架构剖析实现

这里写目录标题

相关背景介绍

相关设计思路介绍

项目基础

文件系统接口

扇区

文件结构

关于inode

为什么淘宝不用小文件存储

淘宝网为什么不用普通文件存储海量小数据？

设计思路

关键数据结构哈希表

代码日志

mmp_file.h

mmap_file.cpp

file_op.h

main_mmap_op_file.cpp

index_handle.cpp

blockwritetest.cpp

总结

你可能感兴趣的:(c,c/c++,文件操作,tfs,分布式,架构)