git
是一个开源的分布式代码控制系统(SCM
),由Linus
在2005年开发.当时由于linux内核工程所使用的SCM工具BitKeeper
的提供商不再提供免费使用,Linus没有寻找到能替代BitKeeper,满足需求的SCM工具,因此自己设计开发了git.为什么叫这个名字,在初始版本的README
中,Linus是这样解释的, 也就是说,其实也没有特殊的含义-:)
GIT - the stupid content tracker
"git" can mean anything, depending on your mood.
- random three-letter combination that is pronounceable, and not
actually used by any common UNIX command. The fact that it is a
mispronounciation of "get" may or may not be relevant.
- stupid. contemptible and despicable. simple. Take your pick from the
dictionary of slang.
- "global information tracker": you're in a good mood, and it actually
works for you. Angels sing, and a light suddenly fills the room.
- "goddamn idiotic truckload of sh*t": when it breaks
相比较而言,git有如下几个显著特点和优势:
1) 从仓库克隆以后,包含所有的历史修改记录.
2) 分支管理的高效性和高性能.
3) 分布式开发的高效率.
当然,git还有许多强大和便利的功能.git的源码工程请参考:https://github.com/git/git
git的初始版本为git-0.1,完成了对象数据库和cache的框架设计,并且只是实现了底层命令的操作,我们现在使用git时,比如git add
或者git rm
等,都是以特定的参数调用某个底层命令.git的每个底层命令都被编译成为可执行文件. git初始版本的目录结构及文件如下图所示:
在cache.h
头文件中,包括cache处理流程中涉及到的几个类型和接口,在流程分析中会说明这些类型,该文件的内容如下:
#ifndef CACHE_H
#define CACHE_H
#include
#include
#include
#include
#include
#include
#include
#include
#include
#include
/*
* Basic data structures for the directory cache
*
* NOTE NOTE NOTE! This is all in the native CPU byte format. It's
* not even trying to be portable. It's trying to be efficient. It's
* just a cache, after all.
*/
#define CACHE_SIGNATURE 0x44495243 /* "DIRC" */
struct cache_header {
unsigned int signature;
unsigned int version;
unsigned int entries;
unsigned char sha1[20];
};
/*
* The "cache_time" is just the low 32 bits of the
* time. It doesn't matter if it overflows - we only
* check it for equality in the 32 bits we save.
*/
struct cache_time {
unsigned int sec;
unsigned int nsec;
};
/*
* dev/ino/uid/gid/size are also just tracked to the low 32 bits
* Again - this is just a (very strong in practice) heuristic that
* the inode hasn't changed.
*/
struct cache_entry {
struct cache_time ctime;
struct cache_time mtime;
unsigned int st_dev;
unsigned int st_ino;
unsigned int st_mode;
unsigned int st_uid;
unsigned int st_gid;
unsigned int st_size;
unsigned char sha1[20];
unsigned short namelen;
unsigned char name[0];
};
const char *sha1_file_directory;
struct cache_entry **active_cache;
unsigned int active_nr, active_alloc;
#define DB_ENVIRONMENT "SHA1_FILE_DIRECTORY"
#define DEFAULT_DB_ENVIRONMENT ".dircache/objects"
#define cache_entry_size(len) ((offsetof(struct cache_entry,name) + (len) + 8) & ~7)
#define ce_size(ce) cache_entry_size((ce)->namelen)
#define alloc_nr(x) (((x)+16)*3/2)
/* Initialize the cache information */
extern int read_cache(void);
/* Return a statically allocated filename matching the sha1 signature */
extern char *sha1_file_name(unsigned char *sha1);
/* Write a memory buffer out to the sha file */
extern int write_sha1_buffer(unsigned char *sha1, void *buf, unsigned int size);
/* Read and unpack a sha1 file into memory, write memory to a sha1 file */
extern void * read_sha1_file(unsigned char *sha1, char *type, unsigned long *size);
extern int write_sha1_file(char *buf, unsigned len);
/* Convert to/from hex/sha1 representation */
extern int get_sha1_hex(char *hex, unsigned char *sha1);
extern char *sha1_to_hex(unsigned char *sha1); /* static buffer! */
/* General helper functions */
extern void usage(const char *err);
#endif /* CACHE_H */
源码流程分析主要包括如下几部分:
init-db命令处理流程
update-cache命令处理流程
write-tree命令处理流程
show-diff命令处理流程