Linux,unix在内核中设有 缓冲区高速缓冲或页面高速缓冲,大多数磁盘I/O都通过缓冲进行,采用延迟写技术。
sync:将所有修改过的快缓存区排入写队列,然后返回,并不等待实际写磁盘操作结束
fsync:只对有文件描述符制定的单一文件起作用,并且等待些磁盘操作结束,然后返回。
fdatasync:类似fsync,但它只影响文件的数据部分。fsync还会同步更新文件的属性。
fflush:标准I/O函数(如:fread,fwrite)会在内存建立缓冲,该函数刷新内存缓冲,将内容写入内核缓冲,要想将其写入磁盘,还需要调用fsync。(先调用fflush后调用fsync,否则不起作用)
---------------------------------------------------------------------------
主要涉及到 setbuf, fflush, fsync,sync等函数。
首先来说输入输出库的缓冲。
The three types of buffering available are unbuffered, block buffered, and line buffered. When an output stream is unbuffered, information appears on the destination file or terminal as soon as written; when it is
block buffered many characters are saved up and written as a block; when it is line buffered characters are
saved up until a newline is output or input is read from any stream attached to a terminal device (typically
stdin). The function fflush(3) may be used to force the block out early. (See fclose(3).) Normally all files
are block buffered. When the first I/O operation occurs on a file, malloc(3) is called, and a buffer is
obtained. If a stream refers to a terminal (as stdout normally does) it is line buffered. The standard error
stream stderr is always unbuffered by default.
一般来说,block buffered的效率高些,将多次的操作合并成一次操作。现在标准库里缓存一部分,
直到该缓冲区满了,或者程序显示的调用fflush时,将进行更新操作。
而setbuf 则可以设置该缓冲区的大小。
#include <stdio.h>
void setbuf(FILE *stream, char *buf);
这个函数应该必须在如何输出被写到该文件之前调用。一般放在main里靠前面的语句!
但是setbuf有个经典的错误,man手册上也提到了,c陷阱和缺陷上也提到了
You must make sure that both buf and the space it points to still exist by the time stream is closed, which also happens at program termination. For example, the following is illegal:
#include <stdio.h>
int main()
{
char buf[BUFSIZ];
setbuf(stdin, buf);
printf("Hello, world!/n");
return 0;
}
这个程序是错误的。buf缓冲区最后一次清空应该在main函数结束之后,程序交回控制给操作系统
之前C运行库所必须进行的清理工作的一部分,但是此时buf字符数组已经释放。
修改的方法是 将buf设置为static,或者全局变量; 或者调用malloc来动态申请内存。
char * malloc();
setbuf(stdout,malloc(BUFSIZE));
这里不需要判断malloc的返回值,如果malloc调用失败,将返回一个null指针,setbuf的第二个参数可以
是null,此时不进行缓冲!
对应的,fflush函数则刷新缓冲区,将缓冲区上的内容更新到文件里。
#include <stdio.h>
int fflush(FILE *stream);
The function fflush forces a write of all user-space buffered data for the given output or update stream via the stream underlying write function. The open status of the stream is unaffected.
If the stream argument is NULL, fflush flushes all open output streams.
但是fflush仅仅刷新C库里的缓冲。
其他的一些数据的刷新需要调用fsync或者sync!!
Note that fflush() only flushes the user space buffers provided by the C library. To ensure that the data is physically stored on disk the kernel buffers must be flushed too, e.g. with sync(2) or fsync(2).
fsync和sync最终将缓冲的数据更新到文件里。
#include <unistd.h>
int fsync(int fd);
fsync copies all in-core parts of a file to disk, and waits until the device reports that all parts are on stable storage. It also updates metadata stat information. It does not necessarily ensure that the entry in the directory containing the file has also reached disk. For that an explicit fsync on the file descriptor of the directory is also needed.
NAME
sync - commit buffer cache to disk
SYNOPSIS
#include <unistd.h>
void sync(void);
DESCRIPTION
sync first commits inodes to buffers, and then buffers to disk.
ERRORS
This function is always successful.
同步命令sync就直接调用了sync函数来更新磁盘上的缓冲!!
-------------------------------------------------------------------------------------------------------------