python difflib 比较2个文件, 自动生成html

1. 起因介绍

偶然读到 difflib 的源码,发现这个还挺有用的。

difflib.py 源码里面写到:

See tools/scripts/diff.py for an example usage of this class.

这里例子的实际位置是:

C:\Users\Administrator\AppData\Local\Programs\Python\Python311\Lib\diff.py

(我把 diff.py 复制一份,然后重命名为  get_diff.py )
 内容如下:

#!/usr/bin/env python3
""" Command line interface to difflib.py providing diffs in four formats:

* ndiff:    lists every line and highlights interline changes.
* context:  highlights clusters of changes in a before/after format.
* unified:  highlights clusters of changes in an inline format.
* html:     generates side by side comparison with change highlights.

"""

import sys, os, difflib, argparse
from datetime import datetime, timezone

def file_mtime(path):
    t = datetime.fromtimestamp(os.stat(path).st_mtime,
                               timezone.utc)
    return t.astimezone().isoformat()

def main():

    parser = argparse.ArgumentParser()
    parser.add_argument('-c', action='store_true', default=False,
                        help='Produce a context format diff (default)')
    parser.add_argument('-u', action='store_true', default=False,
                        help='Produce a unified format diff')
    parser.add_argument('-m', action='store_true', default=False,
                        help='Produce HTML side by side diff '
                             '(can use -c and -l in conjunction)')
    parser.add_argument('-n', action='store_true', default=False,
                        help='Produce a ndiff format diff')
    parser.add_argument('-l', '--lines', type=int, default=3,
                        help='Set number of context lines (default 3)')
    parser.add_argument('fromfile')
    parser.add_argument('tofile')
    options = parser.parse_args()

    n = options.lines
    fromfile = options.fromfile
    tofile = options.tofile

    fromdate = file_mtime(fromfile)
    todate = file_mtime(tofile)
    with open(fromfile) as ff:
        fromlines = ff.readlines()
    with open(tofile) as tf:
        tolines = tf.readlines()

    if options.u:
        diff = difflib.unified_diff(fromlines, tolines, fromfile, tofile, fromdate, todate, n=n)
    elif options.n:
        diff = difflib.ndiff(fromlines, tolines)
    elif options.m:
        diff = difflib.HtmlDiff().make_file(fromlines,tolines,fromfile,tofile,context=options.c,numlines=n)
    else:
        diff = difflib.context_diff(fromlines, tolines, fromfile, tofile, fromdate, todate, n=n)

    sys.stdout.writelines(diff)

if __name__ == '__main__':
    main()

2. 使用例子:

假设是有2个文件(代码文件,txt文本都行) g1.py, g2.py:

g1.py:

import os
import requests


print("ok")

g2.py:

import re
import requests

print("yes!")
print("nice!")

运行命令:

python get_diff.py g1.py g2.py -m  > diff.html

会生成一个 diff.html 文件, 用默认浏览器打开, 效果如下
python difflib 比较2个文件, 自动生成html_第1张图片

你可能感兴趣的:(python,开发语言)