python读取和写入EXIF信息

什么是EXIF信息呢?
百度百科:Exif是一种图象文件格式,它的数据存储与JPEG格式是完全相同的。实际上Exif格式就是在JPEG格式头部插入了数码照片的信息,包括拍摄时的光圈、快门、白平衡、ISO、焦距、日期时间等各种和拍摄条件以及相机品牌、型号、色彩编码、拍摄时录制的声音以及全球定位系统(GPS)、缩略图等。所有的JPEG文件以字符串“0xFFD8”开头,并以字符串“0xFFD9”结束。文件头中有一系列“0xFF??”格式的字符串,称为“标识”,用来标记JPEG文件的信息段。“0xFFD8”表示图像信息开始,“0xFFD9”表示图像信息结束,这两个标识后面没有信息,而其它标识紧跟一些信息字符。0xFFE0 — 0xFFEF之间的标识符称为“应用标记”,没有被常规JPEG文件利用,Exif正是利用这些信息串记录拍摄信息的。
逛摄影论坛时经常会看到,照片的底部包含很多其他信息,如:曝光度,光圈,焦距,快门,机身等等,这些信息就是EXIF信息,摄影爱好者可以参考这些信息提高自己的摄影技术。本文主要涉及的是如何把信息隐藏到图片中,比如一个电影地址。

首先实现一个最简单的方式,把信息直接添加到图片的头部或者尾部,直接添加到头部由于破坏了图片的数据,所以头部会出现一块黑色的区域比较明显,所以别人一下子就看出来了,效果最差。添加到尾部只是简单的增加了图片的大小,图片的数据区域并没有改变,所以如果信息量不是很大,基本是看不出来的,缺点是传到其他网站时容易被裁剪掉。下面的代码实现了把种子隐藏到图片尾部的1024字节区域。

import sys  
 
def add_info(origin_file, data_file, output_file):  
    container = open(origin_file, "rb").read()  
    data = open(data_file, "rb").read()  
    f = open(output_file, "wb")  
    f.write(container)  
    if len(data) <= 1024:  
        data = '%s%s' %(data,' '*(1024 - len(data)))  
    else:  
        raise Exception("flag data too long")  
 
    f.write(data)  
    f.close()  
 
def read_info(filename):  
    container = open(filename,"r").read()  
    print container[len(container) - 1024:len(container)].rstrip()  
 
if "__main__" == __name__:  
    try:  
        if len(sys.argv) == 4:  
            add_info(sys.argv[1], sys.argv[2], sys.argv[3])  
            read_info(sys.argv[3])  
        else :    
            print "arguments error" 
    except Exception,err :  
        print err
2. 接下来这种方式是把信息写到exif信息中,操作起来比较麻烦,也存在被裁剪的风险。但比上面风险要小很多,一般的网站不会清除图片的exif信息。网上有很多读取EXIF信息的demo,但是写入EXIF信息的比较少,很多人推荐使用pyexif2,但是这个源码安装和配置相当麻烦,直接pass。我需要的是一个文件就能搞定读和写的库,找了半天终于发现了 pexif ,操作起来十分方便。废话少说,直接贴代码。我添加了set_copyright和read_copyright函数,把电影地址信息添加到Copyright这个标识上,并尝试读取出来。这样就可以非常方便的实现在后台上传图片的时候把电影信息添加到图片里了。
#coding=utf-8  
""" 
pexif is a module which allows you to view and modify meta-data in 
JPEG/JFIF/EXIF files. 
 
The main way to use this is to create an instance of the JpegFile class. 
This should be done using one of the static factory methods fromFile, 
fromString or fromFd. 
 
After manipulating the object you can then write it out using one of the 
writeFile, writeString or writeFd methods. 
 
The get_exif() method on JpegFile returns the ExifSegment if one exists. 
 
Example: 
 
jpeg = pexif.JpegFile.fromFile("foo.jpg") 
exif = jpeg.get_exif() 
.... 
jpeg.writeFile("new.jpg") 
 
For photos that don't currently have an exef segment you can specify 
an argument which will create the exef segment if it doesn't exist. 
 
Example: 
 
jpeg = pexif.JpegFile.fromFile("foo.jpg") 
exif = jpeg.get_exif(create=True) 
.... 
jpeg.writeFile("new.jpg") 
 
The JpegFile class handles file that are formatted in something 
approach the JPEG specification (ISO/IEC 10918-1) Annex B 'Compressed 
Data Formats', and JFIF and EXIF standard. 
 
a JPEG file is made of a series of segments followed by the image 
data. In particular it should look something like: 
 
[ SOI | <arbitrary segments> | SOS | image data | EOI ] 
 
So, the library expects a Start-of-Image marker, followed 
by an arbitrary number of segment (assuming that a segment 
has the format: 
 
[ <0xFF> <segment-id> <size-byte0> <size-byte1> <data> ] 
 
and that there are no gaps between segments. 
 
The last segment must be the Start-of-Scan header, and the library 
assumes that following Start-of-Scan comes the image data, finally 
followed by the End-of-Image marker. 
 
This is probably not sufficient to handle arbitrary files conforming 
to the JPEG specs, but it should handle files that conform to 
JFIF or EXIF, as well as files that conform to neither but 
have both JFIF and EXIF application segment (which is the majority 
of files in existence!).  
 
When writing out files all segment will be written out in the order 
in which they were read. Any 'unknown' segment will be written out 
as is. Note: This may or may not corrupt the data. If the segment 
format relies on absolute references then this library may still 
corrupt that segment! 
 
Can have a JpegFile in two modes: Read Only and Read Write. 
 
Read Only mode: trying to access missing elements will result in 
an AttributeError. 
 
Read Write mode: trying to access missing elements will automatically 
create them. 
 
E.g:  
 
img.exif.primary.<tagname> 
             .geo 
             .interop 
             .exif.<tagname> 
             .exif.makernote.<tagname> 
 
        .thumbnail 
img.flashpix.<...> 
img.jfif.<tagname> 
img.xmp 
 
E.g:  
 
try: 
 print img.exif.tiff.exif.FocalLength 
except AttributeError: 
 print "No Focal Length data" 
 
""" 
 
import StringIO  
import sys  
from struct import unpack, pack  
 
MAX_HEADER_SIZE = 64 * 1024 
DELIM = 0xff 
EOI = 0xd9 
SOI_MARKER = chr(DELIM) + '\xd8' 
EOI_MARKER = chr(DELIM) + '\xd9' 
 
EXIF_OFFSET = 0x8769 
GPSIFD = 0x8825 
 
TIFF_OFFSET = 6 
TIFF_TAG = 0x2a 
 
DEBUG = 0 
 
def debug(*debug_string):  
    """Used for print style debugging. Enable by setting the global 
    DEBUG to 1.""" 
    if DEBUG:  
        for each in debug_string:  
            print each,  
        print 
 
class DefaultSegment:  
    """DefaultSegment represents a particluar segment of a JPEG file. 
    This class is instantiated by JpegFile when parsing Jpeg files 
    and is not intended to be used directly by the programmer. This 
    base class is used as a default which doesn't know about the internal 
    structure of the segment. Other classes subclass this to provide 
    extra information about a particular segment. 
    """ 
 
    def __init__(self, marker, fd, data, mode):  
        """The constructor for DefaultSegment takes the marker which 
        identifies the segments, a file object which is currently positioned 
        at the end of the segment. This allows any subclasses to potentially 
        extract extra data from the stream. Data contains the contents of the 
        segment.""" 
        self.marker = marker  
        self.data = data  
        self.mode = mode  
        self.fd = fd  
        assert mode in ["rw", "ro"]  
        if not self.data is None:  
            self.parse_data(data)  
 
    class InvalidSegment(Exception):  
        """This exception may be raised by sub-classes in cases when they 
        can't correctly identify the segment.""" 
        pass 
 
    def write(self, fd):  
        """This method is called by JpegFile when writing out the file. It 
        must write out any data in the segment. This shouldn't in general be 
        overloaded by subclasses, they should instead override the get_data() 
        method.""" 
        fd.write('\xff')  
        fd.write(pack('B', self.marker))  
        data = self.get_data()  
        fd.write(pack('>H', len(data) + 2))  
        fd.write(data)  
 
    def get_data(self):  
        """This method is called by write to generate the data for this segment. 
        It should be overloaded by subclasses.""" 
        return self.data  
 
    def parse_data(self, data):  
        """This method is called be init to parse any data for the segment. It 
        should be overloaded by subclasses rather than overloading __init__""" 
        pass 
 
    def dump(self, fd):  
        """This is called by JpegFile.dump() to output a human readable 
        representation of the segment. Subclasses should overload this to provide 
        extra information.""" 
        print >> fd, " Section: [%5s] Size: %6d" % \  
              (jpeg_markers[self.marker][0], len(self.data))  
 
class StartOfScanSegment(DefaultSegment):  
    """The StartOfScan segment needs to be treated specially as the actual 
    image data directly follows this segment, and that data is not included 
    in the size as reported in the segment header. This instances of this class 
    are created by JpegFile and it should not be subclassed. 
    """ 
    def __init__(self, marker, fd, data, mode):  
        DefaultSegment.__init__(self, marker, fd, data, mode)  
 
        # For SOS we also pull out the actual data  
        img_data = fd.read()  
        # -2 accounts for the EOI marker at the end of the file  
        self.img_data = img_data[:-2]  
        fd.seek(-2, 1)  
 
    def write(self, fd):  
        """Write segment data to a given file object""" 
        DefaultSegment.write(self, fd)  
        fd.write(self.img_data)  
 
    def dump(self, fd):  
        """Dump as ascii readable data to a given file object""" 
        print >> fd, " Section: [  SOS] Size: %6d Image data size: %6d" % \  
              (len(self.data), len(self.img_data))  
 
class ExifType:  
    """The ExifType class encapsulates the data types used 
    in the Exif spec. These should really be called TIFF types 
    probably. This could be replaced by named tuples in python 2.6.""" 
    lookup = {}  
 
    def __init__(self, type_id, name, size):  
        """Create an ExifType with a given name, size and type_id""" 
        self.id = type_id  
        self.name = name  
        self.size = size  
        ExifType.lookup[type_id] = self 
 
BYTE = ExifType(1, "byte", 1).id 
ASCII = ExifType(2, "ascii", 1).id 
SHORT = ExifType(3, "short", 2).id 
LONG = ExifType(4, "long", 4).id 
RATIONAL = ExifType(5, "rational", 8).id 
UNDEFINED = ExifType(7, "undefined", 1).id 
SLONG = ExifType(9, "slong", 4).id 
SRATIONAL = ExifType(10, "srational", 8).id 
 
def exif_type_size(exif_type):  
    """Return the size of a type""" 
    return ExifType.lookup.get(exif_type).size  
 
class Rational:  
    """A simple fraction class. Python 2.6 could use the inbuilt Fraction class.""" 
 
    def __init__(self, num, den):  
        """Create a number fraction num/den.""" 
        self.num = num  
        self.den = den  
 
    def __repr__(self):  
        """Return a string representation of the fraction.""" 
        return "%s / %s" % (self.num, self.den)  
 
    def as_tuple(self):  
        """Return the fraction a numerator, denominator tuple.""" 
        return (self.num, self.den)  
 
class IfdData:  
    """Base class for IFD""" 
 
    name = "Generic Ifd" 
    tags = {}  
    embedded_tags = {}  
 
    def special_handler(self, tag, data):  
        """special_handler method can be over-ridden by subclasses 
        to specially handle the conversion of tags from raw format 
        into Python data types.""" 
        pass 
 
    def ifd_handler(self, data):  
        """ifd_handler method can be over-ridden by subclasses to 
        specially handle conversion of the Ifd as a whole into a 
        suitable python representation.""" 
        pass 
 
    def extra_ifd_data(self, offset):  
        """extra_ifd_data method can be over-ridden by subclasses 
        to specially handle conversion of the Python Ifd representation 
        back into a byte stream.""" 
        return ""  
 
    def has_key(self, key):  
        return self[key] != None 
 
    def __setattr__(self, name, value):  
        for key, entry in self.tags.items():  
            if entry[1] == name:  
                self[key] = value  
        self.__dict__[name] = value  
 
    def __delattr__(self, name):  
        for key, entry in self.tags.items():  
            if entry[1] == name:  
                del self[key]  
        del self.__dict__[name]  
 
    def __getattr__(self, name):  
        for key, entry in self.tags.items():  
            if entry[1] == name:  
                x = self[key]  
                if x is None:  
                    raise AttributeError  
                return x  
        for key, entry in self.embedded_tags.items():  
            if entry[0] == name:  
                if self.has_key(key):  
                    return self[key]  
                else:  
                    if self.mode == "rw":  
                        new = entry[1](self.e, 0, "rw", self.exif_file)  
                        self[key] = new  
                        return new  
                    else:  
                        raise AttributeError  
        raise AttributeError, "%s not found.. %s" % (name, self.embedded_tags)  
 
    def __getitem__(self, key):  
        if type(key) == type(""):  
            try:  
                return self.__getattr__(key)  
            except AttributeError:  
                return None 
        for entry in self.entries:  
            if key == entry[0]:  
                if entry[1] == ASCII and not entry[2] is None:  
                    return entry[2].strip('\0')  
                else:  
                    return entry[2]  
        return None 
 
    def __delitem__(self, key):  
        if type(key) == type(""):  
            try:  
                return self.__delattr__(key)  
            except AttributeError:  
                return None 
        for entry in self.entries:  
            if key == entry[0]:  
                self.entries.remove(entry)  
 
    def __setitem__(self, key, value):  
        if type(key) == type(""):  
            return self.__setattr__(key, value)  
        found = 0 
        if len(self.tags[key]) < 3:  
            raise "Error: Tags aren't set up correctly, should have tag type." 
        if self.tags[key][2] == ASCII:  
            if not value is None and not value.endswith('\0'):  
                value = value + '\0' 
        for i in range(len(self.entries)):  
            if key == self.entries[i][0]:  
                found = 1 
                entry = list(self.entries[i])  
                if value is None:  
                    del self.entries[i]  
                else:  
                    entry[2] = value  
                    self.entries[i] = tuple(entry)  
                break 
        if not found:  
            # Find type...  
            # Not quite enough yet...  
            self.entries.append((key, self.tags[key][2], value))  
        return 
 
    def __init__(self, e, offset, exif_file, mode, data = None):  
        self.exif_file = exif_file  
        self.mode = mode  
        self.e = e  
        self.entries = []  
        if data is None:  
            return 
        num_entries = unpack(e + 'H', data[offset:offset+2])[0]  
        next = unpack(e + "I", data[offset+2+12*num_entries:  
                                    offset+2+12*num_entries+4])[0]  
        debug("OFFSET %s - %s" % (offset, next))  
 
        for i in range(num_entries):  
            start = (i * 12) + 2 + offset  
            debug("START: ", start)  
            entry = unpack(e + "HHII", data[start:start+12])  
            tag, exif_type, components, the_data = entry  
 
            debug("%s %s %s %s %s" % (hex(tag), exif_type,  
                                      exif_type_size(exif_type), components,  
                                      the_data))  
            byte_size = exif_type_size(exif_type) * components  
 
            if tag in self.embedded_tags:  
                actual_data = self.embedded_tags[tag][1](e, the_data,  
                                                         exif_file, self.mode, data)  
            else:  
                if byte_size > 4:  
                    debug(" ...offset %s" % the_data)  
                    the_data = data[the_data:the_data+byte_size]  
                else:  
                    the_data = data[start+8:start+8+byte_size]  
 
                if exif_type == BYTE or exif_type == UNDEFINED:  
                    actual_data = list(the_data)  
                elif exif_type == ASCII:  
                    if the_data[-1] != '\0':  
                        actual_data = the_data + '\0' 
                        #raise JpegFile.InvalidFile("ASCII tag '%s' not   
                        # NULL-terminated: %s [%s]" % (self.tags.get(tag,   
                        # (hex(tag), 0))[0], the_data, map(ord, the_data)))  
                        #print "ASCII tag '%s' not NULL-terminated:   
                        # %s [%s]" % (self.tags.get(tag, (hex(tag), 0))[0],   
                        # the_data, map(ord, the_data))  
                    actual_data = the_data  
                elif exif_type == SHORT:  
                    actual_data = list(unpack(e + ("H" * components), the_data))  
                elif exif_type == LONG:  
                    actual_data = list(unpack(e + ("I" * components), the_data))  
                elif exif_type == SLONG:  
                    actual_data = list(unpack(e + ("i" * components), the_data))  
                elif exif_type == RATIONAL or exif_type == SRATIONAL:  
                    if exif_type == RATIONAL: t = "II" 
                    else: t = "ii" 
                    actual_data = []  
                    for i in range(components):  
                        actual_data.append(Rational(*unpack(e + t,  
                                                            the_data[i*8:  
                                                                     i*8+8])))  
                else:  
                    raise "Can't handle this" 
 
                if (byte_size > 4):  
                    debug("%s" % actual_data)  
 
                self.special_handler(tag, actual_data)  
            entry = (tag, exif_type, actual_data)  
            self.entries.append(entry)  
 
            debug("%-40s %-10s %6d %s" % (self.tags.get(tag, (hex(tag), 0))[0],  
                                          ExifType.lookup[exif_type],  
                                          components, actual_data))  
        self.ifd_handler(data)  
 
    def isifd(self, other):  
        """Return true if other is an IFD""" 
        return issubclass(other.__class__, IfdData)  
 
    def getdata(self, e, offset, last = 0):  
        data_offset = offset+2+len(self.entries)*12+4 
        output_data = ""  
 
        out_entries = []  
 
        # Add any specifc data for the particular type  
        extra_data = self.extra_ifd_data(data_offset)  
        data_offset += len(extra_data)  
        output_data += extra_data  
 
        for tag, exif_type, the_data in self.entries:  
            magic_type = exif_type  
            if (self.isifd(the_data)):  
                debug("-> Magic..")  
                sub_data, next_offset = the_data.getdata(e, data_offset, 1)  
                the_data = [data_offset]  
                debug("<- Magic", next_offset, data_offset, len(sub_data),  
                      data_offset + len(sub_data))  
                data_offset += len(sub_data)  
                assert(next_offset == data_offset)  
                output_data += sub_data  
                magic_type = exif_type  
                if exif_type != 4:  
                    magic_components = len(sub_data)  
                else:  
                    magic_components = 1 
                exif_type = 4 # LONG  
                byte_size = 4 
                components = 1 
            else:  
                magic_components = components = len(the_data)  
                byte_size = exif_type_size(exif_type) * components  
 
            if exif_type == BYTE or exif_type == UNDEFINED:  
                actual_data = "".join(the_data)  
            elif exif_type == ASCII:  
                actual_data = the_data   
            elif exif_type == SHORT:  
                actual_data = pack(e + ("H" * components), *the_data)  
            elif exif_type == LONG:  
                actual_data = pack(e + ("I" * components), *the_data)  
            elif exif_type == SLONG:  
                actual_data = pack(e + ("i" * components), *the_data)  
            elif exif_type == RATIONAL or exif_type == SRATIONAL:  
                if exif_type == RATIONAL: t = "II" 
                else: t = "ii" 
                actual_data = ""  
                for i in range(components):  
                    actual_data += pack(e + t, *the_data[i].as_tuple())  
            else:  
                raise "Can't handle this", exif_type  
            if (byte_size) > 4:  
                output_data += actual_data  
                actual_data = pack(e + "I", data_offset)   
                data_offset += byte_size  
            else:  
                actual_data = actual_data + '\0' * (4 - len(actual_data))  
            out_entries.append((tag, magic_type,  
                                magic_components, actual_data))  
 
        data = pack(e + 'H', len(self.entries))  
        for entry in out_entries:  
            data += pack(self.e + "HHI", *entry[:3])  
            data += entry[3]  
 
        next_offset = data_offset  
        if last:  
            data += pack(self.e + "I", 0)  
        else:  
            data += pack(self.e + "I", next_offset)  
        data += output_data  
 
        assert (next_offset == offset+len(data))  
 
        return data, next_offset  
 
    def dump(self, f, indent = ""):  
        """Dump the IFD file""" 
        print >> f, indent + "<--- %s start --->" % self.name  
        for entry in self.entries:  
            tag, exif_type, data = entry  
            if exif_type == ASCII:  
                data = data.strip('\0')  
            if (self.isifd(data)):  
                data.dump(f, indent + "    ")  
            else:  
                if data and len(data) == 1:  
                    data = data[0]  
                print >> f, indent + "  %-40s %s" % \  
                      (self.tags.get(tag, (hex(tag), 0))[0], data)  
        print >> f, indent + "<--- %s end --->" % self.name  
 
class IfdInterop(IfdData):  
    name = "Interop" 
    tags = {  
        # Interop stuff  
        0x0001: ("Interoperability index", "InteroperabilityIndex"),  
        0x0002: ("Interoperability version", "InteroperabilityVersion"),  
        0x1000: ("Related image file format", "RelatedImageFileFormat"),  
        0x1001: ("Related image file width", "RelatedImageFileWidth"),  
        0x1002: ("Related image file length", "RelatedImageFileLength"),  
        }  
 
class CanonIFD(IfdData):  
    tags = {  
        0x0006: ("Image Type", "ImageType"),  
        0x0007: ("Firmware Revision", "FirmwareRevision"),  
        0x0008: ("Image Number", "ImageNumber"),  
        0x0009: ("Owner Name", "OwnerName"),  
        0x000c: ("Camera serial number", "SerialNumber"),  
        0x000f: ("Customer functions", "CustomerFunctions")  
        }  
    name = "Canon" 
 
class FujiIFD(IfdData):  
    tags = {  
        0x0000: ("Note version", "NoteVersion"),  
        0x1000: ("Quality", "Quality"),  
        0x1001: ("Sharpness", "Sharpness"),  
        0x1002: ("White balance", "WhiteBalance"),  
        0x1003: ("Color", "Color"),  
        0x1004: ("Tone", "Tone"),  
        0x1010: ("Flash mode", "FlashMode"),  
        0x1011: ("Flash strength", "FlashStrength"),  
        0x1020: ("Macro", "Macro"),  
        0x1021: ("Focus mode", "FocusMode"),  
        0x1030: ("Slow sync", "SlowSync"),  
        0x1031: ("Picture mode", "PictureMode"),  
        0x1100: ("Motor or bracket", "MotorOrBracket"),  
        0x1101: ("Sequence number", "SequenceNumber"),  
        0x1210: ("FinePix Color", "FinePixColor"),  
        0x1300: ("Blur warning", "BlurWarning"),  
        0x1301: ("Focus warning", "FocusWarning"),  
        0x1302: ("AE warning", "AEWarning")  
        }  
    name = "FujiFilm" 
 
    def getdata(self, e, offset, last = 0):  
        pre_data = "FUJIFILM" 
        pre_data += pack("<I", 12)  
        data, next_offset = IfdData.getdata(self, e, 12, last)  
        return pre_data + data, next_offset + offset  
 
def ifd_maker_note(e, offset, exif_file, mode, data):  
    """Factory function for creating MakeNote entries""" 
    if exif_file.make == "Canon":  
        # Canon maker note appears to always be in Little-Endian  
        return CanonIFD('<', offset, exif_file, mode, data)  
    elif exif_file.make == "FUJIFILM":  
        # The FujiFILM maker note is special.  
        # See http://www.ozhiker.com/electronics/pjmt/jpeg_info/fujifilm_mn.html  
 
        # First it has an extra header  
        header = data[offset:offset+8]  
        # Which should be FUJIFILM  
        if header != "FUJIFILM":  
            raise JpegFile.InvalidFile("This is FujiFilm JPEG. " \  
                                       "Expecting a makernote header "\  
                                       "<FUJIFILM>. Got <%s>." % header)  
        # The it has its own offset  
        ifd_offset = unpack("<I", data[offset+8:offset+12])[0]  
        # and it is always litte-endian  
        e = "<" 
        # and the data is referenced from the start the Ifd data, not the  
        # TIFF file.  
        ifd_data = data[offset:]  
        return FujiIFD(e, ifd_offset, exif_file, mode, ifd_data)  
    else:  
        raise JpegFile.InvalidFile("Unknown maker: %s. Can't "\  
                                   "currently handle this." % exif_file.make)  
 
class IfdGPS(IfdData):  
    name = "GPS" 
    tags = {  
        0x0: ("GPS tag version", "GPSVersionID", BYTE, 4),  
        0x1: ("North or South Latitude", "GPSLatitudeRef", ASCII, 2),  
        0x2: ("Latitude", "GPSLatitude", RATIONAL, 3),  
        0x3: ("East or West Longitude", "GPSLongitudeRef", ASCII, 2),  
        0x4: ("Longitude", "GPSLongitude", RATIONAL, 3),  
        0x5: ("Altitude reference", "GPSAltitudeRef", BYTE, 1),  
        0x6: ("Altitude", "GPSAltitude", RATIONAL, 1)  
        }  
 
    def __init__(self, e, offset, exif_file, mode, data = None):  
        IfdData.__init__(self, e, offset, exif_file, mode, data)  
        if data is None:  
            self.GPSVersionID = ['\x02', '\x02', '\x00', '\x00']  
 
class IfdExtendedEXIF(IfdData):  
    tags = {  
        # Exif IFD Attributes  
        # A. Tags relating to version  
        0x9000: ("Exif Version", "ExifVersion"),  
        0xA000: ("Supported Flashpix version", "FlashpixVersion"),  
        # B. Tag relating to Image Data Characteristics  
        0xA001: ("Color Space Information", "ColorSpace"),  
        # C. Tags relating to Image Configuration  
        0x9101: ("Meaning of each component", "ComponentConfiguration"),  
        0x9102: ("Image compression mode", "CompressedBitsPerPixel"),  
        0xA002: ("Valid image width", "PixelXDimension"),  
        0xA003: ("Valid image height", "PixelYDimension"),  
        # D. Tags relatin to User informatio  
        0x927c: ("Manufacturer notes", "MakerNote"),  
        0x9286: ("User comments", "UserComment"),  
        # E. Tag relating to related file information  
        0xA004: ("Related audio file", "RelatedSoundFile"),  
        # F. Tags relating to date and time  
        0x9003: ("Date of original data generation", "DateTimeOriginal", ASCII),  
        0x9004: ("Date of digital data generation", "DateTimeDigitized", ASCII),  
        0x9290: ("DateTime subseconds", "SubSecTime"),  
        0x9291: ("DateTime original subseconds", "SubSecTimeOriginal"),  
        0x9292: ("DateTime digitized subseconds", "SubSecTimeDigitized"),  
        # G. Tags relating to Picture taking conditions  
        0x829a: ("Exposure Time", "ExposureTime"),  
        0x829d: ("F Number", "FNumber"),  
        0x8822: ("Exposure Program", "ExposureProgram"),      
        0x8824: ("Spectral Sensitivity", "SpectralSensitivity"),  
        0x8827: ("ISO Speed Rating", "ISOSpeedRatings"),  
        0x8829: ("Optoelectric conversion factor", "OECF"),  
        0x9201: ("Shutter speed", "ShutterSpeedValue"),  
        0x9202: ("Aperture", "ApertureValue"),  
        0x9203: ("Brightness", "BrightnessValue"),  
        0x9204: ("Exposure bias", "ExposureBiasValue"),  
        0x9205: ("Maximum lens apeture", "MaxApertureValue"),  
        0x9206: ("Subject Distance", "SubjectDistance"),  
        0x9207: ("Metering mode", "MeteringMode"),  
        0x9208: ("Light mode", "LightSource"),  
        0x9209: ("Flash", "Flash"),  
        0x920a: ("Lens focal length", "FocalLength"),  
        0x9214: ("Subject area", "Subject area"),  
        0xa20b: ("Flash energy", "FlashEnergy"),  
        0xa20c: ("Spatial frequency results", "SpatialFrquencyResponse"),  
        0xa20e: ("Focal plane X resolution", "FocalPlaneXResolution"),  
        0xa20f: ("Focal plane Y resolution", "FocalPlaneYResolution"),  
        0xa210: ("Focal plane resolution unit", "FocalPlaneResolutionUnit"),  
        0xa214: ("Subject location", "SubjectLocation"),  
        0xa215: ("Exposure index", "ExposureIndex"),  
        0xa217: ("Sensing method", "SensingMethod"),  
        0xa300: ("File source", "FileSource"),  
        0xa301: ("Scene type", "SceneType"),  
        0xa302: ("CFA pattern", "CFAPattern"),  
        0xa401: ("Customer image processing", "CustomerRendered"),  
        0xa402: ("Exposure mode", "ExposureMode"),  
        0xa403: ("White balance", "WhiteBalance"),  
        0xa404: ("Digital zoom ratio", "DigitalZoomRation"),  
        0xa405: ("Focal length in 35mm film", "FocalLengthIn35mmFilm"),  
        0xa406: ("Scene capture type", "SceneCaptureType"),  
        0xa407: ("Gain control", "GainControl"),  
        0xa40a: ("Sharpness", "Sharpness"),  
        0xa40c: ("Subject distance range", "SubjectDistanceRange"),  
 
        # H. Other tags  
        0xa420: ("Unique image ID", "ImageUniqueID"),  
        }  
    embedded_tags = {  
        0x927c: ("MakerNote", ifd_maker_note),  
        }  
    name = "Extended EXIF" 
 
class IfdTIFF(IfdData):  
    """ 
    """ 
 
    tags = {  
        # Private Tags  
        0x8769: ("Exif IFD Pointer", "ExifOffset", LONG),   
        0xA005: ("Interoparability IFD Pointer", "InteroparabilityIFD", LONG),  
        0x8825: ("GPS Info IFD Pointer", "GPSIFD", LONG),  
        # TIFF stuff used by EXIF  
 
        # A. Tags relating to image data structure  
        0x100: ("Image width", "ImageWidth", LONG),  
        0x101: ("Image height", "ImageHeight", LONG),  
        0x102: ("Number of bits per component", "BitsPerSample", SHORT),  
        0x103: ("Compression Scheme", "Compression", SHORT),  
        0x106: ("Pixel Composition", "PhotometricInterpretion", SHORT),  
        0x112: ("Orientation of image", "Orientation", SHORT),  
        0x115: ("Number of components", "SamplesPerPixel", SHORT),  
        0x11c: ("Image data arrangement", "PlanarConfiguration", SHORT),  
        0x212: ("Subsampling ration of Y to C", "YCbCrSubsampling", SHORT),  
        0x213: ("Y and C positioning", "YCbCrCoefficients", SHORT),  
        0x11a: ("X Resolution", "XResolution", RATIONAL),  
        0x11b: ("Y Resolution", "YResolution", RATIONAL),  
        0x128: ("Unit of X and Y resolution", "ResolutionUnit", SHORT),  
 
        # B. Tags relating to recording offset  
        0x111: ("Image data location", "StripOffsets", LONG),  
        0x116: ("Number of rows per strip", "RowsPerStrip", LONG),  
        0x117: ("Bytes per compressed strip", "StripByteCounts", LONG),  
        0x201: ("Offset to JPEG SOI", "JPEGInterchangeFormat", LONG),  
        0x202: ("Bytes of JPEG data", "JPEGInterchangeFormatLength", LONG),  
 
        # C. Tags relating to image data characteristics  
 
        # D. Other tags  
        0x132: ("File change data and time", "DateTime", ASCII),  
        0x10e: ("Image title", "ImageDescription", ASCII),  
        0x10f: ("Camera Make", "Make", ASCII),  
        0x110: ("Camera Model", "Model", ASCII),  
        0x131: ("Camera Software", "Software", ASCII),  
        0x13B: ("Artist", "Artist", ASCII),  
        0x8298: ("Copyright holder", "Copyright", ASCII),  
    }  
 
    embedded_tags = {  
        0xA005: ("Interoperability", IfdInterop),   
        EXIF_OFFSET: ("ExtendedEXIF", IfdExtendedEXIF),  
        0x8825: ("GPS", IfdGPS),  
        }  
 
    name = "TIFF Ifd" 
 
    def special_handler(self, tag, data):  
        if self.tags[tag][1] == "Make":  
            self.exif_file.make = data.strip('\0')  
 
    def new_gps(self):  
        if self.has_key(GPSIFD):  
            raise ValueError, "Already have a GPS Ifd"  
        assert self.mode == "rw" 
        gps = IfdGPS(self.e, 0, self.mode, self.exif_file)  
        self[GPSIFD] = gps  
        return gps  
 
class IfdThumbnail(IfdTIFF):  
    name = "Thumbnail" 
 
    def ifd_handler(self, data):  
        size = None 
        offset = None 
        for (tag, exif_type, val) in self.entries:  
            if (tag == 0x201):  
                offset = val[0]  
            if (tag == 0x202):  
                size = val[0]  
        if size is None or offset is None:  
            raise JpegFile.InvalidFile("Thumbnail doesn't have an offset "\  
                                       "and/or size")  
        self.jpeg_data = data[offset:offset+size]  
        if len(self.jpeg_data) != size:  
            raise JpegFile.InvalidFile("Not enough data for JPEG thumbnail."\  
                                       "Wanted: %d got %d" % 
                                       (size, len(self.jpeg_data)))  
 
    def extra_ifd_data(self, offset):  
        for i in range(len(self.entries)):  
            entry = self.entries[i]  
            if entry[0] == 0x201:  
                # Print found field and updating  
                new_entry = (entry[0], entry[1], [offset])  
                self.entries[i] = new_entry  
        return self.jpeg_data  
 
class ExifSegment(DefaultSegment):  
    """ExifSegment encapsulates the Exif data stored in a JpegFile. An 
    ExifSegment contains two Image File Directories (IFDs). One is attribute 
    information and the other is a thumbnail. This module doesn't provide 
    any useful functions for manipulating the thumbnail, but does provide 
    a get_attributes returns an AttributeIfd instances which allows you to 
    manipulate the attributes in a Jpeg file.""" 
 
    def __init__(self, marker, fd, data, mode):  
        self.ifds = []  
        self.e = '<' 
        self.tiff_endian = 'II' 
        DefaultSegment.__init__(self, marker, fd, data, mode)  
 
    def parse_data(self, data):  
        """Overloads the DefaultSegment method to parse the data of 
        this segment. Can raise InvalidFile if we don't get what we expect.""" 
        exif = unpack("6s", data[:6])[0]  
        exif = exif.strip('\0')  
 
        if (exif != "Exif"):  
            raise self.InvalidSegment("Bad Exif Marker. Got <%s>, "\  
                                       "expecting <Exif>" % exif)  
 
        tiff_data = data[TIFF_OFFSET:]  
        data = None # Don't need or want data for now on..  
 
        self.tiff_endian = tiff_data[:2]  
        if self.tiff_endian == "II":  
            self.e = "<" 
        elif self.tiff_endian == "MM":  
            self.e = ">" 
        else:  
            raise JpegFile.InvalidFile("Bad TIFF endian header. Got <%s>, " 
                                       "expecting <II> or <MM>" %  
                                       self.tiff_endian)  
 
        tiff_tag, tiff_offset = unpack(self.e + 'HI', tiff_data[2:8])  
 
        if (tiff_tag != TIFF_TAG):  
            raise JpegFile.InvalidFile("Bad TIFF tag. Got <%x>, expecting "\  
                                       "<%x>" % (tiff_tag, TIFF_TAG))  
 
        # Ok, the header parse out OK. Now we parse the IFDs contained in  
        # the APP1 header.  
 
        # We use this loop, even though we can really only expect and support  
        # two IFDs, the Attribute data and the Thumbnail data  
        offset = tiff_offset  
        count = 0 
 
        while offset:  
            count += 1 
            num_entries = unpack(self.e + 'H', tiff_data[offset:offset+2])[0]  
            start = 2 + offset + (num_entries*12)  
            if (count == 1):  
                ifd = IfdTIFF(self.e, offset, self, self.mode, tiff_data)  
            elif (count == 2):  
                ifd = IfdThumbnail(self.e, offset, self, self.mode, tiff_data)  
            else:  
                raise JpegFile.InvalidFile()  
            self.ifds.append(ifd)  
 
            # Get next offset  
            offset = unpack(self.e + "I", tiff_data[start:start+4])[0]  
 
    def dump(self, fd):  
        print >> fd, " Section: [ EXIF] Size: %6d" % \  
              (len(self.data))  
        for ifd in self.ifds:  
            ifd.dump(fd)  
 
    def get_data(self):  
        ifds_data = ""  
        next_offset = 8 
        for ifd in self.ifds:  
            debug("OUT IFD")  
            new_data, next_offset = ifd.getdata(self.e, next_offset,  
                                                ifd == self.ifds[-1])  
            ifds_data += new_data  
 
        data = ""  
        data += "Exif\0\0" 
        data += self.tiff_endian  
        data += pack(self.e + "HI", 42, 8)  
        data += ifds_data  
 
        return data  
 
    def get_primary(self, create=False):  
        """Return the attributes image file descriptor. If it doesn't 
        exit return None, unless create is True in which case a new 
        descriptor is created.""" 
        if len(self.ifds) > 0:  
            return self.ifds[0]  
        else:  
            if create:  
                assert self.mode == "rw" 
                new_ifd = IfdTIFF(self.e, None, self, "rw")  
                self.ifds.insert(0, new_ifd)  
                return new_ifd  
            else:  
                return None 
 
    def _get_property(self):  
        if self.mode == "rw":  
            return self.get_primary(True)  
        else:  
            primary = self.get_primary()  
            if primary is None:  
                raise AttributeError  
            return primary  
 
    primary = property(_get_property)  
 
jpeg_markers = {  
    0xc0: ("SOF0", []),  
    0xc2: ("SOF2", []),  
    0xc4: ("DHT", []),  
 
    0xda: ("SOS", [StartOfScanSegment]),  
    0xdb: ("DQT", []),  
    0xdd: ("DRI", []),  
 
    0xe0: ("APP0", []),  
    0xe1: ("APP1", [ExifSegment]),  
    0xe2: ("APP2", []),  
    0xe3: ("APP3", []),  
    0xe4: ("APP4", []),  
    0xe5: ("APP5", []),  
    0xe6: ("APP6", []),  
    0xe7: ("APP7", []),  
    0xe8: ("APP8", []),  
    0xe9: ("APP9", []),  
    0xea: ("APP10", []),  
    0xeb: ("APP11", []),  
    0xec: ("APP12", []),  
    0xed: ("APP13", []),  
    0xee: ("APP14", []),  
    0xef: ("APP15", []),  
 
    0xfe: ("COM", []),  
    }  
 
APP1 = 0xe1 
 
class JpegFile:  
    """JpegFile object. You should create this using one of the static methods 
    fromFile, fromString or fromFd. The JpegFile object allows you to examine and 
    modify the contents of the file. To write out the data use one of the methods 
    writeFile, writeString or writeFd. To get an ASCII dump of the data in a file 
    use the dump method.""" 
 
    def fromFile(filename, mode="rw"):  
        """Return a new JpegFile object from a given filename.""" 
        return JpegFile(open(filename, "rb"), filename=filename, mode=mode)  
    fromFile = staticmethod(fromFile)  
 
    def fromString(str, mode="rw"):  
        """Return a new JpegFile object taking data from a string.""" 
        return JpegFile(StringIO.StringIO(str), "from buffer", mode=mode)  
    fromString = staticmethod(fromString)  
 
    def fromFd(fd, mode="rw"):  
        """Return a new JpegFile object taking data from a file object.""" 
        return JpegFile(fd, None, mode=mode)  
    fromFd = staticmethod(fromFd)  
 
    class InvalidFile(Exception):  
        """This exception is raised if a given file is not able to be parsed.""" 
        pass 
 
    class NoSection(Exception):  
        """This exception is raised if a section is unable to be found.""" 
        pass 
 
    def __init__(self, input, filename=None, mode="rw"):  
        """JpegFile Constructor. input is a file object, and filename 
        is a string used to name the file. (filename is used only for 
        display functions).  You shouldn't use this function directly, 
        but rather call one of the static methods fromFile, fromString 
        or fromFd.""" 
        self.filename = filename  
        self.mode = mode  
        # input is the file descriptor  
        soi_marker = input.read(len(SOI_MARKER))  
 
        # The very first thing should be a start of image marker  
        if (soi_marker != SOI_MARKER):  
            raise self.InvalidFile("Error reading soi_marker. Got <%s> "\  
                                   "should be <%s>" % (soi_marker, SOI_MARKER))  
 
        # Now go through and find all the blocks of data  
        segments = []  
        while 1:  
            head = input.read(2)  
            delim, mark  =  unpack(">BB", head)  
            if (delim != DELIM):  
                raise self.InvalidFile("Error, expecting delmiter. "\  
                                       "Got <%s> should be <%s>" % 
                                       (delim, DELIM))  
            if mark == EOI:  
                # Hit end of image marker, game-over!  
                break 
            head2 = input.read(2)  
            size = unpack(">H", head2)[0]  
            data = input.read(size-2)  
            possible_segment_classes = jpeg_markers[mark][1] + [DefaultSegment]  
            # Try and find a valid segment class to handle  
            # this data  
            for segment_class in possible_segment_classes:  
                try:  
                    # Note: Segment class may modify the input file   
                    # descriptor. This is expected.  
                    attempt = segment_class(mark, input, data, self.mode)  
                    segments.append(attempt)  
                    break 
                except DefaultSegment.InvalidSegment:  
                    # It wasn't this one so we try the next type.  
                    # DefaultSegment will always work.  
                    continue 
 
        self._segments = segments  
 
    def writeString(self):  
        """Write the JpegFile out to a string. Returns a string.""" 
        f = StringIO.StringIO()  
        self.writeFd(f)  
        return f.getvalue()  
 
    def writeFile(self, filename):  
        """Write the JpegFile out to a file named filename.""" 
        output = open(filename, "wb")  
        self.writeFd(output)  
 
    def writeFd(self, output):  
        """Write the JpegFile out on the file object output.""" 
        output.write(SOI_MARKER)  
        for segment in self._segments:  
            segment.write(output)  
        output.write(EOI_MARKER)  
 
    def dump(self, f = sys.stdout):  
        """Write out ASCII representation of the file on a given file 
        object. Output default to stdout.""" 
        print >> f, "<Dump of JPEG %s>" % self.filename  
        for segment in self._segments:  
            segment.dump(f)  
 
    def get_exif(self, create=False):  
        """get_exif returns a ExifSegment if one exists for this file. 
        If the file does not have an exif segment and the create is 
        false, then return None. If create is true, a new exif segment is 
        added to the file and returned.""" 
        for segment in self._segments:  
            if segment.__class__ == ExifSegment:  
                return segment  
        if create:  
            return self.add_exif()  
        else:  
            return None 
 
    def add_exif(self):  
        """add_exif adds a new ExifSegment to a file, and returns 
        it. When adding an EXIF segment is will add it at the start of 
        the list of segments.""" 
        assert self.mode == "rw" 
        new_segment = ExifSegment(APP1, None, None, "rw")  
        self._segments.insert(0, new_segment)  
        return new_segment  
 
    def _get_exif(self):  
        """Exif Attribute property""" 
        if self.mode == "rw":  
            return self.get_exif(True)  
        else:  
            exif = self.get_exif(False)  
            if exif is None:  
                raise AttributeError  
            return exif  
 
    exif = property(_get_exif)  
 
    def get_geo(self):  
        """Return a tuple of (latitude, longitude).""" 
        def convert(x):  
            (deg, min, sec) = x  
            return (float(deg.num) / deg.den) +  \  
                (1/60.0 * float(min.num) / min.den) + \  
                (1/3600.0 * float(sec.num) / sec.den)  
        if not self.exif.primary.has_key(GPSIFD):  
            raise self.NoSection, "File %s doesn't have a GPS section." % \  
                self.filename  
 
        gps = self.exif.primary.GPS  
        lat = convert(gps.GPSLatitude)  
        lng = convert(gps.GPSLongitude)  
        if gps.GPSLatitudeRef == "S":  
            lat = -lat  
        if gps.GPSLongitudeRef == "W":  
            lng = -lng  
 
        return lat, lng  
 
    SEC_DEN = 50000000 
 
    def _parse(val):  
        sign = 1 
        if val < 0:  
            val  = -val  
            sign = -1 
 
        deg = int(val)  
        other = (val - deg) * 60 
        minutes = int(other)  
        secs = (other - minutes) * 60 
        secs = long(secs * JpegFile.SEC_DEN)  
        return (sign, deg, minutes, secs)  
 
    _parse = staticmethod(_parse)  
 
    def set_geo(self, lat, lng):  
        """Set the GeoLocation to a given lat and lng""" 
        if self.mode != "rw":  
            raise RWError  
 
        gps = self.exif.primary.GPS  
 
        sign, deg, min, sec = JpegFile._parse(lat)  
        ref = "N" 
        if sign < 0:  
            ref = "S" 
 
        gps.GPSLatitudeRef = ref  
        gps.GPSLatitude = [Rational(deg, 1), Rational(min, 1),  
                            Rational(sec, JpegFile.SEC_DEN)]  
 
        sign, deg, min, sec = JpegFile._parse(lng)  
        ref = "E" 
        if sign < 0:  
            ref = "W" 
        gps.GPSLongitudeRef = ref  
        gps.GPSLongitude = [Rational(deg, 1), Rational(min, 1),  
                             Rational(sec, JpegFile.SEC_DEN)]  
 
    def set_copyright(self, copyright):  
        """Set the copyright to a given copyright string""" 
        if self.mode != "rw":  
            raise RWError  
 
        self.exif.primary.Copyright = copyright  
 
def read_copyright(imgFile):  
    jpeg = JpegFile.fromFd(imgFile)  
    exif = jpeg.get_exif()  
    print exif.get_primary().Copyright.decode("gbk")  
 
if __name__ == "__main__":  
    import StringIO  
    import sys  
    reload(sys)  
    sys.setdefaultencoding('utf-8')  
 
    f = open("test.jpg")  
    read_copyright(f)  
    f.seek(0)  
    jpeg = JpegFile.fromFd(f)  
    copyright = u"qvod://fuck小日本".encode("gbk")  
    jpeg.set_copyright(copyright)  
    buf = StringIO.StringIO()  
    jpeg.writeFd(buf)  
    buf.seek(0)  
    read_copyright(buf)  
    f.close()


转载请注明来自:Alex Zhou,本文链接:http://codingnow.cn/python/612.html

你可能感兴趣的:(python读取和写入EXIF信息)