Project

General

Profile

The Metadata in TIFF files » History » Version 12

Robin Mills, 13 Nov 2016 09:03

1 1 Robin Mills
h1. The Metadata in TIFF files
2
3
The Tagged Image File Format is a container.  It's very flexible and can deal with multiple pages, different colour spaces, frame configurations as well as metadata.  The specification is available from:  https://partners.adobe.com/public/developer/en/tiff/TIFF6.pdf
4
5 9 Robin Mills
!Tiff.png!
6 3 Robin Mills
7 8 Robin Mills
The format consists of an 8 byte header which provides a 2 byte endian flag (II or MM), a 2 byte type "magic number" (42) and a 4 byte offset to an _*Image File Directory*_.
8
9
The _*IFD*_ _(Image File Directory)_  has a two byte header which contains the length of the directory followed by 12 byte field or "tag".  The trailing 4 bytes of the the directory is always the offset to the next dictionary _*or*_ zero to terminate the directory chain.
10 1 Robin Mills
11 7 Robin Mills
A tag consists of a 12 byte record: TagID, TagType, Count and Offset which are 2,2,4 and 4 bytes respectively.  The TagID defines the purpose of the record (Width, Height, ColorSpace etc) and the TagType defines the data format.  Count and Offset are used to contain the raw data _*or*_ provide an offset in the file at which to read the raw data for this tag.
12 1 Robin Mills
13 7 Robin Mills
This is described in the specification.  The code in tiff image.cpp/ TiffImage::printStructure() decodes the dictionaries.
14 1 Robin Mills
15
The TIFF container is so flexible it is used as the structure for most RAW formats including Adobe's DNG.  Additionally the TIFF container is used to store the metadata that lies inside the Exif blocks embedded in JPEG and PNG files.
16 7 Robin Mills
17
There is a 64bit version of Tiff called _*BigTiff*_. This is currently not supported by exiv2.
18 1 Robin Mills
19
*Example:*
20 10 Robin Mills
The version of exiv2(.exe) which ships with v0.25 provides options _*<code>-pS</code>*_ to reveals the structure of the TIFF and option _*<code>-pX</code>*_ is used to extract the raw XMP/xml data.
21
22
The option _*<code>-pa</code>*_ is used to print the metadata in human readable format.<pre>$ exiv2 -pa test/data/test.tiff
23
$ exiv2 -pa test/data/test.tiff 
24
Exif.Image.NewSubfileType                    Long        1  Primary image
25
Exif.Image.ImageWidth                        Short       1  282
26
Exif.Image.ImageLength                       Short       1  212
27 1 Robin Mills
Exif.Image.BitsPerSample                     Short       3  8 8 8
28
Exif.Image.Compression                       Short       1  LZW
29
Exif.Image.PhotometricInterpretation         Short       1  RGB
30 10 Robin Mills
Exif.Image.ImageDescription                  Ascii      32                                 
31
Exif.Image.Make                              Ascii       5  SONY
32
Exif.Image.Model                             Ascii       9  DSC-S600
33
Exif.Image.StripOffsets                      Long       16  4372 ...
34
Exif.Image.Orientation                       Short       1  top, left
35 1 Robin Mills
Exif.Image.SamplesPerPixel                   Short       1  3
36 10 Robin Mills
Exif.Image.RowsPerStrip                      Long        1  14
37
Exif.Image.StripByteCounts                   Long       16  10492 ...
38
Exif.Image.XResolution                       Rational    1  96
39
Exif.Image.YResolution                       Rational    1  96
40 1 Robin Mills
Exif.Image.PlanarConfiguration               Short       1  1
41 10 Robin Mills
Exif.Image.ResolutionUnit                    Short       1  inch
42
Exif.Image.DateTime                          Ascii      20  2009:04:26 12:38:18
43
Exif.Image.Predictor                         Short       1  No prediction scheme used
44 1 Robin Mills
Exif.Image.SampleFormat                      Short       3  Unsigned integer data
45 10 Robin Mills
Exif.Image.XMLPacket                         Byte      2505  (Binary value suppressed)
46
Exif.Image.InterColorProfile                 Undefined 1352  (Binary value suppressed)
47
Xmp.dc.title                                 LangAlt     1  lang="x-default" this is a test image
48 11 Robin Mills
$ exiv2 -pS test/data/test.tiff
49 10 Robin Mills
STRUCTURE OF TIFF FILE (II): test/data/test.tiff
50
 address |    tag                           |      type |    count |    offset | value
51
      10 | 0x00fe NewSubfileType            |      LONG |        1 |         0 | 0
52
      22 | 0x0100 ImageWidth                |     SHORT |        1 |       282 | 282
53
      34 | 0x0101 ImageLength               |     SHORT |        1 |       212 | 212
54
      46 | 0x0102 BitsPerSample             |     SHORT |        3 |       290 | 8 8 8
55
      58 | 0x0103 Compression               |     SHORT |        1 |         5 | 5
56
      70 | 0x0106 PhotometricInterpretation |     SHORT |        1 |         2 | 2
57
      82 | 0x010e ImageDescription          |     ASCII |       32 |       296 |                                
58
      94 | 0x010f Make                      |     ASCII |        5 |       328 | SONY
59
     106 | 0x0110 Model                     |     ASCII |        9 |       334 | DSC-S600
60
     118 | 0x0111 StripOffsets              |      LONG |       16 |       344 | 4372 14864 25634 37110 50496 ...
61
     130 | 0x0112 Orientation               |     SHORT |        1 |         1 | 1
62
     142 | 0x0115 SamplesPerPixel           |     SHORT |        1 |         3 | 3
63
     154 | 0x0116 RowsPerStrip              |      LONG |        1 |        14 | 14
64
     166 | 0x0117 StripByteCounts           |      LONG |       16 |       408 | 10492 10770 11476 13385 13348 ...
65
     178 | 0x011a XResolution               |  RATIONAL |        1 |       472 | 96/1
66
     190 | 0x011b YResolution               |  RATIONAL |        1 |       480 | 96/1
67
     202 | 0x011c PlanarConfiguration       |     SHORT |        1 |         1 | 1
68
     214 | 0x0128 ResolutionUnit            |     SHORT |        1 |         2 | 2
69
     226 | 0x0132 DateTime                  |     ASCII |       20 |       488 | 2009:04:26 12:38:18
70
     238 | 0x013d Predictor                 |     SHORT |        1 |         1 | 1
71
     250 | 0x0153 SampleFormat              |     SHORT |        3 |       508 | 1 1 1
72
     262 | 0x02bc XMLPacket                 |      BYTE |     2505 |       514 | <?xpacket begin="..." id="W5M0Mp ...
73
     274 | 0x8773 InterColorProfile         | UNDEFINED |     1352 |      3020 | ...Happl. ..scnrRGB XYZ ....... ...
74
END test/data/test.tiff
75
$ exiv2 -pX test/data/test.tiff  | xmllint --format -
76 1 Robin Mills
<?xml version="1.0"?>
77
<?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?>
78
<x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
79
  <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
80
    <rdf:Description xmlns:dc="http://purl.org/dc/elements/1.1/" rdf:about="">
81
      <dc:title>
82
        <rdf:Alt>
83 10 Robin Mills
          <rdf:li xml:lang="x-default">this is a test image</rdf:li>
84 1 Robin Mills
        </rdf:Alt>
85
      </dc:title>
86
    </rdf:Description>
87
  </rdf:RDF>
88
</x:xmpmeta>
89
<?xpacket end="w"?>
90 10 Robin Mills
$ </pre>You can use the output from _*<code>-pS</code>*_ to determine the location of the XMLPacket:<pre>$ exiv2 -pS --grep XML test/data/test.tiff | grep XMLPacket 
91
     262 | 0x02bc XMLPacket                 |      BYTE |     2505 |       514 | <?xpacket begin="..." id="W5M0Mp ...
92
$ </pre>You can see the XMLPacket of 2505 bytes stored at offset 514.   You can extract the XMP with the following command which says:  set the block size to 1 byte, skip 514 bytes and extract 2505 bytes:
93
<pre>$ dd bs=1 count=2505 skip=514 if=test/data/test.tiff | xmllint --format - 
94
2505+0 records in
95
2505+0 records out
96
2505 bytes (2.5 kB) copied, 0.0064 s, 391 kB/s
97 1 Robin Mills
<?xml version="1.0"?>
98
<?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?>
99
<x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
100 4 Robin Mills
  <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
101 2 Robin Mills
    <rdf:Description xmlns:dc="http://purl.org/dc/elements/1.1/" rdf:about="">
102
      <dc:title>
103
        <rdf:Alt>
104 10 Robin Mills
          <rdf:li xml:lang="x-default">this is a test image</rdf:li>
105 2 Robin Mills
        </rdf:Alt>
106
      </dc:title>
107
    </rdf:Description>
108
  </rdf:RDF>
109
</x:xmpmeta>
110
<?xpacket end="w"?>
111 10 Robin Mills
</pre>The option _*<code>-pX</code>*_ doesn't actually use dd to achieve the same result and is much more convenient to use.
112 2 Robin Mills
113 10 Robin Mills
To demonstrate that the metadata block in a JPEG is a TIFF file, extract and print the structure.<pre>$ exiv2 -pS test/data/Reagan.jpg 
114
$ exiv2 -pS test/data/Reagan.jpg
115 2 Robin Mills
STRUCTURE OF JPEG FILE: test/data/Reagan.jpg
116
 address | marker       |  length | data
117
       0 | 0xffd8 SOI  
118
       2 | 0xffe1 APP1  |    5718 | Exif..MM.*......................
119 10 Robin Mills
    5722 | 0xffed APP13 |    3038 | Photoshop 3.0.8BIM..........Z...
120
    8762 | 0xffe1 APP1  |    5329 | http://ns.adobe.com/xap/1.0/.<?x
121
   14093 | 0xffe2 APP2  |     576 | ICC_PROFILE......0ADBE....mntrRG chunk 1/1
122
   14671 | 0xffee APP14 |      14 | Adobe.d@......
123
   14687 | 0xffdb DQT   |     132 
124
   14821 | 0xffc0 SOF0  |      17 
125
   14840 | 0xffdd DRI   |       4 
126
   14846 | 0xffc4 DHT   |     418 
127
   15266 | 0xffda SOS  
128
$ </pre>  We can extract the APP1/Exif data as follows:<pre>$ dd bs=1 skip=$((2+2+8)) count=$((5718-8)) if=test/data/Reagan.jpg of=ReaganExif.tiff
129
5710+0 records in
130
5710+0 records out
131
5710 bytes (5.7 kB) copied, 0.026688 s, 214 kB/s
132
$ exiv2 -pS ReaganExif.tif 
133
STRUCTURE OF TIFF FILE (MM): ReaganExif.tif
134
 address |    tag                           |      type |    count |    offset | value
135
      10 | 0x0100 ImageWidth                |     SHORT |        1 |  13107200 | 200
136
      22 | 0x0101 ImageLength               |     SHORT |        1 |   8519680 | 130
137
      34 | 0x0102 BitsPerSample             |     SHORT |        4 |       242 | 8 8 8 8
138
      46 | 0x0103 Compression               |     SHORT |        1 |     65536 | 1
139
      58 | 0x0106 PhotometricInterpretation |     SHORT |        1 |    131072 | 2
140
      70 | 0x010e ImageDescription          |     ASCII |      403 |       250 | 040621-N-6536T-062.USS Ronald Re ...
141
      82 | 0x010f Make                      |     ASCII |       18 |       653 | NIKON CORPORATION
142
      94 | 0x0110 Model                     |     ASCII |       10 |       671 | NIKON D1X
143
     106 | 0x0112 Orientation               |     SHORT |        1 |     65536 | 1
144
     118 | 0x0115 SamplesPerPixel           |     SHORT |        1 |    262144 | 4
145
     130 | 0x011a XResolution               |  RATIONAL |        1 |       681 | 3000000/10000
146
     142 | 0x011b YResolution               |  RATIONAL |        1 |       689 | 3000000/10000
147
     154 | 0x011c PlanarConfiguration       |     SHORT |        1 |     65536 | 1
148
     166 | 0x0128 ResolutionUnit            |     SHORT |        1 |    131072 | 2
149
     178 | 0x0131 Software                  |     ASCII |       40 |       697 | Adobe Photoshop Elements 12.0 Ma ...
150
     190 | 0x0132 DateTime                  |     ASCII |       20 |       737 | 2016:09:13 11:58:16
151
     202 | 0x013b Artist                    |     ASCII |       34 |       757 | Photographer..s Mate 3rd Class ( ...
152
     214 | 0x8769 ExifTag                   |      LONG |        1 |       792 | 792
153
     226 | 0x8825 GPSTag                    |      LONG |        1 |      1400 | 1400
154
    1422 | 0x0103 Compression               |     SHORT |        1 |    393216 | 6
155
    1434 | 0x011a XResolution               |  RATIONAL |        1 |      1498 | 72/1
156
    1446 | 0x011b YResolution               |  RATIONAL |        1 |      1506 | 72/1
157
    1458 | 0x0128 ResolutionUnit            |     SHORT |        1 |    131072 | 2
158
    1470 | 0x0201 JPEGInterchangeFormat     |      LONG |        1 |      1514 | 1514
159
    1482 | 0x0202 JPEGInterchangeFormatLeng |      LONG |        1 |      4196 | 4196
160
END ReaganExif.tif
161 12 Robin Mills
$ </pre>The version of the exiv2 utility that ships with v0.26 later has an option _*<code>-pR</code>*_ which _Recursively_ prints the structure of an image file.  *_Recursively*_ means  the utility prints structures embedded within other structures.  For example, the Exif metadata in a JPEG is encoded as a TIFF embedded in the APP1/Exif segment of the JPEG.<pre>$ exiv2 -pR test/data/Reagan.jpg 
162 10 Robin Mills
STRUCTURE OF JPEG FILE: test/data/Reagan.jpg
163
 address | marker       |  length | data
164
       0 | 0xffd8 SOI  
165
       2 | 0xffe1 APP1  |    5718 | Exif..MM.*......................
166 4 Robin Mills
  STRUCTURE OF TIFF FILE (MM): MemIo
167
   address |    tag                           |      type |    count |    offset | value
168
        10 | 0x0100 ImageWidth                |     SHORT |        1 |  13107200 | 200
169
        22 | 0x0101 ImageLength               |     SHORT |        1 |   8519680 | 130
170
        34 | 0x0102 BitsPerSample             |     SHORT |        4 |       242 | 8 8 8 8
171
        46 | 0x0103 Compression               |     SHORT |        1 |     65536 | 1
172
        58 | 0x0106 PhotometricInterpretation |     SHORT |        1 |    131072 | 2
173
        70 | 0x010e ImageDescription          |     ASCII |      403 |       250 | 040621-N-6536T-062.USS Ronald Re ...
174
        82 | 0x010f Make                      |     ASCII |       18 |       653 | NIKON CORPORATION
175
        94 | 0x0110 Model                     |     ASCII |       10 |       671 | NIKON D1X
176
       106 | 0x0112 Orientation               |     SHORT |        1 |     65536 | 1
177
       118 | 0x0115 SamplesPerPixel           |     SHORT |        1 |    262144 | 4
178
       130 | 0x011a XResolution               |  RATIONAL |        1 |       681 | 681/0
179
       142 | 0x011b YResolution               |  RATIONAL |        1 |       689 | 689/0
180
       154 | 0x011c PlanarConfiguration       |     SHORT |        1 |     65536 | 1
181
       166 | 0x0128 ResolutionUnit            |     SHORT |        1 |    131072 | 2
182
       178 | 0x0131 Software                  |     ASCII |       40 |       697 | Adobe Photoshop Elements 12.0 Ma ...
183
       190 | 0x0132 DateTime                  |     ASCII |       20 |       737 | 2016:09:13 11:58:16
184
       202 | 0x013b Artist                    |     ASCII |       34 |       757 | Photographer..s Mate 3rd Class ( ...
185
       214 | 0x8769 ExifTag                   |      LONG |        1 |       792 | 792
186
    STRUCTURE OF TIFF FILE (MM): MemIo
187
     address |    tag                           |      type |    count |    offset | value
188
         794 | 0x829a ExposureTime              |  RATIONAL |        1 |      1254 | 1254/0
189
         806 | 0x829d FNumber                   |  RATIONAL |        1 |      1262 | 1262/0
190
         818 | 0x8822 ExposureProgram           |     SHORT |        1 |     65536 | 1
191
         830 | 0x9000 ExifVersion               | UNDEFINED |        4 | 808596016 | 0220
192
         842 | 0x9003 DateTimeOriginal          |     ASCII |       20 |      1270 | 2004:06:21 23:37:53
193
         854 | 0x9004 DateTimeDigitized         |     ASCII |       20 |      1290 | 2004:06:21 23:37:53
194
         866 | 0x9101 ComponentsConfiguration   | UNDEFINED |        4 |  16909056 | ...
195
         878 | 0x9102 CompressedBitsPerPixel    |  RATIONAL |        1 |      1310 | 1310/0
196
         890 | 0x9201 ShutterSpeedValue         | SRATIONAL |        1 |      1318 | 1318/0
197
         902 | 0x9202 ApertureValue             |  RATIONAL |        1 |      1326 | 1326/0
198
         914 | 0x9204 ExposureBiasValue         | SRATIONAL |        1 |      1334 | 1334/0
199
         926 | 0x9205 MaxApertureValue          |  RATIONAL |        1 |      1342 | 1342/0
200
         938 | 0x9207 MeteringMode              |     SHORT |        1 |    131072 | 2
201
         950 | 0x9208 LightSource               |     SHORT |        1 |    655360 | 10
202 1 Robin Mills
         962 | 0x9209 Flash                     |     SHORT |        1 |         0 | 0
203 4 Robin Mills
         974 | 0x920a FocalLength               |  RATIONAL |        1 |      1350 | 1350/0
204
         986 | 0x9290 SubSecTime                |     ASCII |        3 | 808845312 | 06
205
         998 | 0x9291 SubSecTimeOriginal        |     ASCII |        3 | 808845312 | 06
206
        1010 | 0x9292 SubSecTimeDigitized       |     ASCII |        3 | 808845312 | 06
207
        1022 | 0xa000 FlashpixVersion           | UNDEFINED |        4 | 808529968 | 0100
208
        1034 | 0xa001 ColorSpace                |     SHORT |        1 |4294901760 | 65535
209
        1046 | 0xa002 PixelXDimension           |      LONG |        1 |       200 | 200
210
        1058 | 0xa003 PixelYDimension           |      LONG |        1 |       130 | 130
211
        1070 | 0xa217 SensingMethod             |     SHORT |        1 |    131072 | 2
212
        1082 | 0xa300 FileSource                | UNDEFINED |        1 |  50331648 | .
213
        1094 | 0xa301 SceneType                 | UNDEFINED |        1 |  16777216 | .
214
        1106 | 0xa401 CustomRendered            |     SHORT |        1 |         0 | 0
215
        1118 | 0xa402 ExposureMode              |     SHORT |        1 |     65536 | 1
216
        1130 | 0xa403 WhiteBalance              |     SHORT |        1 |     65536 | 1
217
        1142 | 0xa404 DigitalZoomRatio          |  RATIONAL |        1 |      1358 | 1358/0
218
        1154 | 0xa405 FocalLengthIn35mmFilm     |     SHORT |        1 |   4128768 | 63
219
        1166 | 0xa406 SceneCaptureType          |     SHORT |        1 |         0 | 0
220
        1178 | 0xa407 GainControl               |     SHORT |        1 |         0 | 0
221
        1190 | 0xa408 Contrast                  |     SHORT |        1 |         0 | 0
222
        1202 | 0xa409 Saturation                |     SHORT |        1 |         0 | 0
223
        1214 | 0xa40a Sharpness                 |     SHORT |        1 |         0 | 0
224
        1226 | 0xa40c SubjectDistanceRange      |     SHORT |        1 |         0 | 0
225
        1238 | 0xa420 ImageUniqueID             |     ASCII |       33 |      1366 | 127c1377b054a3f65bf2754ebb24e7f2 ...
226
    END MemIo
227
       226 | 0x8825 GPSTag                    |      LONG |        1 |      1400 | 1400
228
      1422 | 0x0103 Compression               |     SHORT |        1 |    393216 | 6
229
      1434 | 0x011a XResolution               |  RATIONAL |        1 |      1498 | 1498/0
230
      1446 | 0x011b YResolution               |  RATIONAL |        1 |      1506 | 1506/0
231
      1458 | 0x0128 ResolutionUnit            |     SHORT |        1 |    131072 | 2
232
      1470 | 0x0201 JPEGInterchangeFormat     |      LONG |        1 |      1514 | 1514
233
      1482 | 0x0202 JPEGInterchangeFormatLeng |      LONG |        1 |      4196 | 4196
234
  END MemIo
235
    5722 | 0xffed APP13 |    3038 | Photoshop 3.0.8BIM..........Z...
236
  Record | DataSet | Name                     | Length | Data
237
       1 |      90 | CharacterSet             |      3 | .%G
238
       1 |      90 | CharacterSet             |      3 | .%G
239
       2 |       0 | RecordVersion            |      2 | ..
240
       2 |     120 | Caption                  |    402 | 040621-N-6536T-062.USS Ronald Reagan (CV...
241
       2 |     122 | Writer                   |      9 | Dir. NVNS
242
       2 |      40 | SpecialInstructions      |     49 | Credit as U.S. Navy photo by Elizabeth T...
243
       2 |      80 | Byline                   |     32 | Photographer..s Mate 3rd Class (
244
       2 |      85 | BylineTitle              |     21 | U.S Navy Photographer
245
       2 |     110 | Credit                   |      8 | U.S Navy
246
       2 |     115 | Source                   |     24 | Navy Visual News Service
247
       2 |       5 | ObjectName               |     18 | 040621-N-6536T-062
248
       2 |      55 | DateCreated              |      8 | 20040621
249
       2 |      60 | TimeCreated              |     11 | 000000+0000
250
       2 |      62 | DigitizationDate         |      8 | 20040621
251
       2 |      63 | DigitizationTime         |     11 | 233753-0400
252
       2 |      90 | City                     |     19 | Straits of Magellan
253
       2 |     101 | CountryName              |     13 | South America
254
       2 |      15 | Category                 |      1 | N
255 1 Robin Mills
       2 |      20 | SuppCategory             |     12 | 703-614-9154
256
       2 |      20 | SuppCategory             |     23 | navyvisualnews@navy.mil
257
       2 |      20 | SuppCategory             |     11 | UNCLASSFIED
258
       2 |      10 | Urgency                  |      1 | 5
259
       2 |      25 | Keywords                 |     13 | ronald reagan
260
       2 |      25 | Keywords                 |      6 | reagan
261
       2 |      25 | Keywords                 |      6 | cvn 76
262
       2 |      25 | Keywords                 |      6 | cvn-76
263
       2 |      25 | Keywords                 |     18 | straights magellan
264
       2 |      25 | Keywords                 |      8 | magellan
265
       2 |      25 | Keywords                 |      7 | carrier
266
       2 |      25 | Keywords                 |     12 | nimitz-class
267
       2 |      25 | Keywords                 |      4 | ship
268
       2 |      25 | Keywords                 |      8 | underway
269
    8762 | 0xffe1 APP1  |    5329 | http://ns.adobe.com/xap/1.0/.<?x
270
   14093 | 0xffe2 APP2  |     576 | ICC_PROFILE......0ADBE....mntrRG chunk 1/1
271
   14671 | 0xffee APP14 |      14 | Adobe.d@......
272
   14687 | 0xffdb DQT   |     132 
273
   14821 | 0xffc0 SOF0  |      17 
274
   14840 | 0xffdd DRI   |       4 
275
   14846 | 0xffc4 DHT   |     418 
276
   15266 | 0xffda SOS  
277 10 Robin Mills
$ </pre>