Project

General

Profile

The Metadata in TIFF files » History » Version 15

Robin Mills, 13 Nov 2016 09:19

1 1 Robin Mills
h1. The Metadata in TIFF files
2
3 14 Robin Mills
The Tagged Image File Format is a container.  It's very flexible and can deal with multiple pages, different colour spaces, different frame configurations as well as metadata.  The specification is available from:  https://partners.adobe.com/public/developer/en/tiff/TIFF6.pdf
4 1 Robin Mills
5 14 Robin Mills
The TIFF container is so flexible it is used as the structure for most RAW formats including Adobe's DNG.  Additionally the TIFF container is used to store the metadata that lies inside the Exif blocks embedded in JPEG and PNG files.
6
7
There is a 64bit version of Tiff called _*BigTiff*_. This is currently not supported by exiv2.
8
9 3 Robin Mills
!Tiff.png!
10 7 Robin Mills
11 1 Robin Mills
The format consists of an 8 byte header which provides a 2 byte endian flag (II or MM), a 2 byte type "magic number" (42) and a 4 byte offset to an _*Image File Directory*_.
12
13 15 Robin Mills
The _*IFD*_ _(Image File Directory)_  has a two byte header which contains the length of the directory followed by 12 byte "fields" or "tags".  The trailing 4 bytes of the directory is the offset to the next directory _*or*_ zero to terminate the directory chain.
14 7 Robin Mills
15
A tag consists of a 12 byte record: TagID, TagType, Count and Offset which are 2,2,4 and 4 bytes respectively.  The TagID defines the purpose of the record (Width, Height, ColorSpace etc) and the TagType defines the data format.  Count and Offset are used to contain the raw data _*or*_ provide an offset in the file at which to read the raw data for this tag.
16 1 Robin Mills
17
*Example:*
18 10 Robin Mills
The version of exiv2(.exe) which ships with v0.25 provides options _*<code>-pS</code>*_ to reveals the structure of the TIFF and option _*<code>-pX</code>*_ is used to extract the raw XMP/xml data.
19
20
The option _*<code>-pa</code>*_ is used to print the metadata in human readable format.<pre>$ exiv2 -pa test/data/test.tiff
21
$ exiv2 -pa test/data/test.tiff 
22
Exif.Image.NewSubfileType                    Long        1  Primary image
23
Exif.Image.ImageWidth                        Short       1  282
24
Exif.Image.ImageLength                       Short       1  212
25 1 Robin Mills
Exif.Image.BitsPerSample                     Short       3  8 8 8
26
Exif.Image.Compression                       Short       1  LZW
27
Exif.Image.PhotometricInterpretation         Short       1  RGB
28 10 Robin Mills
Exif.Image.ImageDescription                  Ascii      32                                 
29
Exif.Image.Make                              Ascii       5  SONY
30
Exif.Image.Model                             Ascii       9  DSC-S600
31
Exif.Image.StripOffsets                      Long       16  4372 ...
32
Exif.Image.Orientation                       Short       1  top, left
33 1 Robin Mills
Exif.Image.SamplesPerPixel                   Short       1  3
34 10 Robin Mills
Exif.Image.RowsPerStrip                      Long        1  14
35
Exif.Image.StripByteCounts                   Long       16  10492 ...
36
Exif.Image.XResolution                       Rational    1  96
37
Exif.Image.YResolution                       Rational    1  96
38 1 Robin Mills
Exif.Image.PlanarConfiguration               Short       1  1
39 10 Robin Mills
Exif.Image.ResolutionUnit                    Short       1  inch
40
Exif.Image.DateTime                          Ascii      20  2009:04:26 12:38:18
41
Exif.Image.Predictor                         Short       1  No prediction scheme used
42 1 Robin Mills
Exif.Image.SampleFormat                      Short       3  Unsigned integer data
43 10 Robin Mills
Exif.Image.XMLPacket                         Byte      2505  (Binary value suppressed)
44
Exif.Image.InterColorProfile                 Undefined 1352  (Binary value suppressed)
45
Xmp.dc.title                                 LangAlt     1  lang="x-default" this is a test image
46 11 Robin Mills
$ exiv2 -pS test/data/test.tiff
47 10 Robin Mills
STRUCTURE OF TIFF FILE (II): test/data/test.tiff
48
 address |    tag                           |      type |    count |    offset | value
49
      10 | 0x00fe NewSubfileType            |      LONG |        1 |         0 | 0
50
      22 | 0x0100 ImageWidth                |     SHORT |        1 |       282 | 282
51
      34 | 0x0101 ImageLength               |     SHORT |        1 |       212 | 212
52
      46 | 0x0102 BitsPerSample             |     SHORT |        3 |       290 | 8 8 8
53
      58 | 0x0103 Compression               |     SHORT |        1 |         5 | 5
54
      70 | 0x0106 PhotometricInterpretation |     SHORT |        1 |         2 | 2
55
      82 | 0x010e ImageDescription          |     ASCII |       32 |       296 |                                
56
      94 | 0x010f Make                      |     ASCII |        5 |       328 | SONY
57
     106 | 0x0110 Model                     |     ASCII |        9 |       334 | DSC-S600
58
     118 | 0x0111 StripOffsets              |      LONG |       16 |       344 | 4372 14864 25634 37110 50496 ...
59
     130 | 0x0112 Orientation               |     SHORT |        1 |         1 | 1
60
     142 | 0x0115 SamplesPerPixel           |     SHORT |        1 |         3 | 3
61
     154 | 0x0116 RowsPerStrip              |      LONG |        1 |        14 | 14
62
     166 | 0x0117 StripByteCounts           |      LONG |       16 |       408 | 10492 10770 11476 13385 13348 ...
63
     178 | 0x011a XResolution               |  RATIONAL |        1 |       472 | 96/1
64
     190 | 0x011b YResolution               |  RATIONAL |        1 |       480 | 96/1
65
     202 | 0x011c PlanarConfiguration       |     SHORT |        1 |         1 | 1
66
     214 | 0x0128 ResolutionUnit            |     SHORT |        1 |         2 | 2
67
     226 | 0x0132 DateTime                  |     ASCII |       20 |       488 | 2009:04:26 12:38:18
68
     238 | 0x013d Predictor                 |     SHORT |        1 |         1 | 1
69
     250 | 0x0153 SampleFormat              |     SHORT |        3 |       508 | 1 1 1
70
     262 | 0x02bc XMLPacket                 |      BYTE |     2505 |       514 | <?xpacket begin="..." id="W5M0Mp ...
71
     274 | 0x8773 InterColorProfile         | UNDEFINED |     1352 |      3020 | ...Happl. ..scnrRGB XYZ ....... ...
72
END test/data/test.tiff
73
$ exiv2 -pX test/data/test.tiff  | xmllint --format -
74 1 Robin Mills
<?xml version="1.0"?>
75
<?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?>
76
<x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
77
  <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
78
    <rdf:Description xmlns:dc="http://purl.org/dc/elements/1.1/" rdf:about="">
79
      <dc:title>
80
        <rdf:Alt>
81 10 Robin Mills
          <rdf:li xml:lang="x-default">this is a test image</rdf:li>
82 1 Robin Mills
        </rdf:Alt>
83
      </dc:title>
84
    </rdf:Description>
85
  </rdf:RDF>
86
</x:xmpmeta>
87
<?xpacket end="w"?>
88 13 Robin Mills
$ </pre>You can use the output from _*<code>-pS</code>*_ to determine the location of the XMLPacket:<pre>$ exiv2 -pS test/data/test.tiff | grep XMLPacket 
89 10 Robin Mills
     262 | 0x02bc XMLPacket                 |      BYTE |     2505 |       514 | <?xpacket begin="..." id="W5M0Mp ...
90
$ </pre>You can see the XMLPacket of 2505 bytes stored at offset 514.   You can extract the XMP with the following command which says:  set the block size to 1 byte, skip 514 bytes and extract 2505 bytes:
91
<pre>$ dd bs=1 count=2505 skip=514 if=test/data/test.tiff | xmllint --format - 
92
2505+0 records in
93
2505+0 records out
94
2505 bytes (2.5 kB) copied, 0.0064 s, 391 kB/s
95 1 Robin Mills
<?xml version="1.0"?>
96
<?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?>
97
<x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
98 4 Robin Mills
  <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
99 2 Robin Mills
    <rdf:Description xmlns:dc="http://purl.org/dc/elements/1.1/" rdf:about="">
100
      <dc:title>
101
        <rdf:Alt>
102 10 Robin Mills
          <rdf:li xml:lang="x-default">this is a test image</rdf:li>
103 2 Robin Mills
        </rdf:Alt>
104
      </dc:title>
105
    </rdf:Description>
106
  </rdf:RDF>
107
</x:xmpmeta>
108
<?xpacket end="w"?>
109 13 Robin Mills
</pre>The option _*<code>-pX</code>*_ doesn't use dd to achieve the same result and is much more convenient to use.
110 2 Robin Mills
111 10 Robin Mills
To demonstrate that the metadata block in a JPEG is a TIFF file, extract and print the structure.<pre>$ exiv2 -pS test/data/Reagan.jpg 
112
$ exiv2 -pS test/data/Reagan.jpg
113 2 Robin Mills
STRUCTURE OF JPEG FILE: test/data/Reagan.jpg
114
 address | marker       |  length | data
115
       0 | 0xffd8 SOI  
116
       2 | 0xffe1 APP1  |    5718 | Exif..MM.*......................
117 10 Robin Mills
    5722 | 0xffed APP13 |    3038 | Photoshop 3.0.8BIM..........Z...
118
    8762 | 0xffe1 APP1  |    5329 | http://ns.adobe.com/xap/1.0/.<?x
119
   14093 | 0xffe2 APP2  |     576 | ICC_PROFILE......0ADBE....mntrRG chunk 1/1
120
   14671 | 0xffee APP14 |      14 | Adobe.d@......
121
   14687 | 0xffdb DQT   |     132 
122
   14821 | 0xffc0 SOF0  |      17 
123
   14840 | 0xffdd DRI   |       4 
124
   14846 | 0xffc4 DHT   |     418 
125
   15266 | 0xffda SOS  
126
$ </pre>  We can extract the APP1/Exif data as follows:<pre>$ dd bs=1 skip=$((2+2+8)) count=$((5718-8)) if=test/data/Reagan.jpg of=ReaganExif.tiff
127
5710+0 records in
128
5710+0 records out
129
5710 bytes (5.7 kB) copied, 0.026688 s, 214 kB/s
130
$ exiv2 -pS ReaganExif.tif 
131
STRUCTURE OF TIFF FILE (MM): ReaganExif.tif
132
 address |    tag                           |      type |    count |    offset | value
133
      10 | 0x0100 ImageWidth                |     SHORT |        1 |  13107200 | 200
134
      22 | 0x0101 ImageLength               |     SHORT |        1 |   8519680 | 130
135
      34 | 0x0102 BitsPerSample             |     SHORT |        4 |       242 | 8 8 8 8
136
      46 | 0x0103 Compression               |     SHORT |        1 |     65536 | 1
137
      58 | 0x0106 PhotometricInterpretation |     SHORT |        1 |    131072 | 2
138
      70 | 0x010e ImageDescription          |     ASCII |      403 |       250 | 040621-N-6536T-062.USS Ronald Re ...
139
      82 | 0x010f Make                      |     ASCII |       18 |       653 | NIKON CORPORATION
140
      94 | 0x0110 Model                     |     ASCII |       10 |       671 | NIKON D1X
141
     106 | 0x0112 Orientation               |     SHORT |        1 |     65536 | 1
142
     118 | 0x0115 SamplesPerPixel           |     SHORT |        1 |    262144 | 4
143
     130 | 0x011a XResolution               |  RATIONAL |        1 |       681 | 3000000/10000
144
     142 | 0x011b YResolution               |  RATIONAL |        1 |       689 | 3000000/10000
145
     154 | 0x011c PlanarConfiguration       |     SHORT |        1 |     65536 | 1
146
     166 | 0x0128 ResolutionUnit            |     SHORT |        1 |    131072 | 2
147
     178 | 0x0131 Software                  |     ASCII |       40 |       697 | Adobe Photoshop Elements 12.0 Ma ...
148
     190 | 0x0132 DateTime                  |     ASCII |       20 |       737 | 2016:09:13 11:58:16
149
     202 | 0x013b Artist                    |     ASCII |       34 |       757 | Photographer..s Mate 3rd Class ( ...
150
     214 | 0x8769 ExifTag                   |      LONG |        1 |       792 | 792
151
     226 | 0x8825 GPSTag                    |      LONG |        1 |      1400 | 1400
152
    1422 | 0x0103 Compression               |     SHORT |        1 |    393216 | 6
153
    1434 | 0x011a XResolution               |  RATIONAL |        1 |      1498 | 72/1
154
    1446 | 0x011b YResolution               |  RATIONAL |        1 |      1506 | 72/1
155
    1458 | 0x0128 ResolutionUnit            |     SHORT |        1 |    131072 | 2
156
    1470 | 0x0201 JPEGInterchangeFormat     |      LONG |        1 |      1514 | 1514
157
    1482 | 0x0202 JPEGInterchangeFormatLeng |      LONG |        1 |      4196 | 4196
158
END ReaganExif.tif
159 12 Robin Mills
$ </pre>The version of the exiv2 utility that ships with v0.26 later has an option _*<code>-pR</code>*_ which _Recursively_ prints the structure of an image file.  *_Recursively*_ means  the utility prints structures embedded within other structures.  For example, the Exif metadata in a JPEG is encoded as a TIFF embedded in the APP1/Exif segment of the JPEG.<pre>$ exiv2 -pR test/data/Reagan.jpg 
160 10 Robin Mills
STRUCTURE OF JPEG FILE: test/data/Reagan.jpg
161
 address | marker       |  length | data
162
       0 | 0xffd8 SOI  
163
       2 | 0xffe1 APP1  |    5718 | Exif..MM.*......................
164 4 Robin Mills
  STRUCTURE OF TIFF FILE (MM): MemIo
165
   address |    tag                           |      type |    count |    offset | value
166
        10 | 0x0100 ImageWidth                |     SHORT |        1 |  13107200 | 200
167
        22 | 0x0101 ImageLength               |     SHORT |        1 |   8519680 | 130
168
        34 | 0x0102 BitsPerSample             |     SHORT |        4 |       242 | 8 8 8 8
169
        46 | 0x0103 Compression               |     SHORT |        1 |     65536 | 1
170
        58 | 0x0106 PhotometricInterpretation |     SHORT |        1 |    131072 | 2
171
        70 | 0x010e ImageDescription          |     ASCII |      403 |       250 | 040621-N-6536T-062.USS Ronald Re ...
172
        82 | 0x010f Make                      |     ASCII |       18 |       653 | NIKON CORPORATION
173
        94 | 0x0110 Model                     |     ASCII |       10 |       671 | NIKON D1X
174
       106 | 0x0112 Orientation               |     SHORT |        1 |     65536 | 1
175
       118 | 0x0115 SamplesPerPixel           |     SHORT |        1 |    262144 | 4
176
       130 | 0x011a XResolution               |  RATIONAL |        1 |       681 | 681/0
177
       142 | 0x011b YResolution               |  RATIONAL |        1 |       689 | 689/0
178
       154 | 0x011c PlanarConfiguration       |     SHORT |        1 |     65536 | 1
179
       166 | 0x0128 ResolutionUnit            |     SHORT |        1 |    131072 | 2
180
       178 | 0x0131 Software                  |     ASCII |       40 |       697 | Adobe Photoshop Elements 12.0 Ma ...
181
       190 | 0x0132 DateTime                  |     ASCII |       20 |       737 | 2016:09:13 11:58:16
182
       202 | 0x013b Artist                    |     ASCII |       34 |       757 | Photographer..s Mate 3rd Class ( ...
183
       214 | 0x8769 ExifTag                   |      LONG |        1 |       792 | 792
184
    STRUCTURE OF TIFF FILE (MM): MemIo
185
     address |    tag                           |      type |    count |    offset | value
186
         794 | 0x829a ExposureTime              |  RATIONAL |        1 |      1254 | 1254/0
187
         806 | 0x829d FNumber                   |  RATIONAL |        1 |      1262 | 1262/0
188
         818 | 0x8822 ExposureProgram           |     SHORT |        1 |     65536 | 1
189
         830 | 0x9000 ExifVersion               | UNDEFINED |        4 | 808596016 | 0220
190
         842 | 0x9003 DateTimeOriginal          |     ASCII |       20 |      1270 | 2004:06:21 23:37:53
191
         854 | 0x9004 DateTimeDigitized         |     ASCII |       20 |      1290 | 2004:06:21 23:37:53
192
         866 | 0x9101 ComponentsConfiguration   | UNDEFINED |        4 |  16909056 | ...
193
         878 | 0x9102 CompressedBitsPerPixel    |  RATIONAL |        1 |      1310 | 1310/0
194
         890 | 0x9201 ShutterSpeedValue         | SRATIONAL |        1 |      1318 | 1318/0
195
         902 | 0x9202 ApertureValue             |  RATIONAL |        1 |      1326 | 1326/0
196
         914 | 0x9204 ExposureBiasValue         | SRATIONAL |        1 |      1334 | 1334/0
197
         926 | 0x9205 MaxApertureValue          |  RATIONAL |        1 |      1342 | 1342/0
198
         938 | 0x9207 MeteringMode              |     SHORT |        1 |    131072 | 2
199
         950 | 0x9208 LightSource               |     SHORT |        1 |    655360 | 10
200 1 Robin Mills
         962 | 0x9209 Flash                     |     SHORT |        1 |         0 | 0
201 4 Robin Mills
         974 | 0x920a FocalLength               |  RATIONAL |        1 |      1350 | 1350/0
202
         986 | 0x9290 SubSecTime                |     ASCII |        3 | 808845312 | 06
203
         998 | 0x9291 SubSecTimeOriginal        |     ASCII |        3 | 808845312 | 06
204
        1010 | 0x9292 SubSecTimeDigitized       |     ASCII |        3 | 808845312 | 06
205
        1022 | 0xa000 FlashpixVersion           | UNDEFINED |        4 | 808529968 | 0100
206
        1034 | 0xa001 ColorSpace                |     SHORT |        1 |4294901760 | 65535
207
        1046 | 0xa002 PixelXDimension           |      LONG |        1 |       200 | 200
208
        1058 | 0xa003 PixelYDimension           |      LONG |        1 |       130 | 130
209
        1070 | 0xa217 SensingMethod             |     SHORT |        1 |    131072 | 2
210
        1082 | 0xa300 FileSource                | UNDEFINED |        1 |  50331648 | .
211
        1094 | 0xa301 SceneType                 | UNDEFINED |        1 |  16777216 | .
212
        1106 | 0xa401 CustomRendered            |     SHORT |        1 |         0 | 0
213
        1118 | 0xa402 ExposureMode              |     SHORT |        1 |     65536 | 1
214
        1130 | 0xa403 WhiteBalance              |     SHORT |        1 |     65536 | 1
215
        1142 | 0xa404 DigitalZoomRatio          |  RATIONAL |        1 |      1358 | 1358/0
216
        1154 | 0xa405 FocalLengthIn35mmFilm     |     SHORT |        1 |   4128768 | 63
217
        1166 | 0xa406 SceneCaptureType          |     SHORT |        1 |         0 | 0
218
        1178 | 0xa407 GainControl               |     SHORT |        1 |         0 | 0
219
        1190 | 0xa408 Contrast                  |     SHORT |        1 |         0 | 0
220
        1202 | 0xa409 Saturation                |     SHORT |        1 |         0 | 0
221
        1214 | 0xa40a Sharpness                 |     SHORT |        1 |         0 | 0
222
        1226 | 0xa40c SubjectDistanceRange      |     SHORT |        1 |         0 | 0
223
        1238 | 0xa420 ImageUniqueID             |     ASCII |       33 |      1366 | 127c1377b054a3f65bf2754ebb24e7f2 ...
224
    END MemIo
225
       226 | 0x8825 GPSTag                    |      LONG |        1 |      1400 | 1400
226
      1422 | 0x0103 Compression               |     SHORT |        1 |    393216 | 6
227
      1434 | 0x011a XResolution               |  RATIONAL |        1 |      1498 | 1498/0
228
      1446 | 0x011b YResolution               |  RATIONAL |        1 |      1506 | 1506/0
229
      1458 | 0x0128 ResolutionUnit            |     SHORT |        1 |    131072 | 2
230
      1470 | 0x0201 JPEGInterchangeFormat     |      LONG |        1 |      1514 | 1514
231
      1482 | 0x0202 JPEGInterchangeFormatLeng |      LONG |        1 |      4196 | 4196
232
  END MemIo
233
    5722 | 0xffed APP13 |    3038 | Photoshop 3.0.8BIM..........Z...
234
  Record | DataSet | Name                     | Length | Data
235
       1 |      90 | CharacterSet             |      3 | .%G
236
       1 |      90 | CharacterSet             |      3 | .%G
237
       2 |       0 | RecordVersion            |      2 | ..
238
       2 |     120 | Caption                  |    402 | 040621-N-6536T-062.USS Ronald Reagan (CV...
239
       2 |     122 | Writer                   |      9 | Dir. NVNS
240
       2 |      40 | SpecialInstructions      |     49 | Credit as U.S. Navy photo by Elizabeth T...
241
       2 |      80 | Byline                   |     32 | Photographer..s Mate 3rd Class (
242
       2 |      85 | BylineTitle              |     21 | U.S Navy Photographer
243
       2 |     110 | Credit                   |      8 | U.S Navy
244
       2 |     115 | Source                   |     24 | Navy Visual News Service
245
       2 |       5 | ObjectName               |     18 | 040621-N-6536T-062
246
       2 |      55 | DateCreated              |      8 | 20040621
247
       2 |      60 | TimeCreated              |     11 | 000000+0000
248
       2 |      62 | DigitizationDate         |      8 | 20040621
249
       2 |      63 | DigitizationTime         |     11 | 233753-0400
250
       2 |      90 | City                     |     19 | Straits of Magellan
251
       2 |     101 | CountryName              |     13 | South America
252
       2 |      15 | Category                 |      1 | N
253 1 Robin Mills
       2 |      20 | SuppCategory             |     12 | 703-614-9154
254
       2 |      20 | SuppCategory             |     23 | navyvisualnews@navy.mil
255
       2 |      20 | SuppCategory             |     11 | UNCLASSFIED
256
       2 |      10 | Urgency                  |      1 | 5
257
       2 |      25 | Keywords                 |     13 | ronald reagan
258
       2 |      25 | Keywords                 |      6 | reagan
259
       2 |      25 | Keywords                 |      6 | cvn 76
260
       2 |      25 | Keywords                 |      6 | cvn-76
261
       2 |      25 | Keywords                 |     18 | straights magellan
262
       2 |      25 | Keywords                 |      8 | magellan
263
       2 |      25 | Keywords                 |      7 | carrier
264
       2 |      25 | Keywords                 |     12 | nimitz-class
265
       2 |      25 | Keywords                 |      4 | ship
266
       2 |      25 | Keywords                 |      8 | underway
267
    8762 | 0xffe1 APP1  |    5329 | http://ns.adobe.com/xap/1.0/.<?x
268
   14093 | 0xffe2 APP2  |     576 | ICC_PROFILE......0ADBE....mntrRG chunk 1/1
269
   14671 | 0xffee APP14 |      14 | Adobe.d@......
270
   14687 | 0xffdb DQT   |     132 
271
   14821 | 0xffc0 SOF0  |      17 
272
   14840 | 0xffdd DRI   |       4 
273
   14846 | 0xffc4 DHT   |     418 
274
   15266 | 0xffda SOS  
275 10 Robin Mills
$ </pre>