The Metadata in JPEG files » History » Revision 12

« Previous | Revision 12/36 (diff) | Next »
Tuan Nhu, 22 Jul 2013 12:14

The Metadata in JPEG files¶

This article shares my investigations about the metadata structure in JPEG files. It also introduces briefly the algorithm used in exiv2 to read and write Exif, IPTC, XMP data and image comments on the JPEG files.

1. Background¶

JPEG ISO standard is a commonly used method of lossy compression for digital photography. The name "JPEG" stands for Joint Photographic Experts Group, the name of the committee.

JPEG refers only to a class of compression algorithms, not to a specific file format. In order to produce files with embedded JPEG streams, a number of file format standards have been adapted or devised. Some of them are JPEG /JFIF, JPEG /SPIFF (Still Picture Interchange File Format), JPEG /CIFF, JPEG/Exif (Exchangeable image file format).

Among them, the most common types are JPEG/Exif and JPEG/JFIF.

JPEG/Exif is the most common image format used by digital cameras and other photographic image capture devices.
JPEG/JFIF is the most common format for storing and transmitting photographic images on the World Wide Web.

2. The metadata structure in JPEG¶

A JPEG file contains several segments; each segment contains different kinds of data, delimited by two-byte codes called markers. The markers are hexadecimal; they begin with 0xFF and end with a code (1 byte) indicating the kind of marker.

Some markers consist of just those two bytes; others are followed by two bytes indicating the length of marker-specific payload data that follows. The length includes the two bytes for the length, but not the two bytes for the marker.

Short name	Bytes	Payload	Name and Comments
SOI	0xFF, 0xD8	None	Start Of Image
SOF0	0xFF, 0xC0	Variable size	Start Of Frame (Baseline DCT) Indicates that this is a baseline DCT-based JPEG, and specifies the width, height, number of components, and component subsampling
SOF2	0xFF, 0xC2	Variable size	Start Of Frame (Progressive DCT) Indicates that this is a progressive DCT-based JPEG, and specifies the width, height, number of components, and component subsampling
DHT	0xFF, 0xC4	Variable size	Define Huffman Table(s)
DQT	0xFF, 0xDB	Variable size	Define Quantization Table(s)
DRI	0xFF, 0xDD	2 bytes	Define Restart Interval Specifies the interval between RSTn markers, in macroblocks. This marker is followed by two bytes indicating the fixed size so it can be treated like any other variable size segment.
SOS	0xFF, 0xDA	Variable size	Start Of Scan Begins a top-to-bottom scan of the image. In baseline DCT JPEG images, there is generally a single scan. Progressive DCT JPEG images usually contain multiple scans. This marker specifies which slice of data it will contain, and is immediately followed by entropy-coded data.
RSTn	0xFF, 0xDn n(n=0..7)	None	Restart Inserted every r macroblocks, where r is the restart interval set by a DRI marker. Not used if there was no DRI marker. The low 3 bits of the marker code cycle in value from 0 to 7.
APPn	0xFF, 0xEn	Variable size	Application-specific For example, an Exif JPEG file uses an APP1 marker to store metadata, laid out in a structure based closely on TIFF.
COM	0xFF, 0xFE	Variable size	Comment
EOI	0xFF, 0xD9	None	End Of Image

Fig.1. The common JPEG markers. From Wikipedia, https://en.wikipedia.org/wiki/JPEG

The metadata in JPEG file is stored in APPn (0xFF, 0xEn) segment and the comment is stored in COM segment (0xFF, 0xFE). Several vendors might use the same APPn marker type to include their information, so these markers often begin with a vendor name (e.g., "Exif" or "Adobe") or some other identifying string.

Exiv2 provides fast and easy read write access to the Exif, IPTC and XMP. Hence, this article only focuses on the position of Exif, IPTC and XMP data in JPEG files.

2.1 Exif¶

Exif JPEG file uses an APP1 segment to store the information (and multiples APP2 segments for flashPix data). Exif APP1 segment stores a great amount of information on photographic parameters for digital cameras and it is the preferred way to store thumbnail images nowadays. It can also host an additional section with GPS data. All details about Exif are available at http://www.exif.org/Exif2-2.PDF

In theory, Exif APP1 is recorded immediately after the SOI marker (the marker indicating the beginning of the file). However, this leads to the incompatibility between the Exif and JFIF standards because both of them specify that their particular application segment (APP0 for JFIF, APP1 for Exif) must be the first in the image file. In practice, most JPEG files contain a JFIF marker segment (APP0) that precedes the Exif APP1. This allows older readers to correctly handle the format JFIF segment, while newer readers also decode the following Exif segment, being less strict about requiring it to appear first. This way will not affect the image decoding for most decoders, but poorly designed JFIF or Exif parsers may not recognize the file properly.

Exif APP1 segment consists of the APP1 marker (0xFFE1), Exif identifier string (“Exif\0\0”), and the attribute information itself. The identifier string "Exif\0\0” is used to avoid a conflict with other applications using APP1 (e.g XMP).

Fig.2. Basic Structure of JPEG Files. From Exif.org, http://www.exif.org/Exif2-2.PDF

Exif does not use APPn segments other than APP1, APP2 and COM segments. However, some unknown APPn may still exist on the file structure and Exif readers should be designed to skip over them.

2.2 XMP¶

In a typical edited JPEG file, XMP (eXtensible Metadata Platform) information is typically included alongside Exif and IPTC (Information Interchange Model data). XMP uses an APP1 segment in order to store metadata information; the storage format is RDF (Resource Description Framework) implemented as an application of XML.

XMP APP1 segment consists of the APP1 marker (0xFFE1), XMP identifier string (“http://ns.adobe.com/xap/1.0/\x00”), and Unicode XMP packet (the encoding is usually UTF-8, but it can also be UTF-16 or UTF-32). The packet cannot be split in multiple segments, so there is a maximum size of approximately 64KB (2^16-1 bytes).

The structure of the packet content can be found at http://www.w3.org/TR/REC-rdf-syntax/.
The reference document for XMP 3.2 can be downloaded from Adobe Systems Incorporated http://xml.coverpages.org/xmp.html

2.3 IPTC¶

Adobe Photoshop uses the APP13 segment for storing non-graphic information, such as layers, paths, IPTC data and more. The content of an APP13 segment is formed by APP1 marker (0xFFE1), an identifier string (usually "Photoshop 3.0\000", but also 'Adobe_Photoshop2.5:', used by earlier versions) followed by a sequence of resource data blocks. In general, a resource block contains only a few bytes, but there is the important IPTC block can be quite large. The IPTC block may not fit into one APP13 segment, so it can be split into multiple APP13 segments.

The reference document for the Photoshop file format is available at http://www.adobe.com/devnet-apps/photoshop/fileformatashtml/

3 Exiv2 JPEG read/write metadata algorithms¶

Files (7)

Updated by Tuan Nhu over 8 years ago · 12 revisions

test.jpg (285 KB) test.jpg	from http://www.flickr.com/photos/canderson/5470671552/	Tuan Nhu, 23 Jul 2013 09:28
fig2.jpg (155 KB) fig2.jpg	Basic Structure of JPEG Files, From Exif.org	Tuan Nhu, 24 Jul 2013 00:34
fig3.jpg (343 KB) fig3.jpg	Flowchart of readMetadata	Tuan Nhu, 24 Jul 2013 00:37
fig4.jpg (443 KB) fig4.jpg	Flowchart of writeMetadata	Tuan Nhu, 24 Jul 2013 00:38
fig2d.jpg (155 KB) fig2d.jpg		Robin Mills, 17 Aug 2013 13:33
fig3d.jpg (70.4 KB) fig3d.jpg		Robin Mills, 17 Aug 2013 13:33
fig4d.jpg (81 KB) fig4d.jpg		Robin Mills, 17 Aug 2013 13:33

Project

General

Profile

Exiv2

Wiki