The JPEG algorithm is a lossy data compression algorithm

The good old JPEG, despite a lot of undeniable advantages, still has significant limitations. Was called upon to remove them new method image compression, which has been in development for a long time. Now that JPEG2000 has become an officially recognized format, this should serve as the beginning of its active support by various software manufacturers.

Surely many who work with graphics on a computer are interested in the question: how can an image that occupies a very impressive amount of space in the PC memory be squeezed into a much smaller size on the disk? I remember that at the dawn of my publishing career, the word “compression” was so mysterious and surprising for me... In fact, how does image compression occur - after all, without it it is now unthinkable to imagine either the Web, or digital photography, or color printing?

So, compression. It may or may not lead to a loss of quality. The last case is methods such as RLE (Run Length Encoding, encoding run lengths, which results in pairs like ( skip, value, Where skip is the number of consecutive zeros, and value- the value following them) and LZW (compression using the Lempel-Ziff-Welch method), implemented in PSD formats, GIF and TIFF. They are also widely used by archivers such as RAR and ZIP. The average degree of lossless compression is 2-3 times.

If you need to compress an image more, you cannot do without losing quality. What are the principles? Firstly, any image contains a certain redundancy, the removal of which will not lead to a noticeable change in the quality of the image. Secondly, the human eye is more sensitive to changes in brightness than color. Therefore, different degrees of compression are applied to different image channels - information is lost, but this is not visually noticeable. In addition, the eye's sensitivity to small image elements is low, which makes it possible to remove them without compromising quality. This way you can compress the image (even if the deterioration in quality is already noticeable) up to an acceptable threshold. The degree of quality degradation is determined for each specific case. For printing, only minimal distortions are allowed, but for posting on the Internet (depending on the purpose) - much more.

The most popular among lossy compression methods is JPEG, which, even with thirtyfold compression, retains sufficient image quality. By the way, most modern data compression methods (for example, Layer-4, known as mp3, as well as MPEG) implement mechanisms similar to JPEG. Let's take a closer look at this format, especially since not so long ago its newest implementation, JPEG2000, was finally approved, which included all the additions made to JPEG/MPEG over ten years of its development.

JPEG

The name of the compression algorithm is an abbreviation for the Joint Photographic Expert Group, an initiative group formed from experts from the ITU (International Telecommunication Union) and ISO (International Organization for Standardization). That is why its name contains the prefix Joint. In 1992, JPEG was declared the international standard for graphics.

When compressing using the JPEG method, quality is always lost. In this case, there is always a choice: to give preference to quality at the expense of volume (the file size will be compressed by approximately three times) or, on the contrary, to achieve a minimum image size at which it will still remain recognizable (the degree of compression can reach 100). Compression, in which the difference in quality between the resulting image and the original is still unnoticeable, results in a 10-20 times reduction in file size.

Scope of application

JPEG is the best compressor for photographic-quality full-color and monochrome images. If you want to save an image with an index palette, it is first converted to full color. When compressing using the JPEG method, you need to keep in mind that everything depends on the nature of the images: much less volume will be occupied by those where the color changes are insignificant and there are no sharp color transitions. JPEG is used wherever photographic images need to be stored: in digital cameras, printing (EPS DCS 2.0), and the Internet is unthinkable without it.

There are several types of JPEG compression, but we will consider only two of them, used in the standard package for working with raster images Adobe Photoshop, — baseline And progressive. The other two methods - ariphmetic and loseless - are exotic, and for a number of reasons have not become widespread.

How does compression occur?

1. The first stage is color model conversion images (usually RGB) into a model where the brightness and color components are separated (for example, YCbCr or YUV), which allows for an optimal approach to the choice of compression levels for each channel (taking into account the characteristics of eye perception). The conversion occurs as follows:

Y = 0.299xR+0.587*G+0.114xB Cb = (B-Y)/0.866/2+128 Cr = (R-Y)/0.701/2+128

2. At the next stage, the so-called prefiltration, in which neighboring pixels separately in each of the channels Cb and Cr are grouped in pairs in the horizontal and vertical directions, and the brightness channel Y is left unchanged. After this, the entire group of four pixels receives the average value of the corresponding Cb and Cr components. For brevity, such a scheme can be designated as 4:1:1 (the same form of representation is adopted in DRAW - the jpeg export window). Taking into account the fact that each pixel is encoded by 3 bytes (256 levels for each of the three channels), as a result, the amount of data is automatically reduced by 2 times (instead of 12 bytes to transfer 4 pixels, it is enough to transfer only 4+1+1 = 6 bytes) . From a mathematical point of view, such a transformation leads to a significant loss of information, but the human eye does not perceive the loss, since there is significant redundancy in ordinary photographic images.

3. The received information, which has passed the stage of primary “cleaning”, is separately grouped in each channel again into blocks, but already 8x8 in size, after which basic compression is applied to them - the so-called. discrete cosine transform, for short - DCT (discrete cosine transform). As a result, information about distribution pixel brightness is converted to another form, where it is described by a distribution based on frequency of occurrence one or another pixel brightness. DCT has a number of advantages over other transforms (for example, the Fourier transform), providing better recovery information.

Instead of an array of 64 values (8x8 pixels) for each block that makes up the image, we get an array of 64 frequencies. Let's look at how DCT works using an example. Let's say the brightness of the pixels in one block of our image has the form shown in Fig. 1 on the left, then the result of the transformation will be as shown on the right.

Despite significant accuracy, some loss of information does occur at this stage - this is why JPEG always leads to loss of quality. The main purpose of the transformation is to find out big picture distribution of large (top left in the figure) and small (bottom right) objects, which will be useful later when eliminating unimportant information.

4. The next stage is removing information that is barely noticeable to the eye from the block, or quantization(quantization). All components are divided into various coefficients that determine the significance of each of them for high-quality restoration of the original image, and the result rounded up up to an integer value. It is this procedure that introduces the greatest loss of quality, reducing the final volume of the image. High-frequency components are quantized roughly, while low-frequency components are quantized more precisely, since they are most noticeable. In order to somewhat smooth out the decrease in quality, the luminance channel uses smaller division factors than the chrominance channels. But more often (this is done to speed up calculations), instead of specially selected values, only one is taken - the one entered by the user when choosing the degree of compression.

Here, for example, is what the Photoshop window looks like when saving an image using the Save for web operation, where the Quality parameter (or rather, a derivative of it) is the same rounding factor(Fig. 2).

As a result of quantization, a set of components is obtained, from which the original image is reconstructed with a given accuracy (Fig. 3).

In Fig. Figure 4 shows the result of reconstructing a black and white square using one, four and fifteen components, respectively.

5. After the main work of image compression has been completed, further transformations are reduced to secondary tasks: the remaining components are collected in sequence in such a way that those responsible for large parts are located first, and then for all the smaller ones. If you look at the picture, the movement of the encoder looks like a zigzag line. This stage is called ZigZag (Fig. 5).

Then the resulting sequence is compressed: first with the usual RLE, then with the Huffman method.

6. And finally, clean technical stage— the data is enclosed in a shell and provided with a header, which indicates all the compression parameters so that the image can be restored. However, sometimes this information is not included in the headers, which gives additional benefits in compression, but in this case you need to be sure that the application that will read the file knows about them.

That, in general, is all the transformation. Now let's calculate what compression was achieved in our example. We received 7 values from which the original 8x8 image will be restored. So, the compression from the use of DCT conversion in both color channels was 8x8/7 9 times. Let's allocate not seven, but 11 coefficients to the brightness channel, which will give 8x8/11 6. For all three channels it will be (9+9+6)/3=8 times. The reduction in quality during the “thinning” of the image, which occurred at the second stage, gives an additional double increase (the 4-1-1 scheme, taking into account the peculiarities of encoding the brightness component), which will give a final result of 16 times. This is a rough calculation that does not take into account some aspects, but reflects the real picture. To get a thirty-fold reduction in file size, you need to leave only 3-4 components.

The image restoration process proceeds in reverse order: first, the components are multiplied by the values from the quantization table, and approximate coefficients for the inverse cosine transform are obtained. The better the quality selected during compression, the higher the degree of approximation to the original coefficients, which means the image will be restored more accurately. There is only one action left to add: just before completion, make some adjustments (noise) to the boundary pixels from neighboring blocks in order to remove sharp differences between them.

Disadvantages of JPEG

Inability to achieve high compression ratios due to block size restrictions (8x8 only).
Blocky structure at high levels of compression.
Rounding sharp corners and blurring of subtle elements in an image.
Only RGB images are supported (JPEG for CMYK images can only be used in EPS format via DCS).
The image cannot be displayed until it has completely loaded.

It's been ten years since JPEG was approved as a standard. During this time, groups of researchers proposed a number of significant additions to the original version, which resulted in the emergence of a new standard at the end of last year.

JPEG2000

Since 1997, work began aimed at creating a universal encoding system that would remove all the limitations imposed by JPEG and could effectively work with all types of images: black and white, grayscale, full color and multi-component, regardless of content ( whether these will be photographs, fairly small text, or even drawings). Along with international standardizing organizations, such industry giants as Agfa, Canon, Fujifilm, Hewlett-Packard, Kodak, LuraTech, Motorola, Ricoh, Sony and others took part in its development.

Since the new algorithm claimed to be universal, it was additionally tasked with using various methods of data transmission (in real time and with a narrow bandwidth), which is especially critical in multimedia applications, for example, in real-time broadcasts over the Internet.

Basic requirements for the JPEG2000 format:

Achieving a higher degree of compression compared to JPEG.
Supports monochrome images, which will allow you to use it to compress images with text.
Possibility of compression without loss at all.
Outputs images with gradual improvement in detail (as in progressive GIF).
The use of priority areas in the image, for which the quality can be set higher than the rest of the image.
Decoding in real time (no delays).

Compression principle

The main compression mechanism in JPEG2000, unlike JPEG, uses wavelet transformation - a system of filters applied to the entire image. Without going into details of compression, we will only note the main points.

First, in the same way as for JPEG, the image is converted into the YCrCb system, after which the initial removal of redundant information occurs (by already known combining adjacent pixels into 2x2 blocks). Then the entire image is divided into parts of the same size (tile), each of which, independently of the others, will undergo further transformations (this reduces the requirements for memory and computing resources). Next, each channel is filtered by low-pass and high-pass filters separately in rows and rows, as a result of which, after the first pass, four smaller images (subband) are formed in each part. All of them carry information about the original image, but their information content is very different (Fig. 6).

For example, the image obtained after low-pass filtering by rows and rows (top left) carries the greatest amount of information, and the image obtained after high-pass filtering contains the least. The information content of images obtained after low-pass filtering of rows and high-pass filtering of columns (and vice versa) is average. The most informative image is again filtered, and the resulting components, as with jpeg compression, are quantized. This happens several times: for lossless compression the cycle is usually repeated 3 times, with losses - 10 iterations are considered a reasonable compromise between size, quality and decompression speed. The result is one small image and a set of pictures with small details, which sequentially and with a certain accuracy restore it to normal size. Obviously, the highest degree of compression is obtained on large images, since a larger number of cycles can be set.

Practical implementation

Since the foundations of JPEG2000 compression were laid, a number of companies have developed fairly effective algorithms for its implementation.

Among the major software developers, Corel can be noted (by the way, it was one of the first to introduce into its packages support for the wi format, based on wave transformations, for which it is honored and praised) - all images are supplied on CDs with the CorelDRAW package up to the ninth version , were compressed in exactly this way.

Later, Adobe joined in. Some of the ideas contained in JPEG2000 were applied by the developers of Photoshop 6 in the form of advanced options when saving an image in the JPEG format (regular, based on the cosine transform). Among them is progressive JPEG (the Progressive parameter in the Save window for Web). This algorithm is intended primarily for real-time systems and works exactly the same as progressive GIF. First, a rough copy of the image appears, consisting of only a few blocks large size, and over time, when the remaining data is loaded, the structure begins to be viewed more and more clearly, until, finally, the final image is completely restored. Unlike GIF, this algorithm places a greater burden on the viewer, since it will have to complete the entire transformation cycle for each version transmitted.

Other additions include the inclusion of several JPEGs in the file. compressed images with different degrees of compression, resolution and even color models. Accordingly, in Photoshop 6 it became possible to select individual areas in an image and apply different compression settings to them ( Region-Of-Interest, such a mechanism was first proposed back in 1995), using lower values in the quantization table. To do this, set the required area (for example, in the form of a new channel in the image) and click the mask icon next to the Quality item. In the window that appears, you can experiment with the image by moving the sliders - the finished result is displayed on the screen, allowing you to quickly find the necessary compromise between quality and size.

Specialized converters and viewers

Since the standard does not stipulate specific implementations of compression/decompression methods, this gives scope to third-party developers of compression algorithms. In fact, you can use either a simplified wave conversion algorithm and thereby speed up the compression process, or, conversely, use a more complex one and, accordingly, requiring large system resources.

Customized solutions from other companies are available as commercial designs. Some are implemented as separate programs (JPEG 2000 developed by Aware), others - as additional modules for the most common raster editors (ImagePress JPEG2000 developed by Pegasus Imaging and the LEAD JPEG2000 module from LEAD Technologies). The company LuraTech, which has been working on this issue for a long time, stands out against their background. It promotes its LuraWave technology in the self-contained product LuraWave SmartCompress (the third version is already available) and offers modules for Photoshop, Paintshop, Photopaint. Distinctive feature- more high speed work (almost instant conversion) even with images several megabytes in size. Accordingly, the price of this module is the highest - $79.

To view JPEG2000 images in browsers, you need to install a special viewer module (all developers offer it for free). Inserting an image into an HTML document, like any plug-in, comes down to using the EMBED construct (with additional parameters). For example, means that a progressive image transmission method will be used. That is, in our example (a file of 139 KB in size), first only 250 bytes are transferred, on the basis of which a rough image will be built, then, after loading 500 bytes, the image is updated (this continues until the LIMIT value is reached).

If you want to get a better image, you need to select the Improve item from the menu that pops up using the right button (Fig. 9). In four downloads, the entire image will be downloaded completely.

Conclusions

So, JPEG2000 objectively shows better results than JPEG only at high compression levels. With compression of 10-20 times, there is not much difference. Will it be able to displace or simply compete with the widespread format? In the near future - it’s unlikely; in most cases, the quality/size ratio provided by JPEG is quite acceptable. And those 10-20% additional compression that JPEG2000 provides with visually the same quality are unlikely to lead to an increase in its popularity.

But digital camera manufacturing companies are showing keen interest in the new format, since the size of light-sensitive matrices is steadily increasing every year, and it is becoming increasingly difficult to store images in memory. And then the new format will become more widespread, and who knows, perhaps after some time JPEG2000 will become equal to JPEG. In any case, Analog Micro Devices recently released a specialized chip in which compression/decompression using the new technology is implemented at the hardware level, and the US Department of Defense is already actively using the new format for recording photographs obtained from spy satellites.

Facts and speculation

1. JPEG loses quality when opening and resaving the file.

Not true. Quality is only lost when a compression level less than the one with which the image was saved is selected.

2. JPEG loses quality when editing the file.

Is it true. When you save the modified file, all transformations are performed again - so avoid frequent image editing. This only applies when the file is closed; if the file remains open, there is no cause for concern.

3. The result of compression with the same parameters in different programs will be the same.

Not true. Different programs interpret user input differently. For example, in one program the quality of the saved image is indicated (as, for example, in Photoshop), in another - the degree of its compression (the inverse value).

4. When setting the maximum quality, the image is saved without any loss of quality.

Not true. JPEG always compresses with losses. But setting, for example, 90% quality instead of 100% results in a reduction in file size greater than the deterioration in quality perceived by the eye.

5. Any JPEG file can be opened in any editor that understands the JPEG format.

Not true. This type of JPEG, called progressive JPEG, is not understood by some editors.

6. JPEG does not support transparency.

Is it true. Sometimes it may seem that some part of the image is transparent, but in fact its color is simply chosen to match the background color in the html page.

7. JPEG compresses better than GIF.

Not true. They have different areas of application. In general, a typical “GIF” image after conversion to JPEG will have a larger volume.

JPEG2000 vs JPEG

1. With twenty to thirty times compression, JPEG2000 and JPEG give approximately the same quality (by the way, Photoshop cannot compress an ordinary photograph more than this limit).

2. With higher compression, the quality of JPEG2000 is significantly higher than that of JPEG, which allows you to compress up to 50 times without any losses, and with some losses (we are talking about images for the Internet) - up to 100 and even up to 200.

3. At high levels of compression in those areas where a smooth color change occurs, the image does not acquire the block structure characteristic of a simple JPEG. JPEG2000 also somewhat smears and rounds out sharp edges - see photographs (Fig. 7 and 8).

It shows the results of compression of a test file with different degrees of compression (on the left - saved in Photoshop in JPG format, on the right - in JPEG2000 format). For the image in Fig. 7 compression levels were selected: 20, 40, 70 and 145 (they can be explicitly specified when saving in JPEG2000), the degree JPG compression was chosen so that the file size was the same as after JPEG2000 compression. As they say, the results are obvious. For clarity, a second experiment was carried out on an image with sharper details (with compression levels of 10, 20, 40 and 80). The advantage is again on the side of JPEG2000 (Fig. 8).

4. Since, in fact, copies with different resolutions are stored in one JPEG2000 file

I mean, for those who make image galleries on the Internet, there is no need to create thumbnails for them.

5. Of particular interest is compression without distortion (loseless mode). Thus, the test file with LZW compression from Photoshop took 827 KB, and the compressed JPEG2000 took 473 KB.

6. Compared to JPEG, its more advanced namesake consumes significantly more system resources. But the power of computers, which has increased significantly over the past couple of years, makes it possible to successfully solve image compression problems using a new method.

7. Lack of JPEG2000 support in browsers. To view such images, you need to download a fairly large additional module(1.2 MB).

8. Lack of free software for saving images in the new format.

Magazines are freely available.

On the same topic:

Scope of application

The JPEG algorithm is most suitable for compressing photographs and paintings containing realistic scenes with smooth transitions of brightness and color. JPEG is most widespread in digital photography and for storing and transmitting images using the Internet.

On the other hand, JPEG is unsuitable for compressing drawings, text and character graphics, where the sharp contrast between adjacent pixels leads to noticeable artifacts. It is advisable to save such images in lossless formats such as TIFF, GIF or PNG.

JPEG (like other distortion compression methods) is not suitable for compressing images during multi-stage processing, since distortions will be introduced into the images each time intermediate processing results are saved.

JPEG should not be used in cases where even minimal losses are unacceptable, for example, when compressing astronomical or medical images. In such cases, the Lossless JPEG compression mode provided by the JPEG standard (which, however, is not supported by most popular codecs) or the JPEG-LS compression standard may be recommended.

Compression

Compression converts the image from the RGB color space to YCbCr (YUV). It should be noted that the JPEG standard (ISO/IEC 10918-1) does not in any way regulate the choice of YCbCr, allowing other types of conversion (for example, with a number of components other than three), and compression without conversion (directly to RGB), however, the specification JFIF (JPEG File Interchange Format, proposed in 1991 by specialists from C-Cube Microsystems, and which has now become a de facto standard) involves the use of the RGB->YCbCr conversion.

After the RGB->YCbCr conversion for the image channels Cb and Cr, which are responsible for color, “subsampling” can be performed, which consists in the fact that each block of 4 pixels (2x2) of the brightness channel Y is assigned the average values of Cb and Cr (thinning scheme “4:2:0”). Moreover, for each 2x2 block, instead of 12 values (4 Y, 4 Cb and 4 Cr), only 6 are used (4 Y and one averaged Cb and Cr each). If increased demands are placed on the quality of the image restored after compression, thinning can be performed only in one direction - vertically (“4:4:0” scheme) or horizontally (“4:2:2”), or not performed at all (“4:4:4”).

The standard also allows decimation with averaging of Cb and Cr not for a 2x2 block, but for four pixels located sequentially (vertically or horizontally), that is, for blocks 1x4, 4x1 (“4:1:1” scheme), as well as 2x4 and 4x2 (scheme “4:1:0”). It is also possible to use various types thinning for Cb and Cr, but in practice such schemes are used extremely rarely.

Next, the brightness component Y and the color components Cb and Cr are divided into blocks of 8x8 pixels. Each such block is subjected to a discrete cosine transform (DCT). The resulting DCT coefficients are quantized (different quantization matrices are generally used for Y, Cb and Cr) and packed using run and Huffman coding. The JPEG standard also allows for the use of much more efficient arithmetic encoding, however, due to patent restrictions (the patent for the arithmetic QM encoder described in the JPEG standard belongs to IBM), it is rarely used in practice. To the popular libjpeg library latest versions Support for arithmetic encoding is included, but viewing images compressed using this method may be problematic because many viewers do not support decoding them.

The matrices used to quantize the DCT coefficients are stored in the header part of the JPEG file. They are usually constructed so that high-frequency coefficients are subject to stronger quantization than low-frequency ones. This results in coarsening of small details in the image. The higher the compression ratio, the more strongly all coefficients are quantized.

When saving an image as a JPEG file, a quality parameter is specified in some arbitrary unit, for example, from 1 to 100 or from 1 to 10. A higher number usually corresponds to better quality (and larger size compressed file). However, even when using highest quality(corresponding to a quantization matrix consisting of only ones), the reconstructed image will not exactly coincide with the original one, which is associated both with the finite accuracy of DCT implementation and with the need to round the values of Y, Cb, Cr and DCT coefficients to the nearest integer. The Lossless JPEG compression mode, which does not use DCT, provides an exact match between the restored and the original images, but its low efficiency (the compression ratio rarely exceeds 2) and the lack of support from software developers have not contributed to the popularity of Lossless JPEG.

Varieties of JPEG compression schemes

The JPEG standard provides two main ways to represent encoded data.

The most common, supported by most available codecs, is the sequential JPEG data representation, which involves sequential traversal of the encoded image block by block from left to right, from top to bottom. The operations described above are performed on each encoded image block, and the encoding results are placed in the output stream in the form of a single “scan,” that is, an array of encoded data corresponding to the sequentially passed (“scanned”) image. The main or "baseline" encoding mode allows only this representation. Extended mode, along with sequential mode, also allows progressive JPEG data presentation.

In the case of progressive JPEG, the compressed data is written to the output stream as a set of scans, each of which describes the entire image with an increasing degree of detail. This is achieved either by recording in each scan not the full set of DCT coefficients, but only some part of them: first - low-frequency ones, in subsequent scans - high-frequency ones (the “spectral selection” method, that is, spectral samples), or by sequential, from scan to scan, refinement of DCT coefficients (method of “successive approximation”, that is, successive approximations). This progressive representation of data is especially useful when transmitting compressed images using low-speed communication channels, since it allows you to get an overview of the entire image after only a small part of the JPEG file has been transmitted.

Both described schemes (both sequential and progressive JPEG) are based on DCT and fundamentally do not allow obtaining a reconstructed image absolutely identical to the original one. However, the standard also allows compression that does not use DCT, but is built on the basis of a linear predictor (lossless, that is, “lossless”, JPEG), guaranteeing a complete, bit-for-bit, match of the original and restored images. At the same time, the compression ratio for photographic images rarely reaches 2, but the guaranteed absence of distortion in some cases is in demand. Noticeably large degrees compression can be obtained using the JPEG-LS compression method, which, despite the similarity in names, is not directly related to the JPEG ISO/IEC 10918-1 (ITU T.81 Recommendation) standard, described by the ISO/IEC 14495-1 (ITU T .87 Recommendation).

Syntax and structure

The JPEG file contains the sequence markers, each of which begins with byte 0xFF, indicating the beginning of the marker, and an identifier byte. Some markers consist of just this pair of bytes, while others contain additional data consisting of a two-byte field with the length of the information part of the marker (including the length of this field, but minus the two bytes of the beginning of the marker, that is, 0xFF and the identifier) and the data itself. This file structure allows you to quickly find a marker with the necessary data (for example, line length, number of lines and number of color components of the compressed image).

Basic JPEG markers

Marker	Bytes	Length	Purpose	Comments
SOI	0xFFD8	No	Start of image
SOF0	0xFFC0	variable size	Start of frame (basic, DCT)	Indicates that the image was encoded in basic mode using DCT and Huffman code. The marker contains the number of lines and the length of the image line (two-byte fields with offsets of 5 and 7 relative to the beginning of the marker, respectively), the number of components (byte field with offset 8 relative to the beginning of the marker), the number of bits per component (byte field with offset 4 relative to the beginning of the marker), as well as the ratio of components (for example, 4:2:0).
SOF1	0xFFC1	variable size	Start of frame (extended, DCT, Huffman code)	Indicates that the image was encoded in extended mode using DCT and Huffman code. The marker contains the number of lines and line length of the image, the number of components, the number of bits per component, and the component ratio (for example, 4:2:0).
SOF2	0xFFC2	variable size	Start of frame (progressive, DCT, Huffman code)	Indicates that the image was encoded in progressive mode using DCT and Huffman code. The marker contains the number of lines and line length of the image, the number of components, the number of bits per component, and the component ratio (for example, 4:2:0).
DHT	0xFFC4	variable size	Contains Huffman tables	Specifies one or more Huffman tables.
DQT	0xFFDB	variable size	Contains quantization tables	Specifies one or more quantization tables.
DRI	0xFFDD	4 bytes	Specifies the repetition interval	Sets the interval between RST markers n in macroblocks.
SOS	0xFFDA	variable size	Start scanning	The beginning of the first or next scan of an image with the traversal direction from left to right from top to bottom. If Basic encoding mode was used, one scan is used. When using progressive modes, multiple scans are used. The SOS marker is the separator between the informative (header) and encoded (actually compressed data) parts of the image.
RST n	0xFFD n	No	Restart	Inserted in every r macroblock, where r- DRI marker restart interval. Not used if there is no DRI marker. n, low 3 bits of the code marker, cycles from 0 to 7.
APP n	0xFFE n	variable size	Set by application	For example, the EXIF of a JPEG file uses the APP1 marker to store metadata arranged in a TIFF-based structure.
COM	0xFFFE	variable size	Comment	Contains the text of the comment.
EOI	0xFFD9	No	End of the encoded part of the image.

Advantages and disadvantages

The disadvantages of compression according to the JPEG standard include the appearance of characteristic artifacts in restored images at high compression rates: the image is scattered into blocks of 8x8 pixels (this effect is especially noticeable in image areas with smooth changes in brightness), in areas with high spatial frequency (for example, contrast contours and image boundaries), artifacts appear in the form of noise halos. It should be noted that the JPEG standard (ISO/IEC 10918-1, Annex K, clause K.8) provides for the use of special filters to suppress blocking artifacts, but in practice such filters, despite their high efficiency, are practically not used. However, despite its shortcomings, JPEG has become very widespread due to its fairly high (relative to alternatives that existed at the time of its appearance) compression ratio, support for compression of full-color images, and relatively low computational complexity.

JPEG compression performance

To speed up the compression process according to the JPEG standard, parallelization of calculations is traditionally used, in particular when calculating DCT. Historically, one of the first attempts to speed up the compression process using this approach is described in a paper published in 1993 by Kasperovich and Babkin, which proposed an original DCT approximation that makes it possible to efficiently parallelize calculations using 32-bit registers general purpose Intel 80386 processors. More efficient computing circuits that appeared later used SIMD extensions of the instruction set of x86 processors. Significantly best results make it possible to achieve schemes that use the computing capabilities of graphics accelerators (NVIDIA CUDA and AMD FireStream technologies) to organize parallel computing not only DCT, but also other stages of JPEG compression (color space conversion, run-level, statistical coding, etc.), and for each 8x8 block of encoded or decoded image. The article was for the first time [ source?] presents an implementation of parallelization of all stages of the JPEG algorithm using CUDA technology, which significantly accelerated the performance of compression and decoding using the JPEG standard.

In 2010, scientists from the PLANETS project placed instructions for reading the JPEG format in a special capsule, which was placed in a special bunker in the Swiss Alps. This was done with the aim of preserving for posterity information about digital formats popular at the beginning of the 21st century.

Notes

Links

JFIF Specification 1.02 (text file)
JPEG optimization. Part 1, Part 2, Part 3.

(pronounced "japeg" Joint Photographic Experts Group, after the name of the development organization) is one of the popular graphic formats used for storing photographs and similar images. Files containing JPEG data typically have the extensions .jpeg, .jfif, .jpg, .JPG, or .JPE. However, of these, .jpg is the most popular extension on all platforms.

1. Joint Group of Experts in the Field of Photography;

2. An image compression method developed by this group and the corresponding graphic format, often used on the WWW. It is characterized by compactness of files and, accordingly, fast transfer, as well as “loss” of image quality. It is used primarily for photographs, since for them the loss of quality is less critical. Stores color parameters in the RGB color model.

JPEG(pronounced " jpeg", English Joint Photographic Experts Group, by the name of the developer organization) is one of the popular graphic formats used for storing photographs and similar images. Files containing JPEG data usually have the extension .jpeg, .jfif, .jpg, .JPG, or .JPE. However, of these .jpg the most popular extension on all platforms. The MIME type is image/jpeg.

The JPEG algorithm is a lossy data compression algorithm.