• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
12.1.1
2=====
3
4### Significant changes relative to 2.1.0
5
61. Fixed a regression introduced in 2.1.0 that caused build failures with
7non-GCC-compatible compilers for Un*x/Arm platforms.
8
92. Fixed a regression introduced by 2.1 beta1[13] that prevented the Arm 32-bit
10(AArch32) Neon SIMD extensions from building unless the C compiler flags
11included `-mfloat-abi=softfp` or `-mfloat-abi=hard`.
12
133. Fixed an issue in the AArch32 Neon SIMD Huffman encoder whereby reliance on
14undefined C compiler behavior led to crashes ("SIGBUS: illegal alignment") on
15Android systems when running AArch32/Thumb builds of libjpeg-turbo built with
16recent versions of Clang.
17
18
192.1.0
20=====
21
22### Significant changes relative to 2.1 beta1
23
241. Fixed a regression introduced by 2.1 beta1[6(b)] whereby attempting to
25decompress certain progressive JPEG images with one or more component planes of
26width 8 or less caused a buffer overrun.
27
282. Fixed a regression introduced by 2.1 beta1[6(b)] whereby attempting to
29decompress a specially-crafted malformed progressive JPEG image caused the
30block smoothing algorithm to read from uninitialized memory.
31
323. Fixed an issue in the Arm Neon SIMD Huffman encoders that caused the
33encoders to generate incorrect results when using the Clang compiler with
34Visual Studio.
35
364. Fixed a floating point exception (CVE-2021-20205) that occurred when
37attempting to compress a specially-crafted malformed GIF image with a specified
38image width of 0 using cjpeg.
39
405. Fixed a regression introduced by 2.0 beta1[15] whereby attempting to
41generate a progressive JPEG image on an SSE2-capable CPU using a scan script
42containing one or more scans with lengths divisible by 32 and non-zero
43successive approximation low bit positions would, under certain circumstances,
44result in an error ("Missing Huffman code table entry") and an invalid JPEG
45image.
46
476. Introduced a new flag (`TJFLAG_LIMITSCANS` in the TurboJPEG C API and
48`TJ.FLAG_LIMIT_SCANS` in the TurboJPEG Java API) and a corresponding TJBench
49command-line argument (`-limitscans`) that causes the TurboJPEG decompression
50and transform functions/operations to return/throw an error if a progressive
51JPEG image contains an unreasonably large number of scans.  This allows
52applications that use the TurboJPEG API to guard against an exploit of the
53progressive JPEG format described in the report
54["Two Issues with the JPEG Standard"](https://libjpeg-turbo.org/pmwiki/uploads/About/TwoIssueswiththeJPEGStandard.pdf).
55
567. The PPM reader now throws an error, rather than segfaulting (due to a buffer
57overrun) or generating incorrect pixels, if an application attempts to use the
58`tjLoadImage()` function to load a 16-bit binary PPM file (a binary PPM file
59with a maximum value greater than 255) into a grayscale image buffer or to load
60a 16-bit binary PGM file into an RGB image buffer.
61
628. Fixed an issue in the PPM reader that caused incorrect pixels to be
63generated when using the `tjLoadImage()` function to load a 16-bit binary PPM
64file into an extended RGB image buffer.
65
669. Fixed an issue whereby, if a JPEG buffer was automatically re-allocated by
67one of the TurboJPEG compression or transform functions and an error
68subsequently occurred during compression or transformation, the JPEG buffer
69pointer passed by the application was not updated when the function returned.
70
71
722.0.90 (2.1 beta1)
73==================
74
75### Significant changes relative to 2.0.6:
76
771. The build system, x86-64 SIMD extensions, and accelerated Huffman codec now
78support the x32 ABI on Linux, which allows for using x86-64 instructions with
7932-bit pointers.  The x32 ABI is generally enabled by adding `-mx32` to the
80compiler flags.
81
82     Caveats:
83     - CMake 3.9.0 or later is required in order for the build system to
84automatically detect an x32 build.
85     - Java does not support the x32 ABI, and thus the TurboJPEG Java API will
86automatically be disabled with x32 builds.
87
882. Added Loongson MMI SIMD implementations of the RGB-to-grayscale, 4:2:2 fancy
89chroma upsampling, 4:2:2 and 4:2:0 merged chroma upsampling/color conversion,
90and fast integer DCT/IDCT algorithms.  Relative to libjpeg-turbo 2.0.x, this
91speeds up:
92
93     - the compression of RGB source images into grayscale JPEG images by
94approximately 20%
95     - the decompression of 4:2:2 JPEG images by approximately 40-60% when
96using fancy upsampling
97     - the decompression of 4:2:2 and 4:2:0 JPEG images by approximately
9815-20% when using merged upsampling
99     - the compression of RGB source images by approximately 30-45% when using
100the fast integer DCT
101     - the decompression of JPEG images into RGB destination images by
102approximately 2x when using the fast integer IDCT
103
104    The overall decompression speedup for RGB images is now approximately
1052.3-3.7x (compared to 2-3.5x with libjpeg-turbo 2.0.x.)
106
1073. 32-bit (Armv7 or Armv7s) iOS builds of libjpeg-turbo are no longer
108supported, and the libjpeg-turbo build system can no longer be used to package
109such builds.  32-bit iOS apps cannot run in iOS 11 and later, and the App Store
110no longer allows them.
111
1124. 32-bit (i386) OS X/macOS builds of libjpeg-turbo are no longer supported,
113and the libjpeg-turbo build system can no longer be used to package such
114builds.  32-bit Mac applications cannot run in macOS 10.15 "Catalina" and
115later, and the App Store no longer allows them.
116
1175. The SSE2 (x86 SIMD) and C Huffman encoding algorithms have been
118significantly optimized, resulting in a measured average overall compression
119speedup of 12-28% for 64-bit code and 22-52% for 32-bit code on various Intel
120and AMD CPUs, as well as a measured average overall compression speedup of
1210-23% on platforms that do not have a SIMD-accelerated Huffman encoding
122implementation.
123
1246. The block smoothing algorithm that is applied by default when decompressing
125progressive Huffman-encoded JPEG images has been improved in the following
126ways:
127
128     - The algorithm is now more fault-tolerant.  Previously, if a particular
129scan was incomplete, then the smoothing parameters for the incomplete scan
130would be applied to the entire output image, including the parts of the image
131that were generated by the prior (complete) scan.  Visually, this had the
132effect of removing block smoothing from lower-frequency scans if they were
133followed by an incomplete higher-frequency scan.  libjpeg-turbo now applies
134block smoothing parameters to each iMCU row based on which scan generated the
135pixels in that row, rather than always using the block smoothing parameters for
136the most recent scan.
137     - When applying block smoothing to DC scans, a Gaussian-like kernel with a
1385x5 window is used to reduce the "blocky" appearance.
139
1407. Added SIMD acceleration for progressive Huffman encoding on Arm platforms.
141This speeds up the compression of full-color progressive JPEGs by about 30-40%
142on average (relative to libjpeg-turbo 2.0.x) when using modern Arm CPUs.
143
1448. Added configure-time and run-time auto-detection of Loongson MMI SIMD
145instructions, so that the Loongson MMI SIMD extensions can be included in any
146MIPS64 libjpeg-turbo build.
147
1489. Added fault tolerance features to djpeg and jpegtran, mainly to demonstrate
149methods by which applications can guard against the exploits of the JPEG format
150described in the report
151["Two Issues with the JPEG Standard"](https://libjpeg-turbo.org/pmwiki/uploads/About/TwoIssueswiththeJPEGStandard.pdf).
152
153     - Both programs now accept a `-maxscans` argument, which can be used to
154limit the number of allowable scans in the input file.
155     - Both programs now accept a `-strict` argument, which can be used to
156treat all warnings as fatal.
157
15810. CMake package config files are now included for both the libjpeg and
159TurboJPEG API libraries.  This facilitates using libjpeg-turbo with CMake's
160`find_package()` function.  For example:
161
162        find_package(libjpeg-turbo CONFIG REQUIRED)
163
164        add_executable(libjpeg_program libjpeg_program.c)
165        target_link_libraries(libjpeg_program PUBLIC libjpeg-turbo::jpeg)
166
167        add_executable(libjpeg_program_static libjpeg_program.c)
168        target_link_libraries(libjpeg_program_static PUBLIC
169          libjpeg-turbo::jpeg-static)
170
171        add_executable(turbojpeg_program turbojpeg_program.c)
172        target_link_libraries(turbojpeg_program PUBLIC
173          libjpeg-turbo::turbojpeg)
174
175        add_executable(turbojpeg_program_static turbojpeg_program.c)
176        target_link_libraries(turbojpeg_program_static PUBLIC
177          libjpeg-turbo::turbojpeg-static)
178
17911. Since the Unisys LZW patent has long expired, cjpeg and djpeg can now
180read/write both LZW-compressed and uncompressed GIF files (feature ported from
181jpeg-6a and jpeg-9d.)
182
18312. jpegtran now includes the `-wipe` and `-drop` options from jpeg-9a and
184jpeg-9d, as well as the ability to expand the image size using the `-crop`
185option.  Refer to jpegtran.1 or usage.txt for more details.
186
18713. Added a complete intrinsics implementation of the Arm Neon SIMD extensions,
188thus providing SIMD acceleration on Arm platforms for all of the algorithms
189that are SIMD-accelerated on x86 platforms.  This new implementation is
190significantly faster in some cases than the old GAS implementation--
191depending on the algorithms used, the type of CPU core, and the compiler.  GCC,
192as of this writing, does not provide a full or optimal set of Neon intrinsics,
193so for performance reasons, the default when building libjpeg-turbo with GCC is
194to continue using the GAS implementation of the following algorithms:
195
196     - 32-bit RGB-to-YCbCr color conversion
197     - 32-bit fast and accurate inverse DCT
198     - 64-bit RGB-to-YCbCr and YCbCr-to-RGB color conversion
199     - 64-bit accurate forward and inverse DCT
200     - 64-bit Huffman encoding
201
202    A new CMake variable (`NEON_INTRINSICS`) can be used to override this
203default.
204
205    Since the new intrinsics implementation includes SIMD acceleration
206for merged upsampling/color conversion, 1.5.1[5] is no longer necessary and has
207been reverted.
208
20914. The Arm Neon SIMD extensions can now be built using Visual Studio.
210
21115. The build system can now be used to generate a universal x86-64 + Armv8
212libjpeg-turbo SDK package for both iOS and macOS.
213
214
2152.0.6
216=====
217
218### Significant changes relative to 2.0.5:
219
2201. Fixed "using JNI after critical get" errors that occurred on Android
221platforms when using any of the YUV encoding/compression/decompression/decoding
222methods in the TurboJPEG Java API.
223
2242. Fixed or worked around multiple issues with `jpeg_skip_scanlines()`:
225
226     - Fixed segfaults or "Corrupt JPEG data: premature end of data segment"
227errors in `jpeg_skip_scanlines()` that occurred when decompressing 4:2:2 or
2284:2:0 JPEG images using merged (non-fancy) upsampling/color conversion (that
229is, when setting `cinfo.do_fancy_upsampling` to `FALSE`.)  2.0.0[6] was a
230similar fix, but it did not cover all cases.
231     - `jpeg_skip_scanlines()` now throws an error if two-pass color
232quantization is enabled.  Two-pass color quantization never worked properly
233with `jpeg_skip_scanlines()`, and the issues could not readily be fixed.
234     - Fixed an issue whereby `jpeg_skip_scanlines()` always returned 0 when
235skipping past the end of an image.
236
2373. The Arm 64-bit (Armv8) Neon SIMD extensions can now be built using MinGW
238toolchains targetting Arm64 (AArch64) Windows binaries.
239
2404. Fixed unexpected visual artifacts that occurred when using
241`jpeg_crop_scanline()` and interblock smoothing while decompressing only the DC
242scan of a progressive JPEG image.
243
2445. Fixed an issue whereby libjpeg-turbo would not build if 12-bit-per-component
245JPEG support (`WITH_12BIT`) was enabled along with libjpeg v7 or libjpeg v8
246API/ABI emulation (`WITH_JPEG7` or `WITH_JPEG8`.)
247
248
2492.0.5
250=====
251
252### Significant changes relative to 2.0.4:
253
2541. Worked around issues in the MIPS DSPr2 SIMD extensions that caused failures
255in the libjpeg-turbo regression tests.  Specifically, the
256`jsimd_h2v1_downsample_dspr2()` and `jsimd_h2v2_downsample_dspr2()` functions
257in the MIPS DSPr2 SIMD extensions are now disabled until/unless they can be
258fixed, and other functions that are incompatible with big endian MIPS CPUs are
259disabled when building libjpeg-turbo for such CPUs.
260
2612. Fixed an oversight in the `TJCompressor.compress(int)` method in the
262TurboJPEG Java API that caused an error ("java.lang.IllegalStateException: No
263source image is associated with this instance") when attempting to use that
264method to compress a YUV image.
265
2663. Fixed an issue (CVE-2020-13790) in the PPM reader that caused a buffer
267overrun in cjpeg, TJBench, or the `tjLoadImage()` function if one of the values
268in a binary PPM/PGM input file exceeded the maximum value defined in the file's
269header and that maximum value was less than 255.  libjpeg-turbo 1.5.0 already
270included a similar fix for binary PPM/PGM files with maximum values greater
271than 255.
272
2734. The TurboJPEG API library's global error handler, which is used in functions
274such as `tjBufSize()` and `tjLoadImage()` that do not require a TurboJPEG
275instance handle, is now thread-safe on platforms that support thread-local
276storage.
277
278
2792.0.4
280=====
281
282### Significant changes relative to 2.0.3:
283
2841. Fixed a regression in the Windows packaging system (introduced by
2852.0 beta1[2]) whereby, if both the 64-bit libjpeg-turbo SDK for GCC and the
28664-bit libjpeg-turbo SDK for Visual C++ were installed on the same system, only
287one of them could be uninstalled.
288
2892. Fixed a signed integer overflow and subsequent segfault that occurred when
290attempting to decompress images with more than 715827882 pixels using the
29164-bit C version of TJBench.
292
2933. Fixed out-of-bounds write in `tjDecompressToYUV2()` and
294`tjDecompressToYUVPlanes()` (sometimes manifesting as a double free) that
295occurred when attempting to decompress grayscale JPEG images that were
296compressed with a sampling factor other than 1 (for instance, with
297`cjpeg -grayscale -sample 2x2`).
298
2994. Fixed a regression introduced by 2.0.2[5] that caused the TurboJPEG API to
300incorrectly identify some JPEG images with unusual sampling factors as 4:4:4
301JPEG images.  This was known to cause a buffer overflow when attempting to
302decompress some such images using `tjDecompressToYUV2()` or
303`tjDecompressToYUVPlanes()`.
304
3055. Fixed an issue (CVE-2020-17541), detected by ASan, whereby attempting to
306losslessly transform a specially-crafted malformed JPEG image containing an
307extremely-high-frequency coefficient block (junk image data that could never be
308generated by a legitimate JPEG compressor) could cause the Huffman encoder's
309local buffer to be overrun. (Refer to 1.4.0[9] and 1.4beta1[15].)  Given that
310the buffer overrun was fully contained within the stack and did not cause a
311segfault or other user-visible errant behavior, and given that the lossless
312transformer (unlike the decompressor) is not generally exposed to arbitrary
313data exploits, this issue did not likely pose a security risk.
314
3156. The Arm 64-bit (Armv8) Neon SIMD assembly code now stores constants in a
316separate read-only data section rather than in the text section, to support
317execute-only memory layouts.
318
319
3202.0.3
321=====
322
323### Significant changes relative to 2.0.2:
324
3251. Fixed "using JNI after critical get" errors that occurred on Android
326platforms when passing invalid arguments to certain methods in the TurboJPEG
327Java API.
328
3292. Fixed a regression in the SIMD feature detection code, introduced by
330the AVX2 SIMD extensions (2.0 beta1[1]), that was known to cause an illegal
331instruction exception, in rare cases, on CPUs that lack support for CPUID leaf
33207H (or on which the maximum CPUID leaf has been limited by way of a BIOS
333setting.)
334
3353. The 4:4:0 (h1v2) fancy (smooth) chroma upsampling algorithm in the
336decompressor now uses a similar bias pattern to that of the 4:2:2 (h2v1) fancy
337chroma upsampling algorithm, rounding up or down the upsampled result for
338alternate pixels rather than always rounding down.  This ensures that,
339regardless of whether a 4:2:2 JPEG image is rotated or transposed prior to
340decompression (in the frequency domain) or after decompression (in the spatial
341domain), the final image will be similar.
342
3434. Fixed an integer overflow and subsequent segfault that occurred when
344attempting to compress or decompress images with more than 1 billion pixels
345using the TurboJPEG API.
346
3475. Fixed a regression introduced by 2.0 beta1[15] whereby attempting to
348generate a progressive JPEG image on an SSE2-capable CPU using a scan script
349containing one or more scans with lengths divisible by 16 would result in an
350error ("Missing Huffman code table entry") and an invalid JPEG image.
351
3526. Fixed an issue whereby `tjDecodeYUV()` and `tjDecodeYUVPlanes()` would throw
353an error ("Invalid progressive parameters") or a warning ("Inconsistent
354progression sequence") if passed a TurboJPEG instance that was previously used
355to decompress a progressive JPEG image.
356
357
3582.0.2
359=====
360
361### Significant changes relative to 2.0.1:
362
3631. Fixed a regression introduced by 2.0.1[5] that prevented a runtime search
364path (rpath) from being embedded in the libjpeg-turbo shared libraries and
365executables for macOS and iOS.  This caused a fatal error of the form
366"dyld: Library not loaded" when attempting to use one of the executables,
367unless `DYLD_LIBRARY_PATH` was explicitly set to the location of the
368libjpeg-turbo shared libraries.
369
3702. Fixed an integer overflow and subsequent segfault (CVE-2018-20330) that
371occurred when attempting to load a BMP file with more than 1 billion pixels
372using the `tjLoadImage()` function.
373
3743. Fixed a buffer overrun (CVE-2018-19664) that occurred when attempting to
375decompress a specially-crafted malformed JPEG image to a 256-color BMP using
376djpeg.
377
3784. Fixed a floating point exception that occurred when attempting to
379decompress a specially-crafted malformed JPEG image with a specified image
380width or height of 0 using the C version of TJBench.
381
3825. The TurboJPEG API will now decompress 4:4:4 JPEG images with 2x1, 1x2, 3x1,
383or 1x3 luminance and chrominance sampling factors.  This is a non-standard way
384of specifying 1x subsampling (normally 4:4:4 JPEGs have 1x1 luminance and
385chrominance sampling factors), but the JPEG format and the libjpeg API both
386allow it.
387
3886. Fixed a regression introduced by 2.0 beta1[7] that caused djpeg to generate
389incorrect PPM images when used with the `-colors` option.
390
3917. Fixed an issue whereby a static build of libjpeg-turbo (a build in which
392`ENABLE_SHARED` is `0`) could not be installed using the Visual Studio IDE.
393
3948. Fixed a severe performance issue in the Loongson MMI SIMD extensions that
395occurred when compressing RGB images whose image rows were not 64-bit-aligned.
396
397
3982.0.1
399=====
400
401### Significant changes relative to 2.0.0:
402
4031. Fixed a regression introduced with the new CMake-based Un*x build system,
404whereby jconfig.h could cause compiler warnings of the form
405`"HAVE_*_H" redefined` if it was included by downstream Autotools-based
406projects that used `AC_CHECK_HEADERS()` to check for the existence of locale.h,
407stddef.h, or stdlib.h.
408
4092. The `jsimd_quantize_float_dspr2()` and `jsimd_convsamp_float_dspr2()`
410functions in the MIPS DSPr2 SIMD extensions are now disabled at compile time
411if the soft float ABI is enabled.  Those functions use instructions that are
412incompatible with the soft float ABI.
413
4143. Fixed a regression in the SIMD feature detection code, introduced by
415the AVX2 SIMD extensions (2.0 beta1[1]), that caused libjpeg-turbo to crash on
416Windows 7 if Service Pack 1 was not installed.
417
4184. Fixed out-of-bounds read in cjpeg that occurred when attempting to compress
419a specially-crafted malformed color-index (8-bit-per-sample) Targa file in
420which some of the samples (color indices) exceeded the bounds of the Targa
421file's color table.
422
4235. Fixed an issue whereby installing a fully static build of libjpeg-turbo
424(a build in which `CFLAGS` contains `-static` and `ENABLE_SHARED` is `0`) would
425fail with "No valid ELF RPATH or RUNPATH entry exists in the file."
426
427
4282.0.0
429=====
430
431### Significant changes relative to 2.0 beta1:
432
4331. The TurboJPEG API can now decompress CMYK JPEG images that have subsampled M
434and Y components (not to be confused with YCCK JPEG images, in which the C/M/Y
435components have been transformed into luma and chroma.)   Previously, an error
436was generated ("Could not determine subsampling type for JPEG image") when such
437an image was passed to `tjDecompressHeader3()`, `tjTransform()`,
438`tjDecompressToYUVPlanes()`, `tjDecompressToYUV2()`, or the equivalent Java
439methods.
440
4412. Fixed an issue (CVE-2018-11813) whereby a specially-crafted malformed input
442file (specifically, a file with a valid Targa header but incomplete pixel data)
443would cause cjpeg to generate a JPEG file that was potentially thousands of
444times larger than the input file.  The Targa reader in cjpeg was not properly
445detecting that the end of the input file had been reached prematurely, so after
446all valid pixels had been read from the input, the reader injected dummy pixels
447with values of 255 into the JPEG compressor until the number of pixels
448specified in the Targa header had been compressed.  The Targa reader in cjpeg
449now behaves like the PPM reader and aborts compression if the end of the input
450file is reached prematurely.  Because this issue only affected cjpeg and not
451the underlying library, and because it did not involve any out-of-bounds reads
452or other exploitable behaviors, it was not believed to represent a security
453threat.
454
4553. Fixed an issue whereby the `tjLoadImage()` and `tjSaveImage()` functions
456would produce a "Bogus message code" error message if the underlying bitmap and
457PPM readers/writers threw an error that was specific to the readers/writers
458(as opposed to a general libjpeg API error.)
459
4604. Fixed an issue (CVE-2018-1152) whereby a specially-crafted malformed BMP
461file, one in which the header specified an image width of 1073741824 pixels,
462would trigger a floating point exception (division by zero) in the
463`tjLoadImage()` function when attempting to load the BMP file into a
4644-component image buffer.
465
4665. Fixed an issue whereby certain combinations of calls to
467`jpeg_skip_scanlines()` and `jpeg_read_scanlines()` could trigger an infinite
468loop when decompressing progressive JPEG images that use vertical chroma
469subsampling (for instance, 4:2:0 or 4:4:0.)
470
4716. Fixed a segfault in `jpeg_skip_scanlines()` that occurred when decompressing
472a 4:2:2 or 4:2:0 JPEG image using the merged (non-fancy) upsampling algorithms
473(that is, when setting `cinfo.do_fancy_upsampling` to `FALSE`.)
474
4757. The new CMake-based build system will now disable the MIPS DSPr2 SIMD
476extensions if it detects that the compiler does not support DSPr2 instructions.
477
4788. Fixed out-of-bounds read in cjpeg (CVE-2018-14498) that occurred when
479attempting to compress a specially-crafted malformed color-index
480(8-bit-per-sample) BMP file in which some of the samples (color indices)
481exceeded the bounds of the BMP file's color table.
482
4839. Fixed a signed integer overflow in the progressive Huffman decoder, detected
484by the Clang and GCC undefined behavior sanitizers, that could be triggered by
485attempting to decompress a specially-crafted malformed JPEG image.  This issue
486did not pose a security threat, but removing the warning made it easier to
487detect actual security issues, should they arise in the future.
488
489
4901.5.90 (2.0 beta1)
491==================
492
493### Significant changes relative to 1.5.3:
494
4951. Added AVX2 SIMD implementations of the colorspace conversion, chroma
496downsampling and upsampling, integer quantization and sample conversion, and
497accurate integer DCT/IDCT algorithms.  When using the accurate integer DCT/IDCT
498algorithms on AVX2-equipped CPUs, the compression of RGB images is
499approximately 13-36% (avg. 22%) faster (relative to libjpeg-turbo 1.5.x) with
50064-bit code and 11-21% (avg. 17%) faster with 32-bit code, and the
501decompression of RGB images is approximately 9-35% (avg. 17%) faster with
50264-bit code and 7-17% (avg. 12%) faster with 32-bit code.  (As tested on a
5033 GHz Intel Core i7.  Actual mileage may vary.)
504
5052. Overhauled the build system to use CMake on all platforms, and removed the
506autotools-based build system.  This decision resulted from extensive
507discussions within the libjpeg-turbo community.  libjpeg-turbo traditionally
508used CMake only for Windows builds, but there was an increasing amount of
509demand to extend CMake support to other platforms.  However, because of the
510unique nature of our code base (the need to support different assemblers on
511each platform, the need for Java support, etc.), providing dual build systems
512as other OSS imaging libraries do (including libpng and libtiff) would have
513created a maintenance burden.  The use of CMake greatly simplifies some aspects
514of our build system, owing to CMake's built-in support for various assemblers,
515Java, and unit testing, as well as generally fewer quirks that have to be
516worked around in order to implement our packaging system.  Eliminating
517autotools puts our project slightly at odds with the traditional practices of
518the OSS community, since most "system libraries" tend to be built with
519autotools, but it is believed that the benefits of this move outweigh the
520risks.  In addition to providing a unified build environment, switching to
521CMake allows for the use of various build tools and IDEs that aren't supported
522under autotools, including XCode, Ninja, and Eclipse.  It also eliminates the
523need to install autotools via MacPorts/Homebrew on OS X and allows
524libjpeg-turbo to be configured without the use of a terminal/command prompt.
525Extensive testing was conducted to ensure that all features provided by the
526autotools-based build system are provided by the new build system.
527
5283. The libjpeg API in this version of libjpeg-turbo now includes two additional
529functions, `jpeg_read_icc_profile()` and `jpeg_write_icc_profile()`, that can
530be used to extract ICC profile data from a JPEG file while decompressing or to
531embed ICC profile data in a JPEG file while compressing or transforming.  This
532eliminates the need for downstream projects, such as color management libraries
533and browsers, to include their own glueware for accomplishing this.
534
5354. Improved error handling in the TurboJPEG API library:
536
537     - Introduced a new function (`tjGetErrorStr2()`) in the TurboJPEG C API
538that allows compression/decompression/transform error messages to be retrieved
539in a thread-safe manner.  Retrieving error messages from global functions, such
540as `tjInitCompress()` or `tjBufSize()`, is still thread-unsafe, but since those
541functions will only throw errors if passed an invalid argument or if a memory
542allocation failure occurs, thread safety is not as much of a concern.
543     - Introduced a new function (`tjGetErrorCode()`) in the TurboJPEG C API
544and a new method (`TJException.getErrorCode()`) in the TurboJPEG Java API that
545can be used to determine the severity of the last
546compression/decompression/transform error.  This allows applications to
547choose whether to ignore warnings (non-fatal errors) from the underlying
548libjpeg API or to treat them as fatal.
549     - Introduced a new flag (`TJFLAG_STOPONWARNING` in the TurboJPEG C API and
550`TJ.FLAG_STOPONWARNING` in the TurboJPEG Java API) that causes the library to
551immediately halt a compression/decompression/transform operation if it
552encounters a warning from the underlying libjpeg API (the default behavior is
553to allow the operation to complete unless a fatal error is encountered.)
554
5555. Introduced a new flag in the TurboJPEG C and Java APIs (`TJFLAG_PROGRESSIVE`
556and `TJ.FLAG_PROGRESSIVE`, respectively) that causes the library to use
557progressive entropy coding in JPEG images generated by compression and
558transform operations.  Additionally, a new transform option
559(`TJXOPT_PROGRESSIVE` in the C API and `TJTransform.OPT_PROGRESSIVE` in the
560Java API) has been introduced, allowing progressive entropy coding to be
561enabled for selected transforms in a multi-transform operation.
562
5636. Introduced a new transform option in the TurboJPEG API (`TJXOPT_COPYNONE` in
564the C API and `TJTransform.OPT_COPYNONE` in the Java API) that allows the
565copying of markers (including EXIF and ICC profile data) to be disabled for a
566particular transform.
567
5687. Added two functions to the TurboJPEG C API (`tjLoadImage()` and
569`tjSaveImage()`) that can be used to load/save a BMP or PPM/PGM image to/from a
570memory buffer with a specified pixel format and layout.  These functions
571replace the project-private (and slow) bmp API, which was previously used by
572TJBench, and they also provide a convenient way for first-time users of
573libjpeg-turbo to quickly develop a complete JPEG compression/decompression
574program.
575
5768. The TurboJPEG C API now includes a new convenience array (`tjAlphaOffset[]`)
577that contains the alpha component index for each pixel format (or -1 if the
578pixel format lacks an alpha component.)  The TurboJPEG Java API now includes a
579new method (`TJ.getAlphaOffset()`) that returns the same value.  In addition,
580the `tjRedOffset[]`, `tjGreenOffset[]`, and `tjBlueOffset[]` arrays-- and the
581corresponding `TJ.getRedOffset()`, `TJ.getGreenOffset()`, and
582`TJ.getBlueOffset()` methods-- now return -1 for `TJPF_GRAY`/`TJ.PF_GRAY`
583rather than 0.  This allows programs to easily determine whether a pixel format
584has red, green, blue, and alpha components.
585
5869. Added a new example (tjexample.c) that demonstrates the basic usage of the
587TurboJPEG C API.  This example mirrors the functionality of TJExample.java.
588Both files are now included in the libjpeg-turbo documentation.
589
59010. Fixed two signed integer overflows in the arithmetic decoder, detected by
591the Clang undefined behavior sanitizer, that could be triggered by attempting
592to decompress a specially-crafted malformed JPEG image.  These issues did not
593pose a security threat, but removing the warnings makes it easier to detect
594actual security issues, should they arise in the future.
595
59611. Fixed a bug in the merged 4:2:0 upsampling/dithered RGB565 color conversion
597algorithm that caused incorrect dithering in the output image.  This algorithm
598now produces bitwise-identical results to the unmerged algorithms.
599
60012. The SIMD function symbols for x86[-64]/ELF, MIPS/ELF, macOS/x86[-64] (if
601libjpeg-turbo is built with YASM), and iOS/Arm[64] builds are now private.
602This prevents those symbols from being exposed in applications or shared
603libraries that link statically with libjpeg-turbo.
604
60513. Added Loongson MMI SIMD implementations of the RGB-to-YCbCr and
606YCbCr-to-RGB colorspace conversion, 4:2:0 chroma downsampling, 4:2:0 fancy
607chroma upsampling, integer quantization, and accurate integer DCT/IDCT
608algorithms.  When using the accurate integer DCT/IDCT, this speeds up the
609compression of RGB images by approximately 70-100% and the decompression of RGB
610images by approximately 2-3.5x.
611
61214. Fixed a build error when building with older MinGW releases (regression
613caused by 1.5.1[7].)
614
61515. Added SIMD acceleration for progressive Huffman encoding on SSE2-capable
616x86 and x86-64 platforms.  This speeds up the compression of full-color
617progressive JPEGs by about 85-90% on average (relative to libjpeg-turbo 1.5.x)
618when using modern Intel and AMD CPUs.
619
620
6211.5.3
622=====
623
624### Significant changes relative to 1.5.2:
625
6261. Fixed a NullPointerException in the TurboJPEG Java wrapper that occurred
627when using the YUVImage constructor that creates an instance backed by separate
628image planes and allocates memory for the image planes.
629
6302. Fixed an issue whereby the Java version of TJUnitTest would fail when
631testing BufferedImage encoding/decoding on big endian systems.
632
6333. Fixed a segfault in djpeg that would occur if an output format other than
634PPM/PGM was selected along with the `-crop` option.  The `-crop` option now
635works with the GIF and Targa formats as well (unfortunately, it cannot be made
636to work with the BMP and RLE formats due to the fact that those output engines
637write scanlines in bottom-up order.)  djpeg will now exit gracefully if an
638output format other than PPM/PGM, GIF, or Targa is selected along with the
639`-crop` option.
640
6414. Fixed an issue (CVE-2017-15232) whereby `jpeg_skip_scanlines()` would
642segfault if color quantization was enabled.
643
6445. TJBench (both C and Java versions) will now display usage information if any
645command-line argument is unrecognized.  This prevents the program from silently
646ignoring typos.
647
6486. Fixed an access violation in tjbench.exe (Windows) that occurred when the
649program was used to decompress an existing JPEG image.
650
6517. Fixed an ArrayIndexOutOfBoundsException in the TJExample Java program that
652occurred when attempting to decompress a JPEG image that had been compressed
653with 4:1:1 chrominance subsampling.
654
6558. Fixed an issue whereby, when using `jpeg_skip_scanlines()` to skip to the
656end of a single-scan (non-progressive) image, subsequent calls to
657`jpeg_consume_input()` would return `JPEG_SUSPENDED` rather than
658`JPEG_REACHED_EOI`.
659
6609. `jpeg_crop_scanline()` now works correctly when decompressing grayscale JPEG
661images that were compressed with a sampling factor other than 1 (for instance,
662with `cjpeg -grayscale -sample 2x2`).
663
664
6651.5.2
666=====
667
668### Significant changes relative to 1.5.1:
669
6701. Fixed a regression introduced by 1.5.1[7] that prevented libjpeg-turbo from
671building with Android NDK platforms prior to android-21 (5.0).
672
6732. Fixed a regression introduced by 1.5.1[1] that prevented the MIPS DSPR2 SIMD
674code in libjpeg-turbo from building.
675
6763. Fixed a regression introduced by 1.5 beta1[11] that prevented the Java
677version of TJBench from outputting any reference images (the `-nowrite` switch
678was accidentally enabled by default.)
679
6804. libjpeg-turbo should now build and run with full AltiVec SIMD acceleration
681on PowerPC-based AmigaOS 4 and OpenBSD systems.
682
6835. Fixed build and runtime errors on Windows that occurred when building
684libjpeg-turbo with libjpeg v7 API/ABI emulation and the in-memory
685source/destination managers.  Due to an oversight, the `jpeg_skip_scanlines()`
686and `jpeg_crop_scanline()` functions were not being included in jpeg7.dll when
687libjpeg-turbo was built with `-DWITH_JPEG7=1` and `-DWITH_MEMSRCDST=1`.
688
6896. Fixed "Bogus virtual array access" error that occurred when using the
690lossless crop feature in jpegtran or the TurboJPEG API, if libjpeg-turbo was
691built with libjpeg v7 API/ABI emulation.  This was apparently a long-standing
692bug that has existed since the introduction of libjpeg v7/v8 API/ABI emulation
693in libjpeg-turbo v1.1.
694
6957. The lossless transform features in jpegtran and the TurboJPEG API will now
696always attempt to adjust the EXIF image width and height tags if the image size
697changed as a result of the transform.  This behavior has always existed when
698using libjpeg v8 API/ABI emulation.  It was supposed to be available with
699libjpeg v7 API/ABI emulation as well but did not work properly due to a bug.
700Furthermore, there was never any good reason not to enable it with libjpeg v6b
701API/ABI emulation, since the behavior is entirely internal.  Note that
702`-copy all` must be passed to jpegtran in order to transfer the EXIF tags from
703the source image to the destination image.
704
7058. Fixed several memory leaks in the TurboJPEG API library that could occur
706if the library was built with certain compilers and optimization levels
707(known to occur with GCC 4.x and clang with `-O1` and higher but not with
708GCC 5.x or 6.x) and one of the underlying libjpeg API functions threw an error
709after a TurboJPEG API function allocated a local buffer.
710
7119. The libjpeg-turbo memory manager will now honor the `max_memory_to_use`
712structure member in jpeg\_memory\_mgr, which can be set to the maximum amount
713of memory (in bytes) that libjpeg-turbo should use during decompression or
714multi-pass (including progressive) compression.  This limit can also be set
715using the `JPEGMEM` environment variable or using the `-maxmemory` switch in
716cjpeg/djpeg/jpegtran (refer to the respective man pages for more details.)
717This has been a documented feature of libjpeg since v5, but the
718`malloc()`/`free()` implementation of the memory manager (jmemnobs.c) never
719implemented the feature.  Restricting libjpeg-turbo's memory usage is useful
720for two reasons:  it allows testers to more easily work around the 2 GB limit
721in libFuzzer, and it allows developers of security-sensitive applications to
722more easily defend against one of the progressive JPEG exploits (LJT-01-004)
723identified in
724[this report](http://www.libjpeg-turbo.org/pmwiki/uploads/About/TwoIssueswiththeJPEGStandard.pdf).
725
72610. TJBench will now run each benchmark for 1 second prior to starting the
727timer, in order to improve the consistency of the results.  Furthermore, the
728`-warmup` option is now used to specify the amount of warmup time rather than
729the number of warmup iterations.
730
73111. Fixed an error (`short jump is out of range`) that occurred when assembling
732the 32-bit x86 SIMD extensions with NASM versions prior to 2.04.  This was a
733regression introduced by 1.5 beta1[12].
734
735
7361.5.1
737=====
738
739### Significant changes relative to 1.5.0:
740
7411. Previously, the undocumented `JSIMD_FORCE*` environment variables could be
742used to force-enable a particular SIMD instruction set if multiple instruction
743sets were available on a particular platform.  On x86 platforms, where CPU
744feature detection is bulletproof and multiple SIMD instruction sets are
745available, it makes sense for those environment variables to allow forcing the
746use of an instruction set only if that instruction set is available.  However,
747since the ARM implementations of libjpeg-turbo can only use one SIMD
748instruction set, and since their feature detection code is less bulletproof
749(parsing /proc/cpuinfo), it makes sense for the `JSIMD_FORCENEON` environment
750variable to bypass the feature detection code and really force the use of NEON
751instructions.  A new environment variable (`JSIMD_FORCEDSPR2`) was introduced
752in the MIPS implementation for the same reasons, and the existing
753`JSIMD_FORCENONE` environment variable was extended to that implementation.
754These environment variables provide a workaround for those attempting to test
755ARM and MIPS builds of libjpeg-turbo in QEMU, which passes through
756/proc/cpuinfo from the host system.
757
7582. libjpeg-turbo previously assumed that AltiVec instructions were always
759available on PowerPC platforms, which led to "illegal instruction" errors when
760running on PowerPC chips that lack AltiVec support (such as the older 7xx/G3
761and newer e5500 series.)  libjpeg-turbo now examines /proc/cpuinfo on
762Linux/Android systems and enables AltiVec instructions only if the CPU supports
763them.  It also now provides two environment variables, `JSIMD_FORCEALTIVEC` and
764`JSIMD_FORCENONE`, to force-enable and force-disable AltiVec instructions in
765environments where /proc/cpuinfo is an unreliable means of CPU feature
766detection (such as when running in QEMU.)  On OS X, libjpeg-turbo continues to
767assume that AltiVec support is always available, which means that libjpeg-turbo
768cannot be used with G3 Macs unless you set the environment variable
769`JSIMD_FORCENONE` to `1`.
770
7713. Fixed an issue whereby 64-bit ARM (AArch64) builds of libjpeg-turbo would
772crash when built with recent releases of the Clang/LLVM compiler.  This was
773caused by an ABI conformance issue in some of libjpeg-turbo's 64-bit NEON SIMD
774routines.  Those routines were incorrectly using 64-bit instructions to
775transfer a 32-bit JDIMENSION argument, whereas the ABI allows the upper
776(unused) 32 bits of a 32-bit argument's register to be undefined.  The new
777Clang/LLVM optimizer uses load combining to transfer multiple adjacent 32-bit
778structure members into a single 64-bit register, and this exposed the ABI
779conformance issue.
780
7814. Fancy upsampling is now supported when decompressing JPEG images that use
7824:4:0 (h1v2) chroma subsampling.  These images are generated when losslessly
783rotating or transposing JPEG images that use 4:2:2 (h2v1) chroma subsampling.
784The h1v2 fancy upsampling algorithm is not currently SIMD-accelerated.
785
7865. If merged upsampling isn't SIMD-accelerated but YCbCr-to-RGB conversion is,
787then libjpeg-turbo will now disable merged upsampling when decompressing YCbCr
788JPEG images into RGB or extended RGB output images.  This significantly speeds
789up the decompression of 4:2:0 and 4:2:2 JPEGs on ARM platforms if fancy
790upsampling is not used (for example, if the `-nosmooth` option to djpeg is
791specified.)
792
7936. The TurboJPEG API will now decompress 4:2:2 and 4:4:0 JPEG images with
7942x2 luminance sampling factors and 2x1 or 1x2 chrominance sampling factors.
795This is a non-standard way of specifying 2x subsampling (normally 4:2:2 JPEGs
796have 2x1 luminance and 1x1 chrominance sampling factors, and 4:4:0 JPEGs have
7971x2 luminance and 1x1 chrominance sampling factors), but the JPEG format and
798the libjpeg API both allow it.
799
8007. Fixed an unsigned integer overflow in the libjpeg memory manager, detected
801by the Clang undefined behavior sanitizer, that could be triggered by
802attempting to decompress a specially-crafted malformed JPEG image.  This issue
803affected only 32-bit code and did not pose a security threat, but removing the
804warning makes it easier to detect actual security issues, should they arise in
805the future.
806
8078. Fixed additional negative left shifts and other issues reported by the GCC
808and Clang undefined behavior sanitizers when attempting to decompress
809specially-crafted malformed JPEG images.  None of these issues posed a security
810threat, but removing the warnings makes it easier to detect actual security
811issues, should they arise in the future.
812
8139. Fixed an out-of-bounds array reference, introduced by 1.4.90[2] (partial
814image decompression) and detected by the Clang undefined behavior sanitizer,
815that could be triggered by a specially-crafted malformed JPEG image with more
816than four components.  Because the out-of-bounds reference was still within the
817same structure, it was not known to pose a security threat, but removing the
818warning makes it easier to detect actual security issues, should they arise in
819the future.
820
82110. Fixed another ABI conformance issue in the 64-bit ARM (AArch64) NEON SIMD
822code.  Some of the routines were incorrectly reading and storing data below the
823stack pointer, which caused segfaults in certain applications under specific
824circumstances.
825
826
8271.5.0
828=====
829
830### Significant changes relative to 1.5 beta1:
831
8321. Fixed an issue whereby a malformed motion-JPEG frame could cause the "fast
833path" of libjpeg-turbo's Huffman decoder to read from uninitialized memory.
834
8352. Added libjpeg-turbo version and build information to the global string table
836of the libjpeg and TurboJPEG API libraries.  This is a common practice in other
837infrastructure libraries, such as OpenSSL and libpng, because it makes it easy
838to examine an application binary and determine which version of the library the
839application was linked against.
840
8413. Fixed a couple of issues in the PPM reader that would cause buffer overruns
842in cjpeg if one of the values in a binary PPM/PGM input file exceeded the
843maximum value defined in the file's header and that maximum value was greater
844than 255.  libjpeg-turbo 1.4.2 already included a similar fix for ASCII PPM/PGM
845files.  Note that these issues were not security bugs, since they were confined
846to the cjpeg program and did not affect any of the libjpeg-turbo libraries.
847
8484. Fixed an issue whereby attempting to decompress a JPEG file with a corrupt
849header using the `tjDecompressToYUV2()` function would cause the function to
850abort without returning an error and, under certain circumstances, corrupt the
851stack.  This only occurred if `tjDecompressToYUV2()` was called prior to
852calling `tjDecompressHeader3()`, or if the return value from
853`tjDecompressHeader3()` was ignored (both cases represent incorrect usage of
854the TurboJPEG API.)
855
8565. Fixed an issue in the ARM 32-bit SIMD-accelerated Huffman encoder that
857prevented the code from assembling properly with clang.
858
8596. The `jpeg_stdio_src()`, `jpeg_mem_src()`, `jpeg_stdio_dest()`, and
860`jpeg_mem_dest()` functions in the libjpeg API will now throw an error if a
861source/destination manager has already been assigned to the compress or
862decompress object by a different function or by the calling program.  This
863prevents these functions from attempting to reuse a source/destination manager
864structure that was allocated elsewhere, because there is no way to ensure that
865it would be big enough to accommodate the new source/destination manager.
866
867
8681.4.90 (1.5 beta1)
869==================
870
871### Significant changes relative to 1.4.2:
872
8731. Added full SIMD acceleration for PowerPC platforms using AltiVec VMX
874(128-bit SIMD) instructions.  Although the performance of libjpeg-turbo on
875PowerPC was already good, due to the increased number of registers available
876to the compiler vs. x86, it was still possible to speed up compression by about
8773-4x and decompression by about 2-2.5x (relative to libjpeg v6b) through the
878use of AltiVec instructions.
879
8802. Added two new libjpeg API functions (`jpeg_skip_scanlines()` and
881`jpeg_crop_scanline()`) that can be used to partially decode a JPEG image.  See
882[libjpeg.txt](libjpeg.txt) for more details.
883
8843. The TJCompressor and TJDecompressor classes in the TurboJPEG Java API now
885implement the Closeable interface, so those classes can be used with a
886try-with-resources statement.
887
8884. The TurboJPEG Java classes now throw unchecked idiomatic exceptions
889(IllegalArgumentException, IllegalStateException) for unrecoverable errors
890caused by incorrect API usage, and those classes throw a new checked exception
891type (TJException) for errors that are passed through from the C library.
892
8935. Source buffers for the TurboJPEG C API functions, as well as the
894`jpeg_mem_src()` function in the libjpeg API, are now declared as const
895pointers.  This facilitates passing read-only buffers to those functions and
896ensures the caller that the source buffer will not be modified.  This should
897not create any backward API or ABI incompatibilities with prior libjpeg-turbo
898releases.
899
9006. The MIPS DSPr2 SIMD code can now be compiled to support either FR=0 or FR=1
901FPUs.
902
9037. Fixed additional negative left shifts and other issues reported by the GCC
904and Clang undefined behavior sanitizers.  Most of these issues affected only
90532-bit code, and none of them was known to pose a security threat, but removing
906the warnings makes it easier to detect actual security issues, should they
907arise in the future.
908
9098. Removed the unnecessary `.arch` directive from the ARM64 NEON SIMD code.
910This directive was preventing the code from assembling using the clang
911integrated assembler.
912
9139. Fixed a regression caused by 1.4.1[6] that prevented 32-bit and 64-bit
914libjpeg-turbo RPMs from being installed simultaneously on recent Red Hat/Fedora
915distributions.  This was due to the addition of a macro in jconfig.h that
916allows the Huffman codec to determine the word size at compile time.  Since
917that macro differs between 32-bit and 64-bit builds, this caused a conflict
918between the i386 and x86_64 RPMs (any differing files, other than executables,
919are not allowed when 32-bit and 64-bit RPMs are installed simultaneously.)
920Since the macro is used only internally, it has been moved into jconfigint.h.
921
92210. The x86-64 SIMD code can now be disabled at run time by setting the
923`JSIMD_FORCENONE` environment variable to `1` (the other SIMD implementations
924already had this capability.)
925
92611. Added a new command-line argument to TJBench (`-nowrite`) that prevents the
927benchmark from outputting any images.  This removes any potential operating
928system overhead that might be caused by lazy writes to disk and thus improves
929the consistency of the performance measurements.
930
93112. Added SIMD acceleration for Huffman encoding on SSE2-capable x86 and x86-64
932platforms.  This speeds up the compression of full-color JPEGs by about 10-15%
933on average (relative to libjpeg-turbo 1.4.x) when using modern Intel and AMD
934CPUs.  Additionally, this works around an issue in the clang optimizer that
935prevents it (as of this writing) from achieving the same performance as GCC
936when compiling the C version of the Huffman encoder
937(<https://llvm.org/bugs/show_bug.cgi?id=16035>).  For the purposes of
938benchmarking or regression testing, SIMD-accelerated Huffman encoding can be
939disabled by setting the `JSIMD_NOHUFFENC` environment variable to `1`.
940
94113. Added ARM 64-bit (ARMv8) NEON SIMD implementations of the commonly-used
942compression algorithms (including the accurate integer forward DCT and h2v2 &
943h2v1 downsampling algorithms, which are not accelerated in the 32-bit NEON
944implementation.)  This speeds up the compression of full-color JPEGs by about
94575% on average on a Cavium ThunderX processor and by about 2-2.5x on average on
946Cortex-A53 and Cortex-A57 cores.
947
94814. Added SIMD acceleration for Huffman encoding on NEON-capable ARM 32-bit
949and 64-bit platforms.
950
951    For 32-bit code, this speeds up the compression of full-color JPEGs by
952about 30% on average on a typical iOS device (iPhone 4S, Cortex-A9) and by
953about 6-7% on average on a typical Android device (Nexus 5X, Cortex-A53 and
954Cortex-A57), relative to libjpeg-turbo 1.4.x.  Note that the larger speedup
955under iOS is due to the fact that iOS builds use LLVM, which does not optimize
956the C Huffman encoder as well as GCC does.
957
958    For 64-bit code, NEON-accelerated Huffman encoding speeds up the
959compression of full-color JPEGs by about 40% on average on a typical iOS device
960(iPhone 5S, Apple A7) and by about 7-8% on average on a typical Android device
961(Nexus 5X, Cortex-A53 and Cortex-A57), in addition to the speedup described in
962[13] above.
963
964    For the purposes of benchmarking or regression testing, SIMD-accelerated
965Huffman encoding can be disabled by setting the `JSIMD_NOHUFFENC` environment
966variable to `1`.
967
96815. pkg-config (.pc) scripts are now included for both the libjpeg and
969TurboJPEG API libraries on Un*x systems.  Note that if a project's build system
970relies on these scripts, then it will not be possible to build that project
971with libjpeg or with a prior version of libjpeg-turbo.
972
97316. Optimized the ARM 64-bit (ARMv8) NEON SIMD decompression routines to
974improve performance on CPUs with in-order pipelines.  This speeds up the
975decompression of full-color JPEGs by nearly 2x on average on a Cavium ThunderX
976processor and by about 15% on average on a Cortex-A53 core.
977
97817. Fixed an issue in the accelerated Huffman decoder that could have caused
979the decoder to read past the end of the input buffer when a malformed,
980specially-crafted JPEG image was being decompressed.  In prior versions of
981libjpeg-turbo, the accelerated Huffman decoder was invoked (in most cases) only
982if there were > 128 bytes of data in the input buffer.  However, it is possible
983to construct a JPEG image in which a single Huffman block is over 430 bytes
984long, so this version of libjpeg-turbo activates the accelerated Huffman
985decoder only if there are > 512 bytes of data in the input buffer.
986
98718. Fixed a memory leak in tjunittest encountered when running the program
988with the `-yuv` option.
989
990
9911.4.2
992=====
993
994### Significant changes relative to 1.4.1:
995
9961. Fixed an issue whereby cjpeg would segfault if a Windows bitmap with a
997negative width or height was used as an input image (Windows bitmaps can have
998a negative height if they are stored in top-down order, but such files are
999rare and not supported by libjpeg-turbo.)
1000
10012. Fixed an issue whereby, under certain circumstances, libjpeg-turbo would
1002incorrectly encode certain JPEG images when quality=100 and the fast integer
1003forward DCT were used.  This was known to cause `make test` to fail when the
1004library was built with `-march=haswell` on x86 systems.
1005
10063. Fixed an issue whereby libjpeg-turbo would crash when built with the latest
1007& greatest development version of the Clang/LLVM compiler.  This was caused by
1008an x86-64 ABI conformance issue in some of libjpeg-turbo's 64-bit SSE2 SIMD
1009routines.  Those routines were incorrectly using a 64-bit `mov` instruction to
1010transfer a 32-bit JDIMENSION argument, whereas the x86-64 ABI allows the upper
1011(unused) 32 bits of a 32-bit argument's register to be undefined.  The new
1012Clang/LLVM optimizer uses load combining to transfer multiple adjacent 32-bit
1013structure members into a single 64-bit register, and this exposed the ABI
1014conformance issue.
1015
10164. Fixed a bug in the MIPS DSPr2 4:2:0 "plain" (non-fancy and non-merged)
1017upsampling routine that caused a buffer overflow (and subsequent segfault) when
1018decompressing a 4:2:0 JPEG image whose scaled output width was less than 16
1019pixels.  The "plain" upsampling routines are normally only used when
1020decompressing a non-YCbCr JPEG image, but they are also used when decompressing
1021a JPEG image whose scaled output height is 1.
1022
10235. Fixed various negative left shifts and other issues reported by the GCC and
1024Clang undefined behavior sanitizers.  None of these was known to pose a
1025security threat, but removing the warnings makes it easier to detect actual
1026security issues, should they arise in the future.
1027
1028
10291.4.1
1030=====
1031
1032### Significant changes relative to 1.4.0:
1033
10341. tjbench now properly handles CMYK/YCCK JPEG files.  Passing an argument of
1035`-cmyk` (instead of, for instance, `-rgb`) will cause tjbench to internally
1036convert the source bitmap to CMYK prior to compression, to generate YCCK JPEG
1037files, and to internally convert the decompressed CMYK pixels back to RGB after
1038decompression (the latter is done automatically if a CMYK or YCCK JPEG is
1039passed to tjbench as a source image.)  The CMYK<->RGB conversion operation is
1040not benchmarked.  NOTE: The quick & dirty CMYK<->RGB conversions that tjbench
1041uses are suitable for testing only.  Proper conversion between CMYK and RGB
1042requires a color management system.
1043
10442. `make test` now performs additional bitwise regression tests using tjbench,
1045mainly for the purpose of testing compression from/decompression to a subregion
1046of a larger image buffer.
1047
10483. `make test` no longer tests the regression of the floating point DCT/IDCT
1049by default, since the results of those tests can vary if the algorithms in
1050question are not implemented using SIMD instructions on a particular platform.
1051See the comments in [Makefile.am](Makefile.am) for information on how to
1052re-enable the tests and to specify an expected result for them based on the
1053particulars of your platform.
1054
10554. The NULL color conversion routines have been significantly optimized,
1056which speeds up the compression of RGB and CMYK JPEGs by 5-20% when using
105764-bit code and 0-3% when using 32-bit code, and the decompression of those
1058images by 10-30% when using 64-bit code and 3-12% when using 32-bit code.
1059
10605. Fixed an "illegal instruction" error that occurred when djpeg from a
1061SIMD-enabled libjpeg-turbo MIPS build was executed with the `-nosmooth` option
1062on a MIPS machine that lacked DSPr2 support.  The MIPS SIMD routines for h2v1
1063and h2v2 merged upsampling were not properly checking for the existence of
1064DSPr2.
1065
10666. Performance has been improved significantly on 64-bit non-Linux and
1067non-Windows platforms (generally 10-20% faster compression and 5-10% faster
1068decompression.)  Due to an oversight, the 64-bit version of the accelerated
1069Huffman codec was not being compiled in when libjpeg-turbo was built on
1070platforms other than Windows or Linux.  Oops.
1071
10727. Fixed an extremely rare bug in the Huffman encoder that caused 64-bit
1073builds of libjpeg-turbo to incorrectly encode a few specific test images when
1074quality=98, an optimized Huffman table, and the accurate integer forward DCT
1075were used.
1076
10778. The Windows (CMake) build system now supports building only static or only
1078shared libraries.  This is accomplished by adding either `-DENABLE_STATIC=0` or
1079`-DENABLE_SHARED=0` to the CMake command line.
1080
10819. TurboJPEG API functions will now return an error code if a warning is
1082triggered in the underlying libjpeg API.  For instance, if a JPEG file is
1083corrupt, the TurboJPEG decompression functions will attempt to decompress
1084as much of the image as possible, but those functions will now return -1 to
1085indicate that the decompression was not entirely successful.
1086
108710. Fixed a bug in the MIPS DSPr2 4:2:2 fancy upsampling routine that caused a
1088buffer overflow (and subsequent segfault) when decompressing a 4:2:2 JPEG image
1089in which the right-most MCU was 5 or 6 pixels wide.
1090
1091
10921.4.0
1093=====
1094
1095### Significant changes relative to 1.4 beta1:
1096
10971. Fixed a build issue on OS X PowerPC platforms (md5cmp failed to build
1098because OS X does not provide the `le32toh()` and `htole32()` functions.)
1099
11002. The non-SIMD RGB565 color conversion code did not work correctly on big
1101endian machines.  This has been fixed.
1102
11033. Fixed an issue in `tjPlaneSizeYUV()` whereby it would erroneously return 1
1104instead of -1 if `componentID` was > 0 and `subsamp` was `TJSAMP_GRAY`.
1105
11063. Fixed an issue in `tjBufSizeYUV2()` whereby it would erroneously return 0
1107instead of -1 if `width` was < 1.
1108
11095. The Huffman encoder now uses `clz` and `bsr` instructions for bit counting
1110on ARM64 platforms (see 1.4 beta1[5].)
1111
11126. The `close()` method in the TJCompressor and TJDecompressor Java classes is
1113now idempotent.  Previously, that method would call the native `tjDestroy()`
1114function even if the TurboJPEG instance had already been destroyed.  This
1115caused an exception to be thrown during finalization, if the `close()` method
1116had already been called.  The exception was caught, but it was still an
1117expensive operation.
1118
11197. The TurboJPEG API previously generated an error (`Could not determine
1120subsampling type for JPEG image`) when attempting to decompress grayscale JPEG
1121images that were compressed with a sampling factor other than 1 (for instance,
1122with `cjpeg -grayscale -sample 2x2`).  Subsampling technically has no meaning
1123with grayscale JPEGs, and thus the horizontal and vertical sampling factors
1124for such images are ignored by the decompressor.  However, the TurboJPEG API
1125was being too rigid and was expecting the sampling factors to be equal to 1
1126before it treated the image as a grayscale JPEG.
1127
11288. cjpeg, djpeg, and jpegtran now accept an argument of `-version`, which will
1129print the library version and exit.
1130
11319. Referring to 1.4 beta1[15], another extremely rare circumstance was
1132discovered under which the Huffman encoder's local buffer can be overrun
1133when a buffered destination manager is being used and an
1134extremely-high-frequency block (basically junk image data) is being encoded.
1135Even though the Huffman local buffer was increased from 128 bytes to 136 bytes
1136to address the previous issue, the new issue caused even the larger buffer to
1137be overrun.  Further analysis reveals that, in the absolute worst case (such as
1138setting alternating AC coefficients to 32767 and -32768 in the JPEG scanning
1139order), the Huffman encoder can produce encoded blocks that approach double the
1140size of the unencoded blocks.  Thus, the Huffman local buffer was increased to
1141256 bytes, which should prevent any such issue from re-occurring in the future.
1142
114310. The new `tjPlaneSizeYUV()`, `tjPlaneWidth()`, and `tjPlaneHeight()`
1144functions were not actually usable on any platform except OS X and Windows,
1145because those functions were not included in the libturbojpeg mapfile.  This
1146has been fixed.
1147
114811. Restored the `JPP()`, `JMETHOD()`, and `FAR` macros in the libjpeg-turbo
1149header files.  The `JPP()` and `JMETHOD()` macros were originally implemented
1150in libjpeg as a way of supporting non-ANSI compilers that lacked support for
1151prototype parameters.  libjpeg-turbo has never supported such compilers, but
1152some software packages still use the macros to define their own prototypes.
1153Similarly, libjpeg-turbo has never supported MS-DOS and other platforms that
1154have far symbols, but some software packages still use the `FAR` macro.  A
1155pretty good argument can be made that this is a bad practice on the part of the
1156software in question, but since this affects more than one package, it's just
1157easier to fix it here.
1158
115912. Fixed issues that were preventing the ARM 64-bit SIMD code from compiling
1160for iOS, and included an ARMv8 architecture in all of the binaries installed by
1161the "official" libjpeg-turbo SDK for OS X.
1162
1163
11641.3.90 (1.4 beta1)
1165==================
1166
1167### Significant changes relative to 1.3.1:
1168
11691. New features in the TurboJPEG API:
1170
1171     - YUV planar images can now be generated with an arbitrary line padding
1172(previously only 4-byte padding, which was compatible with X Video, was
1173supported.)
1174     - The decompress-to-YUV function has been extended to support image
1175scaling.
1176     - JPEG images can now be compressed from YUV planar source images.
1177     - YUV planar images can now be decoded into RGB or grayscale images.
1178     - 4:1:1 subsampling is now supported.  This is mainly included for
1179compatibility, since 4:1:1 is not fully accelerated in libjpeg-turbo and has no
1180significant advantages relative to 4:2:0.
1181     - CMYK images are now supported.  This feature allows CMYK source images
1182to be compressed to YCCK JPEGs and YCCK or CMYK JPEGs to be decompressed to
1183CMYK destination images.  Conversion between CMYK/YCCK and RGB or YUV images is
1184not supported.  Such conversion requires a color management system and is thus
1185out of scope for a codec library.
1186     - The handling of YUV images in the Java API has been significantly
1187refactored and should now be much more intuitive.
1188     - The Java API now supports encoding a YUV image from an arbitrary
1189position in a large image buffer.
1190     - All of the YUV functions now have a corresponding function that operates
1191on separate image planes instead of a unified image buffer.  This allows for
1192compressing/decoding from or decompressing/encoding to a subregion of a larger
1193YUV image.  It also allows for handling YUV formats that swap the order of the
1194U and V planes.
1195
11962. Added SIMD acceleration for DSPr2-capable MIPS platforms.  This speeds up
1197the compression of full-color JPEGs by 70-80% on such platforms and
1198decompression by 25-35%.
1199
12003. If an application attempts to decompress a Huffman-coded JPEG image whose
1201header does not contain Huffman tables, libjpeg-turbo will now insert the
1202default Huffman tables.  In order to save space, many motion JPEG video frames
1203are encoded without the default Huffman tables, so these frames can now be
1204successfully decompressed by libjpeg-turbo without additional work on the part
1205of the application.  An application can still override the Huffman tables, for
1206instance to re-use tables from a previous frame of the same video.
1207
12084. The Mac packaging system now uses pkgbuild and productbuild rather than
1209PackageMaker (which is obsolete and no longer supported.)  This means that
1210OS X 10.6 "Snow Leopard" or later must be used when packaging libjpeg-turbo,
1211although the packages produced can be installed on OS X 10.5 "Leopard" or
1212later.  OS X 10.4 "Tiger" is no longer supported.
1213
12145. The Huffman encoder now uses `clz` and `bsr` instructions for bit counting
1215on ARM platforms rather than a lookup table.  This reduces the memory footprint
1216by 64k, which may be important for some mobile applications.  Out of four
1217Android devices that were tested, two demonstrated a small overall performance
1218loss (~3-4% on average) with ARMv6 code and a small gain (also ~3-4%) with
1219ARMv7 code when enabling this new feature, but the other two devices
1220demonstrated a significant overall performance gain with both ARMv6 and ARMv7
1221code (~10-20%) when enabling the feature.  Actual mileage may vary.
1222
12236. Worked around an issue with Visual C++ 2010 and later that caused incorrect
1224pixels to be generated when decompressing a JPEG image to a 256-color bitmap,
1225if compiler optimization was enabled when libjpeg-turbo was built.  This caused
1226the regression tests to fail when doing a release build under Visual C++ 2010
1227and later.
1228
12297. Improved the accuracy and performance of the non-SIMD implementation of the
1230floating point inverse DCT (using code borrowed from libjpeg v8a and later.)
1231The accuracy of this implementation now matches the accuracy of the SSE/SSE2
1232implementation.  Note, however, that the floating point DCT/IDCT algorithms are
1233mainly a legacy feature.  They generally do not produce significantly better
1234accuracy than the accurate integer DCT/IDCT algorithms, and they are quite a
1235bit slower.
1236
12378. Added a new output colorspace (`JCS_RGB565`) to the libjpeg API that allows
1238for decompressing JPEG images into RGB565 (16-bit) pixels.  If dithering is not
1239used, then this code path is SIMD-accelerated on ARM platforms.
1240
12419. Numerous obsolete features, such as support for non-ANSI compilers and
1242support for the MS-DOS memory model, were removed from the libjpeg code,
1243greatly improving its readability and making it easier to maintain and extend.
1244
124510. Fixed a segfault that occurred when calling `output_message()` with
1246`msg_code` set to `JMSG_COPYRIGHT`.
1247
124811. Fixed an issue whereby wrjpgcom was allowing comments longer than 65k
1249characters to be passed on the command line, which was causing it to generate
1250incorrect JPEG files.
1251
125212. Fixed a bug in the build system that was causing the Windows version of
1253wrjpgcom to be built using the rdjpgcom source code.
1254
125513. Restored 12-bit-per-component JPEG support.  A 12-bit version of
1256libjpeg-turbo can now be built by passing an argument of `--with-12bit` to
1257configure (Unix) or `-DWITH_12BIT=1` to cmake (Windows.)  12-bit JPEG support
1258is included only for convenience.  Enabling this feature disables all of the
1259performance features in libjpeg-turbo, as well as arithmetic coding and the
1260TurboJPEG API.  The resulting library still contains the other libjpeg-turbo
1261features (such as the colorspace extensions), but in general, it performs no
1262faster than libjpeg v6b.
1263
126414. Added ARM 64-bit SIMD acceleration for the YCC-to-RGB color conversion
1265and IDCT algorithms (both are used during JPEG decompression.)  For unknown
1266reasons (probably related to clang), this code cannot currently be compiled for
1267iOS.
1268
126915. Fixed an extremely rare bug (CVE-2014-9092) that could cause the Huffman
1270encoder's local buffer to overrun when a very high-frequency MCU is compressed
1271using quality 100 and no subsampling, and when the JPEG output buffer is being
1272dynamically resized by the destination manager.  This issue was so rare that,
1273even with a test program specifically designed to make the bug occur (by
1274injecting random high-frequency YUV data into the compressor), it was
1275reproducible only once in about every 25 million iterations.
1276
127716. Fixed an oversight in the TurboJPEG C wrapper:  if any of the JPEG
1278compression functions was called repeatedly with the same
1279automatically-allocated destination buffer, then TurboJPEG would erroneously
1280assume that the `jpegSize` parameter was equal to the size of the buffer, when
1281in fact that parameter was probably equal to the size of the most recently
1282compressed JPEG image.  If the size of the previous JPEG image was not as large
1283as the current JPEG image, then TurboJPEG would unnecessarily reallocate the
1284destination buffer.
1285
1286
12871.3.1
1288=====
1289
1290### Significant changes relative to 1.3.0:
1291
12921. On Un*x systems, `make install` now installs the libjpeg-turbo libraries
1293into /opt/libjpeg-turbo/lib32 by default on any 32-bit system, not just x86,
1294and into /opt/libjpeg-turbo/lib64 by default on any 64-bit system, not just
1295x86-64.  You can override this by overriding either the `prefix` or `libdir`
1296configure variables.
1297
12982. The Windows installer now places a copy of the TurboJPEG DLLs in the same
1299directory as the rest of the libjpeg-turbo binaries.  This was mainly done
1300to support TurboVNC 1.3, which bundles the DLLs in its Windows installation.
1301When using a 32-bit version of CMake on 64-bit Windows, it is impossible to
1302access the c:\WINDOWS\system32 directory, which made it impossible for the
1303TurboVNC build scripts to bundle the 64-bit TurboJPEG DLL.
1304
13053. Fixed a bug whereby attempting to encode a progressive JPEG with arithmetic
1306entropy coding (by passing arguments of `-progressive -arithmetic` to cjpeg or
1307jpegtran, for instance) would result in an error, `Requested feature was
1308omitted at compile time`.
1309
13104. Fixed a couple of issues (CVE-2013-6629 and CVE-2013-6630) whereby malformed
1311JPEG images would cause libjpeg-turbo to use uninitialized memory during
1312decompression.
1313
13145. Fixed an error (`Buffer passed to JPEG library is too small`) that occurred
1315when calling the TurboJPEG YUV encoding function with a very small (< 5x5)
1316source image, and added a unit test to check for this error.
1317
13186. The Java classes should now build properly under Visual Studio 2010 and
1319later.
1320
13217. Fixed an issue that prevented SRPMs generated using the in-tree packaging
1322tools from being rebuilt on certain newer Linux distributions.
1323
13248. Numerous minor fixes to eliminate compilation and build/packaging system
1325warnings, fix cosmetic issues, improve documentation clarity, and other general
1326source cleanup.
1327
1328
13291.3.0
1330=====
1331
1332### Significant changes relative to 1.3 beta1:
1333
13341. `make test` now works properly on FreeBSD, and it no longer requires the
1335md5sum executable to be present on other Un*x platforms.
1336
13372. Overhauled the packaging system:
1338
1339     - To avoid conflict with vendor-supplied libjpeg-turbo packages, the
1340official RPMs and DEBs for libjpeg-turbo have been renamed to
1341"libjpeg-turbo-official".
1342     - The TurboJPEG libraries are now located under /opt/libjpeg-turbo in the
1343official Linux and Mac packages, to avoid conflict with vendor-supplied
1344packages and also to streamline the packaging system.
1345     - Release packages are now created with the directory structure defined
1346by the configure variables `prefix`, `bindir`, `libdir`, etc. (Un\*x) or by the
1347`CMAKE_INSTALL_PREFIX` variable (Windows.)  The exception is that the docs are
1348always located under the system default documentation directory on Un\*x and
1349Mac systems, and on Windows, the TurboJPEG DLL is always located in the Windows
1350system directory.
1351     - To avoid confusion, official libjpeg-turbo packages on Linux/Unix
1352platforms (except for Mac) will always install the 32-bit libraries in
1353/opt/libjpeg-turbo/lib32 and the 64-bit libraries in /opt/libjpeg-turbo/lib64.
1354     - Fixed an issue whereby, in some cases, the libjpeg-turbo executables on
1355Un*x systems were not properly linking with the shared libraries installed by
1356the same package.
1357     - Fixed an issue whereby building the "installer" target on Windows when
1358`WITH_JAVA=1` would fail if the TurboJPEG JAR had not been previously built.
1359     - Building the "install" target on Windows now installs files into the
1360same places that the installer does.
1361
13623. Fixed a Huffman encoder bug that prevented I/O suspension from working
1363properly.
1364
1365
13661.2.90 (1.3 beta1)
1367==================
1368
1369### Significant changes relative to 1.2.1:
1370
13711. Added support for additional scaling factors (3/8, 5/8, 3/4, 7/8, 9/8, 5/4,
137211/8, 3/2, 13/8, 7/4, 15/8, and 2) when decompressing.  Note that the IDCT will
1373not be SIMD-accelerated when using any of these new scaling factors.
1374
13752. The TurboJPEG dynamic library is now versioned.  It was not strictly
1376necessary to do so, because TurboJPEG uses versioned symbols, and if a function
1377changes in an ABI-incompatible way, that function is renamed and a legacy
1378function is provided to maintain backward compatibility.  However, certain
1379Linux distro maintainers have a policy against accepting any library that isn't
1380versioned.
1381
13823. Extended the TurboJPEG Java API so that it can be used to compress a JPEG
1383image from and decompress a JPEG image to an arbitrary position in a large
1384image buffer.
1385
13864. The `tjDecompressToYUV()` function now supports the `TJFLAG_FASTDCT` flag.
1387
13885. The 32-bit supplementary package for amd64 Debian systems now provides
1389symlinks in /usr/lib/i386-linux-gnu for the TurboJPEG libraries in /usr/lib32.
1390This allows those libraries to be used on MultiArch-compatible systems (such as
1391Ubuntu 11 and later) without setting the linker path.
1392
13936. The TurboJPEG Java wrapper should now find the JNI library on Mac systems
1394without having to pass `-Djava.library.path=/usr/lib` to java.
1395
13967. TJBench has been ported to Java to provide a convenient way of validating
1397the performance of the TurboJPEG Java API.  It can be run with
1398`java -cp turbojpeg.jar TJBench`.
1399
14008. cjpeg can now be used to generate JPEG files with the RGB colorspace
1401(feature ported from jpeg-8d.)
1402
14039. The width and height in the `-crop` argument passed to jpegtran can now be
1404suffixed with `f` to indicate that, when the upper left corner of the cropping
1405region is automatically moved to the nearest iMCU boundary, the bottom right
1406corner should be moved by the same amount.  In other words, this feature causes
1407jpegtran to strictly honor the specified width/height rather than the specified
1408bottom right corner (feature ported from jpeg-8d.)
1409
141010. JPEG files using the RGB colorspace can now be decompressed into grayscale
1411images (feature ported from jpeg-8d.)
1412
141311. Fixed a regression caused by 1.2.1[7] whereby the build would fail with
1414multiple "Mismatch in operand sizes" errors when attempting to build the x86
1415SIMD code with NASM 0.98.
1416
141712. The in-memory source/destination managers (`jpeg_mem_src()` and
1418`jpeg_mem_dest()`) are now included by default when building libjpeg-turbo with
1419libjpeg v6b or v7 emulation, so that programs can take advantage of these
1420functions without requiring the use of the backward-incompatible libjpeg v8
1421ABI.  The "age number" of the libjpeg-turbo library on Un*x systems has been
1422incremented by 1 to reflect this.  You can disable this feature with a
1423configure/CMake switch in order to retain strict API/ABI compatibility with the
1424libjpeg v6b or v7 API/ABI (or with previous versions of libjpeg-turbo.)  See
1425[README.md](README.md) for more details.
1426
142713. Added ARMv7s architecture to libjpeg.a and libturbojpeg.a in the official
1428libjpeg-turbo binary package for OS X, so that those libraries can be used to
1429build applications that leverage the faster CPUs in the iPhone 5 and iPad 4.
1430
1431
14321.2.1
1433=====
1434
1435### Significant changes relative to 1.2.0:
1436
14371. Creating or decoding a JPEG file that uses the RGB colorspace should now
1438properly work when the input or output colorspace is one of the libjpeg-turbo
1439colorspace extensions.
1440
14412. When libjpeg-turbo was built without SIMD support and merged (non-fancy)
1442upsampling was used along with an alpha-enabled colorspace during
1443decompression, the unused byte of the decompressed pixels was not being set to
14440xFF.  This has been fixed.  TJUnitTest has also been extended to test for the
1445correct behavior of the colorspace extensions when merged upsampling is used.
1446
14473. Fixed a bug whereby the libjpeg-turbo SSE2 SIMD code would not preserve the
1448upper 64 bits of xmm6 and xmm7 on Win64 platforms, which violated the Win64
1449calling conventions.
1450
14514. Fixed a regression (CVE-2012-2806) caused by 1.2.0[6] whereby decompressing
1452corrupt JPEG images (specifically, images in which the component count was
1453erroneously set to a large value) would cause libjpeg-turbo to segfault.
1454
14555. Worked around a severe performance issue with "Bobcat" (AMD Embedded APU)
1456processors.  The `MASKMOVDQU` instruction, which was used by the libjpeg-turbo
1457SSE2 SIMD code, is apparently implemented in microcode on AMD processors, and
1458it is painfully slow on Bobcat processors in particular.  Eliminating the use
1459of this instruction improved performance by an order of magnitude on Bobcat
1460processors and by a small amount (typically 5%) on AMD desktop processors.
1461
14626. Added SIMD acceleration for performing 4:2:2 upsampling on NEON-capable ARM
1463platforms.  This speeds up the decompression of 4:2:2 JPEGs by 20-25% on such
1464platforms.
1465
14667. Fixed a regression caused by 1.2.0[2] whereby, on Linux/x86 platforms
1467running the 32-bit SSE2 SIMD code in libjpeg-turbo, decompressing a 4:2:0 or
14684:2:2 JPEG image into a 32-bit (RGBX, BGRX, etc.) buffer without using fancy
1469upsampling would produce several incorrect columns of pixels at the right-hand
1470side of the output image if each row in the output image was not evenly
1471divisible by 16 bytes.
1472
14738. Fixed an issue whereby attempting to build the SIMD extensions with Xcode
14744.3 on OS X platforms would cause NASM to return numerous errors of the form
1475"'%define' expects a macro identifier".
1476
14779. Added flags to the TurboJPEG API that allow the caller to force the use of
1478either the fast or the accurate DCT/IDCT algorithms in the underlying codec.
1479
1480
14811.2.0
1482=====
1483
1484### Significant changes relative to 1.2 beta1:
1485
14861. Fixed build issue with YASM on Unix systems (the libjpeg-turbo build system
1487was not adding the current directory to the assembler include path, so YASM
1488was not able to find jsimdcfg.inc.)
1489
14902. Fixed out-of-bounds read in SSE2 SIMD code that occurred when decompressing
1491a JPEG image to a bitmap buffer whose size was not a multiple of 16 bytes.
1492This was more of an annoyance than an actual bug, since it did not cause any
1493actual run-time problems, but the issue showed up when running libjpeg-turbo in
1494valgrind.  See <http://crbug.com/72399> for more information.
1495
14963. Added a compile-time macro (`LIBJPEG_TURBO_VERSION`) that can be used to
1497check the version of libjpeg-turbo against which an application was compiled.
1498
14994. Added new RGBA/BGRA/ABGR/ARGB colorspace extension constants (libjpeg API)
1500and pixel formats (TurboJPEG API), which allow applications to specify that,
1501when decompressing to a 4-component RGB buffer, the unused byte should be set
1502to 0xFF so that it can be interpreted as an opaque alpha channel.
1503
15045. Fixed regression issue whereby DevIL failed to build against libjpeg-turbo
1505because libjpeg-turbo's distributed version of jconfig.h contained an `INLINE`
1506macro, which conflicted with a similar macro in DevIL.  This macro is used only
1507internally when building libjpeg-turbo, so it was moved into config.h.
1508
15096. libjpeg-turbo will now correctly decompress erroneous CMYK/YCCK JPEGs whose
1510K component is assigned a component ID of 1 instead of 4.  Although these files
1511are in violation of the spec, other JPEG implementations handle them
1512correctly.
1513
15147. Added ARMv6 and ARMv7 architectures to libjpeg.a and libturbojpeg.a in
1515the official libjpeg-turbo binary package for OS X, so that those libraries can
1516be used to build both OS X and iOS applications.
1517
1518
15191.1.90 (1.2 beta1)
1520==================
1521
1522### Significant changes relative to 1.1.1:
1523
15241. Added a Java wrapper for the TurboJPEG API.  See [java/README](java/README)
1525for more details.
1526
15272. The TurboJPEG API can now be used to scale down images during
1528decompression.
1529
15303. Added SIMD routines for RGB-to-grayscale color conversion, which
1531significantly improves the performance of grayscale JPEG compression from an
1532RGB source image.
1533
15344. Improved the performance of the C color conversion routines, which are used
1535on platforms for which SIMD acceleration is not available.
1536
15375. Added a function to the TurboJPEG API that performs lossless transforms.
1538This function is implemented using the same back end as jpegtran, but it
1539performs transcoding entirely in memory and allows multiple transforms and/or
1540crop operations to be batched together, so the source coefficients only need to
1541be read once.  This is useful when generating image tiles from a single source
1542JPEG.
1543
15446. Added tests for the new TurboJPEG scaled decompression and lossless
1545transform features to tjbench (the TurboJPEG benchmark, formerly called
1546"jpgtest".)
1547
15487. Added support for 4:4:0 (transposed 4:2:2) subsampling in TurboJPEG, which
1549was necessary in order for it to read 4:2:2 JPEG files that had been losslessly
1550transposed or rotated 90 degrees.
1551
15528. All legacy VirtualGL code has been re-factored, and this has allowed
1553libjpeg-turbo, in its entirety, to be re-licensed under a BSD-style license.
1554
15559. libjpeg-turbo can now be built with YASM.
1556
155710. Added SIMD acceleration for ARM Linux and iOS platforms that support
1558NEON instructions.
1559
156011. Refactored the TurboJPEG C API and documented it using Doxygen.  The
1561TurboJPEG 1.2 API uses pixel formats to define the size and component order of
1562the uncompressed source/destination images, and it includes a more efficient
1563version of `TJBUFSIZE()` that computes a worst-case JPEG size based on the
1564level of chrominance subsampling.  The refactored implementation of the
1565TurboJPEG API now uses the libjpeg memory source and destination managers,
1566which allows the TurboJPEG compressor to grow the JPEG buffer as necessary.
1567
156812. Eliminated errors in the output of jpegtran on Windows that occurred when
1569the application was invoked using I/O redirection
1570(`jpegtran <input.jpg >output.jpg`.)
1571
157213. The inclusion of libjpeg v7 and v8 emulation as well as arithmetic coding
1573support in libjpeg-turbo v1.1.0 introduced several new error constants in
1574jerror.h, and these were mistakenly enabled for all emulation modes, causing
1575the error enum in libjpeg-turbo to sometimes have different values than the
1576same enum in libjpeg.  This represents an ABI incompatibility, and it caused
1577problems with rare applications that took specific action based on a particular
1578error value.  The fix was to include the new error constants conditionally
1579based on whether libjpeg v7 or v8 emulation was enabled.
1580
158114. Fixed an issue whereby Windows applications that used libjpeg-turbo would
1582fail to compile if the Windows system headers were included before jpeglib.h.
1583This issue was caused by a conflict in the definition of the INT32 type.
1584
158515. Fixed 32-bit supplementary package for amd64 Debian systems, which was
1586broken by enhancements to the packaging system in 1.1.
1587
158816. When decompressing a JPEG image using an output colorspace of
1589`JCS_EXT_RGBX`, `JCS_EXT_BGRX`, `JCS_EXT_XBGR`, or `JCS_EXT_XRGB`,
1590libjpeg-turbo will now set the unused byte to 0xFF, which allows applications
1591to interpret that byte as an alpha channel (0xFF = opaque).
1592
1593
15941.1.1
1595=====
1596
1597### Significant changes relative to 1.1.0:
1598
15991. Fixed a 1-pixel error in row 0, column 21 of the luminance plane generated
1600by `tjEncodeYUV()`.
1601
16022. libjpeg-turbo's accelerated Huffman decoder previously ignored unexpected
1603markers found in the middle of the JPEG data stream during decompression.  It
1604will now hand off decoding of a particular block to the unaccelerated Huffman
1605decoder if an unexpected marker is found, so that the unaccelerated Huffman
1606decoder can generate an appropriate warning.
1607
16083. Older versions of MinGW64 prefixed symbol names with underscores by
1609default, which differed from the behavior of 64-bit Visual C++.  MinGW64 1.0
1610has adopted the behavior of 64-bit Visual C++ as the default, so to accommodate
1611this, the libjpeg-turbo SIMD function names are no longer prefixed with an
1612underscore when building with MinGW64.  This means that, when building
1613libjpeg-turbo with older versions of MinGW64, you will now have to add
1614`-fno-leading-underscore` to the `CFLAGS`.
1615
16164. Fixed a regression bug in the NSIS script that caused the Windows installer
1617build to fail when using the Visual Studio IDE.
1618
16195. Fixed a bug in `jpeg_read_coefficients()` whereby it would not initialize
1620`cinfo->image_width` and `cinfo->image_height` if libjpeg v7 or v8 emulation
1621was enabled.  This specifically caused the jpegoptim program to fail if it was
1622linked against a version of libjpeg-turbo that was built with libjpeg v7 or v8
1623emulation.
1624
16256. Eliminated excessive I/O overhead that occurred when reading BMP files in
1626cjpeg.
1627
16287. Eliminated errors in the output of cjpeg on Windows that occurred when the
1629application was invoked using I/O redirection (`cjpeg <inputfile >output.jpg`.)
1630
1631
16321.1.0
1633=====
1634
1635### Significant changes relative to 1.1 beta1:
1636
16371. The algorithm used by the SIMD quantization function cannot produce correct
1638results when the JPEG quality is >= 98 and the fast integer forward DCT is
1639used.  Thus, the non-SIMD quantization function is now used for those cases,
1640and libjpeg-turbo should now produce identical output to libjpeg v6b in all
1641cases.
1642
16432. Despite the above, the fast integer forward DCT still degrades somewhat for
1644JPEG qualities greater than 95, so the TurboJPEG wrapper will now automatically
1645use the accurate integer forward DCT when generating JPEG images of quality 96
1646or greater.  This reduces compression performance by as much as 15% for these
1647high-quality images but is necessary to ensure that the images are perceptually
1648lossless.  It also ensures that the library can avoid the performance pitfall
1649created by [1].
1650
16513. Ported jpgtest.cxx to pure C to avoid the need for a C++ compiler.
1652
16534. Fixed visual artifacts in grayscale JPEG compression caused by a typo in
1654the RGB-to-luminance lookup tables.
1655
16565. The Windows distribution packages now include the libjpeg run-time programs
1657(cjpeg, etc.)
1658
16596. All packages now include jpgtest.
1660
16617. The TurboJPEG dynamic library now uses versioned symbols.
1662
16638. Added two new TurboJPEG API functions, `tjEncodeYUV()` and
1664`tjDecompressToYUV()`, to replace the somewhat hackish `TJ_YUV` flag.
1665
1666
16671.0.90 (1.1 beta1)
1668==================
1669
1670### Significant changes relative to 1.0.1:
1671
16721. Added emulation of the libjpeg v7 and v8 APIs and ABIs.  See
1673[README.md](README.md) for more details.  This feature was sponsored by
1674CamTrace SAS.
1675
16762. Created a new CMake-based build system for the Visual C++ and MinGW builds.
1677
16783. Grayscale bitmaps can now be compressed from/decompressed to using the
1679TurboJPEG API.
1680
16814. jpgtest can now be used to test decompression performance with existing
1682JPEG images.
1683
16845. If the default install prefix (/opt/libjpeg-turbo) is used, then
1685`make install` now creates /opt/libjpeg-turbo/lib32 and
1686/opt/libjpeg-turbo/lib64 sym links to duplicate the behavior of the binary
1687packages.
1688
16896. All symbols in the libjpeg-turbo dynamic library are now versioned, even
1690when the library is built with libjpeg v6b emulation.
1691
16927. Added arithmetic encoding and decoding support (can be disabled with
1693configure or CMake options)
1694
16958. Added a `TJ_YUV` flag to the TurboJPEG API, which causes both the compressor
1696and decompressor to output planar YUV images.
1697
16989. Added an extended version of `tjDecompressHeader()` to the TurboJPEG API,
1699which allows the caller to determine the type of subsampling used in a JPEG
1700image.
1701
170210. Added further protections against invalid Huffman codes.
1703
1704
17051.0.1
1706=====
1707
1708### Significant changes relative to 1.0.0:
1709
17101. The Huffman decoder will now handle erroneous Huffman codes (for instance,
1711from a corrupt JPEG image.)  Previously, these would cause libjpeg-turbo to
1712crash under certain circumstances.
1713
17142. Fixed typo in SIMD dispatch routines that was causing 4:2:2 upsampling to
1715be used instead of 4:2:0 when decompressing JPEG images using SSE2 code.
1716
17173. The configure script will now automatically determine whether the
1718`INCOMPLETE_TYPES_BROKEN` macro should be defined.
1719
1720
17211.0.0
1722=====
1723
1724### Significant changes relative to 0.0.93:
1725
17261. 2983700: Further FreeBSD build tweaks (no longer necessary to specify
1727`--host` when configuring on a 64-bit system)
1728
17292. Created symlinks in the Unix/Linux packages so that the TurboJPEG
1730include file can always be found in /opt/libjpeg-turbo/include, the 32-bit
1731static libraries can always be found in /opt/libjpeg-turbo/lib32, and the
173264-bit static libraries can always be found in /opt/libjpeg-turbo/lib64.
1733
17343. The Unix/Linux distribution packages now include the libjpeg run-time
1735programs (cjpeg, etc.) and man pages.
1736
17374. Created a 32-bit supplementary package for amd64 Debian systems, which
1738contains just the 32-bit libjpeg-turbo libraries.
1739
17405. Moved the libraries from */lib32 to */lib in the i386 Debian package.
1741
17426. Include distribution package for Cygwin
1743
17447. No longer necessary to specify `--without-simd` on non-x86 architectures,
1745and unit tests now work on those architectures.
1746
1747
17480.0.93
1749======
1750
1751### Significant changes since 0.0.91:
1752
17531. 2982659: Fixed x86-64 build on FreeBSD systems
1754
17552. 2988188: Added support for Windows 64-bit systems
1756
1757
17580.0.91
1759======
1760
1761### Significant changes relative to 0.0.90:
1762
17631. Added documentation to .deb packages
1764
17652. 2968313: Fixed data corruption issues when decompressing large JPEG images
1766and/or using buffered I/O with the libjpeg-turbo decompressor
1767
1768
17690.0.90
1770======
1771
1772Initial release
1773