12.1.1 2===== 3 4### Significant changes relative to 2.1.0 5 61. Fixed a regression introduced in 2.1.0 that caused build failures with 7non-GCC-compatible compilers for Un*x/Arm platforms. 8 92. Fixed a regression introduced by 2.1 beta1[13] that prevented the Arm 32-bit 10(AArch32) Neon SIMD extensions from building unless the C compiler flags 11included `-mfloat-abi=softfp` or `-mfloat-abi=hard`. 12 133. Fixed an issue in the AArch32 Neon SIMD Huffman encoder whereby reliance on 14undefined C compiler behavior led to crashes ("SIGBUS: illegal alignment") on 15Android systems when running AArch32/Thumb builds of libjpeg-turbo built with 16recent versions of Clang. 17 18 192.1.0 20===== 21 22### Significant changes relative to 2.1 beta1 23 241. Fixed a regression introduced by 2.1 beta1[6(b)] whereby attempting to 25decompress certain progressive JPEG images with one or more component planes of 26width 8 or less caused a buffer overrun. 27 282. Fixed a regression introduced by 2.1 beta1[6(b)] whereby attempting to 29decompress a specially-crafted malformed progressive JPEG image caused the 30block smoothing algorithm to read from uninitialized memory. 31 323. Fixed an issue in the Arm Neon SIMD Huffman encoders that caused the 33encoders to generate incorrect results when using the Clang compiler with 34Visual Studio. 35 364. Fixed a floating point exception (CVE-2021-20205) that occurred when 37attempting to compress a specially-crafted malformed GIF image with a specified 38image width of 0 using cjpeg. 39 405. Fixed a regression introduced by 2.0 beta1[15] whereby attempting to 41generate a progressive JPEG image on an SSE2-capable CPU using a scan script 42containing one or more scans with lengths divisible by 32 and non-zero 43successive approximation low bit positions would, under certain circumstances, 44result in an error ("Missing Huffman code table entry") and an invalid JPEG 45image. 46 476. Introduced a new flag (`TJFLAG_LIMITSCANS` in the TurboJPEG C API and 48`TJ.FLAG_LIMIT_SCANS` in the TurboJPEG Java API) and a corresponding TJBench 49command-line argument (`-limitscans`) that causes the TurboJPEG decompression 50and transform functions/operations to return/throw an error if a progressive 51JPEG image contains an unreasonably large number of scans. This allows 52applications that use the TurboJPEG API to guard against an exploit of the 53progressive JPEG format described in the report 54["Two Issues with the JPEG Standard"](https://libjpeg-turbo.org/pmwiki/uploads/About/TwoIssueswiththeJPEGStandard.pdf). 55 567. The PPM reader now throws an error, rather than segfaulting (due to a buffer 57overrun) or generating incorrect pixels, if an application attempts to use the 58`tjLoadImage()` function to load a 16-bit binary PPM file (a binary PPM file 59with a maximum value greater than 255) into a grayscale image buffer or to load 60a 16-bit binary PGM file into an RGB image buffer. 61 628. Fixed an issue in the PPM reader that caused incorrect pixels to be 63generated when using the `tjLoadImage()` function to load a 16-bit binary PPM 64file into an extended RGB image buffer. 65 669. Fixed an issue whereby, if a JPEG buffer was automatically re-allocated by 67one of the TurboJPEG compression or transform functions and an error 68subsequently occurred during compression or transformation, the JPEG buffer 69pointer passed by the application was not updated when the function returned. 70 71 722.0.90 (2.1 beta1) 73================== 74 75### Significant changes relative to 2.0.6: 76 771. The build system, x86-64 SIMD extensions, and accelerated Huffman codec now 78support the x32 ABI on Linux, which allows for using x86-64 instructions with 7932-bit pointers. The x32 ABI is generally enabled by adding `-mx32` to the 80compiler flags. 81 82 Caveats: 83 - CMake 3.9.0 or later is required in order for the build system to 84automatically detect an x32 build. 85 - Java does not support the x32 ABI, and thus the TurboJPEG Java API will 86automatically be disabled with x32 builds. 87 882. Added Loongson MMI SIMD implementations of the RGB-to-grayscale, 4:2:2 fancy 89chroma upsampling, 4:2:2 and 4:2:0 merged chroma upsampling/color conversion, 90and fast integer DCT/IDCT algorithms. Relative to libjpeg-turbo 2.0.x, this 91speeds up: 92 93 - the compression of RGB source images into grayscale JPEG images by 94approximately 20% 95 - the decompression of 4:2:2 JPEG images by approximately 40-60% when 96using fancy upsampling 97 - the decompression of 4:2:2 and 4:2:0 JPEG images by approximately 9815-20% when using merged upsampling 99 - the compression of RGB source images by approximately 30-45% when using 100the fast integer DCT 101 - the decompression of JPEG images into RGB destination images by 102approximately 2x when using the fast integer IDCT 103 104 The overall decompression speedup for RGB images is now approximately 1052.3-3.7x (compared to 2-3.5x with libjpeg-turbo 2.0.x.) 106 1073. 32-bit (Armv7 or Armv7s) iOS builds of libjpeg-turbo are no longer 108supported, and the libjpeg-turbo build system can no longer be used to package 109such builds. 32-bit iOS apps cannot run in iOS 11 and later, and the App Store 110no longer allows them. 111 1124. 32-bit (i386) OS X/macOS builds of libjpeg-turbo are no longer supported, 113and the libjpeg-turbo build system can no longer be used to package such 114builds. 32-bit Mac applications cannot run in macOS 10.15 "Catalina" and 115later, and the App Store no longer allows them. 116 1175. The SSE2 (x86 SIMD) and C Huffman encoding algorithms have been 118significantly optimized, resulting in a measured average overall compression 119speedup of 12-28% for 64-bit code and 22-52% for 32-bit code on various Intel 120and AMD CPUs, as well as a measured average overall compression speedup of 1210-23% on platforms that do not have a SIMD-accelerated Huffman encoding 122implementation. 123 1246. The block smoothing algorithm that is applied by default when decompressing 125progressive Huffman-encoded JPEG images has been improved in the following 126ways: 127 128 - The algorithm is now more fault-tolerant. Previously, if a particular 129scan was incomplete, then the smoothing parameters for the incomplete scan 130would be applied to the entire output image, including the parts of the image 131that were generated by the prior (complete) scan. Visually, this had the 132effect of removing block smoothing from lower-frequency scans if they were 133followed by an incomplete higher-frequency scan. libjpeg-turbo now applies 134block smoothing parameters to each iMCU row based on which scan generated the 135pixels in that row, rather than always using the block smoothing parameters for 136the most recent scan. 137 - When applying block smoothing to DC scans, a Gaussian-like kernel with a 1385x5 window is used to reduce the "blocky" appearance. 139 1407. Added SIMD acceleration for progressive Huffman encoding on Arm platforms. 141This speeds up the compression of full-color progressive JPEGs by about 30-40% 142on average (relative to libjpeg-turbo 2.0.x) when using modern Arm CPUs. 143 1448. Added configure-time and run-time auto-detection of Loongson MMI SIMD 145instructions, so that the Loongson MMI SIMD extensions can be included in any 146MIPS64 libjpeg-turbo build. 147 1489. Added fault tolerance features to djpeg and jpegtran, mainly to demonstrate 149methods by which applications can guard against the exploits of the JPEG format 150described in the report 151["Two Issues with the JPEG Standard"](https://libjpeg-turbo.org/pmwiki/uploads/About/TwoIssueswiththeJPEGStandard.pdf). 152 153 - Both programs now accept a `-maxscans` argument, which can be used to 154limit the number of allowable scans in the input file. 155 - Both programs now accept a `-strict` argument, which can be used to 156treat all warnings as fatal. 157 15810. CMake package config files are now included for both the libjpeg and 159TurboJPEG API libraries. This facilitates using libjpeg-turbo with CMake's 160`find_package()` function. For example: 161 162 find_package(libjpeg-turbo CONFIG REQUIRED) 163 164 add_executable(libjpeg_program libjpeg_program.c) 165 target_link_libraries(libjpeg_program PUBLIC libjpeg-turbo::jpeg) 166 167 add_executable(libjpeg_program_static libjpeg_program.c) 168 target_link_libraries(libjpeg_program_static PUBLIC 169 libjpeg-turbo::jpeg-static) 170 171 add_executable(turbojpeg_program turbojpeg_program.c) 172 target_link_libraries(turbojpeg_program PUBLIC 173 libjpeg-turbo::turbojpeg) 174 175 add_executable(turbojpeg_program_static turbojpeg_program.c) 176 target_link_libraries(turbojpeg_program_static PUBLIC 177 libjpeg-turbo::turbojpeg-static) 178 17911. Since the Unisys LZW patent has long expired, cjpeg and djpeg can now 180read/write both LZW-compressed and uncompressed GIF files (feature ported from 181jpeg-6a and jpeg-9d.) 182 18312. jpegtran now includes the `-wipe` and `-drop` options from jpeg-9a and 184jpeg-9d, as well as the ability to expand the image size using the `-crop` 185option. Refer to jpegtran.1 or usage.txt for more details. 186 18713. Added a complete intrinsics implementation of the Arm Neon SIMD extensions, 188thus providing SIMD acceleration on Arm platforms for all of the algorithms 189that are SIMD-accelerated on x86 platforms. This new implementation is 190significantly faster in some cases than the old GAS implementation-- 191depending on the algorithms used, the type of CPU core, and the compiler. GCC, 192as of this writing, does not provide a full or optimal set of Neon intrinsics, 193so for performance reasons, the default when building libjpeg-turbo with GCC is 194to continue using the GAS implementation of the following algorithms: 195 196 - 32-bit RGB-to-YCbCr color conversion 197 - 32-bit fast and accurate inverse DCT 198 - 64-bit RGB-to-YCbCr and YCbCr-to-RGB color conversion 199 - 64-bit accurate forward and inverse DCT 200 - 64-bit Huffman encoding 201 202 A new CMake variable (`NEON_INTRINSICS`) can be used to override this 203default. 204 205 Since the new intrinsics implementation includes SIMD acceleration 206for merged upsampling/color conversion, 1.5.1[5] is no longer necessary and has 207been reverted. 208 20914. The Arm Neon SIMD extensions can now be built using Visual Studio. 210 21115. The build system can now be used to generate a universal x86-64 + Armv8 212libjpeg-turbo SDK package for both iOS and macOS. 213 214 2152.0.6 216===== 217 218### Significant changes relative to 2.0.5: 219 2201. Fixed "using JNI after critical get" errors that occurred on Android 221platforms when using any of the YUV encoding/compression/decompression/decoding 222methods in the TurboJPEG Java API. 223 2242. Fixed or worked around multiple issues with `jpeg_skip_scanlines()`: 225 226 - Fixed segfaults or "Corrupt JPEG data: premature end of data segment" 227errors in `jpeg_skip_scanlines()` that occurred when decompressing 4:2:2 or 2284:2:0 JPEG images using merged (non-fancy) upsampling/color conversion (that 229is, when setting `cinfo.do_fancy_upsampling` to `FALSE`.) 2.0.0[6] was a 230similar fix, but it did not cover all cases. 231 - `jpeg_skip_scanlines()` now throws an error if two-pass color 232quantization is enabled. Two-pass color quantization never worked properly 233with `jpeg_skip_scanlines()`, and the issues could not readily be fixed. 234 - Fixed an issue whereby `jpeg_skip_scanlines()` always returned 0 when 235skipping past the end of an image. 236 2373. The Arm 64-bit (Armv8) Neon SIMD extensions can now be built using MinGW 238toolchains targetting Arm64 (AArch64) Windows binaries. 239 2404. Fixed unexpected visual artifacts that occurred when using 241`jpeg_crop_scanline()` and interblock smoothing while decompressing only the DC 242scan of a progressive JPEG image. 243 2445. Fixed an issue whereby libjpeg-turbo would not build if 12-bit-per-component 245JPEG support (`WITH_12BIT`) was enabled along with libjpeg v7 or libjpeg v8 246API/ABI emulation (`WITH_JPEG7` or `WITH_JPEG8`.) 247 248 2492.0.5 250===== 251 252### Significant changes relative to 2.0.4: 253 2541. Worked around issues in the MIPS DSPr2 SIMD extensions that caused failures 255in the libjpeg-turbo regression tests. Specifically, the 256`jsimd_h2v1_downsample_dspr2()` and `jsimd_h2v2_downsample_dspr2()` functions 257in the MIPS DSPr2 SIMD extensions are now disabled until/unless they can be 258fixed, and other functions that are incompatible with big endian MIPS CPUs are 259disabled when building libjpeg-turbo for such CPUs. 260 2612. Fixed an oversight in the `TJCompressor.compress(int)` method in the 262TurboJPEG Java API that caused an error ("java.lang.IllegalStateException: No 263source image is associated with this instance") when attempting to use that 264method to compress a YUV image. 265 2663. Fixed an issue (CVE-2020-13790) in the PPM reader that caused a buffer 267overrun in cjpeg, TJBench, or the `tjLoadImage()` function if one of the values 268in a binary PPM/PGM input file exceeded the maximum value defined in the file's 269header and that maximum value was less than 255. libjpeg-turbo 1.5.0 already 270included a similar fix for binary PPM/PGM files with maximum values greater 271than 255. 272 2734. The TurboJPEG API library's global error handler, which is used in functions 274such as `tjBufSize()` and `tjLoadImage()` that do not require a TurboJPEG 275instance handle, is now thread-safe on platforms that support thread-local 276storage. 277 278 2792.0.4 280===== 281 282### Significant changes relative to 2.0.3: 283 2841. Fixed a regression in the Windows packaging system (introduced by 2852.0 beta1[2]) whereby, if both the 64-bit libjpeg-turbo SDK for GCC and the 28664-bit libjpeg-turbo SDK for Visual C++ were installed on the same system, only 287one of them could be uninstalled. 288 2892. Fixed a signed integer overflow and subsequent segfault that occurred when 290attempting to decompress images with more than 715827882 pixels using the 29164-bit C version of TJBench. 292 2933. Fixed out-of-bounds write in `tjDecompressToYUV2()` and 294`tjDecompressToYUVPlanes()` (sometimes manifesting as a double free) that 295occurred when attempting to decompress grayscale JPEG images that were 296compressed with a sampling factor other than 1 (for instance, with 297`cjpeg -grayscale -sample 2x2`). 298 2994. Fixed a regression introduced by 2.0.2[5] that caused the TurboJPEG API to 300incorrectly identify some JPEG images with unusual sampling factors as 4:4:4 301JPEG images. This was known to cause a buffer overflow when attempting to 302decompress some such images using `tjDecompressToYUV2()` or 303`tjDecompressToYUVPlanes()`. 304 3055. Fixed an issue (CVE-2020-17541), detected by ASan, whereby attempting to 306losslessly transform a specially-crafted malformed JPEG image containing an 307extremely-high-frequency coefficient block (junk image data that could never be 308generated by a legitimate JPEG compressor) could cause the Huffman encoder's 309local buffer to be overrun. (Refer to 1.4.0[9] and 1.4beta1[15].) Given that 310the buffer overrun was fully contained within the stack and did not cause a 311segfault or other user-visible errant behavior, and given that the lossless 312transformer (unlike the decompressor) is not generally exposed to arbitrary 313data exploits, this issue did not likely pose a security risk. 314 3156. The Arm 64-bit (Armv8) Neon SIMD assembly code now stores constants in a 316separate read-only data section rather than in the text section, to support 317execute-only memory layouts. 318 319 3202.0.3 321===== 322 323### Significant changes relative to 2.0.2: 324 3251. Fixed "using JNI after critical get" errors that occurred on Android 326platforms when passing invalid arguments to certain methods in the TurboJPEG 327Java API. 328 3292. Fixed a regression in the SIMD feature detection code, introduced by 330the AVX2 SIMD extensions (2.0 beta1[1]), that was known to cause an illegal 331instruction exception, in rare cases, on CPUs that lack support for CPUID leaf 33207H (or on which the maximum CPUID leaf has been limited by way of a BIOS 333setting.) 334 3353. The 4:4:0 (h1v2) fancy (smooth) chroma upsampling algorithm in the 336decompressor now uses a similar bias pattern to that of the 4:2:2 (h2v1) fancy 337chroma upsampling algorithm, rounding up or down the upsampled result for 338alternate pixels rather than always rounding down. This ensures that, 339regardless of whether a 4:2:2 JPEG image is rotated or transposed prior to 340decompression (in the frequency domain) or after decompression (in the spatial 341domain), the final image will be similar. 342 3434. Fixed an integer overflow and subsequent segfault that occurred when 344attempting to compress or decompress images with more than 1 billion pixels 345using the TurboJPEG API. 346 3475. Fixed a regression introduced by 2.0 beta1[15] whereby attempting to 348generate a progressive JPEG image on an SSE2-capable CPU using a scan script 349containing one or more scans with lengths divisible by 16 would result in an 350error ("Missing Huffman code table entry") and an invalid JPEG image. 351 3526. Fixed an issue whereby `tjDecodeYUV()` and `tjDecodeYUVPlanes()` would throw 353an error ("Invalid progressive parameters") or a warning ("Inconsistent 354progression sequence") if passed a TurboJPEG instance that was previously used 355to decompress a progressive JPEG image. 356 357 3582.0.2 359===== 360 361### Significant changes relative to 2.0.1: 362 3631. Fixed a regression introduced by 2.0.1[5] that prevented a runtime search 364path (rpath) from being embedded in the libjpeg-turbo shared libraries and 365executables for macOS and iOS. This caused a fatal error of the form 366"dyld: Library not loaded" when attempting to use one of the executables, 367unless `DYLD_LIBRARY_PATH` was explicitly set to the location of the 368libjpeg-turbo shared libraries. 369 3702. Fixed an integer overflow and subsequent segfault (CVE-2018-20330) that 371occurred when attempting to load a BMP file with more than 1 billion pixels 372using the `tjLoadImage()` function. 373 3743. Fixed a buffer overrun (CVE-2018-19664) that occurred when attempting to 375decompress a specially-crafted malformed JPEG image to a 256-color BMP using 376djpeg. 377 3784. Fixed a floating point exception that occurred when attempting to 379decompress a specially-crafted malformed JPEG image with a specified image 380width or height of 0 using the C version of TJBench. 381 3825. The TurboJPEG API will now decompress 4:4:4 JPEG images with 2x1, 1x2, 3x1, 383or 1x3 luminance and chrominance sampling factors. This is a non-standard way 384of specifying 1x subsampling (normally 4:4:4 JPEGs have 1x1 luminance and 385chrominance sampling factors), but the JPEG format and the libjpeg API both 386allow it. 387 3886. Fixed a regression introduced by 2.0 beta1[7] that caused djpeg to generate 389incorrect PPM images when used with the `-colors` option. 390 3917. Fixed an issue whereby a static build of libjpeg-turbo (a build in which 392`ENABLE_SHARED` is `0`) could not be installed using the Visual Studio IDE. 393 3948. Fixed a severe performance issue in the Loongson MMI SIMD extensions that 395occurred when compressing RGB images whose image rows were not 64-bit-aligned. 396 397 3982.0.1 399===== 400 401### Significant changes relative to 2.0.0: 402 4031. Fixed a regression introduced with the new CMake-based Un*x build system, 404whereby jconfig.h could cause compiler warnings of the form 405`"HAVE_*_H" redefined` if it was included by downstream Autotools-based 406projects that used `AC_CHECK_HEADERS()` to check for the existence of locale.h, 407stddef.h, or stdlib.h. 408 4092. The `jsimd_quantize_float_dspr2()` and `jsimd_convsamp_float_dspr2()` 410functions in the MIPS DSPr2 SIMD extensions are now disabled at compile time 411if the soft float ABI is enabled. Those functions use instructions that are 412incompatible with the soft float ABI. 413 4143. Fixed a regression in the SIMD feature detection code, introduced by 415the AVX2 SIMD extensions (2.0 beta1[1]), that caused libjpeg-turbo to crash on 416Windows 7 if Service Pack 1 was not installed. 417 4184. Fixed out-of-bounds read in cjpeg that occurred when attempting to compress 419a specially-crafted malformed color-index (8-bit-per-sample) Targa file in 420which some of the samples (color indices) exceeded the bounds of the Targa 421file's color table. 422 4235. Fixed an issue whereby installing a fully static build of libjpeg-turbo 424(a build in which `CFLAGS` contains `-static` and `ENABLE_SHARED` is `0`) would 425fail with "No valid ELF RPATH or RUNPATH entry exists in the file." 426 427 4282.0.0 429===== 430 431### Significant changes relative to 2.0 beta1: 432 4331. The TurboJPEG API can now decompress CMYK JPEG images that have subsampled M 434and Y components (not to be confused with YCCK JPEG images, in which the C/M/Y 435components have been transformed into luma and chroma.) Previously, an error 436was generated ("Could not determine subsampling type for JPEG image") when such 437an image was passed to `tjDecompressHeader3()`, `tjTransform()`, 438`tjDecompressToYUVPlanes()`, `tjDecompressToYUV2()`, or the equivalent Java 439methods. 440 4412. Fixed an issue (CVE-2018-11813) whereby a specially-crafted malformed input 442file (specifically, a file with a valid Targa header but incomplete pixel data) 443would cause cjpeg to generate a JPEG file that was potentially thousands of 444times larger than the input file. The Targa reader in cjpeg was not properly 445detecting that the end of the input file had been reached prematurely, so after 446all valid pixels had been read from the input, the reader injected dummy pixels 447with values of 255 into the JPEG compressor until the number of pixels 448specified in the Targa header had been compressed. The Targa reader in cjpeg 449now behaves like the PPM reader and aborts compression if the end of the input 450file is reached prematurely. Because this issue only affected cjpeg and not 451the underlying library, and because it did not involve any out-of-bounds reads 452or other exploitable behaviors, it was not believed to represent a security 453threat. 454 4553. Fixed an issue whereby the `tjLoadImage()` and `tjSaveImage()` functions 456would produce a "Bogus message code" error message if the underlying bitmap and 457PPM readers/writers threw an error that was specific to the readers/writers 458(as opposed to a general libjpeg API error.) 459 4604. Fixed an issue (CVE-2018-1152) whereby a specially-crafted malformed BMP 461file, one in which the header specified an image width of 1073741824 pixels, 462would trigger a floating point exception (division by zero) in the 463`tjLoadImage()` function when attempting to load the BMP file into a 4644-component image buffer. 465 4665. Fixed an issue whereby certain combinations of calls to 467`jpeg_skip_scanlines()` and `jpeg_read_scanlines()` could trigger an infinite 468loop when decompressing progressive JPEG images that use vertical chroma 469subsampling (for instance, 4:2:0 or 4:4:0.) 470 4716. Fixed a segfault in `jpeg_skip_scanlines()` that occurred when decompressing 472a 4:2:2 or 4:2:0 JPEG image using the merged (non-fancy) upsampling algorithms 473(that is, when setting `cinfo.do_fancy_upsampling` to `FALSE`.) 474 4757. The new CMake-based build system will now disable the MIPS DSPr2 SIMD 476extensions if it detects that the compiler does not support DSPr2 instructions. 477 4788. Fixed out-of-bounds read in cjpeg (CVE-2018-14498) that occurred when 479attempting to compress a specially-crafted malformed color-index 480(8-bit-per-sample) BMP file in which some of the samples (color indices) 481exceeded the bounds of the BMP file's color table. 482 4839. Fixed a signed integer overflow in the progressive Huffman decoder, detected 484by the Clang and GCC undefined behavior sanitizers, that could be triggered by 485attempting to decompress a specially-crafted malformed JPEG image. This issue 486did not pose a security threat, but removing the warning made it easier to 487detect actual security issues, should they arise in the future. 488 489 4901.5.90 (2.0 beta1) 491================== 492 493### Significant changes relative to 1.5.3: 494 4951. Added AVX2 SIMD implementations of the colorspace conversion, chroma 496downsampling and upsampling, integer quantization and sample conversion, and 497accurate integer DCT/IDCT algorithms. When using the accurate integer DCT/IDCT 498algorithms on AVX2-equipped CPUs, the compression of RGB images is 499approximately 13-36% (avg. 22%) faster (relative to libjpeg-turbo 1.5.x) with 50064-bit code and 11-21% (avg. 17%) faster with 32-bit code, and the 501decompression of RGB images is approximately 9-35% (avg. 17%) faster with 50264-bit code and 7-17% (avg. 12%) faster with 32-bit code. (As tested on a 5033 GHz Intel Core i7. Actual mileage may vary.) 504 5052. Overhauled the build system to use CMake on all platforms, and removed the 506autotools-based build system. This decision resulted from extensive 507discussions within the libjpeg-turbo community. libjpeg-turbo traditionally 508used CMake only for Windows builds, but there was an increasing amount of 509demand to extend CMake support to other platforms. However, because of the 510unique nature of our code base (the need to support different assemblers on 511each platform, the need for Java support, etc.), providing dual build systems 512as other OSS imaging libraries do (including libpng and libtiff) would have 513created a maintenance burden. The use of CMake greatly simplifies some aspects 514of our build system, owing to CMake's built-in support for various assemblers, 515Java, and unit testing, as well as generally fewer quirks that have to be 516worked around in order to implement our packaging system. Eliminating 517autotools puts our project slightly at odds with the traditional practices of 518the OSS community, since most "system libraries" tend to be built with 519autotools, but it is believed that the benefits of this move outweigh the 520risks. In addition to providing a unified build environment, switching to 521CMake allows for the use of various build tools and IDEs that aren't supported 522under autotools, including XCode, Ninja, and Eclipse. It also eliminates the 523need to install autotools via MacPorts/Homebrew on OS X and allows 524libjpeg-turbo to be configured without the use of a terminal/command prompt. 525Extensive testing was conducted to ensure that all features provided by the 526autotools-based build system are provided by the new build system. 527 5283. The libjpeg API in this version of libjpeg-turbo now includes two additional 529functions, `jpeg_read_icc_profile()` and `jpeg_write_icc_profile()`, that can 530be used to extract ICC profile data from a JPEG file while decompressing or to 531embed ICC profile data in a JPEG file while compressing or transforming. This 532eliminates the need for downstream projects, such as color management libraries 533and browsers, to include their own glueware for accomplishing this. 534 5354. Improved error handling in the TurboJPEG API library: 536 537 - Introduced a new function (`tjGetErrorStr2()`) in the TurboJPEG C API 538that allows compression/decompression/transform error messages to be retrieved 539in a thread-safe manner. Retrieving error messages from global functions, such 540as `tjInitCompress()` or `tjBufSize()`, is still thread-unsafe, but since those 541functions will only throw errors if passed an invalid argument or if a memory 542allocation failure occurs, thread safety is not as much of a concern. 543 - Introduced a new function (`tjGetErrorCode()`) in the TurboJPEG C API 544and a new method (`TJException.getErrorCode()`) in the TurboJPEG Java API that 545can be used to determine the severity of the last 546compression/decompression/transform error. This allows applications to 547choose whether to ignore warnings (non-fatal errors) from the underlying 548libjpeg API or to treat them as fatal. 549 - Introduced a new flag (`TJFLAG_STOPONWARNING` in the TurboJPEG C API and 550`TJ.FLAG_STOPONWARNING` in the TurboJPEG Java API) that causes the library to 551immediately halt a compression/decompression/transform operation if it 552encounters a warning from the underlying libjpeg API (the default behavior is 553to allow the operation to complete unless a fatal error is encountered.) 554 5555. Introduced a new flag in the TurboJPEG C and Java APIs (`TJFLAG_PROGRESSIVE` 556and `TJ.FLAG_PROGRESSIVE`, respectively) that causes the library to use 557progressive entropy coding in JPEG images generated by compression and 558transform operations. Additionally, a new transform option 559(`TJXOPT_PROGRESSIVE` in the C API and `TJTransform.OPT_PROGRESSIVE` in the 560Java API) has been introduced, allowing progressive entropy coding to be 561enabled for selected transforms in a multi-transform operation. 562 5636. Introduced a new transform option in the TurboJPEG API (`TJXOPT_COPYNONE` in 564the C API and `TJTransform.OPT_COPYNONE` in the Java API) that allows the 565copying of markers (including EXIF and ICC profile data) to be disabled for a 566particular transform. 567 5687. Added two functions to the TurboJPEG C API (`tjLoadImage()` and 569`tjSaveImage()`) that can be used to load/save a BMP or PPM/PGM image to/from a 570memory buffer with a specified pixel format and layout. These functions 571replace the project-private (and slow) bmp API, which was previously used by 572TJBench, and they also provide a convenient way for first-time users of 573libjpeg-turbo to quickly develop a complete JPEG compression/decompression 574program. 575 5768. The TurboJPEG C API now includes a new convenience array (`tjAlphaOffset[]`) 577that contains the alpha component index for each pixel format (or -1 if the 578pixel format lacks an alpha component.) The TurboJPEG Java API now includes a 579new method (`TJ.getAlphaOffset()`) that returns the same value. In addition, 580the `tjRedOffset[]`, `tjGreenOffset[]`, and `tjBlueOffset[]` arrays-- and the 581corresponding `TJ.getRedOffset()`, `TJ.getGreenOffset()`, and 582`TJ.getBlueOffset()` methods-- now return -1 for `TJPF_GRAY`/`TJ.PF_GRAY` 583rather than 0. This allows programs to easily determine whether a pixel format 584has red, green, blue, and alpha components. 585 5869. Added a new example (tjexample.c) that demonstrates the basic usage of the 587TurboJPEG C API. This example mirrors the functionality of TJExample.java. 588Both files are now included in the libjpeg-turbo documentation. 589 59010. Fixed two signed integer overflows in the arithmetic decoder, detected by 591the Clang undefined behavior sanitizer, that could be triggered by attempting 592to decompress a specially-crafted malformed JPEG image. These issues did not 593pose a security threat, but removing the warnings makes it easier to detect 594actual security issues, should they arise in the future. 595 59611. Fixed a bug in the merged 4:2:0 upsampling/dithered RGB565 color conversion 597algorithm that caused incorrect dithering in the output image. This algorithm 598now produces bitwise-identical results to the unmerged algorithms. 599 60012. The SIMD function symbols for x86[-64]/ELF, MIPS/ELF, macOS/x86[-64] (if 601libjpeg-turbo is built with YASM), and iOS/Arm[64] builds are now private. 602This prevents those symbols from being exposed in applications or shared 603libraries that link statically with libjpeg-turbo. 604 60513. Added Loongson MMI SIMD implementations of the RGB-to-YCbCr and 606YCbCr-to-RGB colorspace conversion, 4:2:0 chroma downsampling, 4:2:0 fancy 607chroma upsampling, integer quantization, and accurate integer DCT/IDCT 608algorithms. When using the accurate integer DCT/IDCT, this speeds up the 609compression of RGB images by approximately 70-100% and the decompression of RGB 610images by approximately 2-3.5x. 611 61214. Fixed a build error when building with older MinGW releases (regression 613caused by 1.5.1[7].) 614 61515. Added SIMD acceleration for progressive Huffman encoding on SSE2-capable 616x86 and x86-64 platforms. This speeds up the compression of full-color 617progressive JPEGs by about 85-90% on average (relative to libjpeg-turbo 1.5.x) 618when using modern Intel and AMD CPUs. 619 620 6211.5.3 622===== 623 624### Significant changes relative to 1.5.2: 625 6261. Fixed a NullPointerException in the TurboJPEG Java wrapper that occurred 627when using the YUVImage constructor that creates an instance backed by separate 628image planes and allocates memory for the image planes. 629 6302. Fixed an issue whereby the Java version of TJUnitTest would fail when 631testing BufferedImage encoding/decoding on big endian systems. 632 6333. Fixed a segfault in djpeg that would occur if an output format other than 634PPM/PGM was selected along with the `-crop` option. The `-crop` option now 635works with the GIF and Targa formats as well (unfortunately, it cannot be made 636to work with the BMP and RLE formats due to the fact that those output engines 637write scanlines in bottom-up order.) djpeg will now exit gracefully if an 638output format other than PPM/PGM, GIF, or Targa is selected along with the 639`-crop` option. 640 6414. Fixed an issue (CVE-2017-15232) whereby `jpeg_skip_scanlines()` would 642segfault if color quantization was enabled. 643 6445. TJBench (both C and Java versions) will now display usage information if any 645command-line argument is unrecognized. This prevents the program from silently 646ignoring typos. 647 6486. Fixed an access violation in tjbench.exe (Windows) that occurred when the 649program was used to decompress an existing JPEG image. 650 6517. Fixed an ArrayIndexOutOfBoundsException in the TJExample Java program that 652occurred when attempting to decompress a JPEG image that had been compressed 653with 4:1:1 chrominance subsampling. 654 6558. Fixed an issue whereby, when using `jpeg_skip_scanlines()` to skip to the 656end of a single-scan (non-progressive) image, subsequent calls to 657`jpeg_consume_input()` would return `JPEG_SUSPENDED` rather than 658`JPEG_REACHED_EOI`. 659 6609. `jpeg_crop_scanline()` now works correctly when decompressing grayscale JPEG 661images that were compressed with a sampling factor other than 1 (for instance, 662with `cjpeg -grayscale -sample 2x2`). 663 664 6651.5.2 666===== 667 668### Significant changes relative to 1.5.1: 669 6701. Fixed a regression introduced by 1.5.1[7] that prevented libjpeg-turbo from 671building with Android NDK platforms prior to android-21 (5.0). 672 6732. Fixed a regression introduced by 1.5.1[1] that prevented the MIPS DSPR2 SIMD 674code in libjpeg-turbo from building. 675 6763. Fixed a regression introduced by 1.5 beta1[11] that prevented the Java 677version of TJBench from outputting any reference images (the `-nowrite` switch 678was accidentally enabled by default.) 679 6804. libjpeg-turbo should now build and run with full AltiVec SIMD acceleration 681on PowerPC-based AmigaOS 4 and OpenBSD systems. 682 6835. Fixed build and runtime errors on Windows that occurred when building 684libjpeg-turbo with libjpeg v7 API/ABI emulation and the in-memory 685source/destination managers. Due to an oversight, the `jpeg_skip_scanlines()` 686and `jpeg_crop_scanline()` functions were not being included in jpeg7.dll when 687libjpeg-turbo was built with `-DWITH_JPEG7=1` and `-DWITH_MEMSRCDST=1`. 688 6896. Fixed "Bogus virtual array access" error that occurred when using the 690lossless crop feature in jpegtran or the TurboJPEG API, if libjpeg-turbo was 691built with libjpeg v7 API/ABI emulation. This was apparently a long-standing 692bug that has existed since the introduction of libjpeg v7/v8 API/ABI emulation 693in libjpeg-turbo v1.1. 694 6957. The lossless transform features in jpegtran and the TurboJPEG API will now 696always attempt to adjust the EXIF image width and height tags if the image size 697changed as a result of the transform. This behavior has always existed when 698using libjpeg v8 API/ABI emulation. It was supposed to be available with 699libjpeg v7 API/ABI emulation as well but did not work properly due to a bug. 700Furthermore, there was never any good reason not to enable it with libjpeg v6b 701API/ABI emulation, since the behavior is entirely internal. Note that 702`-copy all` must be passed to jpegtran in order to transfer the EXIF tags from 703the source image to the destination image. 704 7058. Fixed several memory leaks in the TurboJPEG API library that could occur 706if the library was built with certain compilers and optimization levels 707(known to occur with GCC 4.x and clang with `-O1` and higher but not with 708GCC 5.x or 6.x) and one of the underlying libjpeg API functions threw an error 709after a TurboJPEG API function allocated a local buffer. 710 7119. The libjpeg-turbo memory manager will now honor the `max_memory_to_use` 712structure member in jpeg\_memory\_mgr, which can be set to the maximum amount 713of memory (in bytes) that libjpeg-turbo should use during decompression or 714multi-pass (including progressive) compression. This limit can also be set 715using the `JPEGMEM` environment variable or using the `-maxmemory` switch in 716cjpeg/djpeg/jpegtran (refer to the respective man pages for more details.) 717This has been a documented feature of libjpeg since v5, but the 718`malloc()`/`free()` implementation of the memory manager (jmemnobs.c) never 719implemented the feature. Restricting libjpeg-turbo's memory usage is useful 720for two reasons: it allows testers to more easily work around the 2 GB limit 721in libFuzzer, and it allows developers of security-sensitive applications to 722more easily defend against one of the progressive JPEG exploits (LJT-01-004) 723identified in 724[this report](http://www.libjpeg-turbo.org/pmwiki/uploads/About/TwoIssueswiththeJPEGStandard.pdf). 725 72610. TJBench will now run each benchmark for 1 second prior to starting the 727timer, in order to improve the consistency of the results. Furthermore, the 728`-warmup` option is now used to specify the amount of warmup time rather than 729the number of warmup iterations. 730 73111. Fixed an error (`short jump is out of range`) that occurred when assembling 732the 32-bit x86 SIMD extensions with NASM versions prior to 2.04. This was a 733regression introduced by 1.5 beta1[12]. 734 735 7361.5.1 737===== 738 739### Significant changes relative to 1.5.0: 740 7411. Previously, the undocumented `JSIMD_FORCE*` environment variables could be 742used to force-enable a particular SIMD instruction set if multiple instruction 743sets were available on a particular platform. On x86 platforms, where CPU 744feature detection is bulletproof and multiple SIMD instruction sets are 745available, it makes sense for those environment variables to allow forcing the 746use of an instruction set only if that instruction set is available. However, 747since the ARM implementations of libjpeg-turbo can only use one SIMD 748instruction set, and since their feature detection code is less bulletproof 749(parsing /proc/cpuinfo), it makes sense for the `JSIMD_FORCENEON` environment 750variable to bypass the feature detection code and really force the use of NEON 751instructions. A new environment variable (`JSIMD_FORCEDSPR2`) was introduced 752in the MIPS implementation for the same reasons, and the existing 753`JSIMD_FORCENONE` environment variable was extended to that implementation. 754These environment variables provide a workaround for those attempting to test 755ARM and MIPS builds of libjpeg-turbo in QEMU, which passes through 756/proc/cpuinfo from the host system. 757 7582. libjpeg-turbo previously assumed that AltiVec instructions were always 759available on PowerPC platforms, which led to "illegal instruction" errors when 760running on PowerPC chips that lack AltiVec support (such as the older 7xx/G3 761and newer e5500 series.) libjpeg-turbo now examines /proc/cpuinfo on 762Linux/Android systems and enables AltiVec instructions only if the CPU supports 763them. It also now provides two environment variables, `JSIMD_FORCEALTIVEC` and 764`JSIMD_FORCENONE`, to force-enable and force-disable AltiVec instructions in 765environments where /proc/cpuinfo is an unreliable means of CPU feature 766detection (such as when running in QEMU.) On OS X, libjpeg-turbo continues to 767assume that AltiVec support is always available, which means that libjpeg-turbo 768cannot be used with G3 Macs unless you set the environment variable 769`JSIMD_FORCENONE` to `1`. 770 7713. Fixed an issue whereby 64-bit ARM (AArch64) builds of libjpeg-turbo would 772crash when built with recent releases of the Clang/LLVM compiler. This was 773caused by an ABI conformance issue in some of libjpeg-turbo's 64-bit NEON SIMD 774routines. Those routines were incorrectly using 64-bit instructions to 775transfer a 32-bit JDIMENSION argument, whereas the ABI allows the upper 776(unused) 32 bits of a 32-bit argument's register to be undefined. The new 777Clang/LLVM optimizer uses load combining to transfer multiple adjacent 32-bit 778structure members into a single 64-bit register, and this exposed the ABI 779conformance issue. 780 7814. Fancy upsampling is now supported when decompressing JPEG images that use 7824:4:0 (h1v2) chroma subsampling. These images are generated when losslessly 783rotating or transposing JPEG images that use 4:2:2 (h2v1) chroma subsampling. 784The h1v2 fancy upsampling algorithm is not currently SIMD-accelerated. 785 7865. If merged upsampling isn't SIMD-accelerated but YCbCr-to-RGB conversion is, 787then libjpeg-turbo will now disable merged upsampling when decompressing YCbCr 788JPEG images into RGB or extended RGB output images. This significantly speeds 789up the decompression of 4:2:0 and 4:2:2 JPEGs on ARM platforms if fancy 790upsampling is not used (for example, if the `-nosmooth` option to djpeg is 791specified.) 792 7936. The TurboJPEG API will now decompress 4:2:2 and 4:4:0 JPEG images with 7942x2 luminance sampling factors and 2x1 or 1x2 chrominance sampling factors. 795This is a non-standard way of specifying 2x subsampling (normally 4:2:2 JPEGs 796have 2x1 luminance and 1x1 chrominance sampling factors, and 4:4:0 JPEGs have 7971x2 luminance and 1x1 chrominance sampling factors), but the JPEG format and 798the libjpeg API both allow it. 799 8007. Fixed an unsigned integer overflow in the libjpeg memory manager, detected 801by the Clang undefined behavior sanitizer, that could be triggered by 802attempting to decompress a specially-crafted malformed JPEG image. This issue 803affected only 32-bit code and did not pose a security threat, but removing the 804warning makes it easier to detect actual security issues, should they arise in 805the future. 806 8078. Fixed additional negative left shifts and other issues reported by the GCC 808and Clang undefined behavior sanitizers when attempting to decompress 809specially-crafted malformed JPEG images. None of these issues posed a security 810threat, but removing the warnings makes it easier to detect actual security 811issues, should they arise in the future. 812 8139. Fixed an out-of-bounds array reference, introduced by 1.4.90[2] (partial 814image decompression) and detected by the Clang undefined behavior sanitizer, 815that could be triggered by a specially-crafted malformed JPEG image with more 816than four components. Because the out-of-bounds reference was still within the 817same structure, it was not known to pose a security threat, but removing the 818warning makes it easier to detect actual security issues, should they arise in 819the future. 820 82110. Fixed another ABI conformance issue in the 64-bit ARM (AArch64) NEON SIMD 822code. Some of the routines were incorrectly reading and storing data below the 823stack pointer, which caused segfaults in certain applications under specific 824circumstances. 825 826 8271.5.0 828===== 829 830### Significant changes relative to 1.5 beta1: 831 8321. Fixed an issue whereby a malformed motion-JPEG frame could cause the "fast 833path" of libjpeg-turbo's Huffman decoder to read from uninitialized memory. 834 8352. Added libjpeg-turbo version and build information to the global string table 836of the libjpeg and TurboJPEG API libraries. This is a common practice in other 837infrastructure libraries, such as OpenSSL and libpng, because it makes it easy 838to examine an application binary and determine which version of the library the 839application was linked against. 840 8413. Fixed a couple of issues in the PPM reader that would cause buffer overruns 842in cjpeg if one of the values in a binary PPM/PGM input file exceeded the 843maximum value defined in the file's header and that maximum value was greater 844than 255. libjpeg-turbo 1.4.2 already included a similar fix for ASCII PPM/PGM 845files. Note that these issues were not security bugs, since they were confined 846to the cjpeg program and did not affect any of the libjpeg-turbo libraries. 847 8484. Fixed an issue whereby attempting to decompress a JPEG file with a corrupt 849header using the `tjDecompressToYUV2()` function would cause the function to 850abort without returning an error and, under certain circumstances, corrupt the 851stack. This only occurred if `tjDecompressToYUV2()` was called prior to 852calling `tjDecompressHeader3()`, or if the return value from 853`tjDecompressHeader3()` was ignored (both cases represent incorrect usage of 854the TurboJPEG API.) 855 8565. Fixed an issue in the ARM 32-bit SIMD-accelerated Huffman encoder that 857prevented the code from assembling properly with clang. 858 8596. The `jpeg_stdio_src()`, `jpeg_mem_src()`, `jpeg_stdio_dest()`, and 860`jpeg_mem_dest()` functions in the libjpeg API will now throw an error if a 861source/destination manager has already been assigned to the compress or 862decompress object by a different function or by the calling program. This 863prevents these functions from attempting to reuse a source/destination manager 864structure that was allocated elsewhere, because there is no way to ensure that 865it would be big enough to accommodate the new source/destination manager. 866 867 8681.4.90 (1.5 beta1) 869================== 870 871### Significant changes relative to 1.4.2: 872 8731. Added full SIMD acceleration for PowerPC platforms using AltiVec VMX 874(128-bit SIMD) instructions. Although the performance of libjpeg-turbo on 875PowerPC was already good, due to the increased number of registers available 876to the compiler vs. x86, it was still possible to speed up compression by about 8773-4x and decompression by about 2-2.5x (relative to libjpeg v6b) through the 878use of AltiVec instructions. 879 8802. Added two new libjpeg API functions (`jpeg_skip_scanlines()` and 881`jpeg_crop_scanline()`) that can be used to partially decode a JPEG image. See 882[libjpeg.txt](libjpeg.txt) for more details. 883 8843. The TJCompressor and TJDecompressor classes in the TurboJPEG Java API now 885implement the Closeable interface, so those classes can be used with a 886try-with-resources statement. 887 8884. The TurboJPEG Java classes now throw unchecked idiomatic exceptions 889(IllegalArgumentException, IllegalStateException) for unrecoverable errors 890caused by incorrect API usage, and those classes throw a new checked exception 891type (TJException) for errors that are passed through from the C library. 892 8935. Source buffers for the TurboJPEG C API functions, as well as the 894`jpeg_mem_src()` function in the libjpeg API, are now declared as const 895pointers. This facilitates passing read-only buffers to those functions and 896ensures the caller that the source buffer will not be modified. This should 897not create any backward API or ABI incompatibilities with prior libjpeg-turbo 898releases. 899 9006. The MIPS DSPr2 SIMD code can now be compiled to support either FR=0 or FR=1 901FPUs. 902 9037. Fixed additional negative left shifts and other issues reported by the GCC 904and Clang undefined behavior sanitizers. Most of these issues affected only 90532-bit code, and none of them was known to pose a security threat, but removing 906the warnings makes it easier to detect actual security issues, should they 907arise in the future. 908 9098. Removed the unnecessary `.arch` directive from the ARM64 NEON SIMD code. 910This directive was preventing the code from assembling using the clang 911integrated assembler. 912 9139. Fixed a regression caused by 1.4.1[6] that prevented 32-bit and 64-bit 914libjpeg-turbo RPMs from being installed simultaneously on recent Red Hat/Fedora 915distributions. This was due to the addition of a macro in jconfig.h that 916allows the Huffman codec to determine the word size at compile time. Since 917that macro differs between 32-bit and 64-bit builds, this caused a conflict 918between the i386 and x86_64 RPMs (any differing files, other than executables, 919are not allowed when 32-bit and 64-bit RPMs are installed simultaneously.) 920Since the macro is used only internally, it has been moved into jconfigint.h. 921 92210. The x86-64 SIMD code can now be disabled at run time by setting the 923`JSIMD_FORCENONE` environment variable to `1` (the other SIMD implementations 924already had this capability.) 925 92611. Added a new command-line argument to TJBench (`-nowrite`) that prevents the 927benchmark from outputting any images. This removes any potential operating 928system overhead that might be caused by lazy writes to disk and thus improves 929the consistency of the performance measurements. 930 93112. Added SIMD acceleration for Huffman encoding on SSE2-capable x86 and x86-64 932platforms. This speeds up the compression of full-color JPEGs by about 10-15% 933on average (relative to libjpeg-turbo 1.4.x) when using modern Intel and AMD 934CPUs. Additionally, this works around an issue in the clang optimizer that 935prevents it (as of this writing) from achieving the same performance as GCC 936when compiling the C version of the Huffman encoder 937(<https://llvm.org/bugs/show_bug.cgi?id=16035>). For the purposes of 938benchmarking or regression testing, SIMD-accelerated Huffman encoding can be 939disabled by setting the `JSIMD_NOHUFFENC` environment variable to `1`. 940 94113. Added ARM 64-bit (ARMv8) NEON SIMD implementations of the commonly-used 942compression algorithms (including the accurate integer forward DCT and h2v2 & 943h2v1 downsampling algorithms, which are not accelerated in the 32-bit NEON 944implementation.) This speeds up the compression of full-color JPEGs by about 94575% on average on a Cavium ThunderX processor and by about 2-2.5x on average on 946Cortex-A53 and Cortex-A57 cores. 947 94814. Added SIMD acceleration for Huffman encoding on NEON-capable ARM 32-bit 949and 64-bit platforms. 950 951 For 32-bit code, this speeds up the compression of full-color JPEGs by 952about 30% on average on a typical iOS device (iPhone 4S, Cortex-A9) and by 953about 6-7% on average on a typical Android device (Nexus 5X, Cortex-A53 and 954Cortex-A57), relative to libjpeg-turbo 1.4.x. Note that the larger speedup 955under iOS is due to the fact that iOS builds use LLVM, which does not optimize 956the C Huffman encoder as well as GCC does. 957 958 For 64-bit code, NEON-accelerated Huffman encoding speeds up the 959compression of full-color JPEGs by about 40% on average on a typical iOS device 960(iPhone 5S, Apple A7) and by about 7-8% on average on a typical Android device 961(Nexus 5X, Cortex-A53 and Cortex-A57), in addition to the speedup described in 962[13] above. 963 964 For the purposes of benchmarking or regression testing, SIMD-accelerated 965Huffman encoding can be disabled by setting the `JSIMD_NOHUFFENC` environment 966variable to `1`. 967 96815. pkg-config (.pc) scripts are now included for both the libjpeg and 969TurboJPEG API libraries on Un*x systems. Note that if a project's build system 970relies on these scripts, then it will not be possible to build that project 971with libjpeg or with a prior version of libjpeg-turbo. 972 97316. Optimized the ARM 64-bit (ARMv8) NEON SIMD decompression routines to 974improve performance on CPUs with in-order pipelines. This speeds up the 975decompression of full-color JPEGs by nearly 2x on average on a Cavium ThunderX 976processor and by about 15% on average on a Cortex-A53 core. 977 97817. Fixed an issue in the accelerated Huffman decoder that could have caused 979the decoder to read past the end of the input buffer when a malformed, 980specially-crafted JPEG image was being decompressed. In prior versions of 981libjpeg-turbo, the accelerated Huffman decoder was invoked (in most cases) only 982if there were > 128 bytes of data in the input buffer. However, it is possible 983to construct a JPEG image in which a single Huffman block is over 430 bytes 984long, so this version of libjpeg-turbo activates the accelerated Huffman 985decoder only if there are > 512 bytes of data in the input buffer. 986 98718. Fixed a memory leak in tjunittest encountered when running the program 988with the `-yuv` option. 989 990 9911.4.2 992===== 993 994### Significant changes relative to 1.4.1: 995 9961. Fixed an issue whereby cjpeg would segfault if a Windows bitmap with a 997negative width or height was used as an input image (Windows bitmaps can have 998a negative height if they are stored in top-down order, but such files are 999rare and not supported by libjpeg-turbo.) 1000 10012. Fixed an issue whereby, under certain circumstances, libjpeg-turbo would 1002incorrectly encode certain JPEG images when quality=100 and the fast integer 1003forward DCT were used. This was known to cause `make test` to fail when the 1004library was built with `-march=haswell` on x86 systems. 1005 10063. Fixed an issue whereby libjpeg-turbo would crash when built with the latest 1007& greatest development version of the Clang/LLVM compiler. This was caused by 1008an x86-64 ABI conformance issue in some of libjpeg-turbo's 64-bit SSE2 SIMD 1009routines. Those routines were incorrectly using a 64-bit `mov` instruction to 1010transfer a 32-bit JDIMENSION argument, whereas the x86-64 ABI allows the upper 1011(unused) 32 bits of a 32-bit argument's register to be undefined. The new 1012Clang/LLVM optimizer uses load combining to transfer multiple adjacent 32-bit 1013structure members into a single 64-bit register, and this exposed the ABI 1014conformance issue. 1015 10164. Fixed a bug in the MIPS DSPr2 4:2:0 "plain" (non-fancy and non-merged) 1017upsampling routine that caused a buffer overflow (and subsequent segfault) when 1018decompressing a 4:2:0 JPEG image whose scaled output width was less than 16 1019pixels. The "plain" upsampling routines are normally only used when 1020decompressing a non-YCbCr JPEG image, but they are also used when decompressing 1021a JPEG image whose scaled output height is 1. 1022 10235. Fixed various negative left shifts and other issues reported by the GCC and 1024Clang undefined behavior sanitizers. None of these was known to pose a 1025security threat, but removing the warnings makes it easier to detect actual 1026security issues, should they arise in the future. 1027 1028 10291.4.1 1030===== 1031 1032### Significant changes relative to 1.4.0: 1033 10341. tjbench now properly handles CMYK/YCCK JPEG files. Passing an argument of 1035`-cmyk` (instead of, for instance, `-rgb`) will cause tjbench to internally 1036convert the source bitmap to CMYK prior to compression, to generate YCCK JPEG 1037files, and to internally convert the decompressed CMYK pixels back to RGB after 1038decompression (the latter is done automatically if a CMYK or YCCK JPEG is 1039passed to tjbench as a source image.) The CMYK<->RGB conversion operation is 1040not benchmarked. NOTE: The quick & dirty CMYK<->RGB conversions that tjbench 1041uses are suitable for testing only. Proper conversion between CMYK and RGB 1042requires a color management system. 1043 10442. `make test` now performs additional bitwise regression tests using tjbench, 1045mainly for the purpose of testing compression from/decompression to a subregion 1046of a larger image buffer. 1047 10483. `make test` no longer tests the regression of the floating point DCT/IDCT 1049by default, since the results of those tests can vary if the algorithms in 1050question are not implemented using SIMD instructions on a particular platform. 1051See the comments in [Makefile.am](Makefile.am) for information on how to 1052re-enable the tests and to specify an expected result for them based on the 1053particulars of your platform. 1054 10554. The NULL color conversion routines have been significantly optimized, 1056which speeds up the compression of RGB and CMYK JPEGs by 5-20% when using 105764-bit code and 0-3% when using 32-bit code, and the decompression of those 1058images by 10-30% when using 64-bit code and 3-12% when using 32-bit code. 1059 10605. Fixed an "illegal instruction" error that occurred when djpeg from a 1061SIMD-enabled libjpeg-turbo MIPS build was executed with the `-nosmooth` option 1062on a MIPS machine that lacked DSPr2 support. The MIPS SIMD routines for h2v1 1063and h2v2 merged upsampling were not properly checking for the existence of 1064DSPr2. 1065 10666. Performance has been improved significantly on 64-bit non-Linux and 1067non-Windows platforms (generally 10-20% faster compression and 5-10% faster 1068decompression.) Due to an oversight, the 64-bit version of the accelerated 1069Huffman codec was not being compiled in when libjpeg-turbo was built on 1070platforms other than Windows or Linux. Oops. 1071 10727. Fixed an extremely rare bug in the Huffman encoder that caused 64-bit 1073builds of libjpeg-turbo to incorrectly encode a few specific test images when 1074quality=98, an optimized Huffman table, and the accurate integer forward DCT 1075were used. 1076 10778. The Windows (CMake) build system now supports building only static or only 1078shared libraries. This is accomplished by adding either `-DENABLE_STATIC=0` or 1079`-DENABLE_SHARED=0` to the CMake command line. 1080 10819. TurboJPEG API functions will now return an error code if a warning is 1082triggered in the underlying libjpeg API. For instance, if a JPEG file is 1083corrupt, the TurboJPEG decompression functions will attempt to decompress 1084as much of the image as possible, but those functions will now return -1 to 1085indicate that the decompression was not entirely successful. 1086 108710. Fixed a bug in the MIPS DSPr2 4:2:2 fancy upsampling routine that caused a 1088buffer overflow (and subsequent segfault) when decompressing a 4:2:2 JPEG image 1089in which the right-most MCU was 5 or 6 pixels wide. 1090 1091 10921.4.0 1093===== 1094 1095### Significant changes relative to 1.4 beta1: 1096 10971. Fixed a build issue on OS X PowerPC platforms (md5cmp failed to build 1098because OS X does not provide the `le32toh()` and `htole32()` functions.) 1099 11002. The non-SIMD RGB565 color conversion code did not work correctly on big 1101endian machines. This has been fixed. 1102 11033. Fixed an issue in `tjPlaneSizeYUV()` whereby it would erroneously return 1 1104instead of -1 if `componentID` was > 0 and `subsamp` was `TJSAMP_GRAY`. 1105 11063. Fixed an issue in `tjBufSizeYUV2()` whereby it would erroneously return 0 1107instead of -1 if `width` was < 1. 1108 11095. The Huffman encoder now uses `clz` and `bsr` instructions for bit counting 1110on ARM64 platforms (see 1.4 beta1[5].) 1111 11126. The `close()` method in the TJCompressor and TJDecompressor Java classes is 1113now idempotent. Previously, that method would call the native `tjDestroy()` 1114function even if the TurboJPEG instance had already been destroyed. This 1115caused an exception to be thrown during finalization, if the `close()` method 1116had already been called. The exception was caught, but it was still an 1117expensive operation. 1118 11197. The TurboJPEG API previously generated an error (`Could not determine 1120subsampling type for JPEG image`) when attempting to decompress grayscale JPEG 1121images that were compressed with a sampling factor other than 1 (for instance, 1122with `cjpeg -grayscale -sample 2x2`). Subsampling technically has no meaning 1123with grayscale JPEGs, and thus the horizontal and vertical sampling factors 1124for such images are ignored by the decompressor. However, the TurboJPEG API 1125was being too rigid and was expecting the sampling factors to be equal to 1 1126before it treated the image as a grayscale JPEG. 1127 11288. cjpeg, djpeg, and jpegtran now accept an argument of `-version`, which will 1129print the library version and exit. 1130 11319. Referring to 1.4 beta1[15], another extremely rare circumstance was 1132discovered under which the Huffman encoder's local buffer can be overrun 1133when a buffered destination manager is being used and an 1134extremely-high-frequency block (basically junk image data) is being encoded. 1135Even though the Huffman local buffer was increased from 128 bytes to 136 bytes 1136to address the previous issue, the new issue caused even the larger buffer to 1137be overrun. Further analysis reveals that, in the absolute worst case (such as 1138setting alternating AC coefficients to 32767 and -32768 in the JPEG scanning 1139order), the Huffman encoder can produce encoded blocks that approach double the 1140size of the unencoded blocks. Thus, the Huffman local buffer was increased to 1141256 bytes, which should prevent any such issue from re-occurring in the future. 1142 114310. The new `tjPlaneSizeYUV()`, `tjPlaneWidth()`, and `tjPlaneHeight()` 1144functions were not actually usable on any platform except OS X and Windows, 1145because those functions were not included in the libturbojpeg mapfile. This 1146has been fixed. 1147 114811. Restored the `JPP()`, `JMETHOD()`, and `FAR` macros in the libjpeg-turbo 1149header files. The `JPP()` and `JMETHOD()` macros were originally implemented 1150in libjpeg as a way of supporting non-ANSI compilers that lacked support for 1151prototype parameters. libjpeg-turbo has never supported such compilers, but 1152some software packages still use the macros to define their own prototypes. 1153Similarly, libjpeg-turbo has never supported MS-DOS and other platforms that 1154have far symbols, but some software packages still use the `FAR` macro. A 1155pretty good argument can be made that this is a bad practice on the part of the 1156software in question, but since this affects more than one package, it's just 1157easier to fix it here. 1158 115912. Fixed issues that were preventing the ARM 64-bit SIMD code from compiling 1160for iOS, and included an ARMv8 architecture in all of the binaries installed by 1161the "official" libjpeg-turbo SDK for OS X. 1162 1163 11641.3.90 (1.4 beta1) 1165================== 1166 1167### Significant changes relative to 1.3.1: 1168 11691. New features in the TurboJPEG API: 1170 1171 - YUV planar images can now be generated with an arbitrary line padding 1172(previously only 4-byte padding, which was compatible with X Video, was 1173supported.) 1174 - The decompress-to-YUV function has been extended to support image 1175scaling. 1176 - JPEG images can now be compressed from YUV planar source images. 1177 - YUV planar images can now be decoded into RGB or grayscale images. 1178 - 4:1:1 subsampling is now supported. This is mainly included for 1179compatibility, since 4:1:1 is not fully accelerated in libjpeg-turbo and has no 1180significant advantages relative to 4:2:0. 1181 - CMYK images are now supported. This feature allows CMYK source images 1182to be compressed to YCCK JPEGs and YCCK or CMYK JPEGs to be decompressed to 1183CMYK destination images. Conversion between CMYK/YCCK and RGB or YUV images is 1184not supported. Such conversion requires a color management system and is thus 1185out of scope for a codec library. 1186 - The handling of YUV images in the Java API has been significantly 1187refactored and should now be much more intuitive. 1188 - The Java API now supports encoding a YUV image from an arbitrary 1189position in a large image buffer. 1190 - All of the YUV functions now have a corresponding function that operates 1191on separate image planes instead of a unified image buffer. This allows for 1192compressing/decoding from or decompressing/encoding to a subregion of a larger 1193YUV image. It also allows for handling YUV formats that swap the order of the 1194U and V planes. 1195 11962. Added SIMD acceleration for DSPr2-capable MIPS platforms. This speeds up 1197the compression of full-color JPEGs by 70-80% on such platforms and 1198decompression by 25-35%. 1199 12003. If an application attempts to decompress a Huffman-coded JPEG image whose 1201header does not contain Huffman tables, libjpeg-turbo will now insert the 1202default Huffman tables. In order to save space, many motion JPEG video frames 1203are encoded without the default Huffman tables, so these frames can now be 1204successfully decompressed by libjpeg-turbo without additional work on the part 1205of the application. An application can still override the Huffman tables, for 1206instance to re-use tables from a previous frame of the same video. 1207 12084. The Mac packaging system now uses pkgbuild and productbuild rather than 1209PackageMaker (which is obsolete and no longer supported.) This means that 1210OS X 10.6 "Snow Leopard" or later must be used when packaging libjpeg-turbo, 1211although the packages produced can be installed on OS X 10.5 "Leopard" or 1212later. OS X 10.4 "Tiger" is no longer supported. 1213 12145. The Huffman encoder now uses `clz` and `bsr` instructions for bit counting 1215on ARM platforms rather than a lookup table. This reduces the memory footprint 1216by 64k, which may be important for some mobile applications. Out of four 1217Android devices that were tested, two demonstrated a small overall performance 1218loss (~3-4% on average) with ARMv6 code and a small gain (also ~3-4%) with 1219ARMv7 code when enabling this new feature, but the other two devices 1220demonstrated a significant overall performance gain with both ARMv6 and ARMv7 1221code (~10-20%) when enabling the feature. Actual mileage may vary. 1222 12236. Worked around an issue with Visual C++ 2010 and later that caused incorrect 1224pixels to be generated when decompressing a JPEG image to a 256-color bitmap, 1225if compiler optimization was enabled when libjpeg-turbo was built. This caused 1226the regression tests to fail when doing a release build under Visual C++ 2010 1227and later. 1228 12297. Improved the accuracy and performance of the non-SIMD implementation of the 1230floating point inverse DCT (using code borrowed from libjpeg v8a and later.) 1231The accuracy of this implementation now matches the accuracy of the SSE/SSE2 1232implementation. Note, however, that the floating point DCT/IDCT algorithms are 1233mainly a legacy feature. They generally do not produce significantly better 1234accuracy than the accurate integer DCT/IDCT algorithms, and they are quite a 1235bit slower. 1236 12378. Added a new output colorspace (`JCS_RGB565`) to the libjpeg API that allows 1238for decompressing JPEG images into RGB565 (16-bit) pixels. If dithering is not 1239used, then this code path is SIMD-accelerated on ARM platforms. 1240 12419. Numerous obsolete features, such as support for non-ANSI compilers and 1242support for the MS-DOS memory model, were removed from the libjpeg code, 1243greatly improving its readability and making it easier to maintain and extend. 1244 124510. Fixed a segfault that occurred when calling `output_message()` with 1246`msg_code` set to `JMSG_COPYRIGHT`. 1247 124811. Fixed an issue whereby wrjpgcom was allowing comments longer than 65k 1249characters to be passed on the command line, which was causing it to generate 1250incorrect JPEG files. 1251 125212. Fixed a bug in the build system that was causing the Windows version of 1253wrjpgcom to be built using the rdjpgcom source code. 1254 125513. Restored 12-bit-per-component JPEG support. A 12-bit version of 1256libjpeg-turbo can now be built by passing an argument of `--with-12bit` to 1257configure (Unix) or `-DWITH_12BIT=1` to cmake (Windows.) 12-bit JPEG support 1258is included only for convenience. Enabling this feature disables all of the 1259performance features in libjpeg-turbo, as well as arithmetic coding and the 1260TurboJPEG API. The resulting library still contains the other libjpeg-turbo 1261features (such as the colorspace extensions), but in general, it performs no 1262faster than libjpeg v6b. 1263 126414. Added ARM 64-bit SIMD acceleration for the YCC-to-RGB color conversion 1265and IDCT algorithms (both are used during JPEG decompression.) For unknown 1266reasons (probably related to clang), this code cannot currently be compiled for 1267iOS. 1268 126915. Fixed an extremely rare bug (CVE-2014-9092) that could cause the Huffman 1270encoder's local buffer to overrun when a very high-frequency MCU is compressed 1271using quality 100 and no subsampling, and when the JPEG output buffer is being 1272dynamically resized by the destination manager. This issue was so rare that, 1273even with a test program specifically designed to make the bug occur (by 1274injecting random high-frequency YUV data into the compressor), it was 1275reproducible only once in about every 25 million iterations. 1276 127716. Fixed an oversight in the TurboJPEG C wrapper: if any of the JPEG 1278compression functions was called repeatedly with the same 1279automatically-allocated destination buffer, then TurboJPEG would erroneously 1280assume that the `jpegSize` parameter was equal to the size of the buffer, when 1281in fact that parameter was probably equal to the size of the most recently 1282compressed JPEG image. If the size of the previous JPEG image was not as large 1283as the current JPEG image, then TurboJPEG would unnecessarily reallocate the 1284destination buffer. 1285 1286 12871.3.1 1288===== 1289 1290### Significant changes relative to 1.3.0: 1291 12921. On Un*x systems, `make install` now installs the libjpeg-turbo libraries 1293into /opt/libjpeg-turbo/lib32 by default on any 32-bit system, not just x86, 1294and into /opt/libjpeg-turbo/lib64 by default on any 64-bit system, not just 1295x86-64. You can override this by overriding either the `prefix` or `libdir` 1296configure variables. 1297 12982. The Windows installer now places a copy of the TurboJPEG DLLs in the same 1299directory as the rest of the libjpeg-turbo binaries. This was mainly done 1300to support TurboVNC 1.3, which bundles the DLLs in its Windows installation. 1301When using a 32-bit version of CMake on 64-bit Windows, it is impossible to 1302access the c:\WINDOWS\system32 directory, which made it impossible for the 1303TurboVNC build scripts to bundle the 64-bit TurboJPEG DLL. 1304 13053. Fixed a bug whereby attempting to encode a progressive JPEG with arithmetic 1306entropy coding (by passing arguments of `-progressive -arithmetic` to cjpeg or 1307jpegtran, for instance) would result in an error, `Requested feature was 1308omitted at compile time`. 1309 13104. Fixed a couple of issues (CVE-2013-6629 and CVE-2013-6630) whereby malformed 1311JPEG images would cause libjpeg-turbo to use uninitialized memory during 1312decompression. 1313 13145. Fixed an error (`Buffer passed to JPEG library is too small`) that occurred 1315when calling the TurboJPEG YUV encoding function with a very small (< 5x5) 1316source image, and added a unit test to check for this error. 1317 13186. The Java classes should now build properly under Visual Studio 2010 and 1319later. 1320 13217. Fixed an issue that prevented SRPMs generated using the in-tree packaging 1322tools from being rebuilt on certain newer Linux distributions. 1323 13248. Numerous minor fixes to eliminate compilation and build/packaging system 1325warnings, fix cosmetic issues, improve documentation clarity, and other general 1326source cleanup. 1327 1328 13291.3.0 1330===== 1331 1332### Significant changes relative to 1.3 beta1: 1333 13341. `make test` now works properly on FreeBSD, and it no longer requires the 1335md5sum executable to be present on other Un*x platforms. 1336 13372. Overhauled the packaging system: 1338 1339 - To avoid conflict with vendor-supplied libjpeg-turbo packages, the 1340official RPMs and DEBs for libjpeg-turbo have been renamed to 1341"libjpeg-turbo-official". 1342 - The TurboJPEG libraries are now located under /opt/libjpeg-turbo in the 1343official Linux and Mac packages, to avoid conflict with vendor-supplied 1344packages and also to streamline the packaging system. 1345 - Release packages are now created with the directory structure defined 1346by the configure variables `prefix`, `bindir`, `libdir`, etc. (Un\*x) or by the 1347`CMAKE_INSTALL_PREFIX` variable (Windows.) The exception is that the docs are 1348always located under the system default documentation directory on Un\*x and 1349Mac systems, and on Windows, the TurboJPEG DLL is always located in the Windows 1350system directory. 1351 - To avoid confusion, official libjpeg-turbo packages on Linux/Unix 1352platforms (except for Mac) will always install the 32-bit libraries in 1353/opt/libjpeg-turbo/lib32 and the 64-bit libraries in /opt/libjpeg-turbo/lib64. 1354 - Fixed an issue whereby, in some cases, the libjpeg-turbo executables on 1355Un*x systems were not properly linking with the shared libraries installed by 1356the same package. 1357 - Fixed an issue whereby building the "installer" target on Windows when 1358`WITH_JAVA=1` would fail if the TurboJPEG JAR had not been previously built. 1359 - Building the "install" target on Windows now installs files into the 1360same places that the installer does. 1361 13623. Fixed a Huffman encoder bug that prevented I/O suspension from working 1363properly. 1364 1365 13661.2.90 (1.3 beta1) 1367================== 1368 1369### Significant changes relative to 1.2.1: 1370 13711. Added support for additional scaling factors (3/8, 5/8, 3/4, 7/8, 9/8, 5/4, 137211/8, 3/2, 13/8, 7/4, 15/8, and 2) when decompressing. Note that the IDCT will 1373not be SIMD-accelerated when using any of these new scaling factors. 1374 13752. The TurboJPEG dynamic library is now versioned. It was not strictly 1376necessary to do so, because TurboJPEG uses versioned symbols, and if a function 1377changes in an ABI-incompatible way, that function is renamed and a legacy 1378function is provided to maintain backward compatibility. However, certain 1379Linux distro maintainers have a policy against accepting any library that isn't 1380versioned. 1381 13823. Extended the TurboJPEG Java API so that it can be used to compress a JPEG 1383image from and decompress a JPEG image to an arbitrary position in a large 1384image buffer. 1385 13864. The `tjDecompressToYUV()` function now supports the `TJFLAG_FASTDCT` flag. 1387 13885. The 32-bit supplementary package for amd64 Debian systems now provides 1389symlinks in /usr/lib/i386-linux-gnu for the TurboJPEG libraries in /usr/lib32. 1390This allows those libraries to be used on MultiArch-compatible systems (such as 1391Ubuntu 11 and later) without setting the linker path. 1392 13936. The TurboJPEG Java wrapper should now find the JNI library on Mac systems 1394without having to pass `-Djava.library.path=/usr/lib` to java. 1395 13967. TJBench has been ported to Java to provide a convenient way of validating 1397the performance of the TurboJPEG Java API. It can be run with 1398`java -cp turbojpeg.jar TJBench`. 1399 14008. cjpeg can now be used to generate JPEG files with the RGB colorspace 1401(feature ported from jpeg-8d.) 1402 14039. The width and height in the `-crop` argument passed to jpegtran can now be 1404suffixed with `f` to indicate that, when the upper left corner of the cropping 1405region is automatically moved to the nearest iMCU boundary, the bottom right 1406corner should be moved by the same amount. In other words, this feature causes 1407jpegtran to strictly honor the specified width/height rather than the specified 1408bottom right corner (feature ported from jpeg-8d.) 1409 141010. JPEG files using the RGB colorspace can now be decompressed into grayscale 1411images (feature ported from jpeg-8d.) 1412 141311. Fixed a regression caused by 1.2.1[7] whereby the build would fail with 1414multiple "Mismatch in operand sizes" errors when attempting to build the x86 1415SIMD code with NASM 0.98. 1416 141712. The in-memory source/destination managers (`jpeg_mem_src()` and 1418`jpeg_mem_dest()`) are now included by default when building libjpeg-turbo with 1419libjpeg v6b or v7 emulation, so that programs can take advantage of these 1420functions without requiring the use of the backward-incompatible libjpeg v8 1421ABI. The "age number" of the libjpeg-turbo library on Un*x systems has been 1422incremented by 1 to reflect this. You can disable this feature with a 1423configure/CMake switch in order to retain strict API/ABI compatibility with the 1424libjpeg v6b or v7 API/ABI (or with previous versions of libjpeg-turbo.) See 1425[README.md](README.md) for more details. 1426 142713. Added ARMv7s architecture to libjpeg.a and libturbojpeg.a in the official 1428libjpeg-turbo binary package for OS X, so that those libraries can be used to 1429build applications that leverage the faster CPUs in the iPhone 5 and iPad 4. 1430 1431 14321.2.1 1433===== 1434 1435### Significant changes relative to 1.2.0: 1436 14371. Creating or decoding a JPEG file that uses the RGB colorspace should now 1438properly work when the input or output colorspace is one of the libjpeg-turbo 1439colorspace extensions. 1440 14412. When libjpeg-turbo was built without SIMD support and merged (non-fancy) 1442upsampling was used along with an alpha-enabled colorspace during 1443decompression, the unused byte of the decompressed pixels was not being set to 14440xFF. This has been fixed. TJUnitTest has also been extended to test for the 1445correct behavior of the colorspace extensions when merged upsampling is used. 1446 14473. Fixed a bug whereby the libjpeg-turbo SSE2 SIMD code would not preserve the 1448upper 64 bits of xmm6 and xmm7 on Win64 platforms, which violated the Win64 1449calling conventions. 1450 14514. Fixed a regression (CVE-2012-2806) caused by 1.2.0[6] whereby decompressing 1452corrupt JPEG images (specifically, images in which the component count was 1453erroneously set to a large value) would cause libjpeg-turbo to segfault. 1454 14555. Worked around a severe performance issue with "Bobcat" (AMD Embedded APU) 1456processors. The `MASKMOVDQU` instruction, which was used by the libjpeg-turbo 1457SSE2 SIMD code, is apparently implemented in microcode on AMD processors, and 1458it is painfully slow on Bobcat processors in particular. Eliminating the use 1459of this instruction improved performance by an order of magnitude on Bobcat 1460processors and by a small amount (typically 5%) on AMD desktop processors. 1461 14626. Added SIMD acceleration for performing 4:2:2 upsampling on NEON-capable ARM 1463platforms. This speeds up the decompression of 4:2:2 JPEGs by 20-25% on such 1464platforms. 1465 14667. Fixed a regression caused by 1.2.0[2] whereby, on Linux/x86 platforms 1467running the 32-bit SSE2 SIMD code in libjpeg-turbo, decompressing a 4:2:0 or 14684:2:2 JPEG image into a 32-bit (RGBX, BGRX, etc.) buffer without using fancy 1469upsampling would produce several incorrect columns of pixels at the right-hand 1470side of the output image if each row in the output image was not evenly 1471divisible by 16 bytes. 1472 14738. Fixed an issue whereby attempting to build the SIMD extensions with Xcode 14744.3 on OS X platforms would cause NASM to return numerous errors of the form 1475"'%define' expects a macro identifier". 1476 14779. Added flags to the TurboJPEG API that allow the caller to force the use of 1478either the fast or the accurate DCT/IDCT algorithms in the underlying codec. 1479 1480 14811.2.0 1482===== 1483 1484### Significant changes relative to 1.2 beta1: 1485 14861. Fixed build issue with YASM on Unix systems (the libjpeg-turbo build system 1487was not adding the current directory to the assembler include path, so YASM 1488was not able to find jsimdcfg.inc.) 1489 14902. Fixed out-of-bounds read in SSE2 SIMD code that occurred when decompressing 1491a JPEG image to a bitmap buffer whose size was not a multiple of 16 bytes. 1492This was more of an annoyance than an actual bug, since it did not cause any 1493actual run-time problems, but the issue showed up when running libjpeg-turbo in 1494valgrind. See <http://crbug.com/72399> for more information. 1495 14963. Added a compile-time macro (`LIBJPEG_TURBO_VERSION`) that can be used to 1497check the version of libjpeg-turbo against which an application was compiled. 1498 14994. Added new RGBA/BGRA/ABGR/ARGB colorspace extension constants (libjpeg API) 1500and pixel formats (TurboJPEG API), which allow applications to specify that, 1501when decompressing to a 4-component RGB buffer, the unused byte should be set 1502to 0xFF so that it can be interpreted as an opaque alpha channel. 1503 15045. Fixed regression issue whereby DevIL failed to build against libjpeg-turbo 1505because libjpeg-turbo's distributed version of jconfig.h contained an `INLINE` 1506macro, which conflicted with a similar macro in DevIL. This macro is used only 1507internally when building libjpeg-turbo, so it was moved into config.h. 1508 15096. libjpeg-turbo will now correctly decompress erroneous CMYK/YCCK JPEGs whose 1510K component is assigned a component ID of 1 instead of 4. Although these files 1511are in violation of the spec, other JPEG implementations handle them 1512correctly. 1513 15147. Added ARMv6 and ARMv7 architectures to libjpeg.a and libturbojpeg.a in 1515the official libjpeg-turbo binary package for OS X, so that those libraries can 1516be used to build both OS X and iOS applications. 1517 1518 15191.1.90 (1.2 beta1) 1520================== 1521 1522### Significant changes relative to 1.1.1: 1523 15241. Added a Java wrapper for the TurboJPEG API. See [java/README](java/README) 1525for more details. 1526 15272. The TurboJPEG API can now be used to scale down images during 1528decompression. 1529 15303. Added SIMD routines for RGB-to-grayscale color conversion, which 1531significantly improves the performance of grayscale JPEG compression from an 1532RGB source image. 1533 15344. Improved the performance of the C color conversion routines, which are used 1535on platforms for which SIMD acceleration is not available. 1536 15375. Added a function to the TurboJPEG API that performs lossless transforms. 1538This function is implemented using the same back end as jpegtran, but it 1539performs transcoding entirely in memory and allows multiple transforms and/or 1540crop operations to be batched together, so the source coefficients only need to 1541be read once. This is useful when generating image tiles from a single source 1542JPEG. 1543 15446. Added tests for the new TurboJPEG scaled decompression and lossless 1545transform features to tjbench (the TurboJPEG benchmark, formerly called 1546"jpgtest".) 1547 15487. Added support for 4:4:0 (transposed 4:2:2) subsampling in TurboJPEG, which 1549was necessary in order for it to read 4:2:2 JPEG files that had been losslessly 1550transposed or rotated 90 degrees. 1551 15528. All legacy VirtualGL code has been re-factored, and this has allowed 1553libjpeg-turbo, in its entirety, to be re-licensed under a BSD-style license. 1554 15559. libjpeg-turbo can now be built with YASM. 1556 155710. Added SIMD acceleration for ARM Linux and iOS platforms that support 1558NEON instructions. 1559 156011. Refactored the TurboJPEG C API and documented it using Doxygen. The 1561TurboJPEG 1.2 API uses pixel formats to define the size and component order of 1562the uncompressed source/destination images, and it includes a more efficient 1563version of `TJBUFSIZE()` that computes a worst-case JPEG size based on the 1564level of chrominance subsampling. The refactored implementation of the 1565TurboJPEG API now uses the libjpeg memory source and destination managers, 1566which allows the TurboJPEG compressor to grow the JPEG buffer as necessary. 1567 156812. Eliminated errors in the output of jpegtran on Windows that occurred when 1569the application was invoked using I/O redirection 1570(`jpegtran <input.jpg >output.jpg`.) 1571 157213. The inclusion of libjpeg v7 and v8 emulation as well as arithmetic coding 1573support in libjpeg-turbo v1.1.0 introduced several new error constants in 1574jerror.h, and these were mistakenly enabled for all emulation modes, causing 1575the error enum in libjpeg-turbo to sometimes have different values than the 1576same enum in libjpeg. This represents an ABI incompatibility, and it caused 1577problems with rare applications that took specific action based on a particular 1578error value. The fix was to include the new error constants conditionally 1579based on whether libjpeg v7 or v8 emulation was enabled. 1580 158114. Fixed an issue whereby Windows applications that used libjpeg-turbo would 1582fail to compile if the Windows system headers were included before jpeglib.h. 1583This issue was caused by a conflict in the definition of the INT32 type. 1584 158515. Fixed 32-bit supplementary package for amd64 Debian systems, which was 1586broken by enhancements to the packaging system in 1.1. 1587 158816. When decompressing a JPEG image using an output colorspace of 1589`JCS_EXT_RGBX`, `JCS_EXT_BGRX`, `JCS_EXT_XBGR`, or `JCS_EXT_XRGB`, 1590libjpeg-turbo will now set the unused byte to 0xFF, which allows applications 1591to interpret that byte as an alpha channel (0xFF = opaque). 1592 1593 15941.1.1 1595===== 1596 1597### Significant changes relative to 1.1.0: 1598 15991. Fixed a 1-pixel error in row 0, column 21 of the luminance plane generated 1600by `tjEncodeYUV()`. 1601 16022. libjpeg-turbo's accelerated Huffman decoder previously ignored unexpected 1603markers found in the middle of the JPEG data stream during decompression. It 1604will now hand off decoding of a particular block to the unaccelerated Huffman 1605decoder if an unexpected marker is found, so that the unaccelerated Huffman 1606decoder can generate an appropriate warning. 1607 16083. Older versions of MinGW64 prefixed symbol names with underscores by 1609default, which differed from the behavior of 64-bit Visual C++. MinGW64 1.0 1610has adopted the behavior of 64-bit Visual C++ as the default, so to accommodate 1611this, the libjpeg-turbo SIMD function names are no longer prefixed with an 1612underscore when building with MinGW64. This means that, when building 1613libjpeg-turbo with older versions of MinGW64, you will now have to add 1614`-fno-leading-underscore` to the `CFLAGS`. 1615 16164. Fixed a regression bug in the NSIS script that caused the Windows installer 1617build to fail when using the Visual Studio IDE. 1618 16195. Fixed a bug in `jpeg_read_coefficients()` whereby it would not initialize 1620`cinfo->image_width` and `cinfo->image_height` if libjpeg v7 or v8 emulation 1621was enabled. This specifically caused the jpegoptim program to fail if it was 1622linked against a version of libjpeg-turbo that was built with libjpeg v7 or v8 1623emulation. 1624 16256. Eliminated excessive I/O overhead that occurred when reading BMP files in 1626cjpeg. 1627 16287. Eliminated errors in the output of cjpeg on Windows that occurred when the 1629application was invoked using I/O redirection (`cjpeg <inputfile >output.jpg`.) 1630 1631 16321.1.0 1633===== 1634 1635### Significant changes relative to 1.1 beta1: 1636 16371. The algorithm used by the SIMD quantization function cannot produce correct 1638results when the JPEG quality is >= 98 and the fast integer forward DCT is 1639used. Thus, the non-SIMD quantization function is now used for those cases, 1640and libjpeg-turbo should now produce identical output to libjpeg v6b in all 1641cases. 1642 16432. Despite the above, the fast integer forward DCT still degrades somewhat for 1644JPEG qualities greater than 95, so the TurboJPEG wrapper will now automatically 1645use the accurate integer forward DCT when generating JPEG images of quality 96 1646or greater. This reduces compression performance by as much as 15% for these 1647high-quality images but is necessary to ensure that the images are perceptually 1648lossless. It also ensures that the library can avoid the performance pitfall 1649created by [1]. 1650 16513. Ported jpgtest.cxx to pure C to avoid the need for a C++ compiler. 1652 16534. Fixed visual artifacts in grayscale JPEG compression caused by a typo in 1654the RGB-to-luminance lookup tables. 1655 16565. The Windows distribution packages now include the libjpeg run-time programs 1657(cjpeg, etc.) 1658 16596. All packages now include jpgtest. 1660 16617. The TurboJPEG dynamic library now uses versioned symbols. 1662 16638. Added two new TurboJPEG API functions, `tjEncodeYUV()` and 1664`tjDecompressToYUV()`, to replace the somewhat hackish `TJ_YUV` flag. 1665 1666 16671.0.90 (1.1 beta1) 1668================== 1669 1670### Significant changes relative to 1.0.1: 1671 16721. Added emulation of the libjpeg v7 and v8 APIs and ABIs. See 1673[README.md](README.md) for more details. This feature was sponsored by 1674CamTrace SAS. 1675 16762. Created a new CMake-based build system for the Visual C++ and MinGW builds. 1677 16783. Grayscale bitmaps can now be compressed from/decompressed to using the 1679TurboJPEG API. 1680 16814. jpgtest can now be used to test decompression performance with existing 1682JPEG images. 1683 16845. If the default install prefix (/opt/libjpeg-turbo) is used, then 1685`make install` now creates /opt/libjpeg-turbo/lib32 and 1686/opt/libjpeg-turbo/lib64 sym links to duplicate the behavior of the binary 1687packages. 1688 16896. All symbols in the libjpeg-turbo dynamic library are now versioned, even 1690when the library is built with libjpeg v6b emulation. 1691 16927. Added arithmetic encoding and decoding support (can be disabled with 1693configure or CMake options) 1694 16958. Added a `TJ_YUV` flag to the TurboJPEG API, which causes both the compressor 1696and decompressor to output planar YUV images. 1697 16989. Added an extended version of `tjDecompressHeader()` to the TurboJPEG API, 1699which allows the caller to determine the type of subsampling used in a JPEG 1700image. 1701 170210. Added further protections against invalid Huffman codes. 1703 1704 17051.0.1 1706===== 1707 1708### Significant changes relative to 1.0.0: 1709 17101. The Huffman decoder will now handle erroneous Huffman codes (for instance, 1711from a corrupt JPEG image.) Previously, these would cause libjpeg-turbo to 1712crash under certain circumstances. 1713 17142. Fixed typo in SIMD dispatch routines that was causing 4:2:2 upsampling to 1715be used instead of 4:2:0 when decompressing JPEG images using SSE2 code. 1716 17173. The configure script will now automatically determine whether the 1718`INCOMPLETE_TYPES_BROKEN` macro should be defined. 1719 1720 17211.0.0 1722===== 1723 1724### Significant changes relative to 0.0.93: 1725 17261. 2983700: Further FreeBSD build tweaks (no longer necessary to specify 1727`--host` when configuring on a 64-bit system) 1728 17292. Created symlinks in the Unix/Linux packages so that the TurboJPEG 1730include file can always be found in /opt/libjpeg-turbo/include, the 32-bit 1731static libraries can always be found in /opt/libjpeg-turbo/lib32, and the 173264-bit static libraries can always be found in /opt/libjpeg-turbo/lib64. 1733 17343. The Unix/Linux distribution packages now include the libjpeg run-time 1735programs (cjpeg, etc.) and man pages. 1736 17374. Created a 32-bit supplementary package for amd64 Debian systems, which 1738contains just the 32-bit libjpeg-turbo libraries. 1739 17405. Moved the libraries from */lib32 to */lib in the i386 Debian package. 1741 17426. Include distribution package for Cygwin 1743 17447. No longer necessary to specify `--without-simd` on non-x86 architectures, 1745and unit tests now work on those architectures. 1746 1747 17480.0.93 1749====== 1750 1751### Significant changes since 0.0.91: 1752 17531. 2982659: Fixed x86-64 build on FreeBSD systems 1754 17552. 2988188: Added support for Windows 64-bit systems 1756 1757 17580.0.91 1759====== 1760 1761### Significant changes relative to 0.0.90: 1762 17631. Added documentation to .deb packages 1764 17652. 2968313: Fixed data corruption issues when decompressing large JPEG images 1766and/or using buffered I/O with the libjpeg-turbo decompressor 1767 1768 17690.0.90 1770====== 1771 1772Initial release 1773