• Home
  • Raw
  • Download

Lines Matching +full:macos +full:- +full:cmake +full:- +full:shared

6 All performance data on this page is measured on an Intel Core i5-9600K
9 <!-- ---------------------------------------------------------------------- -->
18 Reminder - the codec library API is not designed to be binary compatible across
19 versions. We always recommend rebuilding your client-side code using the updated
24 method to create the 16-bit RGB endpoint colors, and removes the previous
27 * **Bug fix:** Decompressing to an 8-bit per component output image now matches
38 * **Feature:** Command line tool supports a new option, `-decode_unorm8`.
41 (`-d*`) and trial (`-t*`) tool operation if the decompressed output image
42 is stored to an 8-bit per component file format. This option must be set
43 manually for compression (`-c*`) tool operation, as the desired decode mode
49 `-silent` is used.
51 <!-- ---------------------------------------------------------------------- -->
62 command line performance for `-exhaustive` compression when testing on an
67 <!-- ---------------------------------------------------------------------- -->
76 Reminder - the codec library API is not designed to be binary compatible across
77 versions. We always recommend rebuilding your client-side code using the updated
81 * **Bug-fix:** Fixed context allocation for contexts allocated with the
83 * **Bug-fix:** Reduced use of `reinterpret_cast` in the core codec to
85 * **Optimization:** `-medium` search quality no longer tests 4 partition
88 * **Optimization:** `-thorough` and higher search qualities no longer test the
93 drop in image quality for the `-verythorough` and `-exhaustive` presets.
95 <!-- ---------------------------------------------------------------------- -->
104 * **Bug-fix:** Improved handling compiler arguments in CMake, including
105 consistent use of MSVC-style command line arguments for ClangCL.
106 * **Bug-fix:** Invariant Clang builds now use `-ffp-model=precise` with
107 `-ffp-contract=off` which is needed to restore invariance due to recent
109 * **Change:** macOS binary releases are now distributed as a single universal
115 * **Change:** Non-invariant Clang builds now use `-ffp-model=precise` with
116 `-ffp-contract=on`. This should improve performance on older Clang
118 * **Change:** Non-invariant MSVC builds for VS2022 now use `/fp:precise`
120 * **Change:** CMake config variables now use an `ASTCENC_` prefix to add a
122 * **Change:** CMake config `ASTCENC_UNIVERSAL_BUILD` for building macOS
124 AVX2 builds. Universal builds are now on by default for macOS, and always
126 * **Change:** CMake config `ASTCENC_NO_INVARIANCE` has been inverted to
138 <!-- ---------------------------------------------------------------------- -->
144 performance boost, and a few new quality-of-life features.
151 * **Change:** Core library can be built as a shared object by setting the
152 `-DSHAREDLIB=ON` CMake option, resulting in e.g. `libastcenc-avx2-shared.so`.
158 in floating-point rounding if a distance tie is found when using the
163 * **Bug-fix:** Fixed missing symbol error in decompressor-only builds.
164 * **Bug-fix:** Fixed infinity handling in debug trace JSON files.
175 ![Relative scores 4.4 vs 4.3](./ChangeLogImg/relative-4.3-to-4.4.png)
177 <!-- ---------------------------------------------------------------------- -->
186 * **Bug-fix:** Fixed typo in `-2/3/4partitioncandidatelimit` CLI options.
187 * **Bug-fix:** Fixed handling for `-3/4partitionindexlimit` CLI options.
188 * **Bug-fix:** Updated to `stb_image.h` v2.28, which includes multiple fixes
191 <!-- ---------------------------------------------------------------------- -->
199 Reminder - the codec library API is not designed to be binary compatible across
200 versions. We always recommend rebuilding your client-side code using the updated
204 * **Bug-fix:** Use lower case `windows.h` include for MinGW compatibility.
205 * **Change:** The `-mask` command line option, `ASTCENC_FLG_MAP_MASK` in the
207 * **Optimization:** Always skip blue-contraction for `QUANT_256` encodings.
226 ![Relative scores 4.3 vs 4.2](./ChangeLogImg/relative-4.2-to-4.3.png)
229 <!-- ---------------------------------------------------------------------- -->
238 Reminder - the codec library API is not designed to be binary compatible across
239 versions. We always recommend rebuilding your client-side code using the updated
243 * **Bug-fix:** Compression for RGB and RGBA base+offset encodings no
244 longer generate endpoints with the incorrect blue-contract behavior.
245 * **Bug-fix:** Lowest channel correlation calculation now correctly ignores
248 * **Bug-fix:** ISA compatibility now checked in `config_init()` as well as
250 * **Change:** Removed the low-weight count optimization, as more recent
253 * **Feature:** The `-exhaustive` mode now runs full trials on more
255 by 0.1 to 0.25 dB, but slows down compression by 3x. The `-verythorough`
256 and `-thorough` modes also test more candidates.
257 * **Feature:** A new preset, `-verythorough`, has been introduced to provide
258 a standard performance point between `-thorough` and the re-tuned
259 `-exhaustive` mode. This new mode is faster and higher quality than the
260 `-exhaustive` preset in the 4.1 release.
269 * **Optimization:** Introduce early-out threshold for 2/3/4 partition
271 improves performance for `-medium` and `-thorough` searches, for a minor
273 * **Optimization:** Reduce early-out threshold for 3/4 partition searches
275 especially for `-thorough` searches, for a minor loss in image quality.
292 ![Relative scores 4.2 vs 4.0](./ChangeLogImg/relative-4.0-to-4.2.png)
295 <!-- ---------------------------------------------------------------------- -->
308 * **Change:** Command line tool gains a new `-dimage` option to generate
312 * **Bug-fix:** Library decompressor builds for SSE no longer use masked store
314 * **Bug-fix:** Command line decompressor now correctly uses sized type enums
316 * **Bug-fix:** Command line compressor now correctly loads 16 and 32-bit per
318 * **Bug-fix:** Fixed GCC9 compiler warnings on Arm aarch64.
320 <!-- ---------------------------------------------------------------------- -->
330 * **Change:** The `-array` option for specifying the number of image planes
331 for ASTC 3D volumetric block compression been renamed to `-zdim`.
333 `astcenc`, allowing the CMake install step to write binaries into
335 * **Feature:** A new `-ssw` option for specifying the shader sampling swizzle
336 has been added as convenience alternative to the `-cw` option. This is
340 sampling through .ra in the shader, use `-esw ggga -ssw ra`. In this
341 example `-ssw ra` is equivalent to the alternative `-cw 1 0 0 1` encoding.
342 * **Feature:** The `-a` alpha weighting option has been re-enabled in the
346 * **Feature:** The command line tool now has `-repeats <count>` for testing,
354 * **Feature:** `NO_INVARIANCE` builds will enable the `-ffp-contract=fast`
356 will also set the `-mfma` option. This reduces image quality by up to 0.2dB
357 (normally much less), but improves performance by up to 5-20%.
360 0-1 range, which is less accurate but much faster.
365 * **Optimization:** Small data tables now use in-register SIMD permutes
374 the rest of the codec now uses unpacked weights in the 0-64 range.
377 * **Optimization:** Remove redundant per-decimation mode copies of endpoint
378 and weight structures, which were really read-only duplicates.
379 * **Optimization:** Early-out the same endpoint mode color calculation if it
393 ![Relative scores 4.0 vs 3.7](./ChangeLogImg/relative-3.7-to-4.0.png)
396 - - -
398 _Copyright © 2022-2024, Arm Limited and contributors. All rights reserved._