Lines Matching +full:flat +full:- +full:cache
8 D.u = abs(S0.i - S1.i) + S2.u.
15 ABS_DIFF (A,B) = (A>B) ? (A-B) : (B-A)
21 `v_sad_u32(-5, 0, 0)` would return `4294967291` (`-5` interpreted as unsigned),
78 > and sent to the texture cache. Any texture or buffer resources and samplers
79 > are also sent immediately. However, write-data is not immediately sent to the
80 > texture cache.
102 ## FLAT, Scratch, Global instructions
118 GFX7-8 ISA manuals are mistaken about the available LDS size.
138 ## RDNA L0, L1 cache and DLC, GLC bits
140 The old L1 cache was renamed to L0, and a new L1 cache was added to RDNA. The
141 L1 cache is 1 cache per shader array. Some instruction encodings have DLC and
142 GLC bits that interact with the cache.
144 * DLC ("device level coherent") bit: controls the L1 cache
145 * GLC ("globally coherent") bit: controls the L0 cache
151 Stores and atomics always bypass the L1 cache, so they don't support the DLC bit,
192 ## POPS collision wave ID argument (GFX9-10.3)
221 …tps://github.com/llvm/llvm-project/blob/acb089e12ae48b82c0b05c42326196a030df9b82/llvm/lib/Target/A…
230 …VM source.](https://github.com/llvm/llvm-project/blob/main/llvm/lib/Target/AMDGPU/Utils/AMDGPUBase…
261 VMEM/FLAT/GLOBAL/SCRATCH/DS instruction reads an SGPR (or EXEC, or M0).
273 Any non-SOPP SALU instruction (except `s_setvskip`, `s_version`, and any non-lgkmcnt `s_waitcnt`).
287 When there is a misaligned multi-dword FLAT load/store instruction in WGP mode,
288 it needs to be split into multiple single-dword FLAT instructions.
290 ACO doesn't use FLAT load/store on GFX10, so is unaffected.
294 The 12-bit immediate OFFSET field of FLAT instructions must always be 0.
297 ACO doesn't use FLAT load/store on GFX10, so is unaffected.
309 Any non-VALU instruction reads the EXEC mask. Then, any VALU instruction writes the EXEC mask.
327 "MIMG-NSA in a hard clause has unpredictable results on GFX10.1"
361 transcendental instructions in-between.
382 A VALU instruction reading a non-exec SGPR before the SALU write, or a sa_sdst=0 wait after the