• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
1Mesa 22.0.0 Release Notes / 2022-03-09
2======================================
3
4Mesa 22.0.0 is a new development release. People who are concerned
5with stability and reliability should stick with a previous release or
6wait for Mesa 22.0.1.
7
8Mesa 22.0.0 implements the OpenGL 4.6 API, but the version reported by
9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
11Some drivers don't support all the features required in OpenGL 4.6. OpenGL
124.6 is **only** available if requested at context creation.
13Compatibility contexts may report a lower version depending on each driver.
14
15Mesa 22.0.0 implements the Vulkan 1.2 API, but the version reported by
16the apiVersion property of the VkPhysicalDeviceProperties struct
17depends on the particular driver being used.
18
19SHA256 checksum
20---------------
21
22::
23
24   e6c41928b5b9917485bd67cec22d15e62cad7a358bf4c711a647979987601250  mesa-22.0.0.tar.xz
25
26
27New features
28------------
29
30- lavapipe,radv,anv KHR_dynamic_rendering
31- radv EXT_image_view_min_lod
32- VK_KHR_synchronization2 on RADV.
33- OpenSWR has been moved to the Amber branch
34- radeonsi, zink ARB_sparse_texture
35- d3d12 GLES3.1 (shader storage buffers, images, compute, indirect draw, draw params, ARB_framebuffer_no_attachments, ARB_sample_shading, and GLSL400)
36- radeonsi, zink ARB_sparse_texture2
37- zink EXT_memory_object, EXT_memory_object_fd, EXT_semaphore, EXT_semaphore_fd
38- anv VK_VALVE_mutable_descriptor_type
39- Vulkan 1.3 on RADV,Anv.
40- radeonsi, zink ARB_sparse_texture_clamp
41
42
43Bug fixes
44---------
45
46- anv: Unable to import I915_FORMAT_MOD_Y_TILED surfaces produced by iHD vaapi driver
47- anv doesn't always resolve aux buffers with private bindings on transition to external queue
48- Redraw freeze after upgrade to Xwayland 21.1.3
49- [ANV] Rendering corruption in DOOM Eternal
50- lavapipe: dEQP-VK.spirv_assembly.instruction.compute.float16.arithmetic_3.step fails
51- radv: When using VS+PS only, primitive ID is mixed up after NGG culling
52- [22.0.0_rc{1,2}] radeonsi: broken support for HD7850 (radeon 0000:0b:00.0: ring 0 stalled for more than...)
53- RADV: Texture seams in The Evil Within 2 (regression)
54- ANV: Bad output from TransformFeedback . Regression from Mesa 21. Something to do with VB+XFB -> VB+XFB dependency?
55- radv: CullDistance fail
56- Changing the buffer modifer via Wayland dmabuf-feedback apparently does not reliably reset the buffer age to 0
57- radv: causes hang untl gpu reset on navy_flounder
58- Crocus related crashes when running QT apps
59- Please add missing ray tracing Vulkan extensions to features.txt
60- [anv][bisected][regression][tgl] dEQP-VK.dynamic_rendering.suballocation.multisample_resolve.* failures
61- radv/aco: DXVKs strict float emulation causes black screen/graphical corruption on RDNA1
62- intel: env vars doc out of date
63- [REGRESSION][BISECTED] iris: Qutebrowser/QtWebEngine sporadically flashes the window in white
64- Flickering Intel Uhd 620 Graphics
65- Vulkan Wayland WSI returns empty surface formats
66- [radeon, rs690] piglit spec@glsl-1.10@execution@built-in-functions@fs-atan-float + 10 other 'atan' tests crash (mesa-22dev)
67- [bisected] artifacts in vaapi h.264 playback
68- [bisected] artifacts in vaapi h.264 playback
69- radv: msaa regression
70- [DG2] dEQP-GLES3.functional.shaders.derivate.fwidth* failures
71- FIFO present mode doesn't wait
72- Dota2 lockup wsi/x11: Avoid a class of deadlocks in the WSI queue thread
73- [vulkan][intel] Implement VK_VALVE_mutable_descriptor_type
74- Broken Terraria & Glitches in Forza Horizon 4
75- radv: Q2RTX with cswave32 crashes the GPU
76- CopelliaSim crashes on adding vision sensor to a scene on Ubuntu 20+, runs fine on Ubuntu 18.04
77- radv: renderpass regression
78- Dirt Rally: Flickering glitches on certain foliage going from Mesa 21.2.5 to 21.3.0
79- FrontFacing input is broken on Intel/Vulkan
80- glGetActiveUniform failing with GL_INVALID_VALUE for no reason
81- llvmpipe: Unimplemented get_driver_uuid/get_device_uuid causes segfaults in e.g. Wine
82- All vulkan apps terminate with VK_ERROR_DEVICE_LOST
83- Texture rendering broken on Vega 56, probably due to MR 14171 (tested on Battlefield 1 / vkd3d-proton)
84- Major graphic distortions with RADV in DX11/DX12 with Mesa-git as of January 12th 2022
85- Setting mesa_glthread to true crashes Minecraft when Sodium optimisation mod is enabled.
86- ANV: deviceUUID assumes a single device per system
87- MESA Overlay Vulkan layer version
88- [radeonsi, regression, bisected]: Rendering issues with Factorio
89- Distorted graphics when running Battle.net app under Wine with Radeon hardware
90- [r300g] Battle.net app window content rotated and stretched
91- Freedreno reporting 10 MB of Video memory that's causing issues with WineD3D on Poco F1
92- Panfrost - RK3399 - FACTORIO - glitches everywhere
93- Baldurs Gate 3 (Patch 6) - ribbon-like artifacts on textures
94- mesa >= 21.x (with llvmpipe) will break the game JVGS
95- Game Starsector crashes under certain circumstances with mesa 21.3.+
96- FreeSpace models incorrectly rendered on Polaris cards, causing system freeze
97- ci: Test with Skqp
98- ci: Test with Skqp
99- Metro Exodus (native) fails to launch
100- Poor performance on RX 580 (Polaris20)
101- VAAPI H.264 encoding is slow on Polaris vs. HEVC and Windows AMF
102- v3dv: Missing gem handle reference tracking logic
103- Incomplete evaluation of nested DEFINE macros
104- [r300g, bisected] piglit glsl-fs-discard-04 fails
105- Panfrost G52 Firefox terrible glitches on youtube playback
106- prim restart broken on crocus by prim restart fix for d3d12
107- RADV: Invalid fast clear triggered
108- Assassin's Creed Syndicate crashes with Mesa 21.3.0+ ACO
109- anv: dEQP-VK.graphicsfuzz.spv-stable-pillars-volatile-nontemporal-store fails
110- seg-fault with soft fp64 on iris after register allocation change
111- Taking RGP traces on VANGOGH results in hangs
112- Flickering and blackscreen on Mpv and Clapper (and also low performance on glxgears)
113- intel/vec4: Rework texture handling to not use \`ir_texture_opcode`
114- lavapipe leaks introduced in eb7eccc76f0a02b3a3291adaa39788b157979dc5
115- DXVK SIGBUS with Turnip on Poco F1 at loading to open world.
116- RADV: IsHelperInvocationEXT query is not considered volatile in ACO
117- [GraphicsFuzz] dEQP-VK.graphicsfuzz.stable-binarysearch-tree-nested-if-and-conditional
118- [bisected] Mesa 21.3.x breaks GBM with NVIDIA closed source driver 495.44
119- [DG2] dEQP-GL[45|ES31].functional.shaders.builtin_functions.pack_unpack.packhalf2x16_compute fail
120- Steam reports fatal error with missing OpenGL GLX extension not supported by display since December 6th
121- SNK HEROINES Tag Team Frenzy stuck on white screen
122- Run-time dependency dri found: NO
123- Rise of the Tomb Raider segfault
124- glsl/compiler: suspcious snippet of code (Unused variable)
125- GPU Crash in Yuzu 6600xt 5.15
126- anv: gfxbench 5.0 vulkan fails on android
127- [spirv-fuzz]  lower_trivial_continues_block: Assertion \`!first_instr || instr_is_continue(first_instr)' failed.
128- [RADV] Crash in Metro Exodus in Caspain chapter and Sam's Story
129- [nir][radv] more undefined shifts caught by gcc's sanitizer
130- NIR validation failed after nir_copy_prop
131- lima: Corrupted Android-12 UI on Allwinner A64
132- RADV/ACO: Rendering glitches in Forza Horizon 5 windshields
133- NV_pixel_buffer_object extension can be available in a GLES2 context but the associated glGet constants don't work
134- ThreadSanitizer: data races inside libvulkan_radeon.so
135- [radeonsi] Commit b1b491cdbbac1ccb785f5eb1802006f76ec253f5 leads to memory leak
136- dEQP-GLES31.*imulextended* compiling fp64 glsl 4.00 shader
137- anv: vkMapMemory fails "Memory object already mapped" with VA-API (mpv --gpu-api=vulkan --hwdec=vaapi)
138- RADV: crash in radv_GetPhysicalDeviceImageFormatProperties2
139- RADV/ACO: Monster Hunter Rise Demo renders wrong results
140- radv: Odd lack of implicit host memory invalidation
141- Regression/Bisected: Crash in Godot games when steam overlay enabled
142- RADV: IsHelperInvocationEXT query is not considered volatile in ACO
143- ANV: error submitting with the same semaphore for wait and signal - regression?
144- Vulkan does not work when 10-bit colors ( DefaultDepth 30) are enabled under Xorg on AMD/RADV
145- [TGL+] anv: some dEQP-VK.drm_format_modifiers.* fails
146- radv: SQTT generates trace with no data
147- Mesa 21.3rc3 - compile failure
148- iris: subslice assertion failure on some hardware during initialization
149- Final Fantasy V (Old version): Random text characters are not displayed
150- Diagonal rendering artifacts in Tomb Raider
151- vkCmdSetViewport sometimes ignored on Ivy Bridge, Haswell
152- dota2 black squares corruption
153- [hsw][bisected][regression] dEQP-VK.reconvergence.*nesting* failures
154- Piglit.spec.arb_shader_image_load_store.compiler.expression-allowed-1d_vert failed.
155- anv: dEQP-VK.wsi.wayland.<various> failures
156- freedreno: Crash in fd_bo_cache_alloc()
157- radv_android.c: build errors due to commit 49c3a88
158- dEQP-EGL.functional.sharing.gles2.multithread.* regression with Iris
159- [Turnip] Forsaken Remastered rendering issue
160- [radeonsi] Euro Truck Simulator 2: broken mimimap
161- Old Linux Rocket League version failing to start on Poco F1.
162- [regression][bisected] Launching Valheim OpenGL game leads to GPU Hang
163- Android Meson build regression: hardware/system information apps crash on Raspberry Pi 4
164- radv: format properties are broken with modifiers
165- anv: dEQP-VK.graphicsfuzz.cov-multiple-one-iteration-loops-global-counter-write-matrices fails
166
167
168Changes
169-------
170
171Aaron Watry (2):
172
173- clover: implement CL_IMAGE_BUFFER
174- clover/image: add dimension property
175
176Adam Jackson (37):
177
178- mesa/x11: Remove the swrast-classic-based fake libGL
179- mesa: Remove unused _mesa_all_buffers_are_unmapped
180- mesa: Remove unused _mesa_get_render_format
181- mesa: Remove unused _mesa_DrawTexx{,v}
182- mesa: Remove unused _check_*TexGen*OES
183- mesa: Remove unused _mesa_apply_ci_transfer_ops
184- mesa: Remove unused _mesa_compressed_image_address
185- mesa: Remove unused _es_color4ub
186- mesa: Remove unused _es_RenderbufferStorageEXT
187- mesa: Remove unused _es_{,Get}TexGenfv
188- mesa: Remove unused _vbo_current_binding
189- mesa/vbo: Always use buffer objects for storage
190- mesa/program: Dead code cleanup
191- mesa: Remove unused execmem code
192- mesa: Make _mesa_generate_mipmap_level static
193- glapi: Remove remnants of EXT_paletted_texture and the imaging subset
194- docs: Remove no-longer-accurate text about the xlib driver
195- mesa: Remove unused _mesa_initialize_visual
196- dri: Remove unused driContextSetFlags
197- dri: Remove unused driUpdateFramebufferSize
198- dri: Remove unused driGetRendererString
199- glx: Fix GLX_NV_float_buffer fbconfig handling
200- mesa: Remove unused _mesa_allow_light_in_model
201- mesa: Remove unused _mesa_AllocTextureStorage_sw
202- mesa: Remove unused _mesa_bind_texture
203- mesa: Remove unused _mesa_format_fallback_rgbx_to_rgba
204- mesa: Remove unused _mesa_get_linear_format_srgb
205- mesa: Remove unused _mesa_convert_colors
206- mesa: Remove unused _mesa_all_varyings_in_vbos
207- mesa: Remove unused _mesa_delete_nameless_texture
208- mesa/math: Remove unused m_translate.c
209- mesa: Remove unused _mesa_is_alpha_to_coverage_enabled
210- mesa: Remove unused _mesa_is_front_buffer_{draw,read}ing
211- mesa: Remove unused _mesa_set_sampler_{filters,srgb_decode,wrap}
212- mesa: Remove unused src/mesa/x86-64
213- wsi/x11: Avoid a class of deadlocks in the WSI queue thread
214- dri_interface: Remove the remaining DRI1 API definitions
215
216Alejandro Piñeiro (20):
217
218- v3dv/pipeline: don't clone the nir shader at pipeline_state_create_binning
219- v3d/clif: add support for dumping GS shader state
220- broadcom/compiler: remove unused macro and function definition
221- v3d: remove unused include
222- v3d: remove static v3d_start_binning
223- gallium/u_blitter: clean up texcoords ZW when filling up just XY
224- v3d,v3dv: move TFU register definition to a common header
225- vulkan: move common format helpers to vk_format
226- broadcom/compiler: make shaderdb debug output compatible with shaderdb's report tool
227- meson: bump meson requirement to 0.53.0
228- v3d: support for texture buffer objects
229- v3d: add support for no buffer object bound
230- v3d: restrict formats supported for PIPE_BIND_SHADER_IMAGE
231- v3d: enable ARB_texture_buffer_object and ARB_texture_buffer_range
232- broadcom/compiler: avoid unneeded sint/unorm clamping when lowering stores
233- v3dv: simplify v3dv_debug_ignored_stype
234- vulkan: return default string for undefined enum
235- v3dv: remove unused v3dv_descriptor_map_get_texture_format
236- v3dv: check correct format when load/storing on a depth/stencil buffer
237- vc4/nir_lower_blend: update write mask when we update num components
238
239Alex Xu (Hello71) (5):
240
241- Use initial-exec TLS for glibc only, enable TLS elsewhere
242- Auto-enable TLSDESC support
243- meson: check for lld split TLSDESC bug (fixes #5665)
244- Fix TSD stubs for non-initial-exec case (fixes #5667).
245- meson: tlsdesc: minor reformatting, add comments
246
247Alyssa Rosenzweig (179):
248
249- panfrost: Don't allow rendering/texturing 48-bit
250- panfrost: Detect implementations support AFBC
251- panfrost,panvk: Use dev->has_afbc instead of quirks
252- panfrost: Remove unused MIDGARD_NO_AFBC quirk
253- panfrost: Fix gl_FragColor lowering
254- panfrost: Workaround ISSUE_TSIX_2033
255- panfrost: Add internal afbc_formats
256- panfrost: Decompress for incompatible AFBC formats
257- panfrost: Enable AFBC on v7
258- panfrost: Remove ancient TODO
259- panfrost: Remove duplicated #if
260- panfrost: Rename depth bias fields
261- panfrost: Add alpha_zero_nop/one_store predicate
262- panfrost: Test alpha_zero_nop/one_store predicates
263- panfrost: Pass through alpha_zero_nop/one_store
264- panvk: Pass through alpha_zero_nop/one_store flags
265- nir/lower_blend: Use correct clamp for SNORM
266- mesa: Require MRT support for GL3/ES3
267- nir/lower_pntc_ytransform: Support PointCoordIsSysval
268- pan/bi: Fix typo in helper invocation analysis
269- pan/bi: Make bi_index padding explicit
270- pan/bi: Add secondary staging count
271- pan/bi: Add second destination to TEXC
272- pan/bi: Add bifrost_texture_operation_mode enum
273- pan/bi: Add bifrost_dual_texture_operation struct
274- pan/bi: Use BIFROST_TEXTURE_OPERATION_SINGLE enum
275- pan/bi: Support dual texture scheduling
276- pan/bi: Add bi_dual_tex_as_u32 helper
277- pan/bi: Fix up dual texturing registers
278- pan/bi: Add dual texture fusing pass
279- pan/bi: Test dual texture fusing
280- pan/bi: Enable dual texture fusing pass
281- pan/mdg: Remove duplicate compiler option
282- pan/bi: Annotate Valhall instructions with units
283- pan/bi: Add full form of Valhall MUX instruction
284- pan/bi: Add sqrt form of Valhall FREXPM
285- pan/bi: Add Valhall's special FMA_RSCALE instructions
286- pan/bi: Add XML for assembling Valhall image stores
287- pan/bi: Forbid unaligned staging registers on Valhall
288- pan/bi: Confirm IDP unit on Valhall
289- pan/bi: Suppress uniform validation for LD_BUFFER
290- pan/bi: Add XML for LD_BUFFER
291- asahi: Rename PANDECODE->AGXDECODE
292- asahi: Deflake addresses
293- asahi: Allocate special scratch buffers
294- asahi: Fix agx_map_* structures
295- asahi: Remove silly magic numbers
296- asahi: Remove obnoxious workaround
297- asahi: Fix BIND_PIPELINE sizing and alignment
298- panfrost: Collapse 0 parameters in drm-shim
299- panfrost: Handle AFBC_FEATURES in drm-shim
300- panfrost: Add empty tile flags to GenXML
301- panfrost: Only build GPU indirect kernels for v7
302- pan/bi: Link with Valhall disassembler
303- pan/va: Only hex dump when verbosely disassembling
304- panfrost: Add "hex" type to GenXML
305- panfrost: Add XML for Valhall data structures
306- panfrost: Zero initialize disassembler stats
307- panfrost: Don't shadow Mesa's fui()
308- panfrost: Add Valhall support to pandecode
309- pan/va: Improve assembler unit test output
310- pan/va: Disambiguate sign of CSEL instructions
311- pan/va: Add more assembler tests
312- pan/va: Make LD_VAR index more fine-grained
313- pan/va: Add sample/update modes to LD_VAR
314- pan/va: Add table parameter to LD_ATTR_IMM
315- pan/mdg: Fix definition of UBO unpack
316- docs/macos: Update for recent Mesa changes
317- pan/indirect_draw: Don't upload garbage UBO
318- pan/indirect_draw: Split out update_dcd
319- pan/indirect_draw: Support IDVS jobs
320- panfrost: Track preloaded registers
321- panfrost: Set preload descriptor more accurately
322- panfrost: Remove unused shader info bits
323- panfrost: Fix Secondary Shader field
324- panfrost: Treat IDVS jobs as tiler for scoreboarding
325- panfrost: Add IDVS fields to shader_info
326- panfrost: Split out regalloc/preload helpers
327- panfrost: Remove regalloc from v6.xml
328- panfrost: Set secondary_* fields for IDVS
329- panfrost: Extract panfrost_draw_emit_vertex_section
330- panfrost: Emit IDVS jobs
331- panfrost: Extract panfrost_batch_skip_rasterization
332- panfrost: Skip rasterizer discard draws without side effects
333- panfrost: Align instance size for IDVS
334- panfrost: Add panfrost_compile_inputs->no_idvs option
335- pan/bi: Extract bi_finalize_nir
336- pan/bi: Allow UBO pushing to run multiple times
337- pan/bi: Add IDVS mode to bi_context
338- pan/bi: Remove the "wrong" stores in IDVS variants
339- pan/bi: Split out varying store paths
340- pan/bi: Use position shader ST_CVT path
341- pan/bi: Add helper to decide if IDVS should be used
342- pan/bi: Specialize shaders for IDVS
343- pan/va: Remove extra LD_VAR_IMM_F32 source
344- pan/va: Rename LEA_ATTR to LEA_VARY
345- pan/va: Add .signed bit to right shift instructions
346- pan/va: Generalize LD_VAR_IMM_* to support flat varyings
347- pan/bi: Don't call useless NIR passes
348- panfrost: Make pan_merge macro more robust
349- pan/bi: Don't read base for combined stores
350- pan/mdg: Don't read base for combined stores
351- nir: Eliminate store_combined_output_pan BASE
352- panfrost: Simplify blend lowering pass
353- nir: Extend store_combined_output_pan
354- panfrost: Combine dual source blends
355- panfrost: Remove pan_nir_reorder_writeout
356- pan/bi: Use is_staging_src helper
357- pan/bi: Allow an extra staging source
358- pan/bi: Use fused dual source blending
359- pan/bi: Fix load_const of 1-bit booleans
360- gallium/util: Add pixel->blocks box helper
361- lima,panfrost: Correct pixel vs block mismatches
362- v3d: Use u_box_pixels_to_blocks helper
363- vc4: Use u_box_pixels_to_blocks helper
364- pan/decode: Remove hierarchy mask check
365- pan/decode: Include addresses for jobs
366- pan/decode: Track mmaps with a red-black tree
367- pan/decode: Add hexdump helper
368- pan/decode: Add pandecode_dump_mappings
369- pan/decode: Decode Valhall surface descriptor
370- panfrost: Remove FBD pointer on Bifrost XML
371- pan/decode: Don't print Preload twice
372- pan/decode: Disassemble Bifrost quietly
373- pan/bi: Schedule around blend shader register clobbering
374- asahi: Fake more CAPs with dEQP hacks mode
375- asahi: Warn when hacks mode is enabled
376- asahi: Add XML for unknown 0x4a packet
377- asahi: Break out Fragment Parameters word
378- asahi: Rectify confusing XML comment
379- asahi: Route sample mask from shader
380- agx: Add sample_mask instruction
381- agx: Handle discard intrinsics
382- agx: Lower UBO loads to use per-element indexing
383- panfrost: Use u_reduced_prim for primitive checks
384- panfrost: Only cull polygons
385- pan/bi: Pull BLEND precolouring out of per-dest loop
386- pan/decode: Fix missing newlines in error messages
387- panfrost: Remove HAS_SWIZZLES quirk
388- panfrost: Remove MIDGARD_SFBD quirk
389- panfrost: Remove MIDGARD_BROKEN_FP16 quirk
390- panfrost: Remove NO_TILE_ENABLE_MAP quirk
391- panfrost: Remove MIDGARD_{NO_TYPED_BLEND_STORES,MISSING_LOADS}
392- panfrost: Don't set NO_BLEND_PACKS on Bifrost
393- panfrost: Simplify format class selection
394- panfrost: Remove NO_BLEND_PACKS quirk
395- panfrost: Make primary_shader boolean
396- panfrost: Fix v9 "Stencil from shader" bit
397- panfrost: Don't pass quirks to pan_lower_framebuffer
398- panfrost: Centralize our model list
399- panfrost: Replace panfrost_model_name with model->name
400- panfrost: Make the GPU allowlist implicit
401- panfrost: Get performance counters from table
402- pan/bi: Clean up quirks
403- pan/bi: Assume future Valhall is 16-wide warps
404- panfrost: Add Mali-G51 support
405- docs/panfrost: Add new Midgard/Bifrost chips
406- pan/va: Add .absolute bit to BRANCHZI
407- pan/va: Add missing <clamp/> to V2F32_TO_V2F16
408- pan/va: Add missing fields to LD_TILE
409- pan/va: Test LD_TILE assembly
410- pan/va: Fix MKVEC.v2i16 lane select
411- pan/va: Fix lane select for [US]_TO_[USF]32
412- pan/va: Vectorize 8->16-bit conversions
413- pan/va: Add 2-channel 8-bit swizzles for conversions
414- pan/va: Add lots of swizzle assembler tests
415- pan/va: Handle shift lanes in assembler
416- pan/va: Add ARM_shader_framebuffer_fetch asm test
417- pan/bi: Avoid \*FADD.v2f16 hazard in optimizer
418- pan/bi: Avoid \*FADD.v2f16 hazard in scheduler
419- pan/bi: Lower swizzles on CSEL.i32/MUX.i32
420- panvk: Use more reliable assert for UBO pushing
421- pan/bi: Specialize IDVS in NIR
422- panfrost: Fix FD resource_get_handle
423- panfrost: Fix set_sampler_views for big GL
424- panfrost: Handle NULL sampler views
425- panfrost: Handle NULL samplers
426- panfrost: Flush resources when shadowing
427- panfrost: Push twice as many uniforms
428
429Andreas Baierl (1):
430
431- lima: Fix drawing wide lines
432
433Antonio Caggiano (2):
434
435- gallium: add a link shader hook
436- virgl: Link shader program
437
438Anuj Phogat (7):
439
440- genxml/gen125: Update 3DSTATE_TE fields
441- iris: Enable geometry distribution
442- anv: Enable geometry distribution
443- iris: Enable tessellation redistribution
444- anv: Enable tessellation redistribution
445- anv, iris: Add Wa_16011773973 for DG2
446- anv, iris: Add Wa_22011440098 for DG2
447
448Bas Nieuwenhuizen (73):
449
450- radv: Fix modifier property query.
451- radv: Add bufferDeviceAddressMultiDevice support.
452- amd/rgp: Use VGH clocks for RGP workaround.
453- radv: Disable coherent L2 optimization on cards with noncoherent L2.
454- ci: Bump libdrm for the android image.
455- ci: Add libelf to the Android image.
456- amd/addrlib: Use alternative ADDR_C_ASSERT definition.
457- radv: Remove android build warning.
458- radv: Always use linker script when possible.
459- aco: Remove useless sub-expr.
460- meson: Check arguments before adding.
461- amd/addrlib: Ignore self-assign warnings.
462- util: Add support for clang::fallthrough.
463- ci: Add RADV to Android CI.
464- radv: Fix memory corruption loading RT pipeline cache entries.
465- radv: Add more checking of cache sizes.
466- radv: Don't crash if VkExternalImageFormatProperties isn't provided.
467- radv: Avoid using a new subpass for ds clears.
468- radv: Stop using a subpass for color clears.
469- radv: Remove the skipping of framebuffer emission if we don't have a framebuffer.
470- radv: Set RB+ registers correctly without framebuffer.
471- radv: Don't emit framebuffer state if there is no renderpass active.
472- radv: Add named constants for max framebuffer width/height.
473- radv: Do not use VK_FORMAT_UNDEFINED in meta passes.
474- radv: Support VK_KHR_dynamic_rendering for pipeline creation.
475- radv: Support Begin/EndRendering.
476- radv: Support dynamic rendering inheritance info.
477- radv: Expose the VK_KHR_dynamic_rendering extension.
478- radv/amdgpu: Add a syncobj per queue.
479- radv/winsys: Add queue family param to submit.
480- radv/amdgpu: Add support for submitting 0 commandbuffers.
481- radv: Use the winsys 0 cmdbuffer submission support.
482- radv: Set up ETC2 emulation wiring.
483- radv: Use the correct base format for reintepretation.
484- radv: Add extra plane for decoding ETC images with emulation.
485- radv: Add ETC2 decode shader.
486- radv: Deal with border colors with emulated ETC2.
487- radv: Expose the ETC2 emulation.
488- radv/amdgpu: Only wait on queue_syncobj when needed.
489- radv: Add safety check for RGP traces on VanGogh.
490- radv: Use correct buffer size for query pool result copies.
491- radv: Skip wait timeline ioctl with 0 handles.
492- radv/amdgpu: Use VkResult for wait_timeline_syncobj.
493- vulkan/runtime: Refactor queue submit to take an argument struct.
494- vulkan/runtime: Add sparse bind support.
495- radv: Use dispatch table for QueueWaitIdle in the SQTT layer.
496- radv: Use dispatch table for wsi_display.c
497- radv: Use vulkan runtime for device lost.
498- meson: Bump libdrm_amdgpu version req to 2.4.109.
499- radv: Initialize vk device drm fd.
500- radv: Add function to allow WSI signalling fences/semaphores.
501- radv: Set horizontal sync types.
502- radv: Use vk_command_buffer for preambles.
503- radv: Add new cs_submit2 winsys call.
504- radv: Add new submission path for use by the common sync framework.
505- radv: Delete lots of sync code.
506- radv: Remove syncobj reset mechanism.
507- radv: Rename submit2->submit.
508- radv: Remove dependencies on vk_common entrypoints.
509- radv: Set optimal copy alignment to 1.
510- radv: Add common entrypoint dependency.
511- radv: Use MAX_PUSH_CONSTANTS_SIZE for saved push constants.
512- radv: Use 16-bits to store push constant indices.
513- radv: 256 byte push constants.
514- Revert "nir/algebraic: distribute fmul(fadd(a, b), c) when b and c are constants"
515- util/fossilize_db: Fix double free in error handling.
516- vulkan/wsi/display: Add common implementation of VK_EXT_display_control.
517- anv: Remove VK_EXT_display_control support in favor of common impl.
518- radv: Remove VK_EXT_display_control support in favor of common impl.
519- radv/amdgpu: Fix handling of IB alignment > 4 words.
520- radv/amdgpu: Use aligned sizing for IB buffers.
521- radv: Handle SDMA for padding.
522- radv: Fix preamble argument order.
523
524Biju Das (1):
525
526- kmsro: Add 'rcar-du' driver support
527
528Boris Brezillon (16):
529
530- vulkan: Fix weak symbol emulation when compiling with MSVC
531- vulkan: Set unused entrypoints to vk_entrypoint_stub when compiling with MSVC
532- vulkan: Fix entrypoint generation when compiling for x86 with MSVC
533- microsoft/compiler: Fix dxil_nir_create_bare_samplers()
534- gallium/d3d12: Don't use designated initializers
535- d3d12: Fix "use of designated initializers requires at least '/std:c++20'" error
536- microsoft/compiler: Fix sampler/texture array emission
537- microsoft/compiler: Skip images in redirect_texture_derefs()
538- microsoft/compiler: textureLoad() doesn't take a LOD on MS textures
539- microsoft/spirv_to_dxil: Remove dead variables after the struct split pass
540- microsoft/spirv_to_dxil: Allow dumping NIR
541- microsoft/spirv_to_dxil: Support [un]conditional YZ-flip
542- microsoft/spirv_to_dxil: Discard PSIZ accesses
543- microsoft/spirv_to_dxil: Lower atomics to their dxil variants
544- microsoft/spirv_to_dxil: Make sure the SampleMask is a uint
545- microsoft/spirv_to_dxil: Define idep_libspirv_to_dxil
546
547Boyuan Zhang (5):
548
549- radeon/vcn: update vcn1 enc interface
550- radeon/vcn: update vcn2 enc interface
551- radeon/vcn: remove redundancy for vcn2 enc
552- radeon/vcn: combine encode params func
553- radeon/vcn: combine session init func
554
555Caio Oliveira (89):
556
557- nir/schedule: Handle nir_intrisic_scoped_barrier
558- nir: Add nir_var_mem_image
559- intel/fs: Consider nir_var_mem_image for TGM fences
560- anv: Use nir_foreach_image_variable
561- spirv: Use nir_var_mem_image
562- intel/compiler: Rename vec4 test fixtures
563- intel/compiler: Build all tests in a single binary
564- spirv: Use a single binary for gtests
565- nir: Use a single binary for gtests
566- util: Consolidate existing gtests in a single binary
567- util: Move tests in single file directories to tests/
568- util: Convert u_atomic_test to use gtest
569- util: Convert blob_test to use gtest
570- util: Convert mesa-sha1_test to use gtest
571- util: Convert rb_tree_test to gtest
572- util: Convert roundeven_test to use gtest
573- util: Convert sparse array multithread test to use gtest
574- util: Move test sources to tests/ directory
575- intel/compiler: Add helpers to select SIMD for compute shaders
576- intel/compiler: Use SIMD selection helpers for CS
577- intel/compiler: Use SIMD selection helpers for variable workgroup size
578- intel/compiler: Don't use SIMD larger than needed for workgroup
579- anv: Get rid of "may be used initialized" warning in anv_QueueSubmit2KHR
580- anv: Make shaders array in anv_graphics_pipeline fit Task/Mesh
581- anv: Process FS last when compiling graphics pipeline
582- intel/compiler: Make brw_nir_populate_wm_prog_data() static
583- intel/compiler: Use gl_shader_stage_uses_workgroup() helpers
584- .mailmap: Simplify my name
585- intel/genxml: Add Mesh Shading structures
586- intel/genxml: Inline the BODY structs into the instructions
587- intel/dev: Add an intel_device_info::has_mesh_shading bit
588- intel/blorp: Add option to emit packets that disable Mesh
589- gtest: Add mesa-gtest-extras.h with array ASSERT/EXPECT macros
590- util: Change blob_test to use macro from mesa-gtest-extras.h
591- pan/bi: Make some headers compilable with C++
592- pan/bi: Use gtest for test-scheduler-predicates
593- pan/bi: Use gtest for test-packing
594- pan/bi: Use gtest for test-pack-formats
595- pan/bi: Use gtest for test-optimizer
596- pan/bi: Use gtest for test-constant-fold
597- pan/bi: Drop unused test helpers
598- util/ra: Add simple test for register set serialization
599- util/ra: Fix deserialization of register sets
600- gtest: Fix output of array ASSERT/EXPECT macros
601- intel: Add INTEL_DEBUG=task,mesh
602- intel/compiler: Properly lower WorkgroupId for Task/Mesh
603- intel/compiler: Handle per-primitive inputs in FS
604- intel/compiler: Don't stage Task/Mesh outputs in registers
605- intel/compiler: Don't lower Mesh/Task I/O to temporaries
606- intel/compiler: Add structs to hold TUE/MUE
607- intel/compiler: Make MUE available when setting up FS URB access
608- intel/compiler: Export brw_nir_lower_simd
609- intel/compiler: Add backend compiler basics for Task/Mesh
610- intel/compiler: Lower Task/Mesh local_invocation_{id,index}
611- intel/compiler: Implement Task Output and Mesh Input
612- intel/compiler: Implement Mesh Output
613- util: Convert cache test to use gtest
614- util: Use ralloc for strings in cache test
615- intel/compiler: Use a struct for brw_compile_tcs parameters
616- intel/compiler: Use a struct for brw_compile_tes parameters
617- intel/compiler: Use a struct for brw_compile_gs parameters
618- intel/compiler: Use a struct for brw_compile_bs parameters
619- nir: Initialize nir_register::divergent
620- meson: Bump version required for gtest protocol
621- Revert "nir: disable a NIR test due to undebuggable & locally unreproducible CI failures"
622- ci/windows: Remove line numbers of SPIR-V errors in spirv2dxil tests
623- spirv: Update headers and metadata to SPIR-V 1.6, revision 1
624- nir: Handle volatile semantics for loading HelperInvocation builtin
625- spirv: Identify non-temporal image operand added in SPIR-V 1.6
626- spirv: Use the incorporated names
627- anv/blorp: Split blorp_exec into a render and compute
628- anv: Refactor dirty masking in cmd_buffer_flush_state
629- anv: Simplify assertions related to graphics stages
630- anv: Add another case to INTEL_DEBUG=pc output
631- anv: Use pending pipe control mechanism in flush_pipeline_select()
632- anv/blorp: Apply pending pipe flushes after PIPELINE_SELECT
633- intel/fs/xehp: Add unit test for handling of RaR deps across multiple pipelines.
634- anv: SPIR-V 1.6 shaders imply ALLOW_VARYING_SUBGROUP_SIZE
635- anv: Fix subgroupSupportedStages physical property
636- intel: Only reserve space for Compute Engine out of URB in Gfx12LP
637- intel/compiler: Have specific mesh handling in calculate_urb_setup()
638- intel/compiler: Merge Per-Primitive attribute handling in Mesh case
639- compiler, intel: Add gl_shader_stage_is_mesh()
640- intel: Add INTEL_URB_DEREF_BLOCK_SIZE_MESH
641- intel/common: Add helper for URB allocation in Mesh pipeline
642- anv: Add boilerplate for VK_NV_mesh_shader
643- anv: Implement Mesh Shading pipeline
644- intel/dev: Enable Mesh Shading for DG2
645- anv: Add experimental support for VK_NV_mesh_shader
646
647Caleb Callaway (2):
648
649- vulkan/overlay: support Vulkan 1.2
650- vulkan/overlay: revise and reformat README
651
652Carsten Haitzler (3):
653
654- panfrost: Add GPU G76 to the set of known ids
655- kmsro: Add komeda DPU
656- panfrost: Don't double-free when handling error for unsupported GPU
657
658Charles Baker (5):
659
660- zink: Enable VK_KHR_image_format_list for VK_KHR_imageless_framebuffer
661- zink: Output PackHalf2x16 to uint not float
662- zink: Avoid redundant cast to uint on PackHalf2x16 result
663- zink: Set vertex binding stride without dynamic state extensions
664- Revert "zink: handle vertex buffer offset overflows"
665
666Charles Giessen (10):
667
668- radv: Update description of vk_icdNegotiateLoaderICDInterfaceVersion
669- v3dv: Update LoaderICDInterfaceVersion to v4
670- panvk: Export vk_icdGetPhysicalDeviceProcAddr
671- freedreno, tu: Export vk_icdGetPhysicalDeviceProcAddr
672- radv: Update LoaderICDInterfaceVersion to v5
673- panvk: Update LoaderICDInterfaceVersion to v5
674- anv: Update LoaderICDInterfaceVersion to v5
675- lavapipe: Update LoaderICDInterfaceVersion to v5
676- freedreno, tu: Update LoaderICDInterfaceVersion to v5
677- v3dv: Update LoaderICDInterfaceVersion to v5
678
679Charmaine Lee (1):
680
681- mesa: fix misaligned pointer returned by dlist_alloc
682
683Cherser-s (1):
684
685- radv: handle VK_DESCRIPTOR_TYPE_SAMPLER in VK_VALVE_mutable_descriptor_type extension
686
687Chia-I Wu (23):
688
689- venus: fix vn_instance_wait_roundtrip when seqno wraps
690- venus: prefer VIRTGPU_BLOB_MEM_HOST3D for shmems
691- venus: fix vn_buffer_get_max_buffer_size
692- venus: add vn_renderer_util.[ch]
693- venus: add vn_renderer_shmem_pool
694- venus: use vn_renderer_shmem_pool for reply shmems
695- venus: add vn_cs_encoder_storage_type
696- venus: add VN_CS_ENCODER_STORAGE_SHMEM_POOL for VkCommandBuffer
697- venus: cache shmems
698- venus: add some trace points
699- virgl: disable texture uploads with copy transfers
700- vulkan/wsi: add wsi_common_get_image
701- anv,lavapipe,v3dv: use wsi_common_get_image
702- freedreno/drm, turnip: set DRM_RDWR for exported dma-bufs
703- venus: fix VK_KHR_driver_properties
704- venus: format with clang-format
705- venus: remember the memory bound to a swapchain image
706- venus: handle VkBindImageMemorySwapchainInfoKHR
707- vulkan/wsi/x11: fix x11_image_init return value on errors
708- venus: updates to the doc
709- turnip: respect buf->bo_offset in transform feedback
710- glthread: call _mesa_glthread_BindBuffer unconditionally
711- venus: update venus-protocol to 1.3.204
712
713Christian Gmeiner (25):
714
715- ci/etnaviv: add manual piglit testing
716- ci/bare-metal: armhf: move BM_ROOTFS to generic place
717- ci/etnaviv: armhf: switch to .baremetal-test-armhf
718- ci/etnaviv: no need to force nir anymore
719- ci/bare-metal: add .baremetal-test-arm64
720- ci/bare-metal: switch to common .baremetal-test-arm64
721- panfrost/ci: update piglit fails
722- etnaviv/ci: update piglit fails
723- i915g/ci: update piglit fails
724- iris/ci: update piglit fails
725- ci: Uprev piglit to af1785f31
726- mesa: always support occlusion queries
727- broadcom/ci: use .test-manual-mr
728- vc4: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY
729- i915: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY
730- lima: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY
731- isaspec: Add support for special {:align=} field
732- nir: make lower_sample_tex_compare a common pass
733- nir/nir_lower_tex_shadow: support tex_instr without deref src
734- etnaviv: make use of nir_lower_tex_shadow
735- etnaviv: fix FRONT_AND_BACK culling
736- etnaviv: add multiply_with_8 flag
737- etnaviv: use bytes for read TX data
738- etnaviv: add two new HI related perfmon counter
739- etnaviv: add support for INTEL_blackhole_render
740
741Clayton Craft (1):
742
743- anv: don't advertise vk conformance on GPUs that aren't conformant
744
745Connor Abbott (65):
746
747- vk/format, v3dv: Add a vulkan -> pipe swizzle helper
748- freedreno/fdl: Constify fdl6_get_ubwc_blockwidth()
749- freedreno/fdl: Add mip_level to fdl_layout
750- freedreno/fdl: Add fdl6_view
751- tu: Use fdl6_view in tu_image_view and cross-check
752- tu: Switch clear/blit to fdl6_view and cross-check
753- tu: Remove cross-check scaffolding
754- tu/clear_blit: Stop creating a franken-image for staging blits
755- ir3/cse: Support mov instructions
756- ir3: Use stp/ldp base offset for {load,store}_scratch
757- tu/clear_blit: Move around copy_format()/tu6_plane_format()
758- freedreno/a6xx: Rename GRAS_2D_BLIT_INFO
759- tu: Emit GRAS_LRZ_MRT_BUF_INFO_0
760- tu: Always write GRAS_LRZ_MRT_BUF_INFO_0
761- freedreno/a6xx: Emit GRAS_LRZ_MRT_BUF_INFO_0
762- ir3: Emit barriers for images again
763- ir3: Don't emit barriers for make_available/make_visible
764- ir3/spill: Mark root as non-spillable after inserting
765- ir3/spill: Initial implementation of rematerialization
766- tu: Enable subgroupBroadcastDynamicId
767- tu: Add VK_KHR_buffer_device_address stubs
768- tu: Expose Vulkan 1.2
769- util/dag: Make edge data a uintptr_t
770- util/dag: Add dag_add_edge_max_data
771- ir3/sched: Rewrite delay handling
772- ir3/postsched: Fix copy-paste mistake
773- ir3/postsched: Handle sync dependencies better
774- ir3/delay: Ignore earlier definitions to the same register
775- ir3/postsched: Rewrite delay handling
776- ir3/postsched: Only prefer tex/sfu if they are soft-ready
777- ir3: Stop inserting nops during scheduling
778- ir3/ra: Consider reg file size when swapping killed sources
779- ir3/ra: Add missing asserts to ra_push_interval()
780- ir3/spill: Support larger spill slot offset
781- ir3/lower_pcopy: Fix shr.b illegal copy lowering
782- ir3/lower_pcopy: Fix bug with "illegal" copies and swaps
783- ir3/lower_subgroups: Fix potential infinite loop
784- ir3/ra: Fix logic bug in compress_regs_left
785- ir3: Bump type mismatch penalty to 3
786- ir3: Introduce systall metric and new helper functions
787- ir3: Use new (sy)/(ss) stall helpers in the compiler
788- ir3/sched: Rename tex/sfu to sy/ss
789- ir3/postsched: Rename tex/sfu to sy/ss
790- ir3: Use (ss) for instructions writing shared regs
791- ir3, freedreno: Add options struct for ir3_shader_from_nir()
792- ir3: Pass shader to ir3_nir_post_finalize()
793- ir3: Add wavesize control
794- tu, ir3: Support runtime gl_SubgroupSize in FS
795- tu: Implement VK_EXT_subgroup_size_control
796- freedreno/fdl: Fix reinterpreting "size-compatible" formats
797- nir: Reorder ffma and fsub combining
798- freedreno/a6xx: Name texture descriptor bit
799- tu/blit: Don't set CLAMPENABLE in sampler for 3d path
800- tu: Report code size in pipeline statistics
801- tu: Initial link-time optimizations
802- nir/lower_subgroups: Rename lower_shuffle to lower_relative_shuffle
803- nir: Add support for lowering shuffle to a waterfall loop
804- ir3: Fix copy-paste mistakes in ir3_block_remove_physical_predecessor()
805- ir3: Rewrite (jp) insertion
806- ir3/cp: ir3: Prevent propagating shared regs out of loops harder
807- ir3,tu: Enable subgroup shuffles and relative shuffles
808- ir3/spill: Fix simplify_phi_nodes with multiple loop nesting
809- ir3: Use CAN_REORDER instead of NON_WRITEABLE
810- ir3: Don't always set bindless_tex with readonly images
811- ir3/nir: Fix 1d array readonly images
812
813Corentin Noël (1):
814
815- virgl: Disable cache for VIRGL_BIND_SAMPLER_VIEW
816
817Cristian Ciocaltea (12):
818
819- ci: Uprev deqp-runner to 0.11.0
820- ci: Support building and installing deqp-runner from source
821- ci: Do not remove cmake
822- ci: Create results folder before starting virgl_test_server
823- virgl/ci: Force crosvm error when exit code file is missing
824- virgl/ci: Prevent static link of virglrenderer inside crosvm
825- virgl/ci: Do not hide crosvm output messages
826- virgl/ci: Fix identification of dEQP binary paths
827- iris/ci: Fix whl dEQP expectations
828- iris/ci: Fix piglit tests expectations on amly
829- panfrost/ci: Fix piglit tests expectations on G52
830- freedreno/ci: Fix dEQP tests expectations on A530
831
832Daniel Schürmann (22):
833
834- nir/fold_16bit_sampler_conversions: skip sparse residency tex instructions
835- aco: add more D16 load/store instructions to RA and validator
836- aco: workaround GFX9 hardware bug for D16 image instructions
837- aco: implement D16 texture loads
838- radv: use nir_fold_16bit_sampler_conversions()
839- aco/optimizer: fix fneg modifier propagation on VOP3P
840- aco: change fneg for VOP3P to use fmul with +1.0
841- aco/optimizer: propagate and fold inline constants on VOP3P instructions
842- nir/opt_algebraic: lower fneg_hi/lo to fmul
843- aco/ra: fix get_reg_for_operand() in case of stride mismatches
844- aco: don't allow SDWA on VOP3P instructions
845- aco/optimizer: keep instr_mod_labels after applying extract
846- aco/optimizer: apply extract from p_extract_vector
847- aco/optimizer: optimize extract(extract())
848- aco/optimizer: apply extract from subdword p_split_vector
849- aco: use explicit zero-padding for 64bit image loads in expand_vector()
850- aco: use p_create_vector(v2b,v2b) in get_alu_src_vop3p()
851- aco: don't split VOP3P definitions
852- aco: validate VOP3P opsel correctly
853- nir: refactor nir_opt_move
854- nir/opt_if: merge two break statements from both branch legs
855- nir/opt_if: also merge break statements with ones after the branch
856
857Daniel Stone (10):
858
859- CI: Disable Windows jobs
860- Revert "CI: Disable Windows jobs"
861- ci: Use common build script for libwayland
862- ci: Consistently build Wayland and protocols
863- ci: Upgrade to libdrm 2.4.109
864- zink/ci: Add GL4.6 tessellation flake
865- CI: Don't stream wget directly into bash
866- Revert "gitlab-ci: disable radv-fossils"
867- Revert "ci: disable vs2019 windows build"
868- egl/wayland: Reset buffer age when destroying buffers
869
870Danylo Piliaiev (54):
871
872- ir3/freedreno: account for component in build_tessfactor_base
873- turnip: add support for dirconf
874- driconf: add vk_dont_care_as_load workaround option
875- turnip: implement vk_dont_care_as_load workaround
876- drirc: Apply vk_dont_care_as_load workaround to Forsaken Remastered
877- nir/lower_amul: do not lower 64bit amul to imul24
878- ir3/freedreno: add 64b undef lowering
879- freedreno/ir3: disallow immediate addr/offset for ldg/ldg.a
880- freedreno/ir3: set proper dst size for {store,load}_{global,shared}_ir3
881- freedreno/ir3: use stg.a/ldg.a only if offset is reg or doesn't fit
882- isaspec: inherite parent's bitset gpu gen requirements
883- nir/serialize: Make more space for intrinsic_op allowing 1024 ops
884- ir3/ra: Check register file upper bound when updating preferred_reg
885- tu: fix rast state allocation size on a6xx gen4
886- freedreno/computerator: Support A660 gpu
887- vulkan/util: Handle depth-only formats in vk_att_ref_stencil_layout
888- ir3: print half-dst/src for ldib.b/stib.b
889- freedreno/ir3: add a6xx global atomics and separate atomic opcodes
890- freedreno/ir3: handle global atomics
891- turnip: implement VK_KHR_buffer_device_address
892- ir3/cp: Prevent setting an address on subgroup macros
893- freedreno,tu: Limit the amount of instructions preloaded into icache
894- ir3: Add gen4 new subgroup instructions
895- ir3: Use getfiberid for SubgroupInvocationID on gen4
896- ir3,turnip: Enable subgroup ops support in all stages on gen4
897- ir3,turnip: Add support for GL_KHR_shader_subgroup_quad
898- turnip: Fix operator precedence in address calculation macros for queries
899- ir3: Be able to reduce register limit for RA when CS has barriers
900- ir3: Assert that we cannot have enough concurrent waves for CS with barrier
901- tu: fix workaround for depth bounds test without depth test
902- ir3: New cat3 instructions
903- nir/algebraic: Separate has_dot_4x8 into has_sdot_4x8 and has_udot_4x8
904- ir3: Make nir compiler options a part of ir3_compiler
905- tu,ir3: Implement VK_KHR_shader_integer_dot_product
906- vulkan/wsi: create a common function to compare drm devices
907- tu: implement wsi hook to decide if we can present directly on device
908- tu: support VK_EXT_primitive_topology_list_restart
909- tu: expose VK_KHR_copy_commands2
910- tu: add reference counting for descriptor set layouts
911- turnip/perfetto: Optimize timestamp synchronization
912- turnip/trace: refactor creation and usage of trace flush data
913- turnip: rename tu_drm_get_timestamp into tu_device_get_gpu_timestamp
914- turnip/trace: process u_trace chunks on queue submission
915- turnip/perfetto: handle gpu timestamps being non-monotonic
916- turnip/trace: Delete unused start/end_resolve tracepoints
917- turnip: Drop references to layout of all sets on pool reset/destruction
918- tu: implement sysmem vs gmem autotuner
919- freedreno: Update gmem/sysmem debug options to be in line with turnip
920- tu: add debug option to force gmem
921- ci/freedreno: properly test sysmem and gmem paths
922- ir3: opt_deref in opt loop to remove unnecessary tex casts
923- turnip: Unconditionaly remove descriptor set from pool's list on free
924- ir3: Limit the maximum imm offset in nir_opt_offset for shared vars
925- turnip: Use LATE_Z when there might be depth/stencil feedback loop
926
927Dave Airlie (249):
928
929- brw/nir: remove unused function prototypes.
930- crocus: Delete the MI_COPY_MEM_MEM resource_copy_region implementation.
931- llvmpipe: fix userptr for texture resources.
932- clover: use max shader sampler view/images queries for clover.
933- llvmpipe: swizzle image stores for CL BGRA
934- radv: fence->user_ptr and ctx->fence_map are now totally unused.
935- lavapipe: drop EXT_acquire_xlib_display
936- vulkan/wsi: set correct bits for host allocations/exports for images.
937- vulkan/include: import the video codec headers.
938- vulkan: add new image types undef beta define to switch statements.
939- meson: allow building with vulkan beta extensions enabled.
940- llvmpipe: disable 64-bit integer textures.
941- llvmpipe: fix compressed image sizes.
942- intel/genxml: cleanup video xml collisions.
943- intel/genxml: fix some missing address from the 75 xml
944- intel/genxml: align QM field names across gens.
945- intel/genxml: fix Picure->Picture typo
946- intel/genxml: fix gen6 LD->VLD typo.
947- intel/genxml: generate video headers
948- util/vl: move gallium vl_vlc.h and vl_rbsp.h to shared code.
949- mtypes: drop some context pointers that are unused now
950- mesa: drop unused sw extensions init
951- mesa/dd: burn a bunch of legacy driver interfaces down
952- mesa/dd: remove some fbo driver hooks.
953- mesa/dd: remove NewSamplerObject
954- mesa/light: make _mesa_light static do_light.
955- iris/ci: comment out iris-cml-traces-performance due to hw unavailable
956- mesa/query: remove all the mesa queryobj code.
957- mesa/syncobj: drop unused syncobj code.
958- mesa: remove unused buffer object code.
959- mesa/transformfeedback: remove unused transform feedback code
960- mesa/barrier: remove unused barrier functions
961- mesa/externalobject: delete unused functions
962- intel/compiler: drop glsl options from brw_compiler
963- meson: make mesa/tests/glx depend on gallium
964- mesa/dd/st: move get strings pointer out of dd.h
965- mesa/dd/st: direct wire queries/timestamp/condrender.
966- mesa/st: direct call sync object functions
967- mesa/st: move barriers to direct call
968- mesa/st: move transformfeedback to direct calls
969- mesa/st: move external objects to direct calls
970- mesa/crocus/iris/blorp: drop minify macro in favour of u_minify
971- mesa: move _mesa_varying_slot_in_fs to shader_enums
972- anv: include futex.h explicitly in allocator.
973- brw/compiler: drop mtypes.h from compiler
974- intel/compiler: drop shader_info.h from compiler header
975- intel/crocus: push main/macros.h out to the users
976- mesa/st: move rendermode to direct call
977- mesa/st: move Enable to direct call
978- mesa/st: move query memory info to direct call
979- mesa/st: move perf query to direct call
980- mesa/st: move perfomance monitor to direct call
981- mesa/dd: drop purgeable interface
982- mesa/st: move fbo code to direct calling
983- mesa/st: move texture APIs to direct st calls
984- mesa/st: move Clear to new direct call
985- mesa/st: move pixel/bitmap functions to direct call
986- mesa/st: move clear/flush/finish to direct call
987- mesa/st: move some context functions to direct calls
988- mesa/st: move viewport to direct call
989- mesa/st: move copy image sub data to direct call
990- mesa/st: move program calls to direct call
991- mesa/st: replace most of buffer funcs with direct calls.
992- mesa/st: move blit function to direct call
993- mesa/st: convert the non-optional egl image to direct calls
994- mesa/st: convert DrawTex to direct call
995- mesa/st: move msaa functions to direct call
996- mesa/st: move compute to direct call
997- mesa/st: move draw indirect and xfb to direct calls.
998- mesa/st: drop Draw from dd function table.
999- treewide: drop mtypes/macros includes from main
1000- crocus: cleanup bo exports for external objects
1001- mesa/st: merge st buffer object into GL
1002- mesa/st: start moving bufferobject alloc/free/reference to main.
1003- mesa: add a pipe_context pointer to gl context
1004- mesa: add a pointer to st_config_options to gl_context
1005- mesa: add pointer to cso_context to gl_context
1006- mesa/st: migrate most of state tracker buffer objects into mesa
1007- mesa/st: make static the buffer object funcs that can be
1008- mesa/bufferobj: rename bufferobj functions to be more consistent.
1009- bufferobj: make sw clear buffer static, move it and rename it
1010- bufferobj: inline buffer clearing
1011- bufferobj: inline page commitment
1012- bufferobj: cleanup subdata copies
1013- mesa/st: rename access flag to transfer flag function
1014- mesa/bufferobj: move invalidate buffer to optional feature
1015- mesa/st: remove st_cb_bufferobjects*
1016- mesa: inline mesa_initialize_buffer_object.
1017- mesa/st: refactor compute dispatch to fill grid info earlier.
1018- mesa/st: migrate compute dispatch to mesa
1019- mesa/compute: refactor compute launch to look more like draw
1020- mesa/st: move get sample position code to static in mesa
1021- mesa/st: move msaa functionality into multisample.c
1022- mesa/st: migrate barrier code into mesa
1023- mesa/st: move st strings handling into mesa
1024- mesa: drop texformat code this isn't used.
1025- mesa/st: use has_stencil_export instead of querying screen cap.
1026- mesa: drop unused new renderbuffer code.
1027- mesa: drop unused _mesa_new_program.
1028- mesa/dd: drop unused InvalidateBufferSubData entry.
1029- intel/compiler: remove gfx6 gather wa from backend.
1030- intel/compiler: don't lower swizzles in backend.
1031- intel/compiler: drop unused decleration
1032- mesa: remove StripTextureBorder option.
1033- mesa/draw: drop the multi draw with indices fallback.
1034- mesa/st: move default enabled extensions into mesa.
1035- mesa: drop optional tex/tnl maintains mode.
1036- mesa/st: merge NewDepthClamp state flag
1037- mesa/st: drop the new array driver state bit
1038- mesa/st: drop the rasterizer driver flags
1039- mesa/st: remove the viewport driver state flags
1040- mesa/st: drop NewBlend driver state flags
1041- mesa/st: drop new depth/stencil state bits
1042- mesa/st: drop poly stipple driver state bit
1043- mesa/st: drop new tess state driver bit
1044- mesa/st: drop new uniform driver state bit
1045- mesa: drop unused transform feedback state driver flags
1046- mesa/st: drop ssbo, image and sampler driver state flags bits
1047- mesa/st: drop scissor/window rect driver state bits
1048- mesa/st: drop clip plane driver state bits
1049- mesa/st: drop new framebuffer srgb driver state bit
1050- mesa/st: drop multisample mask/locations state drivers bits
1051- mesa/st: remove conditionals for driver state bits that are always set.
1052- intel/genxml/gen4-5: fix more Raster Operation in BLT to be a uint
1053- crocus: fail resource allocation properly.
1054- crocus: don't create staging resources > half aperture
1055- gallium/mesa: enhance PIPE_CAP_CLIP_PLANES to support override number
1056- crocus: set max clip planes to 6 for gen4.
1057- mapi: generate correct dispatch for EXT_draw_instanced
1058- glsl/linker: cleanup passing gl_context unnecessarily
1059- glsl: don't pass gl_context to lower shared references.
1060- glsl/linker: don't pass gl_context just for constants in xfb code
1061- glsl/linker: move more ctx->Consts to consts.
1062- glsl/linker: remove direct gl_context usage in favour of consts/exts/api
1063- glsl/linker: get rid of gl_context from link varyings
1064- glsl/linker: get rid of gl_context from uniform assign paths
1065- glsl/linker: get rid of gl_context from atomic counters paths
1066- glsl/linker: remove gl_context from check image resources
1067- glsl/linker: remove gl_context usage from more places.
1068- glsl/nir/linker: avoid passing gl_context inside gl_nir linker
1069- glsl/linker/uniform_blocks: don't pass gl_context around.
1070- glsl/linker: drop unused gl_context.
1071- glsl/linker: remove a bunch more gl_context references.
1072- glsl/nir: don't pass gl_context to the convertor routine.
1073- mesa/program: don't use CALLOC_STRUCT for instructions.
1074- mesa: rebalance the CALLOC_STRUCT/FREE force.
1075- mesa/st/perfmon: rebalance CALLOC_STRUCT/FREE
1076- mesa/st: get rid of ST_CALLOC_STRUCT use CALLOC_STRUCT
1077- intel/compiler: add clamp_pointside to vs/tcs/tes keys.
1078- crocus: only clamp point size on last stage.
1079- amd: move vcn decoding regs + structs to a common file.
1080- amd: move uvd decode definitions to common place
1081- mesa/\*: use an internal enum for tessellation primitive types.
1082- mesa/\*: add a shader primitive type to get away from GL types.
1083- includes: add windows lean and mean guard.
1084- nir: remove gl.h include from nir headers.
1085- intel/brw: drop gl header from the brw backend.
1086- glsl/parser: extract consts/exts/api out of context at start.
1087- mesa: drop unused context parameter to shader program data reference.
1088- mtypes: split gl extensions and consts out into a separate header
1089- mesa/mtypes: move matrix enums to shader_enums.h
1090- glsl: remove some deps on mtypes.h.
1091- mesa: move some gl shader types to shader_types.h.
1092- glsl: avoid rebuilding builtin functions on mtypes.h changes.
1093- glsl: move ast_function.cpp off mtypes.h
1094- mesa: move ast_to_hir.cpp off mtypes.h
1095- mtypes: move gl_linked_shader and gl_shader_program to new shader_types.h
1096- mtypes: move gl_program to shader_types.h
1097- mtypes: more gl_active_atomic_buffer to shader_types.h
1098- mtypes: move transform feedback internal structs to shader_types.h
1099- mtypes: move uniform shader types to shader_types.h
1100- mtypes: move bindless image/sampler objects to shader_types.h
1101- mtypes: move gl_shader_variable to shader_types.h
1102- glsl: move off mtypes.h in lots of places.
1103- glsl/fp64: move context.h dependent checks into main.
1104- glsl: drop some more context.h/mtypes.h interactions
1105- vbo: drop unused mtypes.h
1106- docs: update docs for new extension header.
1107- mesa: more mtypes.h cleanups
1108- mesa: split struct gl_config into it's own header.
1109- glsl: drop glheader.h include.
1110- mesa/st: merge memoryobjects code from st into mesa
1111- mesa/st: merge semaphore objects from st into mesa
1112- mesa/st: merge the syncobj code from st into mesa
1113- mesa/st: merge queryobj code from st into mesa.
1114- mesa/st: merge condrender code from st into mesa.
1115- mesa/st: merge st transform feedback object into gl one.
1116- mesa/st: merge transform feedback code from st into mesa
1117- mesa/st: merge perfmon object from st into mesa
1118- mesa/st: merge perfmon counters/groups objects from st into mesa
1119- mesa/st: merge perfmon groups init/cleanup from st into mesa
1120- mesa/st: move perfmon code from st into mesa
1121- mesa/st: cleanup last bits of st perfmon code.
1122- mesa/st: merge texture object/image structs into mesa
1123- mesa/st: merge texture obj/image alloc/free into mesa
1124- intel/perf: use a function to do common allocations
1125- meson: start building intel earlier.
1126- mesa/st: move intel blackhole noop enable to frontend
1127- mesa/st: remove st_context from debug callback
1128- mesa/st: migrate debug callback code into mesa
1129- mesa/st: drop last user of st_Enable.
1130- mesa/st: directly call the uuid get funcs.
1131- mesa/st: drop emit string marker device table entry.
1132- mesa/st: move pin l3 cache to direct check/call.
1133- mesa/dd: drop GetProgramBinaryDriverSHA1
1134- mesa/st: drop useless tex parameter calls.
1135- mesa/st: move st_TexParameter into mesa
1136- mesa/st: drop release all sampler views wrapper
1137- mesa/st/vdpau: direct call the vdpau functions.
1138- mesa/st: move evaluate depth values into mesa
1139- mesa/ctx: store screen pointer in ctx as well
1140- mesa/st: inline st_max_shader_compiler_threads
1141- mesa/st: move shader completion into mesa
1142- mesa/st: move memory query into mesa.
1143- mesa/st: drop some bindless wrappers
1144- mesa/st: drop lots of perfquery wrappers
1145- mesa/st: move perf query test to st_context, drop files.
1146- mesa/st: merge st_renderbuffer into gl_renderbuffer.
1147- mesa/st: move some renderbuffer code into mesa
1148- mesa/st: merge framebuffer objects from st to mesa
1149- mesa/st: move render/finish_render texture in to mesa.
1150- mesa/st: move validate/discard framebuffer into mesa
1151- mesa/st: move st_ReadBuffer functionality into mesa
1152- mesa/st: move DrawBufferAllocate into mesa.
1153- mesa/st: move st renderbuffer code into mesa renderbuffer
1154- mesa/st: move map/unmap renderbuffer code into mesa
1155- mesa/st: move st_new_renderbuffer_fb to manager
1156- mesa/st: move some fbo helpers around.
1157- mesa/st: fixup viewport drawable invalidation
1158- mesa/st: migrate blit code into mesa
1159- mesa/st: move last of renderbuffer functionality into mesa
1160- mesa/st: move renderbuffer format choosing wrapper into mesa.
1161- mesa/st: move manager colorbuffer interface to gl_context.
1162- mesa/st: move invalidate_on_gl_viewport to ctx
1163- mesa/st: move st_fb_orientation into a mesa function
1164- mesa: consolidate setting no error state and checking suid.
1165- mesa/st: collapse st_program into gl_program object.
1166- mesa/st: move new ati fragment shader to mesa
1167- mesa/st: move st_vertex_program to gl_vertex_program in mesa
1168- mesa/st: move program new/delete into mesa
1169- mesa/st: refactor program translation into one file.
1170- crocus: find correct relocation target for the bo.
1171- crocus: fix leak on gen4/5 stencil fallback blit path.
1172- draw/so: don't use pre clip pos if we have a tes either.
1173- lavapipe: fix sampler + sampler view leaks.
1174- lavapipe: reference gallium fences correctly.
1175- vulkan/wsi: handle queue families properly for non-concurrent sharing mode.
1176- crocus: change the line width workaround for gfx4/5
1177- gallivm/nir: extract a valid texture index according to exec_mask.
1178
1179Derek Foreman (3):
1180
1181- egl/wayland: Properly clear stale buffers on resize
1182- panfrost: Support planar formats for scanout
1183- panfrost: support PIPE_RESOURCE_PARAM_NPLANES query
1184
1185Dylan Baker (54):
1186
1187- docs: add release notes for 21.2.4
1188- docs: add sha256 sum for 21.2.4 release
1189- docs: update calendar and link releases notes for 21.2.4
1190- meson: use gtest protocol for gtest based tests when possible
1191- docs: add release notes for 21.2.5
1192- docs: add sha256 sums for 21.2.5
1193- docs: update calendar and link releases notes for 21.2.5
1194- turnip: don't use mesa/macros.h to get utils/rounding.h
1195- docs: add release notes for 21.2.6
1196- docs: add sha256 sums for 21.2.6 relnotes
1197- docs: update calendar and link releases notes for 21.2.6
1198- docs/release-calendar: remove additional 21.2 releases
1199- docs: Add calendar entries for 22.0 release candidates.
1200- mesa/tests: ensure that util_cpu_detect has been called
1201- classic/r100: Delete driver
1202- classic/r200: Delete driver
1203- classic/nouveau: Remove driver
1204- classic/i915: Remove driver
1205- classic/i965: Remove driver
1206- mesa/dri: remove mega driver stub
1207- include/pci_ids: Move PCI ids supported by both i965 and iris to iris
1208- mesa/main/tests: remove dispatch sanity
1209- mesa: Delete libmesa_classic
1210- mesa: Merge libmesa_gallium and libmesa_common
1211- mesa: move common/dri into gallium
1212- meson: restore dri.pc file
1213- CODEOWNERS: remove OpenSWR
1214- new_features: Add OpenSWR removal
1215- fixup! gallium/swr: Remove driver source
1216- docs: move the release for 22.0 out
1217- VERSION: bump for 22.0.0-rc1 release
1218- .pick_status.json: Update to 0447a2303fb06d6ad1f64e5f079a74bf2cf540da
1219- .pick_status.json: Update to 8335fdfeafbe1fd14cb65f9088bbba15d9eb00dc
1220- .pick_status.json: Update to 5e9df85b1a4504c5b4162e77e139056dc80accc6
1221- VERSION: bump version for 22.0.0-rc2
1222- .pick_status.json: Update to 7955df28a6660d8dff77c79c345aa28aa7fa859c
1223- .pick_status.json: Update to 22fc53493092a7507c1e2db47b0c8763158d7b2d
1224- .pick_status.json: Update to 108e6eaa83eed3eb356f3cce835c5f5e3a836b8e
1225- .pick_status.json: Mark b07372312d7053f2ef5c858ceb1fbf9ade5e7c52 as denominated
1226- .pick_status.json: Update to 3759a16d8a883355effa435f46867951ce712dbe
1227- .pick_status.json: Mark a8418abd748e8e761dda9c3594e29e560833d9ff as denominated
1228- .pick_status.json: Update to 22efeec399fb55dcb364dabd65afab137d6f3fc3
1229- .pick_status.json: Update to f030b75b7d2c359b90c18ee4ed83fa05265c12e0
1230- .pick_status.json: Update to fd3451babd6cded6794561d74c8919576ba1f97d
1231- llvmpipe: remove test that passes on 22.0 for some reason from the fail list
1232- .pick_status.json: Update to b21e7e1ef7ba2c2cf97f073a8a01d6bc43835622
1233- d3d12: mark incorrectly passing tests as expected failures
1234- VERSION: bump for 22.0.0-rc3
1235- .pick_status.json: Update to 48b3ef625e1909c1be31fbe10adb53734af38eb4
1236- .pick_status.json: Update to 48b3ef625e1909c1be31fbe10adb53734af38eb4
1237- ci/zink: mark tests fixed by previous commit as fixed
1238- ci/zink: mark more expected passes as such
1239- Revert "panfrost: Fix set_sampler_views for big GL"
1240- .pick_status.json: Update to 342e6f83321a91816358dee82178809b2a8aeeaa
1241
1242Edward O'Callaghan (4):
1243
1244- clover/images: Add array_size to implement CL_IMAGE_ARRAY_SIZE
1245- clover: Implement CL_MEM_OBJECT_IMAGE2D_ARRAY
1246- clover: Implement CL_MEM_OBJECT_IMAGE1D_BUFFER
1247- clover: Implement CL_MEM_OBJECT_IMAGE1D_ARRAY
1248
1249Eleni Maria Stea (1):
1250
1251- dri_drawable: missing header
1252
1253Ella Stanforth (2):
1254
1255- v3dv: Implement VK_KHR_create_renderpass2
1256- vulkan: Allow RegisterDisplayEventEXT before first page flip
1257
1258Ella-0 (2):
1259
1260- v3dv: implement VK_EXT_host_query_reset
1261- v3dv: enable VK_KHR_swapchain_mutable_format
1262
1263Emma Anholt (204):
1264
1265- ci/lvp: Skip some slow tests under ASan.
1266- loader: Avoid enumerating drm devices just to get an fd's PCI ID.
1267- freedreno/afuc: Disable the disassembler on 32-bit builds.
1268- ci/deqp-runner: Drop SUMMARY_LIMIT env var.
1269- ci/deqp-runner: Simplify the --jobs argument setup.
1270- ci/deqp-runner: Use new deqp-runner's built-in renderer/version checks.
1271- ci/deqp-runner: Drop silly CSV env vars.
1272- ci/deqp-runner: Move remaining asan runs to --env LD_PRELOAD=
1273- ci/deqp-runner: Drop LD_LIBRARY_PATH=/usr/local for libkms workaround.
1274- ci/deqp-runner: Don't start GPU hang detection for making junit results.
1275- ci/deqp-runner: Move more non-suite logic under the non-suite 'if'.
1276- ci/piglit-runner: Fix funny indentation of the piglit-runner command.
1277- turnip: Switch tu6_format_texture() to a pipe_format.
1278- turnip: Switch tu6_format_color() to a pipe_format.
1279- turnip: Switch format_to_ifmt() to take a pipe_format.
1280- util: Move freedreno's snorm-to-unorm to util/, adding remaining cases.
1281- turnip: Make copy_format() and tu6_plane_format() return pipe_format
1282- gallium/u_blitter: Read MSAA z/s from sampler's .x instead of .y or .z.
1283- turnip: Drop the assertion about the temporary bit in sync fd imports.
1284- ci/radeonsi: Use a deqp-runner suite suite for stoney.
1285- ci/deqp-runner: Rename the deqp-drivername-\*.txt files to drivername-\*.txt
1286- ci/piglit-runner: Merge piglit-driver-\*.txt files into driver-\*.txt.
1287- ci: Enable testing radeonsi's libva using libva-util unit tests.
1288- ci/etnaviv: Fix the dependency for the build artifacts.
1289- ci/etnaviv: Add some more deqp flakes I've seen in recent runs.
1290- etnaviv: Switch to the NIR compiler by default.
1291- i915g: Check for negate/swizzle on TGSI_OPCODE_KILL_IF's src.yzw.
1292- i915g: Make sure we consider negates/swizzles on bias/shadow coords.
1293- i915g: Improve the explanation for the 1D Y swizzle.
1294- ci: Uprev VK-GL-CTS to 1.2.7.2, and pull in piglit while I'm here.
1295- freedreno: Fix gmem invalidating the depth or stencil of packed d/s.
1296- freedreno/a6xx: Emit a null descriptor for unoccupied IBO slots.
1297- freedreno/fdl6: Add an interface for setting up buffer descriptors.
1298- turnip: Use the new shared buffer-view descriptor creation function.
1299- turnip: Remove buffer-view cross-check code.
1300- freedreno/fdl6: Add support for texture swizzles of A/L/I/LA/RGBx.
1301- freedreno/a6xx: Use the fdl buffer view setup for img/ssbo descriptors.
1302- freedreno: Fix layered rendering to just Z/S and not color.
1303- freedreno/a6xx: Fix partial z/s clears with sysmem.
1304- freedreno/a5xx: Use the defined names for 2D_BLIT_CNTL regs.
1305- freedreno/a618: Mark a flaky test that triggers hangcheck.
1306- freedreno/a6xx: Don't try to generate mipmaps for SNORM with our blitter.
1307- freedreno/a5xx+: Fix missing LA formats.
1308- freedreno/a5xx: Diff reduction in fd5_layout to fd6_layout.
1309- freedreno/a6xx: Try to fix drawing to z/s miplevel/layer offsets.
1310- freedreno/a5xx: Remove bogus assertion about BO size.
1311- freedreno/a5xx: Try to fix drawing to z/s miplevel/layer offsets.
1312- freedreno/a5xx: Clean up a little bit of blitter array pitch setup.
1313- freedreno/ir3: Fix off-by-one in prefetch safety assert.
1314- turnip: Claim 1.2.7.1 CTS conformance.
1315- freedreno: Fix the uniform/nonuniform handling for cat5 bindful modes.
1316- freedreno: Fix constant-index assumptions in IBO loads.
1317- ci/etnaviv: Mark the rest of uniform_api.random as flaky.
1318- ci/etnaviv: Add more texturing flakes.
1319- ci/bare-metal: Close serial and join serial threads before exit.
1320- util: Rename PIPE_FORMAT_G8_B8_R8_420_UNORM.
1321- util/format: Add G8_B8R8_420_UNORM to match Vulkan.
1322- freedreno/fdl6: Skip redundant setting of TILE_ALL for NV12.
1323- freedreno: Set layer_first on (2D) resource imports.
1324- freedreno/a6xx: Create a fd6_view at sampler view update time.
1325- freedreno/a6xx: Switch to relying on fd6_view for our texture descriptors.
1326- freedreno/a6xx: Use fd6_view for non-buffer image descriptors, too.
1327- freedreno/a6xx: Clean up sysmem fb read patching using fd6_view.
1328- freedreno/a6xx: Drop an unused tile_mode arg.
1329- freedreno/a6xx: Inline remaining fd6_tex_const_0() call.
1330- mesa/st: Disable NV_copy_depth_to_color on non-doubles-capable HW.
1331- freedreno/a6xx: Add some notes about piglit failures.
1332- freedreno/ir3: Fix an off-by-one in so->outputs_count safety assert.
1333- ci/freedreno: Enable the tes-input/tcs-input tests.
1334- freedreno/a6xx: Do sparse setup of the TFB program.
1335- ci/freedreno: Add notes explaining the KHR-GL* failures.
1336- ci/freedreno: Add a link to the issue for color_depth_attachments.
1337- freedreno/fdl: Dump the generated layout when a layout test fails.
1338- freedreno: Fix the texturator unit test script.
1339- freedreno/cffdump: Fix 64-bit reg decode in script mode.
1340- freedreno/fdl: Add support for unit testing 3D texture array strides.
1341- freedreno/cffdump: Fix up formatting of texturator unit test script output.
1342- freedreno/cffdump: Handle the TILE_ALL flag in unit test generation.
1343- freedreno/a6xx: Fix a bunch of 3D texture layout to match blob behavior.
1344- freedreno: Stop exposing MSAA image load/store on desktop GL.
1345- freedreno/a6xx: Disable sample averaging on z/s or integer blits.
1346- freedreno/a6xx: Disable sample averaging on non-ubwc z24s8 MSAA blits.
1347- freedreno/a5xx: Define a5xx_2d_surf_info like a6xx has.
1348- freedreno/a5xx: Document the sRGB bit on RB_2D_SRC/DST info.
1349- freedreno/a5xx: Emit MSAA state for sysmem rendering, too.
1350- nir: Un-inline nir_builder_alu_instr_finish_and_insert()
1351- nir: Un-inline more of nir_builder.h.
1352- freedreno/ir3: Make a shared helper for the tess factor stride.
1353- turnip: Move CP_SET_SUBDRAW_SIZE to vkCmdBindPipeline() time.
1354- nir: Uninline a bunch of nir.h functions.
1355- nir: Make nir_build_alu() variants per 1-4 arg count.
1356- freedreno/a6xx: Allocate a fixed-size tess factor BO.
1357- freedreno/a6xx: Skip emitting tess BO pointers past the shader's constlen.
1358- freedreno/a6xx: Set the tess BO ptrs in the program stateobj.
1359- mesa/st: Remove GL_ARB_depth_clamp emulation support.
1360- r300: Add deqp expectations for RV515.
1361- r300: Turn a comment about presub into an assert.
1362- r300: Fix mis-optimization turning -1 - x into 1 - x.
1363- nouveau/nir: Use the address reg for indirect scratch access.
1364- nir/algebraic: Move some generated-code algebraic opt args into a struct.
1365- nir/algebraic: Drop the check for cache == None.
1366- nir/algebraic: Replace relocations for nir_search values with a table.
1367- nir/algebraic: Remove array-of-cond code
1368- nir/algebraic: Move relocations for expression conds to a table.
1369- nir/algebraic: Move relocations for variable conds to a table.
1370- nir/algebraic: Pack various bitfields in the nir_search_value_union.
1371- nir/algebraic: Mark the automaton's filter tables as const.
1372- nir/algebraic: Move all the individual transforms to a common table.
1373- r300: Move the instruction filter for r500_transform_IF() to the top.
1374- r300: Cache the var list in the peephole_mul_omod() loop.
1375- r300: Ensure that immediates have matching negate flags too.
1376- r300: Also consider ALU condition modifiers for loop DCE.
1377- r300: Remove the non_normalized_coords from the shader key.
1378- r300: Precompile the FS at shader creation time.
1379- r300: Route shader stats output to ARB_debug_output.
1380- r300/ci: Update loop expectations
1381- loader: Restore i915g support.
1382- r300/ci: Add some piglit expectations.
1383- nir/nir_to_tgsi: Add support for "if" statements with !native_integers
1384- nir_to_tgsi: Make !native_integers front face input match glsl_to_tgsi.
1385- r300: Disable loop unrolling on r500.
1386- r300: Request NIR shaders from mesa/st and use NIR-to-TGSI.
1387- nir_to_tgsi: Enable nir_opt_move.
1388- nir/nir_opt_move,sink: Include load_ubo_vec4 as a load_ubo instr.
1389- nir_to_tgsi: Set the TGSI Precise flag for exact ALU instructions.
1390- r300: Remove some dead compiler code.
1391- r300: Remove support for SCS.
1392- r300: Remove unused RC_OPCODE_ABS.
1393- r300: Remove unused RC_OPCODE_XPD.
1394- r300: Remove unused RC_OPCODE_SWZ.
1395- r300: Remove unused RC_OPCODE_CLAMP.
1396- r300: Remove unused RC_OPCODE_SFL
1397- r300: Remove unused RC_OPCODE_DPH
1398- ci/freedreno: Add known flakes from the last month.
1399- ci/crocus: Add support for manual CI runs on my G41.
1400- crocus: Clamp VS point sizes to the HW limits as required.
1401- glsl: Delete the vectorization opt pass.
1402- glsl: Delete the optimize_redundant_jumps pass.
1403- glsl: Remove dead prototype for old do_discard_simplification().
1404- glsl: Remove comment about non-existing DFREXP_TO_ARITH
1405- glsl: Retire unused modes for lower_64bit_integer_instructions.
1406- ci/r300: Add another xfail on the main branch.
1407- r300/vs: Allocate temps we see a use as a source, too.
1408- r300/vs: Reuse rc_match_bgnloop().
1409- r300/vs: Fix flow control processing just after an endloop.
1410- ci: Enable a build with MSan.
1411- glcpp: Disable the valgrind tests.
1412- softpipe: Drop the quad pstipple stage.
1413- softpipe: Use the draw module's poly stipple handling, like llvmpipe.
1414- softpipe: Drop duplicate decl of softpipe_find_fs_variant
1415- ci/crocus: Add manual CI for the new HSW box I have at home.
1416- ci: Enable reporting to the flakes IRC channel for i915g and crocus.
1417- ci/i915g: Add a couple more recent regressions.
1418- nir_to_tgsi: Use the same address reg mappings as GLSL-to-TGSI did.
1419- gallium: Delete PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS
1420- tgsi/exec: Simplify indirects now that they always use the ADDR file.
1421- i915g: Turn off FP16 in the vertex shaders.
1422- r300: Use uif() instead of pointer aliasing in program printing.
1423- ci/r300: Update xfails from a full dEQP run.
1424- r300: Fix omod failing to increase the number of channels stored.
1425- nir_to_tgsi: Enable fdot_replicates flag.
1426- ci: Shrink container/rootfs sizes.
1427- nir_to_tgsi: Fix a bug in TXP detection after backend lowering.
1428- ci/llvmpipe: Sort the list of traces.
1429- ci/llvmpipe: Add a trace for the game JVGS, which got regressed recently.
1430- ci: Add paraview traces to several drivers.
1431- freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars.
1432- nir: Apply nir_opt_offsets to nir_intrinsic_load_uniform as well.
1433- r300: Drop unused r300_get_stats() call.
1434- r300: Add consts (uniforms) count to the shader-db output.
1435- nir/lower_dynamic_bo_access: Use copy_inst_indices for our cloned instrs.
1436- nir: Add a .base field to nir_load_ubo_vec4.
1437- nir/opt_offsets: Disable unsigned wrap checks on non-native-integers HW.
1438- nir/opt_offsets: Also apply the max offset to top-level constant folding.
1439- nir_to_tgsi: Use nir_opt_offsets for load_ubo_vec4.
1440- nir/opt_offsets: Use nir_ssa_scalar to chase offset additions.
1441- softpipe: respect !independent_blend_enable for color masks.
1442- softpipe: Request that st fix up DST_ALPHA blending for RGB render targets.
1443- ci/softpipe: Drop the GS sampling known-flakes.
1444- nir/lower_locals_to_regs: Do an ad-hoc copy propagate on our generated MOV.
1445- tgsi_exec: Fix shared var stores for >1 real invocation, and overflow checks.
1446- softpipe: Improve some local var naming in compute shaders.
1447- softpipe: Initialize the CS dispatch mask at machine setup time.
1448- softpipe: Dispatch 4 CS invocations per tgsi_exec thread.
1449- Revert "ci: freedreno: Update a530 dEQP fail expectation list"
1450- ci/llvmpipe: Drop the skip of piglit edgeflag test.
1451- ci/llvmpipe,softpipe: Switch piglit testing to piglit-runner.
1452- ci/traces: Rename the piglit/run.sh script to piglit-traces.sh.
1453- ci/traces: Drop the PIGLIT_PROFILES setting for traces replay.
1454- ci/traces: Clean up the failure report message.
1455- ci/traces: Drop the baseline file creation for trace results.
1456- ci/traces: Drop PIGLIT_REPLAY_UPLOAD_TO_MINIO.
1457- ci/traces: Always generate the junit XML.
1458- ci/freedreno: Update hashes for closed traces.
1459- vulkan: Fix leak of error messages
1460- ci: Use a dlclose-disabling preload library for leak checking in Vulkan.
1461- ci/freedreno: Switch 2 default a630 VK jobs to being GLES and VK ASan jobs.
1462- turnip: Store the computed iova in the tu_buffer.
1463- turnip: Store the computed iova in the tu_image.
1464- r300: Disable fp16 and int16 in swtcl vertex shaders.
1465- r300/r600: Add drm-shim support.
1466- freedreno/isaspec: Add missing dep of encode.py/decode.py calls on isa.py
1467- intel: Add missing dep of gen_*_header.py on utils.py.
1468- i915g: Initialize the rest of the "from_nir" temporary VS struct.
1469
1470Enrico Galli (3):
1471
1472- microsoft/compiler: Shadow tex instructions always use shadow samplers
1473- microsoft/spirv_to_dxil: Add DXIL validation to spirv2dxil
1474- ci/windows: Add validation tests for spriv_to_dxil
1475
1476Eric Engestrom (33):
1477
1478- VERSION: bump to 21.4
1479- docs: reset new_features.txt
1480- VERSION: bump to 22.0
1481- docs: update calendar for 21.3.0-rc1
1482- docs: update calendar for 21.3.0-rc2
1483- docs: update calendar for 21.3.0-rc3
1484- docs: update calendar for 21.3.0-rc4
1485- release-calendar: fix date for next 21.3 rc
1486- meson: drop duplicate addition of surfaceless & drm to the list of platforms
1487- meson: move \`egl_native_platform` definition inside the \`with_egl` block
1488- meson: drop impossible \`if no platform` branch
1489- meson: always define \`HAVE_{X11,XCB}_PLATFORM` when it's enabled
1490- meson: automatically define \`HAVE_{some}_PLATFORM`
1491- docs: update calendar for 21.3.0-rc5
1492- docs/submittingpatches: add link to section describing how to make a backport MR
1493- docs/submittingpatches: add formatting around the release branches names
1494- docs/submittingpatches: mention use of the \`-x` flag of \`git cherry-pick` when backporting a commit
1495- docs: update branchpoint instructions
1496- docs: add release notes for 21.3.0
1497- docs: update calendar and link releases notes for 21.3.0
1498- docs: add 21.3.x release schedule
1499- docs: add 22.0 branchpoint date for perspective
1500- docs: add release notes for 21.3.1
1501- docs: update calendar and link releases notes for 21.3.1
1502- docs: add release notes for 21.3.2
1503- docs: update calendar and link releases notes for 21.3.2
1504- docs: add release notes for 21.3.3
1505- docs: update calendar and link releases notes for 21.3.3
1506- docs: add release notes for 21.3.4
1507- docs: update calendar and link releases notes for 21.3.4
1508- docs: add release notes for 21.3.5
1509- docs: update calendar and link releases notes for 21.3.5
1510- docs/release-calendar: add another 21.3.x since 22.0 has been delayed a bit
1511
1512Erico Nunes (5):
1513
1514- ci: temporarily disable lima CI
1515- mesa: fix GL_MAX_SAMPLES with GLES2
1516- lima/gpir: avoid invalid write in regalloc
1517- lima/ppir: initialize slots array for dummy/undef
1518- lima/ppir: refactor bitcopy to use unsigned char
1519
1520Erik Faye-Lund (9):
1521
1522- docs: update trademark disclaimer
1523- CODEOWNERS: remove ownership of deleted code
1524- ci: remove testing of deleted code
1525- docs: remove mentions of deleted code
1526- docs: remove stale notice about deleted dir
1527- ensure csv-files are crlf on disk
1528- bin/gen_calendar_entries: fix newlines on windows
1529- docs: use http-redirect when possible
1530- docs: remove incorrect drivers from extension
1531
1532Felix DeGrood (2):
1533
1534- anv: increase binding table pool size to 64KB
1535- pps: increase intel.cfg buffer size
1536
1537Filip Gawin (8):
1538
1539- r300: improve precission of linear interpolation
1540- r300: stub derivatives on r300 and r400 hardware
1541- nir: assert that variables in optimize_atomic are initialized
1542- glsl: fix trivial strict aliasing warning
1543- radv: dont call calloc when BVH is empty
1544- iris: fix mapping compressed textures
1545- r300: fix handling swizzle in transform_source_conflicts
1546- r300: replace recursive calls with loops
1547
1548Francisco Jerez (28):
1549
1550- intel/fs/xehp: Teach SWSB pass about the exec pipeline of FS_OPCODE_PACK_HALF_2x16_SPLIT.
1551- intel/fs: Add physical fall-through CFG edge for unconditional BREAK instruction.
1552- intel/dev: Fix size of device info num_subslices array.
1553- intel/dev: Add support for pixel pipe subslice accounting on multi-slice GPUs.
1554- intel/dev: Implement DG2 restrictions requiring additional DSSes to be disabled.
1555- intel/xehp: Implement XeHP workaround Wa_14013910100.
1556- intel/xehp: Implement XeHP workaround Wa_14014148106.
1557- intel/xehp: Update 3DSTATE_PS maximum number of threads per PSD.
1558- intel/fs: Don't assume packed dispatch for fragment shaders on XeHP.
1559- intel/blorp/gfx12+: Drop unnecessary state cache invalidation from binding table setup.
1560- intel/genxml: Fix SLICE_HASH_TABLE struct on XeHP.
1561- iris: Merge gfx11\_ and gfx12_upload_pixel_hashing_tables() into the same function.
1562- intel: Move pixel hashing table computation into common header file.
1563- intel: Minimal calculation of pixel hash table for arbitrary number of pixel pipes.
1564- intel: Rename intel_compute_pixel_hash_table() to intel_compute_pixel_hash_table_3way().
1565- iris: Program pixel hashing tables on XeHP.
1566- anv: Program pixel hashing tables on XeHP.
1567- intel/xehp: Switch to coarser cross-slice pixel hashing with table permutation.
1568- iris/xehp: Implement workaround for 3D texturing+anisotropic filtering.
1569- intel/fs/xehp: Merge repeated in-order read dependencies instead of replacement.
1570- intel/fs: Move legal exec type calculation into helper function in lower_regioning pass.
1571- intel/fs: Teach the lower_regioning pass how to split instructions of unsuported exec type.
1572- intel/fs: Take into account region strides during SIMD lowering decision of SHUFFLE.
1573- intel/fs: Fix destination suboffset calculations for non-trivial strides in SHUFFLE codegen.
1574- intel/fs: Perform 64-bit SHUFFLE lowering in the lower_regioning pass.
1575- intel/fs: Perform 64-bit SEL_EXEC lowering in the lower_regioning pass.
1576- intel/fs: Honor strided source regions specified by the IR for CLUSTER_BROADCAST.
1577- intel/fs: Perform 64-bit CLUSTER_BROADCAST lowering in the lower_regioning pass.
1578
1579Georg Lehmann (8):
1580
1581- meson: Use get_supported_arguments more often.
1582- meson: Remove some unnecessary loops.
1583- amd/addrlib: Use get_supported_arguments to get compiler args.
1584- radv: Increase maxFragmentCombinedOutputResources.
1585- vulkan/wsi/wayland: Fix add_wl_shm_format alpha/opaqueness.
1586- vulkan/wsi/wayland: Convert  missing vulkan formats to shm formats.
1587- vulkan/wsi/wayland: Add modifiers for RGB formats.
1588- vulkan/wsi/wayland: Fix add_drm_format_modifier aplha/opaqueness.
1589
1590Gert Wollny (7):
1591
1592- virgl: Add driconf tweak to force-enable reading back R8_SRGB textures
1593- virgl: obtain supported number of shader sampler views from host
1594- ci: pin virglrenderer version
1595- virgl: Enable higher compatibility profiles if host supports it
1596- util/primconvert: map only index buffer part that is needed
1597- nir_lower_io: propagate the "invariant" flag to outputs
1598- ntt: Set the output invariant flag according to the semantics
1599
1600Greg V (2):
1601
1602- util: __getProgramName: remove check for ancient FreeBSD versions, simplify ifdefs
1603- util: make util_get_process_exec_path work on FreeBSD w/o procfs
1604
1605Guido Günther (6):
1606
1607- etnaviv/drm: Use etna_mesa_debug for debugging messages
1608- etnaviv/drm: Add some bo debug output
1609- etnaviv/drm: Print gpu model at debug verbosity
1610- etnaviv/drm: Use mesa_log* for debugging
1611- entaviv/drm: Use same log format as gallium bits
1612- etnaviv: Use mesa_log*
1613
1614Guilherme Gallo (14):
1615
1616- ci: Update linux kernel to v5.15
1617- iris/ci: Fix traces for amly and deqp list for whl
1618- ci/freedreno: Add maxcpus=2 to the kernel cmdline on a530
1619- panfrost/ci: update piglit tests expectations on G52
1620- ci: Update ci-fairy to version with --token-file support
1621- ci: Uprev piglit
1622- ci: Use ci-fairy minio login via token file
1623- ci: Build skqp on ARM64 images
1624- ci: Add a630_skqp jobs
1625- ci: skqp: Add documentation on how to maintain skqp jobs
1626- ci: Uprev Kernel to v5.16
1627- ci: freedreno: Update a530 dEQP fail expectation list
1628- ci: Add docs for Linux Kernel uprevs
1629- virgl/ci: make crosvm-runner pass variables in a secure way
1630
1631Hamish Arblaster (1):
1632
1633- zink: Fix building on macOS
1634
1635Henry Goffin (1):
1636
1637- intel/compiler/test: Fix build with GCC 7
1638
1639Hoe Hao Cheng (3):
1640
1641- zink/codegen: support platform tags
1642- zink/codegen: remove core_since in constructor
1643- zink/codegen: remove bogus print statement
1644
1645Hyunjun Ko (10):
1646
1647- turnip: expose VK_KHR_driver_properties
1648- anv: Fix to honor the spec to get stencil layout.
1649- radv: Fix to honor the spec to get stencil layout.
1650- vulkan/util: Move helper functions for depth/stencil images to vk_iamge
1651- turnip: Enable VK_KHR_separate_depth_stencil_layouts
1652- turnip: Use the new common device lost tracking
1653- vulkan: fix typo
1654- turnip: Porting to common vulkan implementation for synchronization.
1655- turnip: Porting to common implementation for timeline semaphore
1656- turnip: fix leaks of submit requests.
1657
1658Iago Toral Quiroga (63):
1659
1660- v3dv: fix TLB buffer to image copy path for 3D images
1661- v3dv: enable Vulkan 1.1
1662- broadcom/compiler: disallow tsy barrier in thrsw delay slots
1663- broadcom/compiler: fix assert that current instruction must be in current block
1664- v3dv: refactor TFU jobs
1665- broadcom/compiler: rework simultaneous peripheral access checks
1666- broadcom/compiler: fix condition encoding bug
1667- broadcom/compiler: padding fixes to QPU assembly dumps
1668- broadcom/compiler: make opt passes set current block
1669- broadcom/compiler: check that sig packing is valid when pipelining ldvary
1670- broadcom/compiler: copy packing when converting add to mul
1671- v3dv,v3d: don't store swizzle pointer in shader/pipeline keys
1672- v3d: use V3D_MAX_DRAW_BUFFERS instead of hardcoded constant
1673- v3dv: account for multisampling when computing subpass granularity
1674- v3dv: don't use a global constant for default pipeline dynamic state
1675- v3d,v3dv: move tile size calculation to a common helper
1676- v3dv: fix internal bpp of D/S formats
1677- broadcom/compiler: fix early fragment tests setup
1678- broadcom/compiler: don't allow RF writes from signals after thrend
1679- broadcom/compiler: fix scoreboard locking checks
1680- broadcom/compiler: don't move ldvary earlier if current instruction has ldunif
1681- broadcom/compiler: allow color TLB writes in last instruction
1682- broadcom/compiler: relax restriction on VPM inst in last thread end slot
1683- broadcom/compiler: emit passthrough Z write if shader reads Z
1684- broadcom/compiler: track passthrough Z writes
1685- v3d,v3dv: don't disable EZ for passthrough Z writes
1686- broadcom/compiler: improve documentation for Z writes
1687- broadcom/compiler: improve thrsw merge
1688- v3dv: add a refcount mechanism to BOs
1689- v3dv: add swizzle helpers to identify formats wit R/B swap and reverse flags
1690- v3dv: handle formats with reverse flag
1691- v3dv: implement VK_EXT_4444_formats
1692- v3dv: implement double-buffer mode
1693- v3d: implement double-buffer mode
1694- docs/features: flag VK_KHR_create_renderpass2 as implemented for v3dv
1695- broadcom/simulator: handle DRM_V3D_PARAM_SUPPORTS_MULTISYNC_EXT
1696- v3dv: implement VK_KHR_driver_properties
1697- broadcom/compiler: add lowering pass to scalarize non 32-bit general load/store
1698- broadcom/compiler: better document vectorization implications
1699- broadcom/compiler: implement TMU general 16-bit load/store
1700- broadcom/compiler: lower packing after vectorization
1701- broadcom/compiler: support ldunifa with some 16-bit loads
1702- broadcom/compiler: use ldunifa with unaligned constant offset
1703- broadcom/compiler: implement 32-bit/16-bit conversion opcodes
1704- broadcom/compiler: support f32 to f16 RTZ and RTE rounding modes
1705- v3dv: support VK_KHR_16_bit_storage
1706- broadcom/compiler: support 16-bit uniforms
1707- v3dv: expose storagePushConstant16 feature from VK_KHR_16bit_storage
1708- broadcom/compiler: support 8-bit general store access
1709- broadcom/compiler: handle to/from 8-bit integer conversions
1710- broadcom/compiler: support 8-bit loads via ldunifa
1711- broadcom/compiler: allow vectorization to larger scalar type
1712- broadcom/compiler: update comment on load_uniform fast-path
1713- v3dv: support VK_KHR_8bit_storage
1714- v3dv: drop signature of undefined function
1715- v3dv: implement VK_KHR_imageless_framebuffer
1716- v3dv: rework Vulkan 1.2 feature queries
1717- v3dv: document why we don't expose VK_EXT_scalar_block_layout
1718- v3dv: support resolving depth/stencil attachments
1719- v3dv: fallback to blit resolve if render area is not aligned to tile boundaries
1720- v3dv: expose VK_KHR_depth_stencil_resolve
1721- v3dv: don't submit noop job if there is nothing to wait on or signal
1722- broadcom/compiler: fix offset alignment for ldunifa when skipping
1723
1724Ian Romanick (35):
1725
1726- nir/loop_unroll: Always unroll loops that iterate at most once
1727- glsl/nir: Don't build soft float64 when it cannot be used
1728- nir/constant_folding: Optimize txb with bias of constant zero to tex
1729- intel/compiler: Don't predicate a WHILE if there is a CONT
1730- intel/compiler: Don't store "scalar stage" bits on Gfx8 or Gfx9
1731- intel/stub: Suppress warnings about DRM_I915_QUERY_PERF_CONFIG
1732- intel/stub: Implement DRM_I915_QUERY_ENGINE_INFO
1733- intel/stub: Implement DRM_I915_QUERY_MEMORY_REGIONS
1734- intel/stub: Implement I915_PARAM_HAS_USERPTR_PROBE
1735- intel/fs: Use HF as destination type for F32TOF16 in fquantize2f16
1736- mesa: OpenGL 1.3 feature GL_ARB_texture_border_clamp is not optional
1737- mesa: OpenGL 1.3 feature GL_ARB_texture_cube_map is not optional
1738- mesa: OpenGL 1.3 feature GL_ARB_texture_env_combine is not optional
1739- mesa: OpenGL 1.3 feature GL_ARB_texture_env_dot3 is not optional
1740- mesa: OpenGL 1.3 and OpenGL ES 1.0 are not optional
1741- intel/stub: Silence "initialized field overwritten" warning
1742- intel/stub: Implement shell versions of DRM_I915_GEM_GET_TILING and DRM_I915_SEM_GET_TILING
1743- intel/fs: Fix gl_FrontFacing optimization on Gfx12+
1744- ntt: Extend ntt_compile::addr_declared and ntt_compile::addr_reg
1745- mesa: OpenGL 1.4 feature GL_ARB_depth_texture is not optional
1746- mesa: OpenGL 1.4 feature GL_ARB_texture_env_crossbar is not optional
1747- mesa: OpenGL 1.4 feature GL_EXT_blend_color is not optional
1748- mesa: OpenGL 1.4 feature GL_EXT_blend_func_separate is not optional
1749- mesa: OpenGL 1.4 feature GL_EXT_blend_minmax is not optional
1750- mesa: OpenGL 1.4 feature GL_EXT_point_parameters is not optional
1751- mesa: OpenGL ES 1.1 is not optional
1752- gallivm/nir: Call nir_lower_bool_to_int32 after nir_opt_algebraic_late
1753- nir: All set-on-comparison opcodes can take all float types
1754- intel/fs: Don't optimize out 1.0*x and -1.0*x
1755- spriv: Produce correct result for GLSLstd450Step with NaN
1756- spirv: Produce correct result for GLSLstd450Modf with Inf
1757- spirv: Produce correct result for GLSLstd450Tanh with NaN
1758- nir: Properly handle various exceptional values in frexp
1759- nir: Produce correct results for atan with NaN
1760- nir: Add missing dependency on nir_opcodes.py
1761
1762Icecream95 (9):
1763
1764- panfrost: Set PIPE_CAP_QUADS_FOLLOW_PROVOKING_VERTEX_CONVENTION
1765- pan/bi: Add interference between destinations
1766- pan/bi: Check dependencies of both destinations of instructions
1767- panfrost: Set dirty state in set_shader_buffers
1768- panfrost: Re-emit descriptors after resource shadowing
1769- pan/bi: Make disassembler build reproducibly
1770- panfrost: Fix ubo_mask calculation
1771- pan/bi: Skip psuedo sources in ISA.xml
1772- pan/bi: Don't assign slots for the blend second source
1773
1774Ilia Mirkin (50):
1775
1776- freedreno: support lua54
1777- meson: build freedreno tools when other parts of freedreno not enabled
1778- freedreno: check batch size after the fallback blitter clear
1779- freedreno/ir3: fix setting the max tf vertex when there are no outputs
1780- freedreno/a4xx: fix min/max/bias lod sampler settings
1781- mesa: add missing state to state string computation
1782- nv50,nvc0: expose R8/RG8_SRGB formats for texturing
1783- mesa: add just a tiny bit of debug info to some _mesa_problem calls
1784- freedreno: prefer float immediates when float values are involved
1785- freedreno/a[345]xx: add R8/RG8 SRGB formats
1786- a5xx: remove astc srgb workaround logic
1787- freedreno/ir3: fix image-to-tex flags, remove 3d -> array hack
1788- nir: remove double-validation of src component counts
1789- freedreno/a4xx: add proper buffer texture support
1790- freedreno/a5xx: re-express buffer textures more logically
1791- freedreno/a6xx: re-express buffer textures more logically
1792- freedreno/a4xx: fix stencil-textured border colors
1793- freedreno/a4xx: indicate whether outputs are uint/sint
1794- freedreno/a4xx: include guesses from a3xx for some of the constid's
1795- freedreno/a4xx: hook up sample mask/id, used to determine helper invocs
1796- freedreno/ir3: adjust condition for when to use ldib
1797- mesa: check target/format for Tex(ture)StorageMem*
1798- mesa: move around current texture object fetching
1799- freedreno/a4xx: add missing SNORM formats to help tests pass
1800- freedreno/a4xx: add some missing legacy formats to help TBOs
1801- freedreno/a5xx: add missing L8A8_UNORM format to support TBOs
1802- freedreno/a[345]xx: fix unorm/snorm blend factors when they're "over"
1803- nir: always keep the clip distance array size updated
1804- nir: apply interpolated input intrinsics setting when lowering clipdist
1805- freedreno/ir3: remove unused actual_in counting
1806- nir/lower_clip: replace bogus comment about gl_ClipDistance reading in GL
1807- nir/lower_clip: location offset goes into offset, not base
1808- nir/lower_clip: increment num_inputs/outputs by appropriate amount
1809- nir/lower_clip: support clipdist array + no vars
1810- freedreno/ir3: indicate that clipdist arrays are in use
1811- freedreno/ir3: get the post-lowering clip/cull mask
1812- ci: move windowoverlap exclusion to all-skips
1813- freedreno/ci: add piglit runs for a306
1814- freedreno/ci/a306: add additional skip which hangchecks
1815- freedreno/a3xx: add some legacy formats
1816- freedreno/ci/a306: separate msaa fails
1817- freedreno/ci/a306: split off the f32 blend / texturing failures
1818- freedreno/ci/a306: split off snorm blending failures
1819- freedreno/a5xx: enable OES_gpu_shader5
1820- gitlab-ci: serial close can leave an active read
1821- gitlab-ci: detect a3xx gpu hang recovery failure
1822- nv50,nvc0: add new caps to list
1823- nv50: don't claim support for format-less stores
1824- freedreno/ci/a306: add more skips
1825- freedreno/ci/a306: increase concurrency
1826
1827Italo Nicola (4):
1828
1829- virgl/vtest: use correct resource stride in flush_frontbuffer
1830- virgl/vtest: implement resource_create_front
1831- virgl: flush cmd buffer when flushing frontbuffer
1832- drisw: do an MSAA resolve when copying the backbuffer
1833
1834Iván Briano (15):
1835
1836- intel/nir: also allow unknown format for getting the size of a storage image
1837- anv: Remove unused struct member
1838- anv: add functions to set up fake render passes
1839- anv: allocate fake render pass on pipeline creation
1840- anv: Split out state attachments allocation
1841- anv: allocate fake render pass for continuation command buffers
1842- anv: Split attachment clearing code into their own functions
1843- anv: split end_subpass into more discrete components
1844- anv: implement the meat of VK_KHR_dynamic_rendering
1845- anv: Enable VK_KHR_dynamic_rendering
1846- anv: Handle resolveImageLayout on dynamic rendering
1847- anv: Report the right conformance version
1848- anv: make the pointer valid before we assign stuff into it
1849- anv: only advertise 64b atomic floats if 64b floats are supported
1850- intel/compiler: make CLUSTER_BROADCAST always deal with integers
1851
1852Jakob Bornecrantz (1):
1853
1854- vulkan-device-select: Don't leak drmDevicePtr
1855
1856James Jones (1):
1857
1858- gbm: Don't pass default usage flags on ABIs < 1
1859
1860James Park (5):
1861
1862- vulkan, radv: Support backslash in ICD paths
1863- aco: Work around MSVC restrict in c99_compat.h
1864- ac: Align ADDR_FASTCALL with addrlib
1865- radv: Match function definitions to declarations
1866- meson: Update libelf wrap for Windows
1867
1868Jan Beich (1):
1869
1870- meson: disable -Werror=thread-safety on FreeBSD
1871
1872Jan Zielinski (3):
1873
1874- gallium/swr: Remove driver source
1875- gallium/swr: Remove common code and build options
1876- gallium/swr: clean up the documentation after SWR removal from main
1877
1878Jason Ekstrand (158):
1879
1880- vulkan/log: Tweak our handling of a couple error enums
1881- vulkan/log: Don't assert on non-client-visible objects
1882- vulkan/wsi/win32: Delete the wrapper entrypoints
1883- i965: Emit a NULL surface for buffer textures with no buffer
1884- nir: Add a nir_foreach_image_variable() iterator
1885- i965/uniforms: Handle images as a separate pass
1886- amd/llvm/nir: Add support for nir_var_mem_image
1887- aco: Add support for nir_var_mem_image
1888- glsl/nir_linker: nir_var_mem_image is also a GL uniform
1889- nir/gl_nir_lower_samplers_as_deref: Support nir_var_mem_image
1890- nir/gl_nir_lower_images: Support nir_var_mem_image
1891- st/nir: Assign uniform locations to nir_var_mem_image vars
1892- ntt: Separate image and sampler handling
1893- llvmpipe: Support image variables living in nir_var_mem_image
1894- nir/lower_readonly_images_to_tex: Also rewrite variable modes
1895- clover: Insert dummy uniform variables for images
1896- clover: Use nir_foreach_image_variable for images
1897- zink: Images can live in nir_var_mem_image now
1898- ir3: Check for nir_var_mem_image in shared_barrier handling
1899- spirv: Assert that OpTypeForwardPointer only points to structs
1900- glsl: Use nir_var_mem_image for images
1901- ttn: Use nir_var_mem_image
1902- st/pbo: Use nir_var_mem_image for images
1903- nir: Validate image variable modes
1904- aco: Split var_mem_image barrier handling from global/ssbo
1905- intel/fs: Stop emitting TGM fences for nir_var_mem_ssbo
1906- nir/gl_nir_lower_images: Require nir_var_mem_image
1907- ir3: Images are always nir_var_mem_image
1908- nir: Drop our attempt at typed-based image mode validation
1909- nir: s/nir_var_mem_image/nir_var_image/g
1910- nir: Re-arrange the variable modes
1911- nir/serialize: Pack deref modes better
1912- compiler/types: Combine image and sampler type serialization
1913- compiler/types: Unify the guts of get_sampler/image_count
1914- compiler/types: Add a texture type
1915- nir: Allow texture types
1916- nir/deref: Also optimize samplerND -> textureND casts
1917- lavapipe: Allow for texture types
1918- clover/nir: Don't remove texture variables
1919- spirv: Use texture types for sampled images
1920- compiler/types: Add a wrap_in_arrays helper
1921- anv: Implement VK_EXT_global_priority_query
1922- glsl/nir/linker: Also remove image variables
1923- vulkan/vk_extensions_gen: Drop support for extra includes
1924- vulkan/vk_extensions_gen: Stop including vk_object.h
1925- v3dv: Use vk_instance_get_proc_addr_unchecked for WSI
1926- lavapipe: Use vk_instance_get_proc_addr_unchecked for WSI
1927- vulkan: Rework mako error handling in python generators
1928- vulkan: Drop unnecessary [en]coding comments from python generators
1929- vulkan/dispatch_table: EntrypointBase doesn't need to derive from object
1930- vulkan: Break entrypoint parsing into its own file
1931- vulkan: Move trampoline code-gen to its own file
1932- vulkan: Move all the common object code to runtime/
1933- lavapipe: Don't wrap errors returned from vk_device_init in vk_error
1934- intel: Add has_bit6_swizzle to devinfo
1935- anv,iris: Advertise a max 3D workgroup size of 1024^3
1936- anv/allocator: Add a couple of helpers
1937- anv: Pull aperture size from devinfo
1938- anv/allocator: Use anv_device_release_bo in anv_block_pool_finish
1939- anv: Rename anv_bo::index to exec_obj_index
1940- anv: Add get/set_tiling helpers
1941- anv: Add a couple more checks in MapMemory
1942- anv: Fix FlushMappedMemoryRanges for odd mmap offsets
1943- anv: Add helpers in anv_allocator for mapping BOs
1944- anv: Always set bindless surface base on SKL+
1945- anv: Int64 atomics don't need to depend on softpin
1946- anv: Add a anv_use_relocations helper and use it
1947- anv: Add a use_relocations physical device bit
1948- anv: Stop checking for HAS_EXEC_FENCE
1949- anv: Add an anv_bo_is_pinned helper
1950- anv: Also disallow CCS_E for multi-LOD images
1951- anv: Move device memory maps back to anv_device_memory
1952- vulkan: Pull the device lost framework from ANV
1953- anv: Use the new common device lost tracking
1954- vulkan/device: Add a check_status hook
1955- anv: Wire up the new status check
1956- vulkan: Add a vk_sync base class
1957- vulkan/meson: Re-arrange libvulkan_util deps a bit
1958- vulkan/util: Include stdlib.h
1959- vulkan/vk_device: Add a drm_fd field
1960- vulkan: Add a common vk_drm_syncobj struct
1961- c11/threads: Re-align return values for timed waits
1962- vulkan: Add an emulated timeline sync type
1963- vulkan: Add a common implementation of VkFence
1964- vulkan: Add a common implementation of VkSemaphore
1965- vulkan: Add common implementations of vkQueueSubmit and vkQueueWaitIdle
1966- vulkan/device: Log the timeline mode when lost
1967- vulkan/wsi: Drop wsi_common_get_current_time()
1968- wsi/display: Rework wsi_fence a bit
1969- vulkan/wsi/display: Wrap wsi_display_fence in a vk_sync
1970- anv: Delete ANV_SEMAPHORE_TYPE_DUMMY
1971- anv: Add a BO sync type
1972- anv: Remove the last remnants of in/out fences
1973- anv: Use helpers in util/os_time.h in the query code
1974- anv: Convert to the common sync and submit framework
1975- vulkan: Add an emulated binary vk_sync type
1976- anv: Remove unnecessary syncobj wrappers
1977- anv: Simplify submit_simple_batch()
1978- vulkan,anv: Auto-detect syncobj features
1979- vulkan: Add a dummy sync type
1980- vulkan/sync: Rework asserts a bit
1981- crocus: wm_prog_key::key_alpha_test uses GL enums
1982- intel/fs,vec4: Drop uniform compaction and pull constant support
1983- intel/fs,vec4: Drop support for shader time
1984- intel/blorp: Stop depending on prog_data binding tables
1985- intel/fs,vec4: Drop prog_data binding tables
1986- intel/compiler: Get rid of wm_prog_key::frag_coord_adds_sample_pos
1987- intel/fs: Drop high_quality_derivatives
1988- anv: Stop doing too much per-sample shading
1989- intel/dev: Add gtt_size to devinfo
1990- Revert "anv: Stop doing too much per-sample shading"
1991- vulkan/queue: Handle WSI memory signal information
1992- anv: Implement vk_device::create_sync_for_memory
1993- anv: Drop anv_sync_create_for_bo
1994- radeonsi/nir: Check for VARYING_SLOT_PRIMITIVE_ID not SYSTEM_VALUE
1995- nir: Add a new sample_pos_or_center system value
1996- intel/fs: Return fs_reg directly from builtin setup helpers
1997- intel/fs: Rework emit_samplepos_setup()
1998- intel/fs: Implement the sample_pos_or_center system value
1999- anv,nir: Use sample_pos_or_center in lower_wpos_center
2000- anv/pipeline: Get rid of sample_shading_enable
2001- intel/fs: Be more conservative in split_virtual_grfs
2002- intel/fs: Use OPT() for split_virtual_grfs
2003- intel/eu: Don't double-loop as often in brw_set_uip_jip
2004- Revert "intel/fs: Do cmod prop again after scheduling"
2005- intel/fs: Reset instruction order before re-scheduling
2006- intel/fs: Add a NONE scheduling mode
2007- vulkan/runtime: Validate instance version on 1.0 implementations
2008- anv,radv,v3dv: Move AcquireImageANDROID to common code
2009- radv: Move QueueSignalReleaseImageANDROID to common code
2010- anv: Use the common QueueSignalReleaseImageANDROID from RADV
2011- v3dv: Use the common QueueSignalReleaseImageANDROID from RADV
2012- turnip: Use vk_common_AcquireImageANDROID
2013- turnip: Use vk_common_QueueSignalReleaseImageANDROID for DRM
2014- intel/compiler: Stop using GLuint in brw_compiler.h
2015- intel/fs: Use compare_func for wm_prog_key::alpha_test_func
2016- spirv,radv: Fix some GL enum comments
2017- vulkan/runtime: Implement 1.3 features/properties
2018- anv/pass: Don't set first_subpass_layout for stencil-only attachments
2019- .mailmap: Switch Jason Ekstrand to @collabora.com
2020- anv: Implement 1.3 features/properties
2021- anv: Advertise Vulkan 1.3
2022- vulkan/wsi: Set MUTABLE_FORMAT_BIT in the prime path
2023- vulkan/wsi/drm: Break create_native_image in pieces
2024- vulkan/wsi: Add a helper for the configure/create/bind pattern
2025- vulkan/wsi/drm: Break create_prime_image in pieces
2026- vulkan/wsi/x11: Split image creation
2027- vulkan/wsi/wayland: Split image creation
2028- vulkan/wsi/display: Split image creation
2029- vulkan/wsi/win32: Delete unnecessary copy+paste from DRM
2030- vulkan/wsi/win32: Break create_win32_image in pieces
2031- vulkan/wsi/drm: Drop wsi_create_native/prime_image
2032- wsi/common: Set VK_IMAGE_CREATE_ALIAS_BIT
2033- vulkan/wsi: Add image create and bind helpers
2034- anv/image: Add some asserts when binding swapchain images
2035- anv/image: Call into WSI to create swapchain images
2036- anv: Call vk_command_buffer_finish if create fails
2037- anv: Don't assume depth/stencil attachments have depth
2038
2039Jason2013 (1):
2040
2041- Fix typo
2042
2043Jesse Natalie (236):
2044
2045- microsoft/clc: Images use nir_var_mem_image
2046- util/hash_table: Clear special 0/1 entries for u64 hash table too
2047- microsoft/compiler: Use textures for SRVs
2048- u_prim_restart: Fix index scanning with start offset
2049- d3d12: Don't accumulate timestamp queries
2050- u_threaded_context: Support including from C++
2051- d3d12: Inherit from threaded_query
2052- d3d12: Resources inherit from threaded_resource
2053- d3d12: Inherit from threaded_transfer
2054- d3d12: Use thread safe slab allocators in transfer_map handling
2055- d3d12: Pass explicit context to pre/post draw surface blits
2056- d3d12: Hook up threaded context
2057- d3d12: Fully init primconvert config
2058- d3d12: Fix Linux fence wait return value
2059- d3d12: Handle non-infinite wait timeouts > 49.7 days as infinite
2060- util/libsync: Fix timeout handling if poll() wakes up early
2061- d3d12: Don't wait for \*all* batches when synchronizing a resource
2062- d3d12: Don't wait for GPU reads to do CPU reads
2063- mesa/main: Fix use of alloca() without #include "c99_alloca.h"
2064- clc: Use kernel_arg_type_qual string to add const type qualifier to arg metadata
2065- microsoft/clc: Add a test for arg metadata
2066- d3d12: Fix incorrect hash table usage
2067- meson: Don't override built-in cpp_rtti option, error if it's invalid
2068- meson: Allow mismatching RTTI for MSVC
2069- android: Add a BOARD CFlags option so build can be customized
2070- mesa/main, android: Log errors to logcat
2071- android,d3d12: Support using DirectX-Headers dependency from AOSP
2072- android: Allow forcing softpipe
2073- d3d12: Support BGRA 555 and 565 formats
2074- d3d12: Support PIPE_CAP_MIXED_COLOR_DEPTH_BITS
2075- d3d12: Support RGBX formats mapped to RGBA
2076- microsoft/compiler: Handle GLES external textures
2077- gallium, windows: Use HANDLE instead of FD for external objects
2078- winsys/d3d12: Populate winsys handle format
2079- d3d12: Handle import/export of fd shared handles
2080- d3d12: Make format list all use macros
2081- d3d12: Generate format table using a macro list
2082- d3d12: Generate a pipe format -> typeless mapping table too
2083- d3d12: Validate opened D3D12 resource matches pipe template
2084- CI/windows: Uprev piglit
2085- CI/windows: Upload result.txt as an artifact
2086- CI/windows: Move D3D12 test YML to D3D12 driver folder
2087- CI/windows: Move SPIRV-to-DXIL test YML to microsoft folder
2088- CI/windows: Move reference files to relevant ci subdirectories
2089- CI/d3d12: Add a quick_shader run
2090- d3d12: Support compat level 330
2091- windows: Use TLS context/dispatch with shared-glapi
2092- d3d12: Handle depth readback on drivers that require full-resource copies for depth
2093- nir: Add an 'external' texture type for parity with samplers
2094- d3d12: Force emulation of all YUV formats using per-plane formats
2095- d3d12: Handle opening planar resources
2096- d3d12: Allow creating planar resources
2097- d3d12: Use overall resource format + plane format to get format info
2098- microsoft/compiler: Implement inot
2099- microsoft/compiler: Remove algebaric pass for inot
2100- ci/windows: Remove line numbers from assertions in spirv2dxil tests
2101- glapi: Never use dllimport/dllexport for TLS vars on Windows
2102- microsoft/compiler: Support lowered io (nir_intrinsic_load_input/store_output)
2103- microsoft/compiler: Lower io
2104- microsoft/compiler: Delete non-sysval deref load/store code
2105- microsoft/compiler: Load synthesized sysvals via lowered io
2106- d3d12: Fix NV12 resource importing
2107- softpipe: Add a dummy field to sp_fragment_shader_variant_key
2108- CI: Trigger Windows build on softpipe changes
2109- microsoft/compiler: Emit SSBOs from 0 -> count for GL (non-kernel, non-Vulkan) shaders
2110- microsoft/compiler: Hook up uavs-at-every-stage flag
2111- microsoft/compiler: Handle write masks in SSBO lowering pass
2112- d3d12: Support SSBOs in root signatures
2113- d3d12: Always create buffers as UAV-capable
2114- d3d12: Support setting SSBOs on the context and turning them into descriptors
2115- d3d12: Use DXIL load/store lowering pass
2116- d3d12: Set SSBO support caps
2117- d3d12: Avoid a debug warning trying to unmap a not-mapped resource
2118- d3d12: Replace pipe cap literals with D3D12 defines when available
2119- d3d12: Enable cubemap arrays
2120- microsoft/compiler: Position should always be no-perspective
2121- d3d12: Handle cubemap gather on int cubemaps
2122- d3d12: Enable texture gather
2123- microsoft/compiler: Fix LOD instruction to return 2 values
2124- gallium/aux: Move index offsetting from prim restart to primconvert
2125- microsoft/compiler: Change vulkan_environment bool to an enum
2126- microsoft/compiler: Put SSBO and image handles in separate arrays
2127- microsoft/compiler: Emit GL images in descriptor space 1 with driver_location instead of binding
2128- microsoft/compiler: Unify handle retrieval between images and UBO/SSBO
2129- microsoft/compiler: Emit SRVs/UAVs as arrays
2130- microsoft/compiler: Fix array-of-array handling for derefs of textures/images
2131- microsoft/compiler: Handle images as derefs for GL
2132- microsoft/compiler: Implement atomic image ops
2133- microsoft/compiler: Handle forced early depth
2134- microsoft/compiler: Hook up memory/control barriers
2135- microsoft/compiler: Fix handling of fp16-in-32bit-val ops to handle high bits
2136- d3d12: Shrink 2D array size so that max-layer cube arrays can be created
2137- d3d12: Fix format table typeless-ness for A8 and RGBA1010102
2138- d3d12: Rename UAV -> SSBO to disambiguate with image UAVs
2139- d3d12: Add missed SSBO binding enum value
2140- d3d12: Figure out if we can support GL shader images
2141- d3d12: Handle format support queries for shader images
2142- d3d12: Init null UAVs
2143- d3d12: Retrieve shader image dimensions during shader compiles
2144- d3d12: Handle images in the root signature
2145- d3d12: Handle set_shader_images
2146- d3d12: Create textures as UAV-capable when appropriate
2147- d3d12: Fill out shader image descriptor tables
2148- d3d12: Lower cube images to 2D arrays via existing int cubemap lowering pass
2149- d3d12: Handle memory barriers
2150- d3d12: Handle bitcasting of shader images
2151- d3d12: Set appropriate caps for shader images
2152- nir_opt_dead_cf: Remove dead ifs
2153- shader_info: tess.spacing needs to be unsigned
2154- microsoft/compiler: Move workgroup_size lowering from clc
2155- microsoft/compiler: Handle more GL memory barriers
2156- d3d12: Limit sampler view count to 32
2157- d3d12: Keep state vars last in the per-stage root parameters
2158- d3d12: Remove draw_info from selection_context
2159- d3d12: Stop trying to set D3D12_DIRTY_SHADER during bindings
2160- d3d12: Compile, bind, and cache compute PSOs
2161- d3d12: Support compute root signatures
2162- d3d12: Hook up compute shader variations
2163- d3d12: Implement launch_grid
2164- d3d12: Implement num workgroups as a state var
2165- d3d12: Handle indirect dispatch
2166- d3d12: Run DXIL shared atomic lowering pass
2167- d3d12: Enable compute
2168- docs: Update d3d12 extension list and new_features.txt
2169- d3d12/ci: Skip flaky tex-miplevel-selection and timestamp tests
2170- mesa/st: Assert that NIR drivers that support tess use tess levels as inputs
2171- d3d12: Export d3d12_get_state_var from d3d12_nir_passes.c
2172- d3d12: Fix re-enabling predication after temporary disablement
2173- d3d12: Predication fix: re-enable after restarting a batch if needed
2174- d3d12: Predication fix: For boolean queries used for predication, D3D12 uses uint64, so clear at least a uint64 in the result
2175- d3d12: Declare support for inverted conditional render
2176- d3d12: Upgrade first vertex state var into all vertex draw params
2177- d3d12: Enable base instance and draw params extensions
2178- d3d12: Add a command signature cache for indirect draws
2179- d3d12: Handle draw indirect and multi-draw indirect
2180- d3d12: Handle indirect twoface draws
2181- d3d12: Add a compute transformation to handle indirect draws that need draw params
2182- d3d12: Enable draw and multi-draw indirect
2183- docs: Update d3d12 features
2184- d3d12: When no framebuffer attachments are present, use ForcedSampleCount instead of SampleDesc.Count for MSAA
2185- d3d12: When no framebuffer attachments are present, the viewport must be clamped to framebuffer size
2186- d3d12: Support ARB_framebuffer_no_attachments
2187- docs: Update d3d12 feature list
2188- ci/windows: Use 2 container stages
2189- microsoft/compiler: Handle variables declared per-sample
2190- microsoft/compiler: Handle load_sample_pos_at_id
2191- microsoft/compiler: Always have at least one GS active stream
2192- microsoft/compiler: Handle 'pull model' explicit interpolation intrinsics
2193- microsoft/compiler: Handle textureGatherCmp
2194- microsoft/compiler: Handle input coverage
2195- microsoft/compiler: Handle tex texture/sampler offset srcs
2196- microsoft/compiler: Handle load_invocation_id for GS and HS
2197- microsoft/compiler: Emit samplers as array types
2198- microsoft/compiler: Handle bitfield_insert
2199- microsoft/compiler: Use ibfe/ubfe for bitfield extract instead of lowering to shifts
2200- microsoft/compiler: Handle msb/lsb/bfrev
2201- microsoft/compiler: Lower helper invocations
2202- d3d12: Sample mask output needs to be uint-typed
2203- d3d12: Lower load_sample_pos to load_sample_pos_at_id
2204- d3d12: Report sample positions
2205- d3d12: Modify shaders when MSAA is disabled
2206- d3d12: Relax multisampling direct copy requirements
2207- d3d12: Temp resources for same-resource copies can be MSAA too
2208- d3d12: Report number of GS streams
2209- d3d12: Apply GS point sprite lowering to fixed-function point size too
2210- d3d12: Run point sprite lowering pass on multi-stream GS when safe
2211- d3d12: Support dynamic UBO/SSBO indexing
2212- d3d12: When mapping a non-directly-mappable resource for write, readback first
2213- d3d12: Set sample-rate shading and GLSL 400 caps
2214- docs: Update d3d12 features
2215- mesa/st: Lower user clip planes for tess eval too
2216- microsoft/compiler: Force integer I/O vars to use flat/constant interpolation
2217- microsoft/compiler: Use driver_location instead of location for inter-stage varying index in GL
2218- microsoft/compiler: Semantic table should be de-duped for multi-row semantics too
2219- microsoft/compiler: Multi-row output semantics need to write multiple never_writes_masks
2220- microsoft/compiler: Getting a builtin function with an undeclared signature should be unreachable
2221- microsoft/compiler: Add mapping from MESA_SHADER_* to DXIL_*_SHADER for tessellation
2222- microsoft/compiler: Fix typo in enum entry
2223- microsoft/compiler: Emit statically-indexed resource handles and scratch later
2224- microsoft/compiler: Support emitting multiple functions into a DXIL module
2225- microsoft/compiler: Emit functions with actual function names
2226- microsoft/compiler: Emit all NIR functions into the DXIL module
2227- microsoft/compiler: Handle store_per_vertex_output for HS outputs
2228- microsoft/compiler: Split hull (tess ctrl) shaders into main and patch constant funcs
2229- microsoft/compiler: Delete misleading TODO comments about semantic table
2230- microsoft/compiler: Emit HS PSV validation and entrypoint metadata
2231- microsoft/compiler: Fix I/O signatures for tess shaders
2232- microsoft/compiler: Overlap patch and non-patch varyings so both are separately 0-indexed
2233- microsoft/compiler: When sorting patch varyings, adjust location to be in normal varying range
2234- microsoft/compiler: Gather patch const signature and handle tess factor in it
2235- microsoft/compiler: Add patch constant signature into PSV and as container blob
2236- microsoft/compiler: Add a pass for hull and domain shaders to shrink tess level vars
2237- microsoft/compiler: For store_output from HS, use storePatchConstant
2238- microsoft/compiler: For load_input from DS, use loadPatchConstant
2239- microsoft/compiler: Handle load_per_vertex_output as LoadOutputControlPoint
2240- microsoft/compiler: Handle load_output in the HS stage as reading a previously written patch constant
2241- microsoft/compiler: Handle domain location intrinsic
2242- microsoft/compiler: Emit DS PSV validation and entrypoint metadata
2243- microsoft/compiler: Primitive ID should only be added as a sysval in geometry shaders
2244- microsoft/compiler: Location_frac needs to be included in sort order
2245- microsoft/compiler: Handle clip/cull distance as an input to tess shaders
2246- d3d12: Enable PIPE_CAP_TGSI_TEXCOORD
2247- d3d12: Initial plumbing for tesselation
2248- d3d12: Link tesselation control and eval shaders
2249- d3d12: Handle patch_vertices and patch topology
2250- d3d12: Handle passthrough TCS in the case where eval is bound
2251- d3d12: Add a state variable for patch_vertices_in
2252- d3d12: Update varying creation logic to handle location_frac
2253- d3d12: Handle input clip array size in the shader key
2254- d3d12: Set caps for tesselation
2255- microsoft/compiler: Fix UAV resource ID counting for static indexed handles
2256- d3d12: Fix compute transform for multi-draw indirect with dynamic count + state vars
2257- d3d12: Add UAV barriers for UAVs that are being used by compute transforms
2258- d3d12: Include SO buffer count as a PSO dirty bit
2259- d3d12: Support transform feedback pause/resume
2260- d3d12: Move indirect compute to real indirect dispatches
2261- d3d12: SO buffer filled size is only 32-bit
2262- d3d12: Add a comment for what the existing compute transform does
2263- d3d12: Add a couple compute transforms for "fake" SO buffers
2264- d3d12: Add a compute transform for draw auto
2265- d3d12: Move compute transform state save/restore to compute_transforms.cpp
2266- d3d12: Move "fake" SO buffer handling to compute transforms instead of CPU readback
2267- d3d12: Implement DrawAuto aka DrawTransformFeedback
2268- d3d12: Compute transform UBO0 is actually binding 1
2269- d3d12: Rewrite subquery logic
2270- d3d12: Switch primitives-generated query to use XFB, GS, and IA data
2271- d3d12: ARB_transform_feedback2
2272- microsoft/compiler: Correctly support I/O on variables with location_frac
2273- microsoft/compiler: Support multiple GS output streams
2274- d3d12: Unpack multi-stream varyings
2275- d3d12: Fix xfb varying matching for vars with location_frac
2276- d3d12: Handle indexed queries
2277- d3d12: ARB_transform_feedback3
2278- microsoft/compiler: Only prep phis for the current function
2279- microsoft/compiler: Only treat tess level location as special if it's a patch constant
2280- tc: CPU storage needs to be freed with align_free
2281
2282Jianxun Zhang (7):
2283
2284- intel: provide pci bus and dev info in base device struct
2285- intel: use PCI info to compute device uuid
2286- anv: remove private pci fields
2287- intel: dump PCI info in intel_dev_info
2288- intel: remove chipset_id
2289- intel: add swizzle flag into driver uuid
2290- anv: refactor queue chain
2291
2292Jonathan Gray (6):
2293
2294- util: unbreak non-linux mips64 build
2295- util: fix util_cpu_detect_once() build on OpenBSD
2296- radv: use MAJOR_IN_SYSMACROS for sysmacros.h include
2297- util/u_atomic: fix build on clang archs without 64-bit atomics
2298- util: fix build with clang 10 on mips64
2299- util: use correct type in sysctl argument
2300
2301Jonathan Marek (3):
2302
2303- freedreno/layout: Fix the UBWC block size for the Y plane
2304- turnip: enable UBWC for NV12
2305- turnip: use SUBDRAW_SIZE and constant sized tess bos
2306
2307Jordan Crouse (1):
2308
2309- turnip: Update the msm_kgsl.h header with the sanitized 4.19 version
2310
2311Jordan Justen (50):
2312
2313- intel/genxml: Update genxml to support tessellation/geometry distribution
2314- intel/dev/test: Assert (verx10 / 10) == ver
2315- Revert "iris: Disable I915_FORMAT_MOD_Y_TILED_GEN12* on adl-p/display 13"
2316- iris: Use mi_builder in iris_load_indirect_location()
2317- intel/genxml/125: Update COMPUTE_WALKER POSTSYNC_DATA struct
2318- anv,blorp,iris: Set MOCS for COMPUTE_WALKER post-sync operation
2319- intel/dev: Add platform enum with DG2 G10 & G11
2320- intel: Add intel_gem_count_engines
2321- intel: Add intel_gem_create_context_engines
2322- iris: Add iris_init_batches
2323- iris/batch: Move kernel context init to iris_init_non_engine_contexts
2324- iris/batch: Add exec_flags field
2325- iris: Move away from "hw" for some context terminology
2326- iris: Destroy all batches with a new iris_destroy_batches() function
2327- iris: Make iris_kernel_context_get_priority() public
2328- iris/batch: Add support for engines contexts
2329- intel/l3: Make DG1 urb-size exception more generic
2330- iris: Not all gfx12+ have aux_map_ctx
2331- anv: Align buffer VMA to 2MiB for XeHP
2332- iris: Align buffer VMA to 2MiB for XeHP
2333- nir/lower_tex: Add filter for tex offset lowering
2334- intel/compiler: Use nir_lower_tex_options::lower_offset_filter for tg4 on XeHP
2335- intel/genxml/12.5: Remove bt-pool enable from 3DSTATE_BINDING_TABLE_POOL_ALLOC
2336- anv: Add BINDING_TABLE_POOL_BLOCK_SIZE
2337- intel/compiler: Adjust TCS instance-id for dg2+
2338- isl: Don't enable HDC:L1 caches on DG2
2339- intel: Add device info for DG2
2340- intel: Add \*disabled* device ids for DG2
2341- intel/devinfo: Adjust L3 banks for DG2
2342- iris: Use mi_builder to set 3DPRIM registers for draws
2343- iris: Use mi_builder for load/store reg/mem/imm functions
2344- intel/dev: Add max_threads_per_psd field to devinfo for gfx8+
2345- anv,blorp,crocus,i965,iris: Use devinfo->max_threads_per_psd for gfx8+
2346- intel/dev: Add intel_hwconfig_types.h from random post on the internet
2347- intel/dev: Add intel_print_hwconfig_table()
2348- intel/dev: Print urb size with intel_dev_info
2349- intel/dev: Add intel_device_info::apply_hwconfig
2350- intel/dev: Set intel_device_info::apply_hwconfig for DG2
2351- intel/dev: Apply settings from hwconfig if devinfo::apply_hwconfig is set
2352- intel/dev: Recalculate max_cs_threads after applying hwconfig changes
2353- intel/gem: Return length from intel_i915_query_alloc
2354- intel/dev: Add DG1 PCI id 0x4909
2355- intel/dev: Add device ids for ADL-N
2356- intel/dev: Add device info for RPL
2357- intel/genxml: Extend length of 3DSTATE_WM_HZ_OP for gfx12.5
2358- intel/genxml: Extend length of 3DSTATE_DEPTH_BUFFER for gfx12.5
2359- isl: Enable compression with Tile4
2360- intel/fs: Assert that old pull-const code is not used if devinfo->has_lsc
2361- anv: Align GENERAL_STATE_POOL_MIN_ADDRESS to 2MiB
2362- anv: Align state pools to 2MiB on XeHP
2363
2364Joshua Ashton (8):
2365
2366- radv: Always inline descriptor writes
2367- radv: Split off cmd_buffer variant of descriptor set updates
2368- radv: Split off cmd_buffer variant of descriptor set template updates
2369- nvc0: Fix uninitialized width/height/depth warning.
2370- radv: Refactor S_FIXED to radv_float_to_{s,u}fixed
2371- radv: Expose min_lod in \*_make_texture_descriptor
2372- vulkan: Update the XML and headers to 1.2.199
2373- radv: Implement VK_EXT_image_view_min_lod
2374
2375José Fonseca (4):
2376
2377- d3d10umd: Rename Dxgi.h to DxgiFns.h.
2378- d3d10umd: Update for transfer interface changes.
2379- d3d10umd: Fix MSVC build.
2380- d3d10umd: Update for set_sampler_views take_ownership parameter.
2381
2382Juan A. Suarez Romero (22):
2383
2384- vc4/ci: update expected results
2385- v3dv/ci: update expected results
2386- broadcom/compiler: handle array of structs in GS/FS inputs
2387- broadcom/compiler: apply constant folding on early GS lowering
2388- nir: add NIR_DEBUG envvar
2389- mesa: allow TEXTURE_BUFFER target for ARB_texture_buffer_range
2390- st/pbo: do not use GS for NIR preferred shaders
2391- gallium/util: add helper to clamp colors to valid range
2392- v3d: clamp clear color
2393- gallium: add new PIPE_CAP_IMAGE_STORE_FORMATTED
2394- st/pbo: add the image format in the download FS
2395- st/pbo: set layer coord for array textures
2396- v3d: enable ARB_texture_view
2397- tgsi-to-nir: initialize NIR_DEBUG envvar
2398- nir: use call_once() to init debug variable
2399- broadcom/ci: restructure expected results
2400- softpipe: enable PIPE_CAP_IMAGE_STORE_FORMATTED
2401- d3d12: enable PIPE_CAP_IMAGE_STORE_FORMATTED
2402- mesa/st: do not expose ARB_shader_image_load_store if not fully implemented
2403- mesa: fix MAX_GEOMETRY_IMAGE_UNIFORMS check support
2404- v3d/doc: do not expose ARB_shader_image_load_store
2405- v3d: keep clear color untouched
2406
2407Karol Herbst (5):
2408
2409- spirv: Don't add 0.5 to array indicies for OpImageSampleExplicitLod
2410- clover/image: add templated basic_image class to simplify image subclassing
2411- clover/format: Full rework on how we declare supported images.
2412- clover/formats: pass in cl_mem_flags for better format checking
2413- clover/api: fix clGetMemObjectInfo for images
2414
2415Kenneth Graunke (82):
2416
2417- intel: Drop Tigerlake revision 0 workarounds
2418- crocus: Replace devinfo->ver[x10] checks with GFX_VER[x10]
2419- intel/genxml: Fix Indirect Object Access Upper Bound on Gfx4
2420- intel/genxml: Add an "mbz" data type
2421- intel/genxml: Drop "Hierarchical Depth Buffer MOCS" field
2422- intel/genxml: Change 3DSTATE_CONSTANT_XS::MOCS to be MBZ on Gfx8.
2423- isl: Fill in MOCS even for SURFTYPE_NULL surfaces.
2424- isl: Fill in MOCS for NULL depth, stencil, and HiZ buffers.
2425- blorp: Fill in MOCS even for SURFTYPE_NULL surfaces.
2426- blorp: Fill in MOCS for null depth/stencil/HiZ buffers.
2427- blorp: Use a non-zero MOCS for disabled constant buffers
2428- iris: Drop unnecessary parenthesis
2429- iris: Set Bindless Sampler State MOCS
2430- iris: Set default MOCS for NULL depth/stencil/HiZ buffers
2431- iris: Set MOCS on 3DSTATE_CONSTANT_XS on Gfx9+
2432- iris: Set MOCS on 3DSTATE_CONSTANT_ALL packets that disable all buffers
2433- iris: Set MOCS on NULL vertex buffers
2434- iris: Set MOCS on NULL stream output buffers
2435- iris: Fix MOCS for buffer copies
2436- anv: Set default MOCS for NULL depth/stencil/HiZ buffers
2437- anv: Set MOCS for 3DSTATE_CONSTANT_XS on Gfx7.x as well
2438- anv: Set MOCS in 3DSTATE_CONSTANT_XS even if there isn't a buffer.
2439- anv: Set MOCS on NULL vertex buffers
2440- anv: Set MOCS on NULL stream output buffers
2441- crocus: Set MOCS for most state base addresses on pre-Gen8
2442- crocus: Tidy the ifdefs for emitting STATE_BASE_ADDRESS
2443- crocus: Set MOCS for index buffers on Gen6+
2444- crocus: Set MOCS on NULL stream output buffers
2445- crocus: Set default MOCS for NULL depth/stencil/HiZ buffers
2446- crocus: Set MOCS for push constant buffers where possible
2447- crocus: Set MOCS for 3DSTATE_SO_BUFFERS on Gfx7.x too
2448- crocus: Fix MOCS for buffer copies.
2449- i965: Use ISL for MOCS rather than open coding it everywhere
2450- i965: Set default MOCS for NULL depth/stencil/HiZ buffers
2451- i965: Set MOCS for push constant buffers on Haswell and Gfx9+
2452- i965: Set MOCS on NULL stream output buffers
2453- i965: Set MOCS for 3DSTATE_SO_BUFFERS on Gfx7.x too
2454- i965: Set MOCS for 3DSTATE_INDEX_BUFFER on Gfx6/7 as well.
2455- i965: Fix MOCS for BLORP buffer copies
2456- i965: Port STATE_BASE_ADDRESS to genxml and fix bugs
2457- i965: Set MOCS for Bindless Surface/Sampler State base addresses
2458- intel/genxml: Add an field option for nonzero="true"
2459- intel/genxml: Assert that all MOCS fields are non-zero on Gfx7+
2460- intel/genxml: Include blitter commands in gen*_pack.h
2461- intel/genxml: Allow MI_FLUSH_DW on the blitter
2462- intel/genxml: Add XY_BLOCK_COPY_BLT on Tigerlake and later.
2463- iris: Fix parameters to iris_copy_region in reallocate_resource_inplace
2464- intel/genxml: Simplify prefix handling for field value lists
2465- intel/genxml: Collapse leading underscores on prefixed value defines
2466- intel/genxml: Fix MI_FLUSH_DW to actually specify the length properly
2467- intel/genxml: Fix XY_BLOCK_COPY_BLT destination tiling field type
2468- intel/genxml: Decode VALIGN/HALIGN values in XY_BLOCK_COPY_BLT
2469- iris: Make a helper function for cross-batch dependency flushing
2470- iris: Check for cross-batch flushing whenever a buffer is newly written.
2471- iris: Tidy code in iris_use_pinned_bo a bit
2472- blorp: Fix compute-blits for rectangles not aligned to the workgroup
2473- blorp: Don't try to use the 3D stencil write hardware for compute
2474- blorp: Assert that BLORP_BATCH_PREDICATE_ENABLE isn't set for compute
2475- blorp: Disallow multisampling for BLORP compute blits and copies.
2476- iris: Rename is_render_target to is_dest in a few blit functions
2477- isl: Move some genxml surface state helpers into an include file
2478- intel/vec4: Use ir_texture_opcode less in emit_texture()
2479- intel/vec4: Use nir_texop in emit_texture instead of translating
2480- intel/vec4: Inline emit_texture and move helpers to brw_vec4_nir.cpp
2481- intel/compiler: Use uppercase enum values in brw_ir_performance.cpp
2482- intel/fs: Reuse the same FS input slot for VUE header fields.
2483- iris: Use prog_data->inputs rather than shader info in SBE code.
2484- iris: Do primitive ID overrides in 3DSTATE_SBE not SBE_SWIZ
2485- iris: Directly access BOs rather than using iris_resource_bo(...)
2486- intel: Allow copy engine class in intel_gem_create_context_engines()
2487- intel/genxml: Add XY_BLOCK_COPY_BLT Color Depth enum values
2488- intel/dev: Add a has_flat_ccs flag
2489- blorp: Add a blorp_address::local_hint flag
2490- isl: Add isl_dev->mocs.blitter_{src,dst} fields
2491- blorp: Add support for blorp_copy via XY_BLOCK_COPY_BLT
2492- iris: Create an IRIS_BATCH_BLITTER for using the BLT command streamer
2493- iris: Only have one blorp_batch_init/finish in iris_copy_region()
2494- iris: Set BLORP_BATCH_USE_{COMPUTE,BLITTER} flags for the target batch
2495- iris: Implement iris_blorp_exec() for the blitter engine
2496- iris: Fix and refactor check for clear color being fully zero
2497- iris: Make an iris_foreach_batch macro that skips unsupported batches
2498- blorp: Add blorp_measure hooks to the blitter codepaths
2499
2500Khem Raj (1):
2501
2502- v3dv: account for 64bit time_t on 32bit arches
2503
2504Konstantin Seurer (4):
2505
2506- lavapipe: Fixed maxFragmentCombinedOutputResources
2507- anv: Fixed maxFragmentCombinedOutputResources
2508- turnip: Fixed maxFragmentCombinedOutputResources
2509- panvk: Fixed maxFragmentCombinedOutputResources
2510
2511Kostiantyn Lazukin (3):
2512
2513- util/u_trace: Replace Flag with IntEnum to support python3.5
2514- util/ra: use adjacency matrix for undirected graph
2515- util/ra: Fix numeric overflow during bitset allocation
2516
2517Krunal Patel (2):
2518
2519- frontends/va: use un-padded width/height in ExportSurfaceHandle
2520- frontend/va: Setting the size of VADRMPRIMESurfaceDescriptor
2521
2522Leandro Ribeiro (12):
2523
2524- egl: remove unnecessary spaces after types
2525- egl/wayland: replace EGL_DRI2_MAX_FORMATS by EGL_DRI2_NUM_FORMATS
2526- egl/wayland: deprecate drm_handle_format() and drm_handle_capabilities()
2527- egl/wayland: do not try to access memory if allocation failed
2528- egl/wayland: move formats and modifiers to a separate struct
2529- egl/wayland: remove unused constant EGL_DRI2_NUM_FORMATS
2530- loader: add function to get render node from dev_t
2531- egl/wayland: add initial dma-buf feedback support
2532- egl/wayland: move loader_dri_create_image() calls to separate functions
2533- egl/wayland: use surface dma-buf feedback to allocate surface buffers
2534- egl/wayland: do not try to bind to wl_drm if not advertised
2535- egl/wayland: fix surface dma-buf feedback error exits
2536
2537Lepton Wu (1):
2538
2539- driconf: Fix unhandled tags in static conf
2540
2541Lionel Landwerlin (119):
2542
2543- vulkan/wsi/wayland: don't expose surface formats not fully supported
2544- anv: fix push constant lowering with bindless shaders
2545- pps: remove counter_ids fields
2546- pps: add an intel config file
2547- docs: put a list of commands to setup perfetto
2548- intel/dev: printout timestamp period
2549- intel/pps: provide accurate min sampling period
2550- intel/pps: reuse timestamp_frequency from intel_device_info
2551- intel/dev: fix HSW GT3 number of subslices in slice1
2552- intel/dev: don't forget to set max_eu_per_subslice in generated topology
2553- intel/dev: reuse internal functions to set mask
2554- intel/dev: fix subslice/eu total computations with some fused configurations
2555- intel/perf: fix perf equation subslice mask generation for gfx12+
2556- intel/devinfo: use compatible type for ARRAY_SIZE
2557- intel/devinfo: fix wrong offset computation
2558- intel: remove 2 preproduction pci-id for ADLS
2559- intel: move away from booleans to identify platforms
2560- intel/dev: also test crocus & i915 pci-ids
2561- anv: don't forget to add scratch buffer to BO list
2562- anv: fix multiple wait/signal on same binary semaphore
2563- anv: don't try to close fd = -1
2564- anv: initialize anv_bo_sync base fields
2565- intel/fs: fix shader call lowering pass
2566- pps: allow drivers to report timestamps in their own time domain
2567- intel/perf: add a helper to read timestamp from reports
2568- pps: fixup sporadic missing counters
2569- intel/ds: drop timestamp correlation code
2570- intel/perf: track end timestamp of queries
2571- intel/ds: drop unused constructors
2572- intel/ds: isolate intel/perf from the pps-producer
2573- intel/pps: tweak intel config some more
2574- intel/ds: remove verbose messages
2575- intel: move timestamp scaling helper to intel/perf
2576- anv: fix execbuf syncobjs/syncobj_values array leak
2577- util/u_trace: refcount payloads
2578- nir/opt_deref: don't try to cast empty structures
2579- util/u_trace: add end_of_pipe property to tracepoints
2580- util/u_trace/perfetto: add new env variable to enable perfetto
2581- intel/nir: preserve access value when duping intrinsic
2582- nir/lower_io: include the variable access in the lowered intrinsic
2583- nir/print: printout ACCESS_STREAM_CACHE_POLICY
2584- nir: add a new access flag to allow access in helper invocations
2585- nir: add intrinsics for ray queries
2586- nir: track variables representing ray queries
2587- nir: add a ray query optimization pass
2588- spirv: handle ray query intrinsics
2589- intel/debug: reclaim 7 unused bits from classic driver
2590- genxml: fix compilation with P/I defines
2591- genxml: protect _length defines in genX_bits.h
2592- docs/envvars: update after INTEL_DEBUG cleanup
2593- util/u_vector: prevent C++ warning on cast from void* to something else
2594- vulkan: fix missing handling of WSI memory signal
2595- anv,wsi: simplify WSI synchronization
2596- gitlab-ci: disable radv-fossils
2597- intel/devinfo: adjust subslice array size
2598- util/u_trace: protect against reentrant calls
2599- anv: don't leave anv_batch fields undefined
2600- anv: limit compiler valid color outputs using NIR variables
2601- intel/dev: fixup chv workaround
2602- ci: disable vs2019 windows build
2603- intel/devinfo: printout pixel pipes in info printout
2604- intel/devinfo: printout devinfo struct size
2605- intel/devinfo: add a helper to check for slice availability
2606- intel/devinfo: drop num_eus_per_subslice field
2607- anv: fix perf queries
2608- intel/dev: extract slice/subslice total computation
2609- intel/devinfo: split out l3/pixelpipes counting
2610- intel/devinfo: deal with i915 topology query change
2611- intel/fs: disable VRS when omask is written
2612- blorp: add description & helpers to printout ops
2613- isl: add helpers to printout ops
2614- anv: expose a couple of emit helper to build utrace buffer copies
2615- intel/dev,perf: Use a single timescale function
2616- intel/blorp: add measure_end entry point
2617- anv: implement u_trace support
2618- intel/ds: reuse intel_ioctl()
2619- intel/ds: allow user to select metric set at start time
2620- intel/ds: don't forget to reset upper dword timestamp read
2621- intel/ds: use the right i915_drm.h include location
2622- intel/ds: use a per GPU clock ID
2623- util/u_process: protect entrypoints for c++
2624- anv: add perfetto source
2625- pps: enable anv source in example config file
2626- tools/pps: limit intel cfg to 250ms of sampling
2627- iris: utrace/perfetto support
2628- intel/dev: fix ppipe_mask computation
2629- anv: fix missing descriptor copy of bufferview/surfacestate content
2630- genxml: reduce amount of generated code
2631- anv: verify that the format supports multisampling
2632- anv: switch a bunch of struct/enum to 1.3 versions
2633- relnotes/features: updates for Vulkan 1.3
2634- intel/ci: expected failure for 1.3 with older CTS
2635- docs: start some documentation on Anv
2636- docs/anv: list environment variables
2637- docs/anv: add descriptor memory layout
2638- anv: tidy long lines in descriptor code
2639- docs: update INTEL_DEBUG environment variable documentation
2640- intel/tracepoint: simplify tracepoint descriptions
2641- util/u_trace: make mako conditional code easier to read
2642- util/utrace: make generated code a tiny bit nicer to look at
2643- compiler: add VARYING bit for primitive shading rate
2644- genxml: gen12.5 changes for CPS
2645- genxml: add new 3DSTATE_PS_EXTRA bit
2646- intel/dev: details CPS feature support
2647- isl: add support for coarse pixel control surfaces
2648- isl: disable CPB surface compression
2649- nir/builder: add ishl_imm helper
2650- intel/compiler: add a new pass to lower shading rate into HW format
2651- intel/compiler: add primitive rate output support
2652- anv: force primitive shading rate write in last geometry stage
2653- anv/pass: rely on precomputed dynamic rendering pass/subpass more
2654- anv: Update VK_KHR_fragment_shading_rate for newer HW
2655- intel/fs: don't set allow_sample_mask for CS intrinsics
2656- intel/nir: fix shader call lowering
2657- anv: fix conditional render for vkCmdDrawIndirectByteCountEXT
2658- nir: fix lower_memcpy
2659- anv/genxml/intel/fs: fix binding shader record entry
2660- anv: fix fast clear type value with external images
2661- intel/fs: fix total_scratch computation
2662
2663Lorenz Brun (1):
2664
2665- frontends/va: Return error in vaRenderPicture if decoder is NULL
2666
2667Louis-Francis Ratté-Boulianne (7):
2668
2669- microsoft/compiler: add support for load_layer_id
2670- microsoft/compiler: Add subpass input types
2671- microsoft/compiler: Use SRVs for read-only images
2672- microsoft/spirv_to_dxil: check for variables r/w access
2673- microsoft/spirv_to_dxil: lower input attachments
2674- microsoft/spirv_to_dxil: Lower push constant loads to UBO loads
2675- microsoft/spirv_to_dxil: Allow passing a vulkan -> d3d12 binding mapping table
2676
2677Lucas Stach (11):
2678
2679- etnaviv: fix alpha blend with dither on older GPUs
2680- etnaviv: initialize vertex attributes on context reset
2681- egl/wayland: break double/tripple buffering feedback loops
2682- etnaviv: drm: fix size limit in etna_cmd_stream_realloc
2683- etnaviv: drm: properly handle reviving BOs via a lookup
2684- etnaviv: drm: export BO idle check function
2685- etnaviv: drm: rename _etna_bo_del
2686- etnaviv: drm: defer destruction of softpin BOs
2687- egl/dri2: remove superfluous flush when changing the context
2688- egl/dri2: short-circuit dri2_make_current when possible
2689- etnaviv: draw: only mark resources as read/written when the state changed
2690
2691M Henning (2):
2692
2693- nouveau/nir: Use natural alignment for scalars
2694- nouveau/nir: Lower 64-bit phis
2695
2696Manas Chaudhary (2):
2697
2698- panvk: Add check for null fence
2699- panvk: Fix pointer corruption in panvk_add_wait_event_syncobjs
2700
2701Maniraj D (1):
2702
2703- egl: set TSD as NULL after deinit
2704
2705Manuel Stoeckl (2):
2706
2707- gbm: add missing R16 case in gbm_bo_get_bpp
2708- gbm: add GBM_FORMAT_GR1616 and RG1616
2709
2710Marcin Ślusarz (41):
2711
2712- iris: fix scratch address patching for TESS_EVAL stage
2713- intel: fix INTEL_DEBUG environment variable on 32-bit systems
2714- intel/decoder: Dump Task/Mesh shaders
2715- spirv: handle SpvOpMemberName
2716- intel/compiler: extract brw_nir_load_global_const out of rt code
2717- intel/compiler: Get mesh_global_addr from the Inline Parameter for Task/Mesh
2718- intel/compiler: Load draw_id from XP0 in Task/Mesh shaders
2719- nir: limit lower_clip_cull_distance_arrays input to traditional stages
2720- nir/print: simplify printing of IO semantics
2721- nir/print: expand printing of io semantics.gs_streams
2722- nir/print: compact printing of intrinsic indices
2723- nir/print: move print_load_const_instr up
2724- nir/print: group hex and float vectors together
2725- nir/print: print const value near each use of const ssa variable
2726- intel/compiler: disable workaround not applicable to gfx >= 11
2727- spirv: handle ViewportMaskNV builtin/cap from SPV_NV_mesh_shader
2728- compiler: add new MESH_VIEW_COUNT/MESH_VIEW_INDICES system values
2729- spirv: add MeshViewCountNV/MeshViewIndidcesNV builtins from SPV_NV_mesh_shader
2730- nir: add load_mesh_view_count and load_mesh_view_indices intrinsics
2731- spirv: handle multiview bits of SPV_NV_mesh_shader
2732- nir: remove invalid assert affecting per-view variables
2733- spirv: mark [Clip|Cull]DistancePerViewNV variables as compact
2734- nir: handle per-view clip/cull distances
2735- freedreno/rnn: normalize line endings in rules-ng.xsd
2736- microsoft/compiler: normalize line endings
2737- ci/windows: normalize line endings
2738- radv/ci: add line endings exception for files generated with wine
2739- Add new rules to .gitattributes
2740- intel/compiler: handle gl_[Clip|Cull]Distance in mesh shaders
2741- intel/compiler: handle gl_[Clip|Cull]Distance from mesh in fragment shaders
2742- intel/compiler: Use Task/Mesh InlineData for the first few push constants
2743- anv: Enable conditional rendering in vkCmdDrawMeshTasksNV
2744- anv: Add support for non-zero firstTask in vkCmdDrawMeshTasksNV
2745- anv: Add support for UBOs, SSBOs and push constants in Mesh pipeline
2746- anv: Implement indirect dispatch for Mesh pipeline
2747- anv: tell the hardware about gl_[Clip|Cull]Distance in mesh shaders
2748- anv: include ClipDistance array in mesh shader per-vertex output
2749- anv: Put first few push constants directly into Task/Mesh InlineData
2750- intel/compiler: fix array & struct IO lowering in mesh shaders
2751- anv: don't set color state when input state was requested
2752- intel/compiler: ignore per-primitive attrs when calculating flat input mask
2753
2754Marek Olšák (240):
2755
2756- gallium/util: add some extern "C" guards
2757- radeonsi: si_state_shaders.c -> cpp
2758- radeonsi: split si_shader_key into ps and ge parts to minimize memcmp overhead
2759- radeonsi: don't memcmp inlined uniform values if uniform inlining is disabled
2760- radeonsi: don't pass NULL into si_get_nir_shader
2761- radeonsi: replace the GS prolog with a monolithic shader variant
2762- radeonsi: enable shader culling on Navi1x consumer SKUs as well
2763- ac,radeonsi: print a lowercase codename in the renderer string
2764- radeonsi: reorder and don't print patch level DRM version in the renderer string
2765- ac/llvm: use fmac instead of mul+sub in face culling
2766- ac/llvm: add helper ac_build_is_inf_or_nan
2767- ac/llvm: accept primitives whose face culling determinant is Inf or NaN
2768- gallium,vbo: add PIPE_BIND_VERTEX_STATE for display lists
2769- gallium/u_threaded: implement pipelined partial buffer uploads using CPU storage
2770- mesa: fix crashes in the no_error path of glUniform
2771- radeonsi: add SI_MAX_VRAM_MAP_SIZE definition
2772- radeonsi: add an option to use CPU storage uploads for threaded context
2773- radeonsi: change bind_history to track usage in each shader stage
2774- radeonsi: rebind a buffer only in shader stages where it's been bound
2775- radeonsi: don't sync PS or CS before (clear|copy)_buffer based on bind history
2776- radeonsi: don't update bind_history for internal buffer clears and copies
2777- radeonsi: don't sync before clear_buffer and copy_buffer if the buffer is idle
2778- radeonsi: properly destroy buffers on failure
2779- winsys/amdgpu: remove force_chaining parameter from cs_check_space
2780- winsys/amdgpu: set max_ib_size and max_check_space_size later in cs_check_space
2781- radeonsi: don't set inline_uniforms for viewperf because it's enabled by default
2782- amd/addrlib: change how the license is formatted to match internal tree
2783- amd/addrlib: cosmetic addrlib update
2784- mesa: discard draws with count=0 to decrease overhead
2785- st/mesa: don't crash when draw indirect buffer has no storage
2786- mesa: remove USAGE_ELEMENT_ARRAY_BUFFER because it's unused and adding overhead
2787- mesa: move setting USAGE_PIXEL_PACK_BUFFER out of BindBuffer to reduce overhead
2788- mesa: remove redundant flagging USAGE_ARRAY_BUFFER
2789- mesa: add a no_error path to _mesa_handle_bind_buffer_gen
2790- glthread: don't execute display lists if they have no effect
2791- glthread: don't sync for glIsEnabled with a few enums
2792- glthread: add an option to make glCheckFramebufferStatus a no-op
2793- glthread: add a trivial thread-safe way to skip display list execution
2794- radeonsi: enable shader culling for indirect draws
2795- radeonsi: remove unused parameters in si_emit_draw_packets
2796- gallium/radeon: change the BO priority definitions to bits
2797- gallium/radeon: remove/merge some BO priorities and remove holes
2798- gallium/radeon: remove unused RADEON_DEPENDENCY_START_FENCE
2799- gallium/radeon: merge BO read/write usage flags with priority flags
2800- winsys/amdgpu: simplify parameter passing and derefs in cs_add_buffer
2801- winsys/amdgpu: remove an amdgpu_cs dereference from amdgpu_cs_add_buffer
2802- winsys/amdgpu: don't clear RADEON_USAGE_SYNCHRONIZED for last_added_bo_usage
2803- winsys/amdgpu: increase the BO hash list size
2804- winsys/amdgpu: don't use ip_instance and ring fields of fence and IB structures
2805- winsys/amdgpu: move BO fence array updates to the CS thread
2806- winsys/amdgpu: optimize looping inefficiencies in add_bo_fence_dependencies
2807- radeonsi: don't invoke si_decompress_depth if textures are not dirty at binding
2808- mesa: fix locking when destroying/overwriting/adding display lists
2809- mesa: remove display list OPCODE_NOP
2810- mesa: remove PADDING_64BIT by adding the dlist header into vbo_save_vertex_list
2811- vbo: return a GL error earlier in vbo_save_playback_vertex_list_gallium
2812- vbo: use int16_t for vbo_save_vertex_list::gallium::private_refcount
2813- vbo: restructure vbo_save_vertex_list to get more cache hits
2814- mesa: use alloca in search_resource_hash
2815- glsl: add gl_resource_name to precompute "name" properties later
2816- mesa: don't compute the same strlen up to 3x in _mesa_program_resource_find_name
2817- mesa: precompute strlen in gl_resource_name::length and use it
2818- mesa: rename locals in _mesa_program_resource_find_name for clarity
2819- mesa: preparse [ and [0] in gl_resource_name and use it in shader_query.cpp
2820- mesa: handle hash collisions in program resource lookups (e.g. uniforms)
2821- mesa: add separate hash tables for each GLSL resource type
2822- mesa: skip strlen when hashing strings for ProgramResourceHash
2823- radeonsi: print the border color error message only once
2824- util: add a util_bitcount variant that selects POPCNT through C++ template arg
2825- st/mesa: change st_atom_array.c to cpp
2826- st/mesa: use POPCNT in st_update_array if the CPU supports it
2827- mesa: change gl_vertex_array_object::NewArrays to bool
2828- mesa: add NewVertexBuffers/NewVertexElements flags to indicate state changes
2829- cso: add missing parameters into cso_set_vertex_buffers
2830- st/mesa: don't update vertex elements when GL doesn't change them
2831- driconf: set vblank_mode=0 for viewperf2020
2832- gallium/util: fix util_can_blit_via_copy_region with unbound render condition
2833- gallium/u_blitter: disable sample shading for all blits
2834- gallium/u_blitter: do MSAA copies in 1 pass using sample shading
2835- gallium/u_blitter: work around broken sample shading in llvmpipe and zink
2836- radeonsi: fix 2 issues with depth_cleared_level_mask
2837- radeonsi: add a faster clear path for glClearTexImage
2838- radeonsi: rename stencil_cleared_level_mask -> stencil_cleared_level_mask_once
2839- radeonsi: allow and finish TC-compatible MSAA HTILE
2840- radeonsi: fix a typo preventing a fast depth-stencil clear
2841- radeonsi: increase tc_max_cpu_storage_size
2842- vbo: utilize structure padding to optimize indirection cold->prims[0].begin
2843- driconf: disallow 10-bit pbuffers for viewperf2020/maya due to X errors
2844- gallium: rename PIPE_CAPF_MAX_POINT_WIDTH -> MAX_POINT_SIZE
2845- gallium: add missing point and line CAPs
2846- radeonsi: set correct point and line limits
2847- st/mesa: use new point and line CAPs
2848- nir: add new SSA instruction scheduler grouping loads into indirection groups
2849- radeonsi: enable nir_group_loads for better performance
2850- radeonsi: fix shader culling with integer pixel centers
2851- radeonsi: fix view culling for wide lines
2852- radeonsi: use ac_build_load_to_sgpr in gfx10_emit_ngg_culling_epilogue
2853- radeonsi: make si_get_small_prim_cull_info static
2854- radeonsi: set PERPENDICULAR_ENDCAP_ENA for wide AA lines
2855- radeonsi: set EXTRA_DX_DY_PRECISION for lines where it's supported
2856- radeonsi: add si_state_rasterizer::ngg_cull_flags_lines and rename the others
2857- ac,radeonsi: cull small lines in the shader using the diamond exit rule
2858- radeonsi: unify GFX9_VSGS_NUM_USER_SGPR and GFX9_TESGS_NUM_USER_SGPR
2859- radeonsi: add dcc_msaa option to enable DCC for MSAA
2860- radeonsi: improve memory instruction tracking
2861- radeonsi: remove an incorrect comment at lds_byte0_accept_flag
2862- radeonsi: make si_llvm_emit_clipvertex non-static
2863- radeonsi: unify how ngg_cull_flags are set
2864- radeonsi: cull against clip planes, clipvertex, clip/cull distances in shader
2865- radeonsi: inline declare_vs_specific_input_sgprs
2866- radeonsi: don't use GS SGPR6 for the small prim cull info
2867- glx: add a workaround to glXDestroyWindow for Viewperf2020/Sw
2868- radeonsi: separate culling code from VS/TES (to be reused by GS)
2869- radeonsi: restructure code that declares merged VS-GS and TES-GS SGPRs
2870- radeonsi: add is_gs parameter into si_vs_needs_prolog
2871- radeonsi: simplify si_get_vs_key_outputs for GS
2872- radeonsi: don't use ctx.stage outside of si_llvm_translate_nir
2873- radeonsi: implement shader culling in GS
2874- radeonsi: deduplicate min_esverts code in gfx10_ngg_calculate_subgroup_info
2875- mesa: don't add attenuation constants if ffvp doesn't use them
2876- mesa: add allow_glsl_compat_shaders for shader-db
2877- ac/gpu_info: don't fail on amdgpu_query_video_caps_info failures
2878- ac/surface: allow gfx6-8 to enter the gfx9 DCC codepath for SI_FORCE_FAMILY
2879- mesa: add a more straightforward callback for replacing shaders
2880- driconf: enable glthread for all Unigine benchmarks
2881- driconf: enable glthread for Minecraft
2882- driconf: enable glthread for Basemark GPU
2883- radeonsi: don't print uninitialized inlined_uniform_values
2884- radeonsi: add wave32 flag into prolog/epilog keys
2885- radeonsi: add si_shader::wave_size because it will vary
2886- radeonsi: use si_shader::wave_size
2887- radeonsi: don't use si_get_wave_size in si_get_ir_cache_key
2888- radeonsi: clean up compute_wave_size use in si_compute_blit.c
2889- radeonsi: propagate si_shader::wave_size to VGT_SHADER_STAGES
2890- radeonsi: don't use compute_wave_size directly
2891- radeonsi: centralize wave size computation in si_get_shader_wave_size
2892- radeonsi: rename si_get_shader_wave_size and make it non-inline
2893- mesa: remove SourceChecksum from shader structures
2894- mesa: rename gl_shader::sha1 to disk_cache_sha1
2895- mesa: add shader source SHA1s that are propagated up to glCompileShader
2896- mesa: add gl_linked_shader::linked_source_sha1
2897- mesa: don't compute the same SHA1 twice in glShaderSource
2898- util: add SHA1 printing and comparison functions
2899- nir: add shader_info::source_sha1, its initialization and printing
2900- radeonsi: print source_sha1 as part of shader dumps
2901- radeonsi: print the shader stage for shader-db dumps
2902- glthread: don't sync for more glGetIntegerv enums for glretrace
2903- mesa: use simple_mtx_t for TexMutex (v2)
2904- nir: handle more intrinsics in divergence analysis
2905- nir: disable a NIR test due to undebuggable & locally unreproducible CI failures
2906- nir: serialize divergent fields
2907- nir: add nir_has_divergent_loop function
2908- glsl: fix setting compiled_source_sha1 without a shader cache
2909- radeonsi: add Wave32 heuristics and shader profiles
2910- radeonsi: print more stats for shader-db
2911- radeonsi: add shader profiles that disable binning
2912- radeonsi: unroll loops of up to 128 iterations
2913- mesa: include less stuff in dlist.c
2914- mesa: inline vbo_initialize_save_dispatch and rename the functions
2915- mesa: inline _mesa_install_arrayelt_vtxfmt
2916- mesa: inline _mesa_install_eval_vtxfmt
2917- mesa: inline _mesa_install_dlist_vtxfmt
2918- mesa: move _mesa_initialize_vbo_vtxfmt calls to a common place and inline
2919- mesa: inline _vbo_install_exec_vtxfmt
2920- mesa: move the ES2 check from vbo_init_tmp.h to install_vtxfmt
2921- glapi: rename exec="dynamic" to exec "vtxfmt" to make it self-explanatory
2922- mesa: don't set CallList* redundantly in _mesa_initialize_save_table
2923- mesa: rename dlist functions to match dispatch function names
2924- glapi: rename gl_genexec.py to api_exec_init.py, api_exec.c to api_exec_init.c
2925- glapi: move apiexec API condition determination to common code
2926- glapi: replace dispatch.h inline functions with macros for faster compilation
2927- mesa: add EXT suffix to VertexAttribI*EXT to match glapi name
2928- vbo: rename vertex functions to match GL dispatch names
2929- vbo: rename ES vertex functions to match GL dispatch names
2930- glapi: move reusable glapi printing code to apiexec.py
2931- glapi: autogenerate _mesa_initialize_save_table with python
2932- glapi: autogenerate api_save.h with save_* function declarations
2933- mesa: remove api_exec.h and move its contents into context.h
2934- glapi: autogenerate all _mesa_* forward declarations in api_exec_decl.h
2935- glapi: autogenerate install_vtxfmt with python
2936- mesa: remove GLvertexformat
2937- vbo: expose all exec entrypoints for glthread and match api_exec_decl.h names
2938- glthread: add nop dispatch
2939- glthread: set marshal functions in dispatch only if they exist in the API
2940- glthread: inline _mesa_glthread_restore_dispatch and merge disable & destroy
2941- glthread: fix restoring the dispatch in destroy when the context is not current
2942- glthread: disable glthread if the context is lost
2943- mesa: use ctx->GLThread.enabled now that it's correct
2944- mesa: rename _ae_ArrayElement -> _mesa_ArrayElement to match glapi
2945- mesa: remove COPY_DISPATCH code that doesn't do anything
2946- glapi: add missing no_error settings for implemented functions
2947- mesa,vbo: make ES wrapper functions static
2948- mesa: remove all GL func forward declarations because they are autogenerated
2949- mesa: use nop dispatch for ColorTable/Convolution/Histogram
2950- radeonsi: add a debug option that disables DCC for all exported buffers
2951- driconf: enable glthread for Minecraft-FTB, Stellaris, Battletech
2952- ac/gpu_info: set cu_mask correctly for Arcturus
2953- ac/gpu_info: add AMD_CU_MASK environment variable to set CU_EN
2954- radeonsi: set COMPUTE_DESTINATION_EN_SEn to spi_cu_en
2955- radeonsi: program COMPUTE_STATIC_THREAD_MGMT_SE4..7 on Arcturus
2956- radeonsi: apply spi_cu_en to CU_EN
2957- radv: set COMPUTE_DESTINATION_EN_SEn to spi_cu_en
2958- radv: apply spi_cu_en to CU_EN
2959- radeonsi: pack si_pm4_state
2960- radeonsi: replace SI_PM4_MAX_DW with a max_dw field
2961- radeonsi: decrease the size of si_pm4_state::pm4 except for cs_preamble_state
2962- amd/registers: work around an assertion in parse_kernel_headers.py
2963- ac/llvm: add vindex into ac_build_buffer_store_dword
2964- ac/llvm: remove the num_channels parameter from ac_build_buffer_store_dword
2965- radeonsi: modifiers can't disable DCC
2966- radeonsi: make get_thread_id_in_tg non-static
2967- radeonsi: clean up si_export_mrt_color
2968- radeonsi: set done=1 for PS exports at the end of si_llvm_build_ps_epilog
2969- radeonsi: remove unnecessary code that was used to find the last export
2970- radeonsi: export mrtz before color exports
2971- radeonsi: simplify compacted_mrt_index in si_export_mrt_color
2972- radeonsi: reorder slots for internal buffers, reuse a slot for GS_QUERY_BUF
2973- radeonsi: don't bind the ESGS ring twice, handle the difference in the shader
2974- radeonsi: remove unused si_shader::prolog2
2975- radeonsi: pass sample_coverage VGPR index to the PS prolog instead of guessing
2976- radeonsi: move smoothing to the main shader part to remove 1 live VGPR
2977- radeonsi: do opt_large_constants & lower_indirect_derefs after uniform inlining
2978- radeonsi: use nir->scratch_size instead of ac_count_scratch_private_memory
2979- radeonsi: change si_shader_output_values::vertex_stream to a bitmask
2980- radeonsi: move si_nir_scan_shader into si_shader_info.c
2981- radeonsi: add into the disk cache key whether cached shaders contain LLVM IR
2982- radeonsi: move the GS copy shader into shader variants
2983- ac: move ac_exp_param.h to ac_nir.h
2984- ac/nir: move ac_are_tessfactors_def_in_all_invocs into radeonsi
2985- radeonsi: print all streamout info
2986- radeonsi: print the number of param exports for shader-db
2987- ac/surface: allow displayable DCC with any resolution (e.g. 8K)
2988- radeonsi: rename uses_vmem_* flags
2989- radeonsi: apply fbfetch/indirect_descriptor to uses_vmem_load_other earlier
2990- radeonsi: determine MEM_ORDERED after generating a shader variant
2991- winsys/radeon: fix a hang due to introducing spi_cu_en
2992- amd: add a workaround for an SQ perf counter bug
2993- ac/surface: add more elements to meta equations because HTILE can use them
2994- radeonsi: fix register shadowing after the pm4 state size was decreased
2995- radeonsi: fix an assertion failure with register shadowing
2996
2997Marius Hillenbrand (3):
2998
2999- util/cpu_detect: Add flag for IBM Z (s390x)
3000- llvmpipe: Use lp_build_round_arch on IBM Z (s390x)
3001- util/cpu_detect, gallium: use cpu_family CPU_S390X instead of separate flag
3002
3003Markus_included (1):
3004
3005- Fixed you're to your
3006
3007Martin Roukala (né Peres) (1):
3008
3009- radv/ci: mark the dEQP fails related to a missing VKCTS 1.3 as expected
3010
3011Matt Turner (7):
3012
3013- util/format: Add PIPE_FORMAT_Y8_UNORM as an "other" layout format
3014- tu: Expose required VK_FORMAT_FEATURE bits for planar YUV formats
3015- ir3: Add support for (dis)assembling flat.b
3016- freedreno/ir3: Add infrastructure for flat.b
3017- freedreno/ir3: Use flat.b to load flat varyings on a6xx
3018- freedreno/ir3: Use immediate for flat.b's src1
3019- intel/genxml: capitalize decoder mode select properly
3020
3021Matti Hamalainen (3):
3022
3023- aux/trace: print enum names instead of integer values in gallium traces
3024- aux/trace: implement missing trace calls
3025- aux/trace: cosmetic cleanup
3026
3027Mauro Rossi (3):
3028
3029- android: define cpp_rtti=false because libLLVM is built w/o RTTI
3030- Revert "android: define cpp_rtti=false because libLLVM is built w/o RTTI"
3031- android: define cpp_rtti=false because libLLVM is built w/o RTTI (v2)
3032
3033Melissa Wen (14):
3034
3035- v3dv: drop unused variable on handle_set_event_cpu_job
3036- v3dv: wrap wait semaphores info in v3dv_submit_info_semaphores
3037- v3dv: store wait semaphores in event_wait_cpu_job_info
3038- drm-uapi/v3d: extend interface for multiple semaphores support
3039- v3dv: check multiple semaphores capability
3040- v3dv: enable multiple semaphores on cl submission
3041- v3dv: enable multiple semaphores for tfu job
3042- v3dv: enable multiple semaphores for csd job
3043- v3dv: enable GPU jobs to signal multiple semaphores
3044- v3dv: track submitted jobs by GPU queue type
3045- v3dv: handle wait semaphores in the first job by queue
3046- v3dv: process signal semaphores in the very last job
3047- v3dv: signal fence when all submitted jobs complete execution
3048- v3dv: move sems_info from event_wait job to wait_thread info
3049
3050Michael Tang (2):
3051
3052- microsoft/spirv_to_dxil: turn sysvals into input varyings
3053- microsoft/compiler: Use memcpy instead of a union to write dxil_features
3054
3055Michel Dänzer (1):
3056
3057- Revert "wsi/x11: Avoid a class of deadlocks in the WSI queue thread"
3058
3059Michel Zou (8):
3060
3061- vulkan: fix uninitialized variables
3062- lavapipe: fix unused variable
3063- llvmpipe: Fix Wpointer-to-int-cast
3064- meson: check -mtls if has_exe_wrapper
3065- meson: correctly detect linker arguments
3066- zink: fix -Warray-bounds warning
3067- zink: fix unused variable warning
3068- vulkan/wsi: drop unused wsi_create_win32_image
3069
3070Mike Blumenkrantz (278):
3071
3072- nir/print: print bindless info as applicable
3073- aux/pb: add a tolerance for reclaim failure
3074- virgl: remove unused pipebuffer include
3075- aux/pb: more correctly check number of reclaims
3076- zink: use static array for detecting VK_TIME_DOMAIN_DEVICE_EXT
3077- zink: add a read barrier for indirect dispatch
3078- zink: fully zero surface creation struct
3079- zink: clear descriptor refs on buffer replacement
3080- zink: assert compute descriptor key is valid before hashing it
3081- zink: don't update lazy descriptor states in hybrid mode
3082- zink: move push descriptor updating into lazy-only codepath
3083- zink: add an early return for zink_descriptors_update_lazy_masked()
3084- zink: move last of lazy descriptor state updating back to lazy-only code
3085- lavapipe: stop reading renderpass during pipeline creation
3086- lavapipe: remove some unused struct members
3087- lavapipe: use framebuffer attachment_count member instead of renderpass
3088- lavapipe: remove lvp_subpass_attachment::in_render_loop
3089- lavapipe: remove lvp_subpass_attachment::layout
3090- lavapipe: add attachment index to lvp_render_pass_attachment
3091- lavapipe: remove lvp_subpass::max_sample_count
3092- lavapipe: remove lvp_subpass_attachment and use lvp_render_pass_attachment refs
3093- lavapipe: remove last VK_ATTACHMENT_UNUSED check
3094- lavapipe: store subpass directly to rendering_state
3095- lavapipe: simplify some attachment derefs
3096- lavapipe: remove lvp_subpass::has_color_att
3097- lavapipe: pull layer count from render state during resolve
3098- lavapipe: clamp attachment clear rect sizes
3099- zink: don't ralloc zink_resource structs
3100- zink: align pipe_resource and sampler_view allocations to cachelines
3101- zink: use ctx params for program ref/destroy functions
3102- zink: fix descriptor interface param for program_deinit
3103- zink: remove descriptor pools from hash table on deletion
3104- zink: unref descriptor pools in hybrid mode when they explode
3105- zink: always invalidate descriptor sets on pool free
3106- zink: don't clear descriptor pool cache on context destroy
3107- zink: stop leaking descriptor pool references
3108- zink: rescue surfaces/bufferviews for cache hits during deletion
3109- zink: rename zink_descriptor_layout_key::num_descriptors -> num_bindings
3110- zink: eliminate a hole in zink_descriptor_layout_key
3111- zink: reduce hashed region of zink_descriptor_layout_key
3112- zink: split out descriptor pool sizing into separate struct
3113- zink: detect prim type more accurately for tess/gs lines
3114- zink: don't break early when applying fb clears
3115- zink: only reset zink_resource::so_valid on buffer rebind
3116- zink: don't check rebind count outside of buffer/image rebind function
3117- zink: add notes about binding points which aren't counted in util funcs
3118- zink: stop exporting PIPE_SHADER_CAP_FP16_DERIVATIVES
3119- zink: don't add dynamic vertex pipeline states if no attribs are used
3120- zink: fix gl_SampleMaskIn spirv generation
3121- zink: more accurately update samplemask for fs shader keys
3122- nir/lower_samplers_as_deref: rewrite more image intrinsics
3123- nir/lower_io_to_scalar: add support for bo and shared io
3124- zink: run optimize_nir() only once during compile
3125- zink: move bo load offset adjustment to compiler passes
3126- zink: run lower_io_to_scalar before rewriting bo access
3127- zink: move all 64-32bit shader load rewriting to nir pass
3128- zink: move ssbo store offset adjustment to compiler passes
3129- zink: move shared intrinsic offset adjustments to compiler passes
3130- zink: move all 64-32bit shader store rewriting to nir pass
3131- zink: add more glsl base types to get_glsl_basetype()
3132- zink: move all shader bo/sharedmem access to compiler passes
3133- zink: add better handling for CUBE_COMPATIBLE bit
3134- zink: cache bo SpvId array types
3135- zink: use align64 for allocation sizes
3136- zink: set aspectMask for renderpass2 VkAttachmentReference2 structs
3137- zink: always use explicit lod for texture() when legal in non-fragment stages
3138- zink: be more permissive for injecting LOD into texture() instructions
3139- zink: inject LOD for sampler version of OpImageQuerySize
3140- zink: flag renderpass change when toggling fbfetch
3141- zink: be more spec-compliant for unnormalizedCoordinates samplers
3142- zink: don't clamp cube array surfacess to cubes
3143- zink: don't clamp 2D_ARRAY surfaces to 2D
3144- zink: error when trying to allocate a bo larger than heap size
3145- zink: clamp max buffer sizes to smallest buffer heap size
3146- zink: explicitly enable VK_EXT_shader_subgroup_ballot
3147- zink: add more int/float types to cast switching in ntv
3148- zink: force float dest types on some alu results
3149- zink: stop double printing validation messages
3150- zink: add SpvCapabilityStorageImageMultisample for multisampled storage images
3151- zink: reject all storage multisampling if the feature is unsupported
3152- gallium: add PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY
3153- zink: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY
3154- zink: add queue locking
3155- build: add sha1_h to llvmpipe build
3156- mesa/st: rework psiz lowering
3157- mesa/st: lower psiz for shader precompile
3158- anv: disable debug logging spam
3159- zink: set fbfetch state on lazy batch data when enabling it
3160- zink: always use lazy (non-push) updating for fbfetch descriptors
3161- vk: update headers for 1.2.197
3162- lavapipe: fix cmd queuing for dynamic render
3163- lavapipe: VK_KHR_dynamic_rendering
3164- features: add dynamic render for lavapipe
3165- zink: do a better job conserving locations for packed xfb outputs
3166- zink: clamp PIPE_SHADER_CAP_MAX_INPUTS for xfb
3167- zink: remove lazy ci job
3168- zink: add khr46 to ci
3169- aux/primconvert: handle singular incomplete restarts
3170- zink: rework cached fbfetch descriptor fallback
3171- lavapipe: add some asserts for descriptor dynamic offsets
3172- gallivm: handle TGSI SampleId sysval
3173- Revert "gallium/u_blitter: work around broken sample shading in llvmpipe and zink"
3174- aux/trace: fix PIPE_QUERY_PIPELINE_STATISTICS_SINGLE tracing
3175- aux/trace: support pipe_context::get_query_result_resource
3176- aux/trace: trace pipe_screen::is_format_supported better
3177- aux/trace: fix vertex state tracing
3178- aux/trace: add pipe_context::render_condition_mem
3179- zink: set new point/line caps
3180- zink: update radv ci passes
3181- zink: flatten out draw templates a bit
3182- zink: declare int/float size caps inline with type usage
3183- zink: simplify 64bit vertex attrib lowering
3184- zink: add another compiler pass to convert 64bit vertex attribs
3185- zink: add 1DShadow sampler handling for drivers (radv) that don't support it
3186- zink: radv ci updates for 1dshadow stuff
3187- zink: implement cs uniform inlining
3188- zink: always inline uniforms when running on a cpu driver
3189- zink: ci updates
3190- zink: be more consistent about applying module hash for gfx pipeline
3191- zink: update gfx pipeline shader module pointer even if the program is unchanged
3192- aux/primconvert: break out primconvert internals into util function
3193- aux/primconvert: support pipe_context::draw_vertex_state
3194- zink: stop running discard_if in generated tcs
3195- zink: always add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for 3D images
3196- move util/indices to core util
3197- mesa: convert unsupported primtypes during display list compilation
3198- zink: block suballocator caching for swapchain/dmabuf images
3199- zink: set suballocator bo size to aligned allocation size
3200- zink: implement pipe_context::draw_vertex_state
3201- zink: stop using VK_IMAGE_LAYOUT_PREINITIALIZED for dmabuf
3202- zink: always set matching resource export type for dmabuf creation
3203- dri2: set dimensions on dmabuf import planes
3204- zink: implement multiplanar modifier handling
3205- zink: add error for bo allocation failure
3206- zink: always use slab allocation placement for domains
3207- zink: set zink_resource_object::host_visible based on actual bo placement
3208- zink: demote BAR allocations to device-local on oom
3209- zink: use IMMUTABLE for dummy xfb buffer
3210- zink: fix memory availability reporting
3211- zink: fail context creation more gracefully
3212- zink: clamp to 500 max batch states on nvidia
3213- gallium: add pipe_screen::is_compute_copy_faster hook
3214- gallium: rename PIPE_CAP_PREFER_BLIT_BASED_TEXTURE_TRANSFER
3215- mesa/st: make sampler_type_for_target public
3216- mesa/st: make some pbo functions public
3217- gallium: implement compute pbo download
3218- aux/pb: add a new slab alloc function for reclaiming all bo objects
3219- zink: use pb_slab_alloc_reclaimed(reclaim_all) for BAR heap sometimes
3220- zink: enable PIPE_TEXTURE_TRANSFER_COMPUTE on non-cpu drivers
3221- zink: explicitly init glsl
3222- zink: add a compiler pass to scan for shader image use
3223- zink: set batch state queue on creation
3224- zink: be consistent about waiting on context queue on context destroy
3225- mesa/vbo: be more comprehensive for degenerate primitive conversion in dlists
3226- radv: fix xfb query copy param ordering
3227- zink: add missing assert for 8bit vertex decompose
3228- zink: add some wsi instance extensions
3229- zink: always unset vertex shader variant key data when changing last vertex stage
3230- zink: add extra synchronization for buffer descriptor binds
3231- zink: use device-local heap for sparse backing allocations
3232- zink: add a better threshold for clamping query pool resets on suspend
3233- zink: always set number of timestamp results to 1 for internal qbo
3234- zink: fix availability buffer sizing/copying for xfb queries
3235- zink: skip readback of qbos with no results
3236- Revert "zink: when performing an implicit reset, sync qbos"
3237- zink: use even more accurate stride values for query result copies
3238- zink: ci updates
3239- zink: add flake
3240- aux/trace: copy over stream_output_target_offset method from context
3241- util/vbuf: fix buffer translation sizing
3242- zink: remove SpvMemorySemanticsMakeVisibleMask from nir_intrinsic_memory_barrier
3243- zink: add some nv ci results
3244- aux/trace: add pipe_context::fence_server_signal tracing
3245- zink: update gfx_pipeline_state.vertex_strides when necessary
3246- zink: check EXT_image_drm_format_modifier for dmabuf support
3247- Revert "zink: update gfx_pipeline_state.vertex_strides when necessary"
3248- zink: add VK_KHR_external_semaphore_fd to device exts
3249- zink: add VK_KHR_external_memory_capabilities to instance exts
3250- zink: add driver/device uuid screen hooks
3251- zink: implement GL semaphores
3252- zink: implement external memory object resource handling
3253- zink: enable EXT_external_objects pipe caps
3254- util: add ptralloc
3255- lavapipe: replace hard pointer calcs in dynamic render with ptralloc
3256- lavapipe: replace hard pointer calcs in push descriptors with ptralloc
3257- zink: update nv ci baseline
3258- zink: add deqp ci baseline for nv
3259- lavapipe: fix ptralloc typo
3260- zink: add get_sparse_texture_virtual_page_size hook
3261- zink: set up image create bits for sparse textures
3262- zink: support sparse texture range commits
3263- zink: allow sparse buffers to be suballocated
3264- zink: stop allocating such massive staging regions for buffer maps
3265- zink: fake sparse R9G9B9E5 support as needed
3266- zink: implement sparse shader instructions in ntv
3267- zink: always set actual_dest_type for ntv tex instruction emission
3268- zink: handle residency return value from sparse texture instructions
3269- zink: rename zink_so_info -> zink_shader_info
3270- zink: always pass shader info to ntv
3271- zink: add nir_intrinsic_image_deref_sparse_load to image scanning in compiler
3272- zink: only allocate ntv residency info if it will be used
3273- zink: emit sparse residency cap in ntv
3274- zink: handle min_lod texture operands
3275- zink: batch sparse texture binds
3276- zink: handle sparse texture miptail commits
3277- zink: enable ARB_sparse_texture pipe caps
3278- zink: ARB_sparse_texture2
3279- docs: add features/relnotes for zink sparse texture support
3280- zink: fix non-modifer dmabuf usage
3281- zink: link with vulkan utils
3282- zink: hook up planar image format creation
3283- vulkan/wsi: add VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT for swapchain image caps
3284- lavapipe: remove unused struct member
3285- zink: update nv fails
3286- zink: update radv fails list
3287- zink: never use SpvOpImageQuerySizeLod for texel buffers
3288- zink: add anv (icl) fails
3289- zink: handle bogus xfb draws
3290- zink: reorder fbfetch flag-setting to avoid null deref
3291- zink: return 256 for PIPE_CAP_MIN_MAP_BUFFER_ALIGNMENT
3292- anv: silence wsi debug logging
3293- zink: unify some context casts in zink_create_sampler_view
3294- zink: emit same number of timeline signals as semaphore signals
3295- zink: flag has_work when a GL semaphore is signalled
3296- zink: allow resource creation without VK_FORMAT_FEATURE_COLOR_ATTACHMENT_BIT
3297- zink: use VkImageViewUsageCreateInfo to remove attachment bits
3298- zink: always create a separate VkBuffer for storage use
3299- zink: use the storage buffer for bufferview creation when format allows
3300- zink: flag all buffer resources with PIPE_BIND_SHADER_IMAGE
3301- zink: simplify buffer case for zink_resource_object_init_storage()
3302- zink: remove tmp buffer rebinds
3303- aux/tc: add tc_buffer_write to replace pipe_buffer_write usage
3304- zink: fix vertex buffer mask computation for null buffers
3305- zink: replace qbo pipe_buffer_write usage with tc_buffer_write
3306- zink: replace other pipe_buffer_write usage with pipe_buffer_write_nooverlap
3307- zink: make pipe_buffer_write usage trigger compiler errors
3308- zink: ci updates
3309- zink: clamp tbo creation to maxTexelBufferElements
3310- zink: add vertex shader pipeline bit for generated barrier construction
3311- zink: print an error when the device is lost
3312- zink: fix waiting on current batch id
3313- zink: handle swizzled offset/count values for shader bitfield ops
3314- zink: enable VK_EXT_shader_atomic_float
3315- zink: add warning printf for drivers missing VK_EXT_shader_atomic_float
3316- zink: cast image atomic op params/results based on image type
3317- zink: use SpvScopeDevice over SpvScopeWorkgroup for atomic shader ops
3318- aux/vbuf: use local var for modifying unaligned_vb_mask during update
3319- aux/vbuf: move mask-clearing for vbuf updates after buffer scanning
3320- aux/vbuf: add fastpath for skipping identical vbuf updates
3321- mesa: stop truncating MESA_GLSL=dump
3322- zink: ARB_sparse_texture_clamp
3323- docs: update features/relnotes for zink sparse texture clamp
3324- zink: disable PIPE_SHADER_CAP_FP16_CONST_BUFFERS
3325- llvmpipe: disable PIPE_SHADER_CAP_FP16_CONST_BUFFERS
3326- zink: add VK_BUFFER_USAGE_CONDITIONAL_RENDERING_BIT_EXT for query binds
3327- zink: use scanout obj when returning resource param info
3328- zink: fix PIPE_CAP_TGSI_BALLOT export conditional
3329- zink: reject invalid draws
3330- zink: min/max blit region in coverage functions
3331- aux/draw: fix llvm tcs lane vec generation
3332- lavapipe: use util_pack_color_union() for generating clear colors
3333- zink: ci updates
3334- lavapipe: ci updates
3335- zink: always set VkPipelineMultisampleStateCreateInfo::pSampleMask
3336- zink: always invalidate streamout counter buffer if not resuming
3337- zink: ci updates
3338- zink: fix cached descriptor set invalidation for array bindings
3339- zink: free push descriptor pools on deinit
3340- zink: don't free non-fbfetch dsl structs when switching to fbfetch
3341- gallivm: avoid division by zero when computing cube face
3342- zink: always update shader variants when rebinding a gfx program
3343- zink: use a fence for pipeline cache update jobs
3344- zink: wait on program cache fences before destroying programs
3345- zink: fix descriptor cache pointer array allocation
3346- zink: mark fbfetch push sets as non-cached
3347- zink: stop leaking descriptor sets
3348- zink: invalidate non-punted recycled descriptor sets that are not valid
3349- zink: set shader key size to 0 for non-generated tcs
3350
3351Mike Lothian (1):
3352
3353- meson: Fix dri.pc dridriverdir
3354
3355Mykhailo Skorokhodov (3):
3356
3357- iris: Add missed tile flush flag
3358- Revert "iris: add tile cache flush to iris_copy_region"
3359- nir: Fix read depth for predecessors
3360
3361Nanley Chery (82):
3362
3363- iris: Convert some mod_info checks to asserts
3364- iris: Disable the MC_CCS modifier with norbc
3365- iris: Set DISABLE_AUX_BIT for AUX_USAGE_NONE modifiers
3366- iris: Refactor the assignment to possible_usages
3367- intel/isl: Allow creating non-Y-tiled ASTC surfaces
3368- iris: Allow GPU-based uploads of ASTC textures
3369- anv: Require transfer features for transfer usages
3370- anv: Allow transfer-only linear ASTC images
3371- anv: Drop code from get_blorp_surf_for_anv_buffer
3372- intel/isl: Unify fmt checks in isl_surf_supports_ccs
3373- iris: Enable CCS_E on 32-bpc float formats on TGL+
3374- iris: Drop redundant iris_resource_disable_aux call
3375- intel/isl: Restore CCS_E support for YUYV and UYVY
3376- iris: Drop the YCRCB cases in finish_aux_import
3377- intel/isl: Drop extra devinfo checks for CCS support
3378- intel/isl: Require aux map for some 64K alignment
3379- intel/blorp: Modify the SKL+ CCS resolve rectangle
3380- intel/blorp: Modify get_fast_clear_rect for XeHP
3381- iris: Change a param of iris_resource_init_aux_buf
3382- iris: Use the aux BO and surf less during init
3383- iris: Move some BO setup to iris_resource_init_aux_buf
3384- iris: Simplify iris_get_aux_clear_color_state_size
3385- iris: Don't allocate a clear color BO for some Z/S
3386- iris: Drop row pitch param from iris_get_ccs_surf
3387- iris: Don't allocate and initialize CCS on XeHP
3388- iris: Don't assert a NULL aux BO during aux config
3389- iris: Modify the comment about zeroing CCS
3390- iris: Update the initial CCS state on XeHP
3391- iris: Free the local cache bucket in bufmgr_destroy
3392- iris: Replace bo->real.local with bo->real.heap
3393- iris: Add and use flags_to_heap
3394- iris: Use a num_buckets pointer in add_bucket
3395- iris: Replace "local" with "heap" in bufmgr fn params
3396- iris: Add and use BUCKET_ARRAY_SIZE
3397- iris: Add and use bucket_info_for_heap
3398- iris: Rework the DEVICE_LOCAL heap
3399- iris: Disable the SMEM fallback for CCS on XeHP
3400- iris: Use util packing fns in convert_clear_color
3401- blorp: Drop multisampled code in blorp_can_hiz_clear_depth
3402- intel/isl: Rework HiZ image align calculations
3403- intel/isl: Update comment for the XeHP HiZ block
3404- intel/isl: Use a new HiZ format on XeHP+
3405- intel/isl: Require Y-tiling for depth on gfx4-5
3406- intel/isl: Allow HiZ with Tile4/64 surfaces
3407- intel/isl: Return false more in isl_surf_get_hiz_surf
3408- anv,iris: Flush tile cache after color fast clears
3409- anv,iris: Depth stall around color fast clears
3410- intel: Rename the PSD bit in PIPE_CONTROL for XeHP
3411- anv,iris: PSS Stall Sync around color fast clears
3412- anv,iris: Flush HDC before color fast clears
3413- anv: Don't fill lowered_storage_image_param on SKL+
3414- intel/isl: Don't check pitch in isl_surf_get_mcs_surf
3415- intel/isl: Strengthen MCS SINT format restriction
3416- Revert "intel/isl: Don't reconfigure aux surfaces for MCS"
3417- intel/gen125.xml: Increase Auxiliary Surface Pitch
3418- intel/isl: Allow creating MCS in Tile4 memory
3419- anv: Drop assert against modifier with aux on gfx12
3420- anv: Disable CCS_E for some 8/16bpp copies on TGL+
3421- anv: Use ANV_FAST_CLEAR_DEFAULT_VALUE for CCS on TGL+
3422- anv: Re-enable CCS_E on TGL+
3423- anv: Drop redundant disabling of non-renderable CCS
3424- anv: Disable the SMEM fallback for local memory
3425- anv: Require the local heap for CCS on XeHP
3426- anv: Don't allocate VMA for CCS on XeHP
3427- isl: Enable compression with multisampled Tile64
3428- iris: Explicitly rely on gallium fallbacks for YUV
3429- intel/isl: Support YUV pipe-to-isl format mapping
3430- iris: Drop stale media compression import code
3431- iris: Use iris_format_for_usage in map_aux_addresses
3432- intel: Rename a RenderCompressionFormat field
3433- intel/isl: Support the XeHP media compression format
3434- iris: Support the XeHP media compression format
3435- iris: Refactor a ternary in iris_resource_get_param
3436- iris: Pick the right BO in iris_resource_get_param
3437- iris: Return non-zero stride for clear color plane
3438- intel/isl: Simplify Z-buffer tiling config during emit
3439- intel/isl: Fix depth buffer TiledSurface programming
3440- intel/isl: Add more PRM text for HiZ/STC requirement
3441- iris: Don't fast clear with the view format
3442- Revert "anv: Require the local heap for CCS on XeHP"
3443- anv: Refactor anv_image_init_from_create_info
3444- anv: Disable aux if the explicit modifier lacks it
3445
3446Neha Bhende (11):
3447
3448- st: Fix 64-bit vertex attrib index for TGSI path
3449- st: Fix comments in commit be6d584de43966e
3450- svga: Add GL43 commands support
3451- svga: Add utility to check for GL43 support
3452- tgsi: Add hw_atomic_declared in tgsi_info
3453- svga: Add support for compute shader, shader buffers and image views
3454- svga: shader translation for compute, image views and shader buffers
3455- svga: add GL43 resource validation at draw time
3456- svga: enable GL43 on SVGA GL43 capable device
3457- svga: enable PIPE_CAP_IMAGE_STORE_FORMATTED on gl43 capable device
3458- svga: store shared_mem_size in svga_compute_shader instead of svga_context
3459
3460Neil Roberts (1):
3461
3462- v3d: Update prim_counts when prims generated query in flight without TF
3463
3464Nicholas Bishop (1):
3465
3466- mesa/get: allow NV_pixel_buffer_object constants in GLES2
3467
3468Niklas Haas (1):
3469
3470- wsi/x11: support depth 30 visuals
3471
3472Omar Akkila (1):
3473
3474- llvmpipe: page-align memory allocations
3475
3476Paulo Zanoni (12):
3477
3478- iris: also dump bo's imported and exported flags
3479- iris: destroy our mutexes a little later
3480- iris: fix off-by-one error when clearing stale syncobjs
3481- iris: call brw_process_intel_debug_variable() earlier
3482- iris: extract iris_hw_context_set_unrecoverable()
3483- intel/fs: Assert the GPU supports 64bit ops if present at lower_scoreboard time.
3484- iris: improve error checking in functions that call vma_alloc()
3485- iris: sprinkle some assertions for bufmgr->lock
3486- iris: save some iris_syncobj_reference() calls at update_bo_syncobjs()
3487- iris: implement inter-context busy-tracking
3488- iris: handle IRIS_MEMZONE_BINDER with a real vma_heap like the others
3489- iris: fix register spilling on compute shaders on XeHP
3490
3491Pavel Asyutchenko (6):
3492
3493- llvmpipe: fix wrong assumption on FB fetch shader opacity
3494- llvmpipe: fix gl_FragColor and gl_LastFragData[0] combination
3495- llvmpipe: protect from doing FB fetch of missing buffers
3496- llvmpipe: fix FB fetch with non 32-bit render target formats
3497- llvmpipe: remove dead args from load_unswizzled_block
3498- llvmpipe: enable PIPE_CAP_FBFETCH_COHERENT
3499
3500Pavel Ondračka (9):
3501
3502- r300: Document the RADEON_DEBUG options
3503- r300: Replace RADEON_NO_TCL with RADEON_DEBUG=notcl
3504- r300: Remove broken optimization in rc_transform_KILL
3505- r300: use point sprite coordinates only when drawing points (v5)
3506- r300: fix translate_LRP
3507- r300: properly initialize new_vs in r300_draw_init_vertex_shader
3508- r300: Disable integers and indirect temporary addressing with swctl
3509- r300: Set consistent PIPE_SHADER_CAP_PREFERRED_IR
3510- r300: fix transformation of abs modifiers with negate
3511
3512Philipp Zabel (1):
3513
3514- etnaviv: fix emit_if in case the else block ends in a jump
3515
3516Pierre Moreau (1):
3517
3518- nv50/nir: Switch to the common NIR options
3519
3520Pierre-Eric Pelloux-Prayer (57):
3521
3522- radeonsi: use viewport offset in quant_mode determination
3523- gallium/dri: replace bool with flag parameter
3524- gallium/dri: add createImageFromFds2
3525- gallium/dri: let the driver know if the imported image is DRI_PRIME buffer
3526- radeon_winsys.h: add a parameter to buffer_from_handle
3527- winsys/amdgpu: add uncached flag to the imported DRI_PRIME buffer
3528- radeonsi/gfx10.3: enable SDMA for DRI_PRIME copies
3529- radeonsi: use PIPE_BIND_DRI_PRIME instead of is_dri_blit_image
3530- Revert "gallium: add a is_dri_blit_image bool to pipe_blit_info"
3531- glsl/drirc: add an option for gl_ClipVertex / gl_CullDistance checks
3532- drirc: enable do_dce_before_clip_cull_analysis for ANSA
3533- mesa: don't reset SamplersValidated if nothing changed
3534- glsl/nir: mark samplers inside a block as bindless
3535- radeonsi: treat nir_intrinsic_load_constant as a VMEM operation
3536- ac/surface: use a less strict condition in is_dcc_supported_by_L2
3537- radeonsi/sdma: fix bogus assert
3538- radeonsi: fix ps_uses_fbfetch value
3539- ac/surface: don't validate DCC settings if DCC isn't possible
3540- vbo/dlist: free copied.buffer if no vertices were copied
3541- glx/drirc: add a force_direct_glx_context option
3542- mesa: enable force_direct_glx_context for DiscoveryStudio2020
3543- llvmpipe: add missing NIR alu-op handling
3544- mesa: print a warning when an extension can't be disabled
3545- mesa: don't use dummy_true for some MESA extensions
3546- mesa/init: replace call_once with manual implementation
3547- drirc: add mesa_extension_override option
3548- drirc: add options for BETA CAE Ansa application.
3549- mesa: always call _mesa_update_pixel
3550- mesa: remove NEW_COPY_TEX_STATE
3551- radeonsi/sqtt: fix shader stage values
3552- radeonsi/sqtt: fix FINISH_DONE / BUSY usage
3553- radeonsi/sqtt: reserve a vmid when sqtt is enabled
3554- radeonsi: unreference framebuffer state after use
3555- radeonsi/sqtt: increase the default buffer size to 32MB
3556- radeonsi: fix fast clear / depth decompression corruption
3557- radeonsi: silence a warning
3558- radeonsi: use max_zplanes after the last write
3559- glx: fix querying GLX_FBCONFIG_ID for Window
3560- radeonsi: don't use perp. end caps when line smoothing is on
3561- glthread: only log glthread destroy reason when it's not NULL
3562- radeonsi/gfx8: use the proper dcc clear size
3563- vbo/dlist: fix loopback crash
3564- vbo/dlist: add vertices to incomplete primitives
3565- amd: add SDMA_NOP_PAD
3566- radv: partial sdma support
3567- radv: allocate the prime buffer as uncached
3568- vulkan/wsi: add use_prime_blit param to wsi_swapchain_init
3569- vulkan/wsi: add a private transfer pool to exec the DRI_PRIME blit
3570- radv: implement wsi's private transfer queue using SDMA
3571- radeonsi/gfx10: fix si_texture_get_offset for mipmapped tex
3572- radeonsi,radv: fix usages of surf_pitch
3573- radeonsi/tests: update expected results
3574- radeonsi/tests: add expected results for vega20
3575- driconf: enable vs_position_always_invariant for Dirt Rally
3576- mesa: use less temporaries in build_lighting
3577- radeonsi: limit loop unrolling for LLVM < 13
3578- radeonsi: change rounding mode to round to even
3579
3580Qiang Yu (76):
3581
3582- driconf: add executable_regexp application attribute
3583- drirc: add Mari application workaround
3584- xmlconfig_test: add unit test for executable_regexp
3585- radeonsi: add radeonsi_force_use_fma32 driconf option
3586- driconf: support META application
3587- glx/dri3: fix glXQueryContext does not return GLX_RENDER_TYPE value
3588- loader/dri3: fix piglit egl-copy-buffer test
3589- ci: remove egl-copy-buffers from fail list
3590- glx: no need to create extra pixmap for pbuffer
3591- glx: check drawable type before create drawble
3592- glx: add drawable type argument when create drawable
3593- loader/dri3: add drawable type set by GLX and EGL
3594- loader/dri3: remove unused present capability query
3595- loader/dri3: pack window present event setup into a function
3596- loader/dri3: setup present event with drawable type info
3597- loader/dri3: replace is_pixmap with drawable type
3598- loader/dri3: rename dri3_fake_front_buffer
3599- loader/dri3: stop doing anything in swap buffer for some drawable
3600- loader/dri3: support glx pbuffer swap
3601- glx: fix regression for drawable type detection
3602- glapi: should not add alias function to static_data.py
3603- gallium: add caps for sparse texture support
3604- mesa: add ARB_sparse_texture extension
3605- mesa: add ARB_sparse_texture constants
3606- gallium: add screen get_sparse_texture_virtual_page_size callback
3607- gallium: add get_sparse_texture_virtual_page_size for noop/rbug/trace
3608- mesa: add ARB_sparse_texture query in glGetInternalformativ
3609- mesa: add ARB_sparse_texture texture param set/get
3610- mesa/st: add st_GetSparseTextureVirtualPageSize interface
3611- mesa: glTexStorage* support sparse texture allocation
3612- mesa/st: add st_TexturePageCommitment interface
3613- mesa: implement glTexPageCommitmentARB/glTexturePageCommitmentEXT
3614- mesa/st: update NumSparseLevels from pipe_resource
3615- winsys/radeon: change surface_init flags to 64bit
3616- ac/surface: fix prt_first_mip_tail calculation for gfx9+
3617- ac/surface: add prt_tile_depth
3618- radeonsi: support alloc a sparse texture
3619- radeonsi: use staging buffer for sparse texture when transfer map
3620- radeonsi: implement get_sparse_texture_virtual_page_size
3621- radeonsi: support texture resource commit
3622- radeonsi: enable ARB_sparse_texture
3623- nir: fix nir_tex_instr hash not count is_sparse field
3624- gallium/dd_debug: add get_sparse_texture_virtual_page_size
3625- gallium: add PIPE_CAP_QUERY_SPARSE_TEXTURE_RESIDENCY
3626- mesa: add ARB_sparse_texture2 extension
3627- mesa/main: relax alignment check when ARB_sparse_texture2 available
3628- glsl: add ARB_sparse_texture2 extension
3629- glsl: ir_texture support sprase texture
3630- glsl: add _texture related sparse texture builtin functions
3631- glsl: add _textureCubeArrayShadow related sparse texture builtin func
3632- glsl: add _texelFetch related sparse texture builtin function
3633- glsl: add sparse texture image load builtin functions
3634- glsl: add vec5 glsl types
3635- glsl/nir: convert sparse ir_texture to nir
3636- glsl/nir: convert sparse image load to nir
3637- glsl/nir: adjust sparse texture nir_variable
3638- glsl: add sparseTexelsResidentARB builtin function
3639- glsl/nir: convert is_sparse_texels_resident to nir
3640- radeonsi: lower nir_intrinsic_is_sparse_texels_resident
3641- mesa/main: allow multi sample sparse texture
3642- mesa/main: export _is_multisample_target for external usage
3643- gallium: add multi_sample parameter to get_sparse_texture_virtual_page_size
3644- radeonsi: enable multi sample sparse texture support
3645- radeonsi: enable ARB_sparse_texture2
3646- gallium: add PIPE_CAP_CLAMP_SPARSE_TEXTURE_LOD
3647- mesa: add ARB_sparse_texture_clamp extension
3648- glsl: add ARB_sparse_texture_clamp extension
3649- glsl: ir_texture add clamp field
3650- glsl: _texture support clamp parameter
3651- glsl: _textureCubeArrayShadow support clamp
3652- glsl: add ARB_sparse_texture_clamp builtin functions
3653- glsl/nir: convert ir_texture->clamp to nir
3654- radeonsi: enable ARB_sparse_texture_clamp
3655- radeonsi: workaround Specviewperf13 Catia hang on GFX9
3656- radeonsi: fix depth stencil multi sample texture blit
3657- glx: fix pbuffer refcount init
3658
3659Rafael Antognolli (2):
3660
3661- intel/compiler: Assert that unsupported tg4 offsets were lowered for XeHP
3662- intel: Emit 3DSTATE_BINDING_TABLE_POOL_ALLOC for XeHP
3663
3664Renato Pereyra (2):
3665
3666- anv: Enable implicit CCS for external images
3667- anv: add helper methods related to enabling CCS for external images
3668
3669Rhys Perry (83):
3670
3671- radv: Use nir_var_mem_image in meta shaders
3672- aco: disable mul(cndmask(0, 1, b), a) optimization sometimes
3673- aco: simplify emit_stream_output()
3674- radv: don't use a separate cache entry for GS copy shaders
3675- aco: use std::vector and IDSet in RA validator
3676- docs: update radv extensions in features.txt
3677- nir/algebraic: optimize Cyberpunk 2077's open-coded bitfieldReverse()
3678- nir/lower_system_values: replace local_invocation_id components with zero
3679- radv: lower load_local_invocation_index with 1D workgroups
3680- aco: optimize load_local_invocation_index with single-wave workgroups
3681- aco: consider pseudo-instructions reading exec in needs_exec_mask()
3682- aco/spill: use spills_entry instead of spills_exit to kill linear VGPRs
3683- radv: make RADV_FORCE_FAMILY case-insensitive
3684- radv: fix max_render_backends for Sienna Cichlid null winsys
3685- spirv: run nir_copy_prop before nir_rematerialize_derefs_in_use_blocks_impl
3686- radv: stop running copy-propagation before nir_opt_deref
3687- nir/lower_tex: don't calculate texture_mask for texture_index>=32
3688- nir/lower_subgroups: fix left shift of -1
3689- aco: properly update use counts if a extract is still used
3690- nir/dce: fix DCE of loops with a halt or return instruction in the pre-header
3691- nir/cf: fix insertion of loops/ifs after jumps
3692- nir/tests: add DCE test for loops following a jump
3693- aco: don't create DPP instructions with SGPR operands
3694- nir/opt_offsets: fix try_extract_const_addition recursion
3695- nir/opt_offsets: remove need to loop try_extract_const_addition
3696- nir/algebraic: optimize a*#b & -4
3697- radv: add radv_meta_init_shader
3698- radv: include RT shaders in RADV_DEBUG=shaders,shaderstats
3699- radv: clone shader in radv_shader_compile_to_nir
3700- radv: initialize workgroup_size in radv_meta_init_shader
3701- aco: don't create unnecessary addition in indirect get_sampler_desc()
3702- aco: improve clrx disassembly
3703- aco: use more predictable tiebreaker when forming MADs
3704- aco: create v_fmamk_f32/v_fmaak_f32 from nir_op_ffma
3705- aco: swap operands if necessary to create v_madak/v_fmaak
3706- aco: swap multiplication operands if needed to create v_fmac_f32/etc
3707- radv,aco: implement nir_op_ffma
3708- radv,aco: don't lower some ffma instructions
3709- radv: have the null winsys set more fields
3710- aco: fix check_vop3_operands() for f16vec2 ffma fneg combine
3711- nir/opt_load_store_vectorize: improve ssbo/global alias analysis
3712- nir/algebraic: optimize more 64-bit imul with constant source
3713- nir/opt_access: infer CAN_REORDER for global access
3714- aco: disallow SMEM offsets that are not multiples of 4
3715- aco: skip &-4 before SMEM
3716- aco: remove SMEM constant/addition combining out of the loop
3717- radv: increase maxTaskOutputCount to 65535
3718- nir/lower_shader_calls: fix store_scratch write_mask
3719- ac/nir: fix store_buffer_amd write_masks
3720- nir/uniform_atomics: fix is_atomic_already_optimized without workgroups
3721- aco: remove pack_half_2x16(a, 0) optimization
3722- radv: add RADV_DEBUG=splitfma
3723- radv: set radv_split_fma=true for Proton SotTR
3724- nir/unsigned_upper_bound: don't follow 64-bit f2u32()
3725- aco: use p_extract for SGPR nir_op_unpack_half_2x16_split_y
3726- nir: add nir_op_fmulz and nir_op_ffmaz
3727- radv,aco,ac/llvm: implement fmulz and ffmaz
3728- aco: consider legacy multiplications in optimizer
3729- aco: create v_mac_legacy_f32/v_fmac_legacy_f32
3730- nir/algebraic: add ignore_exact() wrapper
3731- nir/algebraic: optimize open-coded fmulz/ffmaz
3732- nir: add some missing nir_alu_type_get_base_type
3733- nir/algebraic: optimize expressions using fmulz/ffmaz
3734- nir/builder: set write mask if not provided
3735- nir/builder: assume scalar alignment if not provided
3736- radv: avoid providing a write_mask to intrinsic builders
3737- radv: avoid providing an align_mul to intrinsic builders
3738- radv: avoid providing an align_offset to intrinsic builders
3739- ac/nir: avoid providing a write_mask to intrinsic builders
3740- ac/nir: avoid providing an align_mul to intrinsic builders
3741- ac/nir: use shorter builder names
3742- radv: fix raytracing with wave32
3743- radv: use wave32 for raytracing
3744- radv: use 8x4 workgroups for wave32 RT
3745- radv: fix optimized MSAA copies with suballocated images
3746- aco: fix neg(abs(mul(a, b))) if the mul is not VOP3
3747- aco: optimize abs(mul(a, b))
3748- aco: preserve pass_flags during format conversions
3749- aco: fix v_mac_legacy_f32
3750- aco: don't encode src2 for v_writelane_b32_e64
3751- radv: fix R_02881C_PA_CL_VS_OUT_CNTL with mixed cull/clip distances
3752- anv: Enable nir_opt_access
3753- radv: include disable_aniso_single_level and adjust_frag_coord_z in key
3754
3755Rob Clark (63):
3756
3757- freedreno: Fix for large epilogues
3758- freedreno/ir3+isa: Cleanup bindless cat5 samp/tex encoding
3759- freedreno/isa: Fixes for validation
3760- freedreno/ir3/tests: Fix indentation
3761- freedreno/ir3/tests: Add additional disasm test vectors
3762- freedreno/ir3: Don't lower s2en if samp/tex is too large
3763- isaspec: Fix derived field width
3764- isaspec: Do not emit duplicate field encodes
3765- isaspec: Add bitfield size assertions
3766- freedreno/isa: Add immed reg accessors
3767- freedreno/ir3/tests: Don't skip encode test if decode fails
3768- freedreno/ir3/tests: Add some 8/16b ldg/stg tests
3769- freedreno/isa: Fix ldg/stg "halfness"
3770- freedreno/computerator: Fix mergedregs
3771- freedreno/ir3: 64b intrinsic lowering
3772- freedreno/ir3: Lower 64b phis
3773- freedreno/ir3: Add load/store_global lowering
3774- isaspec: Fix gpu_id for default_options
3775- freedreno: Skip built-in shaders for clover
3776- freedreno/ir3: Handle MESA_SHADER_KERNEL
3777- freedreno: implement set_compute_state()
3778- freedreno/ir3: vec8+vec16 support
3779- freedreno/ir3: implement load_work_dim intrinsic
3780- freedreno/ir3: Add support for load_kernel_input
3781- freedreno/ir3: Move lower_idiv_options
3782- freedreno: Fix set_global_binding
3783- freedreno/ir3: Deal with zero-source instructions
3784- freedreno/ir3: 16b bools
3785- freedreno/ir3: 8bit fixes
3786- freedreno/ir3: Fix load/store_global_ir3 type
3787- freedreno/ir3: Fix reg size validation
3788- freedreno/ir3: Add wide load/store lowering
3789- freedreno/ir3: Add ihadd/uhadd
3790- freedreno/ir3: Get req_local_mem from pipe_compute_state
3791- freedreno/ir3: Fix validation of subgroup macros
3792- freedreno/drm: Add some asserts
3793- freedreno/drm: Move suballoc_bo to device
3794- freedreno/ir3/print: Show end's outidxs
3795- freedreno/ir3: xfb fix for duplicate outputs
3796- freedreno: Add PIPE_SHADER_IR_NIR_SERIALIZED support
3797- freedreno: caps for clover
3798- isaspec: Add prototypes for expr evaluators
3799- freedreno/crashdec: Split out mempool decoding
3800- freedreno/crashdec: HFI queue decoding
3801- freedreno/crashdec: Fallback to chip_id for GPU id
3802- freedreno/crashdec: Basing GMU log decoding
3803- freedreno/ir3: Handle instr->address when cloning
3804- freedreno/computerator: Fix @buf header
3805- freedreno/computerator: Mark shader bo for dumping
3806- freedreno/ir3: Dump const state with shader disasm
3807- clover: Move min image support check
3808- freedreno: Report system memory as video memory
3809- freedreno: Small dev_id_compare() cleanup
3810- freedreno: Rearrange dev_id_compare() logic
3811- freedreno: Handle wildcard fuse-id in device matching
3812- freedreno: Update chip-ids
3813- freedreno/decode: Handle chip-id
3814- mesa/st: Lowered ucp should still mark rast state dirty
3815- freedreno: Pass shader cache key instead of shader key
3816- freedreno: Add FD_DIRTY_RASTERIZER_CLIP_PLANE_ENABLE
3817- freedreno/a6xx: Fix clip_mask
3818- freedreno/a5xx: Fix clip_mask
3819- freedreno: Add missing generated header dependency
3820
3821Rohan Garg (7):
3822
3823- intel/fs: OpImageQueryLod does not support arrayed images as an operand
3824- ci: Do not remove libgbm-dev
3825- ci: Move common variables out into a separate file
3826- ci: Do not remove wget
3827- ci/piglit: Start vtest server if driver is set to virpipe
3828- anv: Enable VK_VALVE_mutable_descriptor_type
3829- docs: Update features and new_features for anv
3830
3831Roland Scheidegger (1):
3832
3833- llvmpipe: adjust rounding for viewport scissoring
3834
3835Roman Gilg (1):
3836
3837- vulkan/wsi/x11: document implementation
3838
3839Roman Stratiienko (6):
3840
3841- android.mk: Add missing variables to the make target
3842- v3dv: Fix dEQP-VK.info#instance_extensions test
3843- v3dv: Fix V3DV_HAS_SURFACE preprocessor condition
3844- v3dv: Hotfix: Rename remaining V3DV_HAS_SURFACE->V3DV_USE_WSI_PLATFORM
3845- v3d: Don't force SCANOUT for PIPE_BIND_SHARED requests
3846- v3dv: add Android support
3847
3848Ruijing Dong (6):
3849
3850- frontends/omx: preserve omx to keep current mode for avc decoding
3851- frontends/va: preparing to disable h264 extension flag in vaapi dec path
3852- radeon/vcn: enable dynamic dpb Tier2 support for h264 dec vaapi path
3853- radeon/vcn: enable dynamic dpb Tier2 for hevc dec vaapi path
3854- frontend/va: Keep surface buf addr before reallocation
3855- radeon/vcn: Updating render_pic_list for correction
3856
3857Ryan Houdek (1):
3858
3859- util/xmlconfig: Allow DT_UNKNOWN files
3860
3861Ryan Neph (1):
3862
3863- venus: ignore framebuffer for VkCommandBuffer executed outside of render pass
3864
3865Sagar Ghuge (16):
3866
3867- intel/compiler: Set correct cache policy for A64 byte scattered read
3868- intel/genxml: Add new Primitive ID Not Required bit field to 3DSTATE_DS
3869- intel/compiler: Track primitive id in domain/evaluation shader
3870- anv: Drop hint if primitive id is required or not
3871- iris: Drop hint if primitive id is required or not
3872- anv: Pass correct aux usage while filling out surface state
3873- anv: Enable CCS for storage image formats
3874- intel/compiler: Set correct return format for brw_SAMPLE
3875- intel/compiler: Don't hardcode padding source type to 32bit
3876- intel/compiler: Fix instruction size written calculation
3877- intel/compiler: Add helper to support half float payload with padding
3878- intel/fs: Define and set correct sampler simd mode
3879- intel/compiler: Prepare disasm for 16-bit sampler params
3880- anv, iris: Implement Wa_14014890652 for DG2
3881- intel/genxml: Add L1 Cache Control bit field
3882- intel/genxml: Add Un-Typed Data-Port Cache Flush field to pipe control
3883
3884Sajeesh Sidharthan (1):
3885
3886- frontends/va/av1: handle multiple slice params
3887
3888Samuel Pitoiset (158):
3889
3890- aco: do not return an empty string when disassembly is not supported
3891- radv: fix removing PSIZ when it's not emitted by the last VGT stage
3892- radv: fix OpImageQuerySamples with non-zero descriptor set
3893- radv: do not remove PSIZ for streamout shaders
3894- aco: fix invalid IR generated for b2f64 when the dest is a VGPR
3895- aco: fix emitting stream outputs when the first component isn't zero
3896- radv: add an assertion to prevent GPU hangs when VRS isn't supported
3897- ac/rgp: remove useless code related to GFX6-7
3898- aco: fix loading 64-bit inputs with fragment shaders
3899- radv,aco: decouple shader_info/options from radv_shader_args
3900- radv: re-emit prolog inputs when the nontrivial divisors state changed
3901- radv: remove old RADV_TRACE_FILE warning
3902- radv: fix build errors with Android
3903- radv: report error messages when the driver can't be initialized
3904- aco: only load streamout buffers if streamout is enabled
3905- radv: lower the viewport index to zero when the VGT stage doesn't export it
3906- radv: fix invalid wait_dst_stage_mask type
3907- ac/nir: remove bogus assertion about the position for culling
3908- zink: add CI lists and deqp-suite configuration for RADV
3909- radv: move freeing the trigger SQTT file at a better place
3910- radv: remove useless checks about GFX7 for SQTT
3911- radv: enable SQTT instruction timing by default
3912- radv: stop reporting SQTT/RGP support as experimental
3913- docs: document RADV_THREAD_TRACE_* envvars
3914- radv: rename radv_shader_variant to radv_shader
3915- radv: only enable VK_EXT_display_control for vrcompositor (SteamVR)
3916- radv: optimize subpass barrier flushes for imageless framebuffers
3917- Revert "radv: only enable VK_EXT_display_control for vrcompositor (SteamVR)"
3918- radv: remove unused parameter in radv_emit_subpass_barrier()
3919- radv/sqtt: reserve a VMID for better profiling
3920- radv/sqtt: stop calling radv_cs_add_buffer() for the thread trace BO
3921- radv: do not expose buffer features for depth/stencil formats
3922- ac/rgp: fix alignment of code object records to follow the RGP spec
3923- radv: only emit PGM_LO for the vertex prolog
3924- ac/rgp: add support for queue event timings
3925- ac/rgp: add support for clock calibration
3926- radv/winsys: use same IBs padding as the kernel
3927- zink: update the CI lists for RADV
3928- radv/sqtt: fix GPU hangs when capturing from the compute queue
3929- ac: add initial SPM support
3930- ac/rgp: add support for dumping SPM data
3931- radv: fix a sync issue on GFX9+ by clearing the upload BO fence
3932- ac/spm: fix determing the SPM wire
3933- ac/spm: fix determining the counter slot
3934- zink: update the CI lists for RADV
3935- zink: skip one GLES31 subset to avoid GPU hangs on Navi10
3936- nir: fix constant expression of ibitfield_extract
3937- radv: simplify re-using cache entries in radv_pipeline_cache_insert_shaders()
3938- radv: disable HTILE for D32S8 format and mipmaps on GFX10
3939- util/queue: fix a data race detected by TSAN when finishing the queue
3940- radv: allow TC-compat CMASK with storage images on GFX10+
3941- radv: ignore the descriptor set layout when creating descriptor template
3942- radv: convert remaining enums/structs to 1.2 versions
3943- ac: change family names to uppercase in ac_get_family_name()
3944- radv/winsys: report the real family name instead of OVERRIDDEN
3945- radv: fix emitting VBO when vertex input dynamic state is used
3946- radv: make radv_break_on_count() a non-static function
3947- radv: make radv_copy_buffer() a non-static function
3948- radv: implement optimized MSAA copies using FMASK
3949- ac/nir: fix left shift of 1 by 31 places detected by UBSAN
3950- radv: fix OOB access for inline push constants detected by UBSAN
3951- aco: fix right shift of exponent 32 detected by UBSAN
3952- radv: implement vkCmdWriteTimestamp2KHR()
3953- radv: implement vkCmd{Reset,Set}Event2KHR()
3954- radv: implement vkCmdWriteBufferMarker2AMD()
3955- radv: implement vkCmdWaitEvents2KHR()/vkCmdPipelineBarrier2KHR()
3956- radv: implement vkQueueSubmit2KHR()
3957- radv: add support for VkMemoryBarrier2KHR
3958- radv: add support for creating device-only events
3959- radv: add support for new pipeline stages and access masks
3960- radv: add support for VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL_KHR
3961- radv: switch the remaining stages/access to VK_PIPELINE_STAGE_2/VK_ACCESS_2
3962- radv: advertise VK_KHR_synchronization2
3963- radv: fix accessing NULL pointers when destroy the VRS image
3964- radv: add a workaround to fix a segfault with Metro Exodus (Linux native)
3965- radv: configure the number of SGPRs/VGPRs directly from the arguments
3966- radv: copy the user SGPRs locations outside of radv_declare_shader_args()
3967- radv: constify radv_shader_info in radv_declare_shader_args()
3968- radv/llvm: stop trying to eliminate VS outputs
3969- radv/llvm: constify radv_shader_info
3970- radv: fix resetting the entire vertex input dynamic state
3971- radv: remove never reached free() when compiling shaders
3972- radv: add a helper function to upload a shader binary
3973- radv: upload shader binaries after they are all compiled
3974- radv: pass a pointer to a pipeline for the create/insert cache functions
3975- radv: upload shader binaries of a pipeline contiguously in memory
3976- radv: move a comment at the right place in CmdBindVertexBuffers2EXT()
3977- radv: constify radv_vertex_binding in CmdSetVertexInputEXT()
3978- radv: constify radv_vs_input_state() in more places
3979- radv: fix dynamic rendering with VRS
3980- radv: avoid prefixing few VkXXX structures by struct
3981- radv/sqtt: always dump pipelines and shaders ISA
3982- radv: add few helpers for configuring performance counters
3983- radv: add initial SPM support on GFX10+
3984- radv: fix resume/suspend render pass with depth/stencil attachment
3985- radv: mark GFX10.3 (aka RDNA2) as conformant products with CTS 1.2.7.1
3986- Revert "radv: Add bufferDeviceAddressMultiDevice support."
3987- radv: do not perform depth/stencil resolves for suspended render pass
3988- nir/lower_tex: add lower_lod_zero_width
3989- radv: enable lower_lod_zero_width
3990- radv: fix dynamic rendering inheritance if the subpass index isn't 0
3991- radv: ignore dynamic inheritance if the render pass isn't NULL
3992- radv/winsys: remove useless has_sparse_vm_mappings checks
3993- radv/winsys: clear the PRT VA range when destroying a virtual BO
3994- radv/winsys: stop using reference counting for virtual BOs
3995- radv/winsys: update sparse mappings with OP_REPLACE instead of OP_MAP/OP_UNMAP
3996- ac/nir: add an option to disable anisotropic filtering for single level images
3997- radv,aco: do not disable anisotropy filtering for non-mipmap images
3998- radv: fix dynamic rendering global scissor
3999- radv: remove unnecessary NULL checks in vkMapMemory()/vkUnmapMemory()
4000- radv/winsys: stop zeroing few structs in buffer_from_fd()
4001- radv/winsys: remove radv_amdgpu_winsys_bo::is_shared
4002- radv: fix restoring subpass during hw/fs color resolves
4003- radv: re-enable fast clears for images that support comp-to-single
4004- radv: re-apply "Do not access set layout during vkCmdBindDescriptorSets."
4005- radv: fix clears with value of "1" and different DCC signedness
4006- radv: stop checking buffer size in vkCreateBuffer()
4007- radv: print number of levels with RADV_DEBUG=img
4008- radv: fix copying mutable descriptors to sampler descriptors
4009- radv: add drirc radv_disable_htile_layers and enable it for F1 2021
4010- aco: fix a dynamic-stack-buffer-overflow when printing instructions
4011- aco: do not print prologs disassembly if no disassembler
4012- radv,aco: keep track of the prolog disassembly if necessary
4013- radv: save the vertex prolog to the trace BO for debugging
4014- radv: dump the VS prolog disassembly to the hang report
4015- radv: add UMR markers for the vertex prolog
4016- radv: do not use the common entrypoint for the Metro Exodus layer
4017- radv: remove remaining dead code related to the old sync code
4018- radv/winsys: remove unused syncobj functions
4019- radv: stop checking if dynamic states changed
4020- radv: reset VRS if the current subpass doesn't have a VRS attachment
4021- radv: fix computing the fb size in presence of dynamic VRS attachment
4022- radv: only clear VRS_HTILE_ENCODING on GFX10.3+
4023- radv: allow to disable anisotropic filtering for single level image with drirc
4024- radv: enable radv_disable_aniso_single_level for Battlefield 1 & V
4025- radv/winsys: fix zero submit if no timeline semaphore support
4026- vulkan/runtime: fix accessing NULL pointers detected by UBSAN
4027- radv: fix missing destroy for the overallocation mutex
4028- radv: fix computing the number of color samples if no attachments
4029- radv: optimize CPU overhead of si_cp_dma_prefetch() slightly
4030- radv: disable attachmentFragmentShadingRate for RADV_DEBUG=nohiz
4031- radv: fix copying VRS rates to HTILE if the depth/stencil is cleared
4032- radv: fix copying VRS rates if the ds attachment uses mips
4033- radv/winsys: set GTT_WC flag for CS IBs on GFX6
4034- radv: do not restore NULL compute pipelines after meta operations
4035- radv: stop checking if pipelines are NULL during draws/dispatches
4036- vulkan: Update the XML and headers to 1.3.204
4037- radv: implement 1.3 features/properties
4038- radv: report textureCompressionASTC_HDR as not supported
4039- radv: add a no-op version of vkGetPhysicalDeviceToolPropertiesEXT()
4040- radv: switch a bunch of struct/enum to 1.3 versions
4041- radv: bump conformance version to 1.3.0.0 for RDNA2
4042- radv: advertise Vulkan 1.3
4043- radv/ci: mark dEQP-VK.api.version_check.version as expected failure on Stoney
4044- radv/winsys: fix missing buffer_make_resident() for the null winsys
4045- Revert "radv: re-apply "Do not access set layout during vkCmdBindDescriptorSets.""
4046- radv/winsys: fix initializing debug/perftest options if multiple instances
4047- radv: enable radv_disable_aniso_single_level for The Evil Within 1&2
4048
4049Scott Anderson (1):
4050
4051- egl/wayland: Remove unused wayland enum
4052
4053Sergii Melikhov (1):
4054
4055- vulkan: Unlock before return.
4056
4057Shmerl (1):
4058
4059- docs/features: Add VK_KHR_acceleration_structure, VK_KHR_pipeline_library, VK_KHR_ray_query, VK_KHR_ray_tracing_pipeline.
4060
4061Silvestrs Timofejevs (2):
4062
4063- egl: introduce a log level getter function
4064- egl: add config debug printout
4065
4066Simon McVittie (1):
4067
4068- meson: Try to link all-targets module if Gallium OpenCL is enabled
4069
4070Simon Ser (4):
4071
4072- renderonly: write down usage rules
4073- vulkan/wsi/wayland: use enum wl_shm_format
4074- vulkan/wsi/wayland: use DRM_FORMAT_INVALID
4075- vulkan/wsi/wayland: remove format switch from wl_shm_format_for_vk_format
4076
4077Stefan Brüns (1):
4078
4079- llvmpipe: Add get_{driver,device}_uuid implementations
4080
4081Tapani Pälli (18):
4082
4083- iris: clear bos_written when resetting a batch
4084- vulkan/wsi: provide api for drivers to setup syncobj fd
4085- radv: setup syncobj fd via wsi_device_setup_syncobj_fd
4086- anv: setup syncobj fd via wsi_device_setup_syncobj_fd
4087- vulkan/wsi: implement missing wsi_register_device_event
4088- anv: allow VK_IMAGE_LAYOUT_UNDEFINED as final layout
4089- glsl: fix invariant qualifer usage and matching rule for GLSL 4.20
4090- iris: unref syncobjs and free r/w dependencies array for slab entries
4091- mesa: free idalloc storage for display lists
4092- mesa: free vbo_save_vertex_list store prims
4093- mesa: refactor GetProgramiv to use program resource list
4094- mesa: move GetProgramInterfaceiv as a shader_query function
4095- mesa: change GetProgramiv name length queries to use program resources
4096- intel/genxml: add PIPE_CONTROL field for L3 read only cache invalidation
4097- anv: invalidate L3 read only cache when VF cache is invalidated
4098- iris: invalidate L3 read only cache when VF cache is invalidated
4099- iris: fix a leak on surface states
4100- mesa/st: always use DXT5 when transcoding ASTC format
4101
4102Tatsuyuki Ishi (3):
4103
4104- aco: support DPP8
4105- aco: lower masked swizzle to DPP8
4106- radv/sqtt: Add and enable basic EXT_debug_utils support.
4107
4108Thierry Reding (2):
4109
4110- tegra: Use private reference count for sampler views
4111- tegra: Use private reference count for resources
4112
4113Thomas H.P. Andersen (49):
4114
4115- nine: remove dead code
4116- svga: fix bitwise/logical and mixup
4117- meson: drop compatability with < 0.48
4118- meson: drop a comment relating to old meson version
4119- docs: update the required meson version
4120- meson: drop a temp formatting variable
4121- ci: clean up debian-clang no-error list
4122- gallivm: avoid a self-assign warning
4123- ci: debian-clang: drop -Wno-error for self-assign
4124- lavapipe: fix implicit-fallthrough warning
4125- i915g: fix implicit-fallthrough warning
4126- r300: remove a set but not used variable
4127- r600: remove a set but not used variable
4128- glx: remove a set but not used variable
4129- gallium/u_threaded: drop unused function
4130- gallium/tgsi_exec: drop unused function
4131- draw: drop unused function
4132- microsoft/compiler: dxil_nir_opt_alu_deref_srcs: return progress
4133- lavapipe: fix string-plus-int warning
4134- i915g: avoid left shifting a negative number
4135- panvk: use FALLTHROUGH to stop a warning
4136- panvk: cast negative value to unint8_t
4137- ci: debian-clang: build more drivers
4138- tu/clear_blit: use || when working with bools
4139- r600/sb: silence a sometimes-uninitialized warning
4140- ci: debian-clang: -Wno-error for sometimes-uninitialized
4141- zink: malloc/sizeof mismatch
4142- xa: fix compile warning for -Wabsolute-value
4143- broadcom/compiler: fix compile warning -Wabsolute-value
4144- ci: debian-clang: drop -Wno-error=absolute-value
4145- replace 0 with NULL for NULL pointers
4146- freedreno: silence sometimes-uninitialized warning
4147- freedreno: drop dead assignment
4148- meson: add check kwarg to run_command
4149- ci: clean up debian-android no-error list
4150- anv: drop unused label
4151- ci: debian-android: drop -Wno-error=unused-label
4152- vulkan/vk_extensions_gen: fix -Wextern-initializer warning
4153- ci: debian-android: drop -Wno-error=extern-initializer
4154- microsoft/compiler: fix -Wbitwise-instead-of-logical warning
4155- anv: avoid warning about unused function
4156- vc4: drop unused function
4157- v3d: avoid warning about unused function
4158- v3d: avoid warning about unused function
4159- broadcom: drop unused functions
4160- panfrost: mark two variables as unused
4161- anv: drop a set but unused variable
4162- anv: drop a set but unused variable
4163- svga: silence -Wsometimes-uninitialized
4164
4165Thomas Wagner (1):
4166
4167- util: use anonymous file for memory fd creation
4168
4169Thong Thai (8):
4170
4171- frontends/va: disable packed header support for h264 encoder
4172- frontends/va/enc: hardcode h265 encoder ref pic list size
4173- radeon/vcn: increase encoder dpb size
4174- frontends/va/enc: allow for frames to be marked as (not) referenced
4175- radeon: hardcode uvd/vce encoder not_referenced value to false
4176- radeon/vcn: implement encoder dpb management
4177- frontends/va/enc: default motion estimation parameters for performance
4178- radeonsi: add check for graphics to si_try_normal_clear
4179
4180Timothy Arceri (16):
4181
4182- mesa: fix buffer overrun in SavedObj texture obj array
4183- mesa: remove old tnl device driver header files
4184- mesa: remove _mesa_ir_link_shader()
4185- mesa: make _mesa_associate_uniform_storage() static
4186- mesa: remove GLSL IR to Mesa IR code
4187- mesa/st: move _mesa_generate_parameters_list_for_uniforms() code to st
4188- mesa: move _mesa_ensure_and_associate_uniform_storage() to uniform_query.cpp
4189- mesa: tidy up ir_to_mesa.{cpp,h} includes, comments, etc
4190- mesa: rename ir_to_mesa.{cpp,h} -> link_program.{cpp,h}
4191- mesa: update or remove out of date references to ir_to_mesa
4192- doc: update source tree doc to reflect recent classic/swrast deletions
4193- util: add dri config option force_compat_shaders
4194- util: add workaround for SNK HEROINES Tag Team Frenzy
4195- mesa: make struct in gl_program a union and remove FIXME
4196- glsl/glcpp: make sure to expand new token after concatenation
4197- glsl/st: move st_nir_opts() into gl compiler common code
4198
4199Timur Kristóf (44):
4200
4201- radv: Use MESA_VULKAN_SHADER_STAGES to make room for mesh/task.
4202- aco: Fix how p_is_helper interacts with optimizations.
4203- nir: Rename nir_get_io_vertex_index_src and include per-primitive I/O.
4204- nir: Print Mesh Shader specific info.
4205- nir: Fix nir_lower_io with per primitive outputs.
4206- nir, spirv: Don't mark NV_mesh_shader primitive indices as per-primitive.
4207- nir: Add new option to lower invocation ID from invocation index.
4208- nir: Lower cull and clip distance arrays for mesh shaders.
4209- nir: Don't compact per-vertex and per-primitive outputs together.
4210- nir: Group per-primitive outputs at the end for driver location assign.
4211- nir: Fix sorting per-primitive outputs.
4212- util: Add util_widen_mask function.
4213- aco: Use util_widen_mask.
4214- radv: Use util_widen_mask.
4215- nir: Print task and mesh shader I/O variable names.
4216- aco: Clean up and fix quad group instructions with WQM.
4217- aco/optimizer_postRA: Fix combining DPP into VALU.
4218- aco/optimizer_postRA: Fix applying VCC to branches.
4219- spirv: Allow VRS with mesh shaders.
4220- gitlab-ci: Disable radv-fossils again.
4221- ac/nir/ngg: Lower NV mesh shaders to NGG semantics.
4222- radv: Add radv_pipeline_has_mesh helper.
4223- radv: Add mesh shader specific info.
4224- radv: Cleanup PS input generation.
4225- radv: Cleanup VS output param assignment.
4226- radv: Add support for per-primitive mesh shader outputs.
4227- radv: Setup shader arguments for mesh shaders.
4228- radv: Compile mesh shaders and apply the necessary NIR lowerings.
4229- radv: Set output driver locations for mesh shaders.
4230- radv: Create mesh shading pipelines.
4231- radv: Add support for mesh shading pipelines in the command buffer.
4232- aco: Add Mesh and Task shader stages.
4233- aco: Update README about NGG and mesh shaders.
4234- aco: Add 1D workgroup_id support for mesh shaders.
4235- aco: Use the correct outinfo for mesh shaders.
4236- aco: Export per-primitive mesh shader output attributes.
4237- radv: Implement NV_mesh_shader draw calls.
4238- radv: Enable NV_mesh_shader with a perftest flag.
4239- radv: Note when a mesh shader writes the primitive shading rate.
4240- radv: Lower primitive shading rate for mesh shaders.
4241- aco: Emit VRS rate when it's per-primitive.
4242- radv: Support VRS for mesh shaders.
4243- radv: Disable IB2 on compute queues.
4244- ac/nir/ngg: Fix mixed up primitive ID after culling.
4245
4246Tomeu Vizoso (13):
4247
4248- ci: Add support for lazor Chromebooks
4249- ci: Let manual LAVA jobs have a longer timeout than others
4250- freedreno/ci: Test Turnip on Adreno 618
4251- ci: Create symlink to /install early
4252- ci: Don't set GALLIVM_PERF in the scripts
4253- virgl/ci: Set GALLIVM_PERF=nopt,no_quad_lod
4254- ci: Remove syslogd
4255- virgl/ci: Run each dEQP instance in its own VM
4256- ci: Uprev Crosvm
4257- lvp: Free the driver_data pointer for all commands
4258- anv/ci: Test with deqp-vk on Tiger Lake
4259- anv/tests: Free BO cache and device mutex
4260- ci: Rebalance Iris jobs
4261
4262Topi Pohjolainen (6):
4263
4264- intel/compiler: Handle new sampler descriptor fields for 16bit sampler
4265- intel/compiler/fs: Add support for 16-bit sampler msg payload
4266- intel/compiler: Demote sampler params to 16-bit for CMS/UMS/MCS
4267- intel/compiler: Prepare ld2dms_w for 4 mcs components
4268- intel/compiler: Add new variant for TXF_CMS_W
4269- intel/compiler: Deprecate ld2dms and use ld2dms_w instead
4270
4271Uday Kiran Pichika (2):
4272
4273- iris: enable adaptive sync for IRIS
4274- anv: enable adaptive sync for ANV
4275
4276Vadym Shovkoplias (3):
4277
4278- intel/fs: Fix a cmod prop bug when cmod is set to inst that doesn't support it
4279- anv: Include viewport size in scissor rectangle
4280- glthread: Check out of bounds for MultiDrawElementsBaseVertex cmd
4281
4282Vasily Khoruzhick (14):
4283
4284- gallium: add PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS
4285- lima: enable PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS
4286- lima: add native txp support
4287- lima: add support for 3D textures
4288- lima: handle 1D samplers
4289- lima: add more wrap modes
4290- lima/ppir: check if mul node is a source of add node before inserting
4291- lima/ppir: implement gl_FragDepth support
4292- lima: fix crash with sparse samplers
4293- lima: disasm: use last argument as a filename
4294- lima: disasm: call util_cpu_detect() to init CPU caps
4295- lima: implement dual source blend
4296- lima: use 1 as blend factor for dst_alpha for SRC_ALPHA_SATURATE
4297- lima: fix blending with min/max ops
4298
4299Viktoriia Palianytsia (1):
4300
4301- glsl: fix for unused variable in glsl_types.cpp
4302
4303Vinson Lee (17):
4304
4305- anv: Fix assertion.
4306- radv: Fix memory leak on error path.
4307- clover: Add constructor for image_rd_argument.
4308- zink: Remove duplicate variable unsized.
4309- intel/compiler: Change selected_simd return type to int.
4310- intel/compiler: Initialize SIMDSelectionTest member error.
4311- vulkan/wsi: Unlock before return on error path.
4312- microsoft/spirv_to_dxil: Fix non-Windows build.
4313- virgl: Allocate qdws after virgl_init_context to avoid leak.
4314- ac/rgp: Initialize clock_calibration with memset.
4315- ir3: Make shift operand 64-bit.
4316- panfrost: Avoid double unlock.
4317- isaspec: Sort field names to generate deterministic output.
4318- radv: Fix memory leak on error path.
4319- r600/sfn: Remove unused AluInstruction members.
4320- isaspec: Remove duplicate return statement.
4321- v3dv: Add missing unlocks on errors.
4322
4323Witold Baryluk (2):
4324
4325- zink: Do not access just freed zink_batch_state
4326- zink: Fully initialize VkBufferViewCreateInfo for hashing
4327
4328Xiaohui Gu (1):
4329
4330- iris: Mark a dirty update when vs_needs_sgvs_element value changed
4331
4332Yiwei Zhang (27):
4333
4334- dri_interface: remove obsolete interfaces
4335- dri_interface: remove gl header
4336- venus: refactor private descriptor_set helpers to be private
4337- venus: assign valid memoryTypeIndex of exportable ahb memory for image
4338- venus: release queues on device creation failure
4339- venus: refactor to add vn_device_init
4340- venus: refactor to add vn_buffer_init
4341- venus: refactor the ahb buffer mem_type_bits query api
4342- venus: add struct vn_buffer_memory_requirements
4343- venus: add struct vn_image_memory_requirements
4344- venus: add buffer cache init and usage flows
4345- venus: implement vn_buffer_get_max_buffer_size
4346- venus: implement vn_buffer_cache_get_memory_requirements
4347- venus: implement vn_buffer_cache_entries_create
4348- venus: refactor vn_device_memory_simple_alloc
4349- venus: simplify device memory pool alloc and refcount
4350- venus: defer roundtrip waiting to vkFreeMemory time
4351- venus: track memory type property flags in vn_device_memory
4352- venus: move bo allocation for mappable memory to vn_MapMemory
4353- venus: subtract appended header size in vn_CreatePipelineCache
4354- venus: VkExternalImageFormatProperties is optional
4355- tu: VkExternalImageFormatProperties is optional
4356- venus: update some obsolete assumptions described
4357- venus: track whether a fence is external
4358- Revert "venus: remove vn_ring_wait_all"
4359- venus: make vn_QueueSubmit async for native submissions
4360- venus: properly destroy deferred ahb image before real image creation
4361
4362Yonggang Luo (2):
4363
4364- vulkan: Open registry XML files as UTF-8
4365- win32: Fixes thread local on win32 with clang/mingw (!14062)
4366
4367mwezdeck (1):
4368
4369- virgl/drm: New optimization for uploading textures
4370
4371orbea (1):
4372
4373- build: add sha1_h for lp_texture.c
4374
4375satmandu (1):
4376
4377- Fix compilation on armv7l with gcc 11.2.0
4378
4379shanshengwang (1):
4380
4381- radeon/vce: Limiting max supported refernce frames to 1  for h264 encoding
4382