aa1f6934f83027c9f70d733781f3ebebf3c65721 to f7e89b2f48c6fbbe5b3cedb1a9fcc1b4d47e64b0 · X Strike Force / lib / mesa

Commits on Source 39

docs: Add release notes for 18.1.2 · dd355ee9
Dylan Baker authored Jun 15, 2018

dd355ee9

radv: Fix output for sparse MRTs. · c41a7462

Bas Nieuwenhuizen authored Jun 13, 2018

We need to init the cb_shader_format correctly with the changed
col_format, so this moves the col_format adjustment to before the
adjustment to before the cb_shader_mask gets generated.

Fixes: 06d3c650 "radv: fix a GPU hang when MRTs are sparse"
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106903


CC: 18.1 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Dave Airlie <airlied@redhat.com>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
(cherry picked from commit 41dabdc4)

c41a7462

radv: don't fast clear HTILE for 16-bit depth surfaces on GFX8 · 888b7fca

Samuel Pitoiset authored Jun 13, 2018

This causes rendering issues in Shadow Warrior 2 with DXVK.

Cc: mesa-stable@lists.freedesktop.org
Fixes: ccc64f31 ("radv: enable TC-compat HTILE for 16-bit depth surfaces on GFX8")
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106912


Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
(cherry picked from commit 51e23d34)

888b7fca

radv: update the ZRANGE_PRECISION value for the TC-compat bug · b2740629

Samuel Pitoiset authored Jun 13, 2018

On GFX8+, there is a bug that affects TC-compatible depth surfaces
when the ZRange is not reset after LateZ kills pixels.

The workaround is to always set DB_Z_INFO.ZRANGE_PRECISION to match
the last fast clear value. Because the value is set to 1 by default,
we only need to update it when clearing Z to 0.0.

We also need to set the depth clear regs and to update
ZRANGE_PRECISION when initializing a TC-compat depth image to 0.

Original patch from James Legg.

This fixes random CTS fails with
dEQP-VK.renderpass.suballocation.formats.d32_sfloat_s8_uint.input.*

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105396


CC: <mesa-stable@lists.freedesktop.org>
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
(cherry picked from commit 68dead11)

b2740629

radeonsi/gfx9: fix si_get_buffer_from_descriptors for 48-bit pointers · 66bc41a3

Marek Olšák authored Jun 01, 2018



This fixes:
GL45-CTS.pipeline_statistics_query_tests_ARB.functional_compute_shader_invocations

Cc: 18.0 18.1 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
(cherry picked from commit 6d671078)

66bc41a3

ac/gpu_info: report real total memory sizes · 49f43bdc

Marek Olšák authored Jun 12, 2018



The change from MIN2 to MAX2 is intentional.

Cc: 18.1 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
(cherry picked from commit 95ecde42)

49f43bdc

glsl: allow standalone semicolons outside main() · 156d0230

Dave Airlie authored Jun 14, 2018



GLSL 4.60 offically added this but games and older CTS suites actually
had shaders that did this, we may as well enable it everywhere.

Adding stable because it appears apps in the wild do this.

Acked-by: Timothy Arceri <tarceri@itsqueeze.com>
Reviewed-by: Matt Turner <mattst88@gmail.com>
Cc: <mesa-stable@lists.freedesktop.org>
(cherry picked from commit babd1d52)

156d0230

meson: only build vl_winsys_dri.c when x11 platform is used · 87453e9f

Lukas Rusak authored Jun 01, 2018



This seems to have been missed in the move from autotools

This fixes the following build issue:

../src/gallium/auxiliary/vl/vl_winsys_dri.c:34:10: fatal error: X11/Xlib-xcb.h: No such file or directory
 #include <X11/Xlib-xcb.h>
          ^~~~~~~~~~~~~~~~

Fixes: b1b65397
       ("meson: Build gallium auxiliary")
Reviewed-by: Dylan Baker <dylan@pnwbakers.com>
(cherry picked from commit 1d92d648)

87453e9f

meson: fix private libs when building without glx · 2008ca24

Lukas Rusak authored Jun 04, 2018



I noticed that the generated pkg-config files will include
glx and x11 dependencies even when x11 isn't a selected platform.

This fixes the private libs and was tested by building kmscube

V2:
  - check if gallium-xlib is being used for glx

Fixes: 108d257a "meson: build libEGL"
Reviewed-by: Dylan Baker <dylan@pnwbakers.com>
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
(cherry picked from commit 4cfc4cef)

2008ca24

util/bitset: include util/macro.h · ae394968

Christian Gmeiner authored Jun 15, 2018



BITSET_FFS(x) macro makes use of ARRAY_SIZE(x) macro which is
defined in util/macro.h. Include it directy to make usage more
straightforward.

Fixes: 692bd4a1 ("util: replace Elements() with ARRAY_SIZE()")
Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com>
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
(cherry picked from commit efae1279)

ae394968

glsl: Don't copy propagate from SSBO or shared variables either · f3ec346a

Ian Romanick authored Jun 05, 2018



Since SSBOs can be written by other GPU threads, copy propagating a read
can cause the value to magically change.  SSBO reads are also very
expensive, so doing it twice will be slower.

Haswell, Broadwell, and Skylake had similar results. (Skylake shown)
total instructions in shared programs: 14399120 -> 14399119 (<.01%)
instructions in affected programs: 684 -> 683 (-0.15%)
helped: 1
HURT: 0

total cycles in shared programs: 532978931 -> 532973113 (<.01%)
cycles in affected programs: 530484 -> 524666 (-1.10%)
helped: 1
HURT: 0

Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Cc: mesa-stable@lists.freedesktop.org
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106774
(cherry picked from commit 461a5c89)

f3ec346a

glsl: Don't copy propagate elements from SSBO or shared variables either · a4609fe8

Ian Romanick authored Jun 05, 2018



Since SSBOs can be written by a different GPU thread, copy propagating a
read can cause the value to magically change.  SSBO reads are also very
expensive, so doing it twice will be slower.

The same shader was helped by this patch and the previous.

Haswell, Broadwell, and Skylake had similar results. (Skylake shown)
total instructions in shared programs: 14399119 -> 14399113 (<.01%)
instructions in affected programs: 683 -> 677 (-0.88%)
helped: 1
HURT: 0

total cycles in shared programs: 532973113 -> 532971865 (<.01%)
cycles in affected programs: 524666 -> 523418 (-0.24%)
helped: 1
HURT: 0

Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Cc: mesa-stable@lists.freedesktop.org
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106774
(cherry picked from commit 37bd9ccd)

a4609fe8

radv: fix emitting the TCS regs on GFX9 · c86a99ad

Samuel Pitoiset authored Jun 15, 2018

The primitive ID is NULL and this generates an invalid
select instruction which crashes because one operand is NULL.

This fixes crashes in The Long Journey Home, Quantum Break
and Just Cause 3 with DXVK.

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106756


CC: <mesa-stable@lists.freedesktop.org>
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
(cherry picked from commit 5917761e)

c86a99ad

radv: fix reported number of available VGPRs · 9b8c90fc

Eric Engestrom authored Jun 15, 2018



It's a bit late to round up after an integer division.

Fixes: de889794 "radv: Implement VK_AMD_shader_info"
Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>
Reviewed-by: Alex Smith <asmith@feralinteractive.com>
(cherry picked from commit d85fef1e)

9b8c90fc

radv: fix bitwise check · 6bc8fcbc

Eric Engestrom authored Jun 15, 2018



Fixes: 922cd381 "radv: implement out-of-order rasterization when it's safe on VI+"
Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
(cherry picked from commit 4d08c1e7)

6bc8fcbc

meson: fix i965/anv/isl genX static lib names · 7540acb1

Eric Engestrom authored Jun 15, 2018



Shouldn't make any functional difference, just that `liblibanv_gen90.a`
will now be called `libanv_gen90.a`.

Fixes: 3218056e "meson: Build i965 and dri stack"
Fixes: d1992255 "meson: Add build Intel "anv" vulkan driver"
Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>
Reviewed-by: Tapani Pälli <tapani.palli@intel.com>
Reviewed-by: Dylan Baker <dylan@pnwbakers.com>
(cherry picked from commit e8eb8482)

Trivial merge conflicts resolved by Dylan.

7540acb1

virgl: Remove debugging left-overs · 4c07e44a

Tomeu Vizoso authored Jun 18, 2018



Some fprintfs were probably left unintentionally a few years ago and are
a bit of a nuisance.

Fixes: 2d3301e4 ("virgl: fix reference counting of prime handles")
       Cc: Rob Herring <robh@kernel.org>

Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>
Reviewed-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 9b1cb50b)

4c07e44a

ac/surface: Set compressZ for stencil-only surfaces. · 19655023

Bas Nieuwenhuizen authored Jun 18, 2018



We HTILE compress stencil-only surfaces too.

CC: 18.1 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Marek Olšák <marek.olsak@amd.com>
(cherry picked from commit 1a8501a9)

19655023

ac/gpu_info: add kernel_flushes_hdp_before_ib · e979b79c

Marek Olšák authored May 02, 2018



Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
(cherry picked from commit b81149e2)

Conflicts:
	src/amd/common/ac_gpu_info.c

Conflicts resolved by Dylan

e979b79c

radeonsi: always put persistent buffers into GTT on radeon · decb031b

Marek Olšák authored Jun 08, 2018



This improves performance for certain games.

Cc: 18.1 <mesa-stable@lists.freedesktop.org>
Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>
(cherry picked from commit 9322974e)

decb031b

freedreno/ir3: fix base_vertex · 564c8820

Rob Clark authored Jun 16, 2018



Fixes: c366f422 nir: Offset vertex_id by first_vertex instead of base_vertex
Signed-off-by: Rob Clark <robdclark@gmail.com>
(cherry picked from commit e1e40935)

564c8820

configure: use compliant grep regex checks · 3c1eaa59

Emil Velikov authored Jun 14, 2018



The current `grep "foo\|bar"' trips on some grep implementations, like
the FreeBSD one. Instead use `egrep "foo|bar"' as suggested by Stefan.

Cc: Stefan Esser <se@FreeBSD.org>
Reported-by: Stefan Esser <se@FreeBSD.org>
Bugzilla: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=228673


Fixes: 1914c814 ("configure: error out if building OMX w/o supported platform")
Fixes: 63e11ac2 ("configure: error out if building VA w/o supported platform")
Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>
Reviewed-by: Matt Turner <mattst88@gmail.com>
(cherry picked from commit dfb1f275)

3c1eaa59

glsl/tests/glcpp: reinstate "error out if no tests found" · 2a0015de

Emil Velikov authored Apr 24, 2018



With the recent rework of converting the shell script to a python one
the check for actual tests was dropped.

Bring that back, since it was explicitly added considering we had a ~2
year period, during which the tests were not run.

v2: use raise Exception() over  print() & return false (Dylan)

Fixes: db8cd8e3 ("glcpp/tests: Convert shell scripts to a python
script")
Cc: Dylan Baker <dylan@pnwbakers.com>
Signed-off-by: Emil Velikov <emil.velikov@collabora.com>
(cherry picked from commit d589eddc)

2a0015de

mesa: fix glGetInteger64v for arrays of integers · dd14a0f3

Marek Olšák authored Jun 20, 2018



Cc: 18.1 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
(cherry picked from commit a2790b13)

dd14a0f3

cherry-ignore: Add 587e712e · fc0d0ad0
Dylan Baker authored Jun 22, 2018

fc0d0ad0

radv: fix HTILE metadata initialization in presence of subpass clears · b5f154a8

Samuel Pitoiset authored Jun 25, 2018



If the driver ends up by performing a slow depthstencil clear,
the HTILE metadata won't be initialized correctly.

This fixes random VM faults on Polaris while running CTS
with Bas's runner. This doesn't seem to regress performance.

CC: <mesa-stable@lists.freedesktop.org>
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
(cherry picked from commit 07cb1373)

b5f154a8

meson: Fix auto option for va · 6a1ef7cc

Dylan Baker authored Jun 21, 2018



The same as the previous two patches, but for the libva state tracker.

Fixes: 724916c8
       ("meson: dedup gallium-xvmc logic")
Signed-off-by: Dylan Baker <dylan.c.baker@intel.com>
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
(cherry picked from commit 94cf3970)

6a1ef7cc

meson: Fix auto option for xvmc · abe65eb5

Dylan Baker authored Jun 21, 2018



This fixes the same problem as the previous patch did for vdpau, but for
xvmc.

Fixes: 724916c8
       ("meson: dedup gallium-xvmc logic")
Signed-off-by: Dylan Baker <dylan.c.baker@intel.com>
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
(cherry picked from commit a6943bb4)

Squashed with:

meson: Fix typo that breaks -Dgalium-xvmc=false

_xmvc -> _xvmc. Sigh

Fixes: a6943bb4
       ("meson: Fix auto option for xvmc")
Signed-off-by: Dylan Baker <dylan.c.baker@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Tested-by: Clayton Craft <clayton.a.craft@intel.com>
(cherry picked from commit ced3df56)

abe65eb5

meson: Correct behavior of vdpau=auto · f77cae2c

Dylan Baker authored Jun 21, 2018



Currently if vdpau is set to auto, it will be disabled only in cases
where gallium is disabled or the host OS is not supported (mac, haiku,
windows). However on (for example) Linux if libvdpau is not installed
then the build will error because of the unmet dependency. This corrects
auto to do the right thing, and not error if libvdpau is not installed.

Fixes: 992af0a4
       ("meson: dedup gallium-vdpau logic")
Signed-off-by: Dylan Baker <dylan.c.baker@intel.com>
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
(cherry picked from commit d9a8008a)

f77cae2c

radv: ignore pInheritanceInfo for primary command buffers · 0747f76b

Samuel Pitoiset authored Jun 25, 2018



From the Vulkan spec:
"If this is a primary command buffer, then this value is ignored."

CC: <mesa-stable@lists.freedesktop.org>
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
(cherry picked from commit ba5e25ed)

0747f76b

i965/gen6/gs: Handle case where a GS doesn't allocate VUE · f0f66ee4

Andrii Simiklit authored Jun 22, 2018



We can not use the VUE Dereference flags combination for EOT
message under ILK and SNB because the threads are not initialized
there with initial VUE handle unlike Pre-IL.
So to avoid GPU hangs on SNB and ILK we need
to avoid usage of the VUE Dereference flags combination.
(Was tested only on SNB but according to the specification
SNB Volume 2 Part 1: 1.6.5.3, 1.6.5.6
the ILK must behave itself in the similar way)

v2: Approach to fix this issue was changed.
Instead of different EOT flags in the program end
we will create VUE every time even if GS produces no output.

v3: Clean up the patch.
Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com>

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105399


CC: <mesa-stable@lists.freedesktop.org>
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
Tested-by: Mark Janes <mark.a.janes@intel.com>
(cherry picked from commit 232c5d75)

f0f66ee4

glsl: serialize data from glTransformFeedbackVaryings · 915d9166

Tapani Pälli authored Jun 14, 2018



While XFB has been enabled for cache, we did not serialize enough
data for the whole API to work (such as glGetProgramiv).

Fixes: 6d830940 "Allow shader cache usage with transform feedback"
Signed-off-by: Tapani Pälli <tapani.palli@intel.com>
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106907


Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>
(cherry picked from commit ab2643e4)

915d9166

nir: Handle call instructions in foreach_src · d96eecbd

Faith Ekstrand authored Jun 25, 2018

Even though they don't have regular sources, they do have derefs and
those may have implied sources that should be handled.

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106980


Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
(Unique to 18.1)

d96eecbd

nir/validate: Use the type from the tail of call parameter derefs · bc67499b

Faith Ekstrand authored Jun 25, 2018

Otherwise, if what gets passed into the function call is a deref chain
longer than just a variable deref, we would use the type of the entire
variable rather than the type of the thing being dereferenced.

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106980


Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
(Unique to 18.1)

bc67499b

cherry-ignore: Ignore cac7ab11 · 696be229
Dylan Baker authored Jun 26, 2018

696be229

radv: use separate bind points for the dynamic buffers · 07049c0b

Samuel Pitoiset authored Jun 26, 2018



The Vulkan spec says:

   "pipelineBindPoint is a VkPipelineBindPoint indicating whether
    the descriptors will be used by graphics pipelines or compute
    pipelines. There is a separate set of bind points for each of
    graphics and compute, so binding one does not disturb the other."

CC: <mesa-stable@lists.freedesktop.org>
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
(cherry picked from commit 7a57c827)

07049c0b

cherry-ignore: add a2f5292c · ed54f93d
Dylan Baker authored Jun 27, 2018

ed54f93d
VERSION: bump version to 18.1.3 · 765fdbe2
Dylan Baker authored Jun 29, 2018

765fdbe2
docs: Add release notes for 18.1.3 · f7e89b2f
Dylan Baker authored Jun 29, 2018

f7e89b2f