DirectXShaderCompiler

Граф коммитов

Автор	SHA1	Сообщение	Дата
David Neto	206133c9e7	HLMatrixLower: allow exceptions to propagate out (#6710 ) If an exception is thrown, don't block it in the TempOverloadPool destructor. Allow it to propagate out as a user-visible error. Explicitly clear the TempOverloadPool before returning from the HLMatrixLowerPass::runOnModule. In the normal case, when no exception is thrown, that will still verify that all the overloads actually have been lowered, and will assert out if they aren't.	2024-06-26 11:57:18 -04:00
Steven Perron	e5183a06b9	Set the layout rule explicitly for raw buffer operations (#6701 ) The first first fix in #5392 was not correct. It relied on the layout rule for the address to be the correct layout rule, but that is not always the case. The address is just an integer that could exist in any storage class. The correct solution is to explicitly set the layout rule for the BitCast operation when expanding the RawBuffer* functions. We know that the result of the BitCast is a pointer to the physical storage buffer storage class, so we know the layout need to be the storage buffer layout. Fixes #6554	2024-06-25 10:01:49 -04:00
Adam Yang	74205f8c19	Fix non-determinism in Reassociate caused by address coincidences (#6717 ) Originally @lizhengxing's PR. Retargeting main. This PR pulls the upstream change, Fix non-determinism in Reassociate caused by address coincidences (`ef8761fd3b`), into DXC. Here's the summary of the change: Between building the pair map and querying it there are a few places that erase and create Values. It's rare but the address of these newly created Values is occasionally the same as a just-erased Value that we already have in the pair map. These coincidences should be accounted for to avoid non-determinism. Thanks to Roman Tereshin for the test case. This is part 6 (the last part) of the fix for #6659. Co-authored-by: Zhengxing Li <zhengxingli@microsoft.com>	2024-06-24 17:40:12 -07:00
Raed Rizqie	0eebec69d4	CMake: Add initial support for MinGW-w64 (#6715 )	2024-06-24 15:25:14 -07:00
Zhengxing li	b79169bc85	[llc/opt] Add an option to run all passes twice (#6666 ) This PR pulls the following upstream changes into DXC: [llc/opt] Add an option to run all passes twice (`04464cf731`) > Lately, I have submitted a number of patches to fix bugs that only occurred when using the same pass manager to compile > multiple modules (generally these bugs are failure to reset some persistent state). > > Unfortunately I don't think there is currently a way to test that from the command line. This adds a very simple flag to both > llc and opt, under which the tools will simply re-run their respective > pass pipelines using the same pass manager on (a clone of the same module). Additionally, we verify that both outputs are > bitwise the same. > > Reviewers: yaron.keren [opt] Fix sanitizer complaints about r254774 (`38707c45be`) > `Out` can be null if no output is requested, so move any access > to it inside the conditional. Thanks to Justin Bogner for finding > this. This is for the test of this change (`ef8761fd3b`) to fix #6659. --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Adam Yang <31109344+adam-yang@users.noreply.github.com>	2024-06-24 15:03:08 -07:00
Adam Yang	b197bec653	Add a new WeakVH value handle; NFC (#6703 ) Originally @lizhengxing's PR. Retargeting main. This PR pulls 2 upstream changes, Add a new WeakVH value handle; NFC (`f1c0eafd5b`) and Use a 2 bit pointer in ValueHandleBase::PrevPair; NFC (`b297bff1cc`), into DXC. Here's the summary of the changes: Add a new WeakVH value handle; NFC > WeakVH nulls itself out if the value it was tracking gets deleted, but it does not track RAUW. > > Reviewers: dblaikie, davide > > Subscribers: mcrosier, llvm-commits > > Differential Revision: https://reviews.llvm.org/D32267 Use a 2 bit pointer in ValueHandleBase::PrevPair; NFC > This was an omission in r301813. I had made the supporting changes to make this happen, but I forgot to actually update the > > PrevPair declaration. This is part 4 and 5 of the fix for #6659.	2024-06-21 11:55:24 -07:00
Steven Perron	8b18659aef	Avoid adding types to default namespace (#6700 ) Some of the types that have been added to the vk namespace were being added to the default namespace when compiling for DXIL. The if conditions were such that they would fall through to a default case. The solution is to explicitly add code that we should skip adding those builtin types when the vk namespace is not defined. Fixes #6646.	2024-06-21 15:20:16 +00:00
David Neto	1f8f79688e	CMake: Use find_package(Python3) (#6675 ) Fixes the CMake CMP0148 warning at configure time. find_package(Python3) is supported since CMake 3.12, and DXC requires at least CMake 3.17	2024-06-20 13:11:49 -04:00
Steven Perron	73d3ccf399	Update spirv deps (#6709 ) Update the SPIR-V Tools to the latest.	2024-06-20 13:04:49 -04:00
Zhengxing li	45018c752d	Rename WeakVH to WeakTrackingVH; NFC (#6663 ) This PR pulls the upstream change, Rename WeakVH to WeakTrackingVH; NFC (`e6bca0eecb`), into DXC. Here's the summary of the change: > I plan to use WeakVH to mean "nulls itself out on deletion, but does not track RAUW" in a subsequent commit. > > Reviewers: dblaikie, davide > > Reviewed By: davide > > Subscribers: arsenm, mehdi_amini, mcrosier, mzolotukhin, jfb, llvm-commits, nhaehnle > > Differential Revision: https://reviews.llvm.org/D32266 This is part 3 of the fix for #6659. --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2024-06-18 12:34:55 -07:00
dependabot[bot]	df87613a0f	Bump urllib3 from 2.0.7 to 2.2.2 in /utils/git (#6699 ) Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.0.7 to 2.2.2. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/urllib3/urllib3/releases">urllib3's releases</a>.</em></p> <blockquote> <h2>2.2.2</h2> <h2>🚀 urllib3 is fundraising for HTTP/2 support</h2> <p><a href="https://sethmlarson.dev/urllib3-is-fundraising-for-http2-support">urllib3 is raising ~$40,000 USD</a> to release HTTP/2 support and ensure long-term sustainable maintenance of the project after a sharp decline in financial support for 2023. If your company or organization uses Python and would benefit from HTTP/2 support in Requests, pip, cloud SDKs, and thousands of other projects <a href="https://opencollective.com/urllib3">please consider contributing financially</a> to ensure HTTP/2 support is developed sustainably and maintained for the long-haul.</p> <p>Thank you for your support.</p> <h2>Changes</h2> <ul> <li>Added the <code>Proxy-Authorization</code> header to the list of headers to strip from requests when redirecting to a different host. As before, different headers can be set via <code>Retry.remove_headers_on_redirect</code>.</li> <li>Allowed passing negative integers as <code>amt</code> to read methods of <code>http.client.HTTPResponse</code> as an alternative to <code>None</code>. (<a href="https://redirect.github.com/urllib3/urllib3/issues/3122">#3122</a>)</li> <li>Fixed return types representing copying actions to use <code>typing.Self</code>. (<a href="https://redirect.github.com/urllib3/urllib3/issues/3363">#3363</a>)</li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/urllib3/urllib3/compare/2.2.1...2.2.2">https://github.com/urllib3/urllib3/compare/2.2.1...2.2.2</a></p> <h2>2.2.1</h2> <h2>🚀 urllib3 is fundraising for HTTP/2 support</h2> <p><a href="https://sethmlarson.dev/urllib3-is-fundraising-for-http2-support">urllib3 is raising ~$40,000 USD</a> to release HTTP/2 support and ensure long-term sustainable maintenance of the project after a sharp decline in financial support for 2023. If your company or organization uses Python and would benefit from HTTP/2 support in Requests, pip, cloud SDKs, and thousands of other projects <a href="https://opencollective.com/urllib3">please consider contributing financially</a> to ensure HTTP/2 support is developed sustainably and maintained for the long-haul.</p> <p>Thank you for your support.</p> <h2>Changes</h2> <ul> <li>Fixed issue where <code>InsecureRequestWarning</code> was emitted for HTTPS connections when using Emscripten. (<a href="https://redirect.github.com/urllib3/urllib3/issues/3331">#3331</a>)</li> <li>Fixed <code>HTTPConnectionPool.urlopen</code> to stop automatically casting non-proxy headers to <code>HTTPHeaderDict</code>. This change was premature as it did not apply to proxy headers and <code>HTTPHeaderDict</code> does not handle byte header values correctly yet. (<a href="https://redirect.github.com/urllib3/urllib3/issues/3343">#3343</a>)</li> <li>Changed <code>ProtocolError</code> to <code>InvalidChunkLength</code> when response terminates before the chunk length is sent. (<a href="https://redirect.github.com/urllib3/urllib3/issues/2860">#2860</a>)</li> <li>Changed <code>ProtocolError</code> to be more verbose on incomplete reads with excess content. (<a href="https://redirect.github.com/urllib3/urllib3/issues/3261">#3261</a>)</li> </ul> <h2>2.2.0</h2> <h2>🖥️ urllib3 now works in the browser</h2> <p>🎉 <strong>This release adds experimental support for <a href="https://urllib3.readthedocs.io/en/stable/reference/contrib/emscripten.html">using urllib3 in the browser with Pyodide</a>!</strong> 🎉</p> <p>Thanks to Joe Marshall (<a href="https://github.com/joemarshall"><code>@joemarshall</code></a>) for contributing this feature. This change was possible thanks to work done in urllib3 v2.0 to detach our API from <code>http.client</code>. Please report all bugs to the <a href="https://github.com/urllib3/urllib3/issues">urllib3 issue tracker</a>.</p> <h2>🚀 urllib3 is fundraising for HTTP/2 support</h2> <p><a href="https://sethmlarson.dev/urllib3-is-fundraising-for-http2-support">urllib3 is raising ~$40,000 USD</a> to release HTTP/2 support and ensure long-term sustainable maintenance of the project after a sharp decline in financial support for 2023. If your company or organization uses Python and would benefit from HTTP/2 support in Requests, pip, cloud SDKs, and thousands of other projects <a href="https://opencollective.com/urllib3">please consider contributing financially</a> to ensure HTTP/2 support is developed sustainably and maintained for the long-haul.</p> <p>Thank you for your support.</p> <h2>Changes</h2> <ul> <li>Added support for <a href="https://urllib3.readthedocs.io/en/latest/reference/contrib/emscripten.html">Emscripten and Pyodide</a>, including streaming support in cross-origin isolated browser environments where threading is enabled. (<a href="https://redirect.github.com/urllib3/urllib3/issues/2951">#2951</a>)</li> <li>Added support for <code>HTTPResponse.read1()</code> method. (<a href="https://redirect.github.com/urllib3/urllib3/issues/3186">#3186</a>)</li> <li>Added rudimentary support for HTTP/2. (<a href="https://redirect.github.com/urllib3/urllib3/issues/3284">#3284</a>)</li> <li>Fixed issue where requests against urls with trailing dots were failing due to SSL errors when using proxy. (<a href="https://redirect.github.com/urllib3/urllib3/issues/2244">#2244</a>)</li> <li>Fixed <code>HTTPConnection.proxy_is_verified</code> and <code>HTTPSConnection.proxy_is_verified</code> to be always set to a boolean after connecting to a proxy. It could be <code>None</code> in some cases previously. (<a href="https://redirect.github.com/urllib3/urllib3/issues/3130">#3130</a>)</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/urllib3/urllib3/blob/main/CHANGES.rst">urllib3's changelog</a>.</em></p> <blockquote> <h1>2.2.2 (2024-06-17)</h1> <ul> <li>Added the <code>Proxy-Authorization</code> header to the list of headers to strip from requests when redirecting to a different host. As before, different headers can be set via <code>Retry.remove_headers_on_redirect</code>.</li> <li>Allowed passing negative integers as <code>amt</code> to read methods of <code>http.client.HTTPResponse</code> as an alternative to <code>None</code>. (<code>[#3122](https://github.com/urllib3/urllib3/issues/3122) <https://github.com/urllib3/urllib3/issues/3122></code>__)</li> <li>Fixed return types representing copying actions to use <code>typing.Self</code>. (<code>[#3363](https://github.com/urllib3/urllib3/issues/3363) <https://github.com/urllib3/urllib3/issues/3363></code>__)</li> </ul> <h1>2.2.1 (2024-02-16)</h1> <ul> <li>Fixed issue where <code>InsecureRequestWarning</code> was emitted for HTTPS connections when using Emscripten. (<code>[#3331](https://github.com/urllib3/urllib3/issues/3331) <https://github.com/urllib3/urllib3/issues/3331></code>__)</li> <li>Fixed <code>HTTPConnectionPool.urlopen</code> to stop automatically casting non-proxy headers to <code>HTTPHeaderDict</code>. This change was premature as it did not apply to proxy headers and <code>HTTPHeaderDict</code> does not handle byte header values correctly yet. (<code>[#3343](https://github.com/urllib3/urllib3/issues/3343) <https://github.com/urllib3/urllib3/issues/3343></code>__)</li> <li>Changed <code>InvalidChunkLength</code> to <code>ProtocolError</code> when response terminates before the chunk length is sent. (<code>[#2860](https://github.com/urllib3/urllib3/issues/2860) <https://github.com/urllib3/urllib3/issues/2860></code>__)</li> <li>Changed <code>ProtocolError</code> to be more verbose on incomplete reads with excess content. (<code>[#3261](https://github.com/urllib3/urllib3/issues/3261) <https://github.com/urllib3/urllib3/issues/3261></code>__)</li> </ul> <h1>2.2.0 (2024-01-30)</h1> <ul> <li>Added support for <code>Emscripten and Pyodide <https://urllib3.readthedocs.io/en/latest/reference/contrib/emscripten.html></code><strong>, including streaming support in cross-origin isolated browser environments where threading is enabled. (<code>[#2951](https://github.com/urllib3/urllib3/issues/2951) <https://github.com/urllib3/urllib3/issues/2951></code></strong>)</li> <li>Added support for <code>HTTPResponse.read1()</code> method. (<code>[#3186](https://github.com/urllib3/urllib3/issues/3186) <https://github.com/urllib3/urllib3/issues/3186></code>__)</li> <li>Added rudimentary support for HTTP/2. (<code>[#3284](https://github.com/urllib3/urllib3/issues/3284) <https://github.com/urllib3/urllib3/issues/3284></code>__)</li> <li>Fixed issue where requests against urls with trailing dots were failing due to SSL errors when using proxy. (<code>[#2244](https://github.com/urllib3/urllib3/issues/2244) <https://github.com/urllib3/urllib3/issues/2244></code>__)</li> <li>Fixed <code>HTTPConnection.proxy_is_verified</code> and <code>HTTPSConnection.proxy_is_verified</code> to be always set to a boolean after connecting to a proxy. It could be <code>None</code> in some cases previously. (<code>[#3130](https://github.com/urllib3/urllib3/issues/3130) <https://github.com/urllib3/urllib3/issues/3130></code>__)</li> <li>Fixed an issue where <code>headers</code> passed in a request with <code>json=</code> would be mutated (<code>[#3203](https://github.com/urllib3/urllib3/issues/3203) <https://github.com/urllib3/urllib3/issues/3203></code>__)</li> <li>Fixed <code>HTTPSConnection.is_verified</code> to be set to <code>False</code> when connecting from a HTTPS proxy to an HTTP target. It was set to <code>True</code> previously. (<code>[#3267](https://github.com/urllib3/urllib3/issues/3267) <https://github.com/urllib3/urllib3/issues/3267></code>__)</li> <li>Fixed handling of new error message from OpenSSL 3.2.0 when configuring an HTTP proxy as HTTPS (<code>[#3268](https://github.com/urllib3/urllib3/issues/3268) <https://github.com/urllib3/urllib3/issues/3268></code>__)</li> <li>Fixed TLS 1.3 post-handshake auth when the server certificate validation is disabled (<code>[#3325](https://github.com/urllib3/urllib3/issues/3325) <https://github.com/urllib3/urllib3/issues/3325></code>__)</li> <li>Note for downstream distributors: To run integration tests, you now need to run the tests a second time with the <code>--integration</code> pytest flag. (<code>[#3181](https://github.com/urllib3/urllib3/issues/3181) <https://github.com/urllib3/urllib3/issues/3181></code>__)</li> </ul> <h1>2.1.0 (2023-11-13)</h1> <ul> <li>Removed support for the deprecated urllib3[secure] extra. (<code>[#2680](https://github.com/urllib3/urllib3/issues/2680) <https://github.com/urllib3/urllib3/issues/2680></code>__)</li> <li>Removed support for the deprecated SecureTransport TLS implementation. (<code>[#2681](https://github.com/urllib3/urllib3/issues/2681) <https://github.com/urllib3/urllib3/issues/2681></code>__)</li> <li>Removed support for the end-of-life Python 3.7. (<code>[#3143](https://github.com/urllib3/urllib3/issues/3143) <https://github.com/urllib3/urllib3/issues/3143></code>__)</li> <li>Allowed loading CA certificates from memory for proxies. (<code>[#3065](https://github.com/urllib3/urllib3/issues/3065) <https://github.com/urllib3/urllib3/issues/3065></code>__)</li> <li>Fixed decoding Gzip-encoded responses which specified <code>x-gzip</code> content-encoding. (<code>[#3174](https://github.com/urllib3/urllib3/issues/3174) <https://github.com/urllib3/urllib3/issues/3174></code>__)</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`27e2a5c5a7`"><code>27e2a5c</code></a> Release 2.2.2 (<a href="https://redirect.github.com/urllib3/urllib3/issues/3406">#3406</a>)</li> <li><a href="`accff72ecc`"><code>accff72</code></a> Merge pull request from GHSA-34jh-p97f-mpxf</li> <li><a href="`34be4a57e5`"><code>34be4a5</code></a> Pin CFFI to a new release candidate instead of a Git commit (<a href="https://redirect.github.com/urllib3/urllib3/issues/3398">#3398</a>)</li> <li><a href="`da410581b6`"><code>da41058</code></a> Bump browser-actions/setup-chrome from 1.6.0 to 1.7.1 (<a href="https://redirect.github.com/urllib3/urllib3/issues/3399">#3399</a>)</li> <li><a href="`b07a669bd9`"><code>b07a669</code></a> Bump github/codeql-action from 2.13.4 to 3.25.6 (<a href="https://redirect.github.com/urllib3/urllib3/issues/3396">#3396</a>)</li> <li><a href="`b8589ec9f8`"><code>b8589ec</code></a> Measure coverage with v4 of artifact actions (<a href="https://redirect.github.com/urllib3/urllib3/issues/3394">#3394</a>)</li> <li><a href="`f3bdc55851`"><code>f3bdc55</code></a> Allow triggering CI manually (<a href="https://redirect.github.com/urllib3/urllib3/issues/3391">#3391</a>)</li> <li><a href="`52392654b3`"><code>5239265</code></a> Fix HTTP version in debug log (<a href="https://redirect.github.com/urllib3/urllib3/issues/3316">#3316</a>)</li> <li><a href="`b34619f94e`"><code>b34619f</code></a> Bump actions/checkout to 4.1.4 (<a href="https://redirect.github.com/urllib3/urllib3/issues/3387">#3387</a>)</li> <li><a href="`9961d14de7`"><code>9961d14</code></a> Bump browser-actions/setup-chrome from 1.5.0 to 1.6.0 (<a href="https://redirect.github.com/urllib3/urllib3/issues/3386">#3386</a>)</li> <li>Additional commits viewable in <a href="https://github.com/urllib3/urllib3/compare/2.0.7...2.2.2">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=urllib3&package-manager=pip&previous-version=2.0.7&new-version=2.2.2)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/microsoft/DirectXShaderCompiler/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-18 12:00:24 -07:00
Steven Perron	4295b25934	Add error for invalid arguments to GetDimension (#6698 ) When processing the GetDimension member function for textures, we do not emit an error if the output variable is not an l-value. This change will add this error. Fixes #6689	2024-06-18 09:05:27 -07:00
Nathan Gauër	129c9f8760	[SPIR-V] Re-enable rich debug instructions for objects (#6696 ) This commit bumps SPIR-V tools version, and re-add support for objects debug instructions when using Vulkan's debug instructions. Because OpenCL debug instructions are not a non-semantic set, the SPIR-V spec would need to be modified, as today it does not allows forward references. Fixes #6691 --------- Signed-off-by: Nathan Gauër <brioche@google.com>	2024-06-18 10:15:29 +00:00
David Neto	393b759c8a	indvars: don't replace a phi when that breaks LCSSA (#6695 ) Induction variable simplification (indvars) tries to rewrite exit values; these appear as phi nodes in loop exit blocks. If the replacement for the phi is still in the loop, then that would break the LCSSA property. Don't do that. Add a test for this.	2024-06-17 20:56:16 -04:00
Chris B	98bb80a1ea	Remove Windows C++ redist hack (#6692 ) This removes the hack introduced in #6683 to workaround issues in the GitHub and ADO runner image: https://github.com/actions/runner-images/issues/10004 Rumor has it the runner images are now fixed... let's see. Fixes #6674	2024-06-17 16:38:50 -05:00
Steven Perron	8c3f40c0ae	Add warning when vk::offset is not correctly aligned (#6694 ) We will start issues a warning when `vk::offset` is not correctly aligned to make it easier for users to understand why their spir-v will not validate. Note that we do not treat this as an error because we want to allow someone to have the flexibility to do other things. For example, they could be targeting an API that does not follow any of the existing rules, which is why they are using `vk::offset`. Fixes #6171	2024-06-17 13:19:14 +02:00
David Neto	206b7c2e53	Remove unstructured loop exits: Don't introduce loops (#6676 ) Previously, if the latch exit was reachable from a different exit block for the loop, then the pass would introduce a loop involving that exit block and the latch exit. This is unwanted and unaccounted for. - Add a test for shared exits. - Add a test for non-dedicated latch exit	2024-06-14 09:43:50 -04:00
Nathan Gauër	56f3c40381	[SPIR-V] Emit OpUndef for undefined values (#6686 ) Before this change, OpConstantNull was emitted when an undef value was required. This causes an issue for some types which cannot have the OpConstantNull value. In addition, it mixed well-defined values with undefined values, which prevents any kind of optimization/analysis later on. Fixes #6653 --------- Signed-off-by: Nathan Gauër <brioche@google.com>	2024-06-13 09:21:34 +02:00
Helena Kotas	84d39b66cf	ExecutionTest::UnaryHalfOpTest#AcosHalf: Update tolerance (#6690 ) Update tolerance for ExecutionTest::UnaryHalfOpTest#AcosHalf test. Enables implementations to calculate `acos` for fp16 type by converting to fp32, doing fp32 math, and then converting back to fp16 using round-to-nearest-even conversing (RTNE) per D3D11 spec. For more details please see issue #6179. As mentioned in the linked issue, for these floating point tests a fixed point tolerance does not really make sense. It should vary based on the magnitude of the expected value. But we are already using this approach in many similar test cases and the simplest fix now is to update the tolerance to accommodate the fp32-to-ft16 conversion. Fixes #6179	2024-06-12 16:14:31 -07:00
Steven Perron	4353db3983	Make the location map run per entry point (#6688 ) The code that adds the input and output decoration in the entry points inputs and outputs assumes that there is a single entry point in the module. When using the `lib` profile that is not true. This commit modifies the code so that it groups the stage variables by entry point, and runs the current code on each group separably. I hesitate to make this change because it will change the locations for code that currently works, and will force users to update their applications accordingly. Or they could modify their shaders to use explicit locations attributes. Neither is great. However, the advantage is that this allows the implicit locations to match what would happen if the shader were compiled individually. It also makes the locations more predictable because change in another shader would change all shader after it. This is a better design, and worth the breakage. Fixes #6678 Fixes #5213	2024-06-12 15:26:34 +00:00
Nathan Gauër	80f6e46bf8	[SPIR-V] Fix GroupNonUniform capabilities+ext (#6687 ) [SPIR-V] Fix GroupNonUniform capabilities+ext Fixes emission of GroupNonUniform capabilities and related extensions, in particular SPV_NV_shader_subgroup_partitioned. Since this PR bumps SPIR-V headers + tools, some test changes are required due to opcode changes. Those are in a separate commit, but same PR. Fixes #6672 --------- Signed-off-by: Nathan Gauër <brioche@google.com>	2024-06-12 09:20:39 +02:00
Zhengxing li	a44c88e2b8	Emulate TrackingVH using WeakVH (#6662 ) This PR pulls the upstream change, Emulate TrackingVH using WeakVH (`8a6238201f`), into DXC. Here's the summary of the change: > This frees up one slot in the HandleBaseKind enum, which I will use later to add a new kind of value handle. The size of the HandleBaseKind enum is important because we store a HandleBaseKind in > the low two bits of a (in the worst case) 4 byte aligned pointer. > > Reviewers: davide, chandlerc > > Subscribers: mcrosier, llvm-commits > > Differential Revision: https://reviews.llvm.org/D32634 This is part 2 of the fix for #6659.	2024-06-11 17:49:44 -07:00
Antonio Maiorano	1c7cb4ffb8	Fix instcombine overflow check inserting inst at wrong place (#6679 ) When optimizing an overflow check of an add followed by a compare, the new instruction was being inserted at the compare, and the add removed. This produced invalid IR in cases where there were other uses of the former add between it and the compare. This fix makes sure to insert the new instruction at the old add location, rather than at the compare. Note that this was also fixed in LLVM: `6f5dca70ed`	2024-06-11 19:26:31 +00:00
Antonio Maiorano	3dc67421ba	Revert "Fix RemoveUnstructuredLoopExits when an exiting edge jumps out multiple levels of loops. (#6668 )" (#6685 ) This reverts commit `8206fbdc7f`. Reason for revert: since landing this, new asserts/crashes have been found.	2024-06-11 18:39:57 +00:00
Natalie Chouinard	40c76f7585	Fix another UAF in SimplifyCFG (#6680 ) In certain cases of unreachable code, SimplifyCFG could try to replace a phi node with a select where the phi node itself was the select's condition. This resulted in an ASAN use-after-free during SimplifyCFG. The test case added isn't quite ideal because by the end of the SimplifyCFG pass, the phi node is restored to its original state both before and after this fix. However, an ASAN build of `dxopt` or `check-clang-dxc` will identify a heap-use-after-free failure in the intermediary steps of this test without this patch and succeeds with it. This was also fixed in upstream LLVM: `602ab24833`	2024-06-11 13:58:30 -04:00
Chris B	0b9acdb75e	Workaround broken GitHub runner images (#6683 ) This PR contains two changes: 1) Moves a pragma to disable a warning, which seems to be required by the new compiler. 2) Adds a preprocessor define to workaround the crashes caused by the runner image mismatching C++ runtime versions. The second change we will want to revert once the runner images are fixed. The issue tracking the runner images is: https://github.com/actions/runner-images/issues/10004 Related #6668	2024-06-11 10:02:33 -05:00
Steven Perron	4b7993c78b	Add option to set the max id (#6654 ) Vulkan implementation can have different limits on the maximum value used as an id in a SPIR-V binary. SPIRV-Tools generall assumes this limit is 0x3FFFFF because all implementations must support at least that value for an id. Since many implementations can support larger values, the tools allows an option that will set a different limit. This commit add an option to DXC to do the same. Fixes #6636	2024-06-10 11:13:01 -07:00
Antonio Maiorano	1d196655b6	Fix crash in scalarrepl-param-hlsl when dynamically indexing a GEP of a constant indexed GEP (#6670 ) When processing global values to determine when to flatten vectors, this pass was only checking the immdiate users of the value for non-dynamic indexing of the vector. But this would fail in the case of a dynamic indexed GEP of a constant indexed GEP (e.g. h[0][a]) because the first level GEP was constant indexed, but not the second. We fix this by checking the full User tree of the value in `hasDynamicVectorIndexing`.	2024-06-06 14:56:20 -04:00
David Neto	8206fbdc7f	Fix RemoveUnstructuredLoopExits when an exiting edge jumps out multiple levels of loops. (#6668 ) Before doing any major surgery on an exit from loop L, ensure that if an exit edge from L goes to block X, then X is in L's parent loop or no loop at all. Add test cases: - a reduced test case where the exiting block does not dominate its own loop latch. - a reduced test case where the exiting block is the latch for its own loop. This reproduces the assert triggered by the original HLSL. - the original HLSL that triggered this bug fix. - the intermediate module from the original HLSL, taken just before the attempt to remove unstructured loop exits.	2024-06-06 10:56:13 -05:00
Zhengxing li	8408ae8829	Use accessors for ValueHandleBase::V; NFC (#6660 ) This PR pulls the upstream change, Use accessors for ValueHandleBase::V; NFC (`6f08789d30`), into DXC. Here's the summary of the change: > This changes code that touches ValueHandleBase::V to go through getValPtr and (newly added) setValPtr. This functionality will be used later, but also seemed like a generally good cleanup. > > I also renamed the field to Val, but that's just to make it obvious that I fixed all the uses. This is part 1 of the fix for #6659. --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2024-06-05 23:20:46 -07:00
Takuto Ikuta	978d36221d	fix compile error for newer VS toolchain (#6648 ) This is to fix compile error like ``` ../../third_party/dxc/utils/TableGen/AsmWriterEmitter.cpp(974): error C2666: '`anonymous-namespace'::IAPrinter::operator ==': overloaded functions have similar conversions ../../third_party/dxc/utils/TableGen/AsmWriterEmitter.cpp(739): note: could be 'bool `anonymous-namespace'::IAPrinter::operator ==(const `anonymous-namespace'::IAPrinter &)' ../../third_party/dxc/utils/TableGen/AsmWriterEmitter.cpp(739): note: or 'bool `anonymous-namespace'::IAPrinter::operator ==(const `anonymous-namespace'::IAPrinter &)' [synthesized expression 'y == x'] ../../third_party/dxc/utils/TableGen/AsmWriterEmitter.cpp(974): note: while trying to match the argument list '(`anonymous-namespace'::IAPrinter, `anonymous-namespace'::IAPrinter)' ``` in dawn project which is updating to newer VS toolchain. This imports fix from `4ab57cd9ab`. more context is in https://issues.chromium.org/issues/341890053#comment2	2024-05-30 10:21:18 -04:00
David Neto	3a78b67849	NFC: Comment, refactor, and test hlsl::RemoveUnstructuredLoopExits (#6655 ) Add a pass to run hlsl::RemoveUnstructuredLoopExits in isolation Example: opt -dxil-r-u-l-e a.ll -S Add some basic tests. No functional change to the pass itself.	2024-05-30 09:43:39 -04:00
Tex Riddell	128e6ce2be	NFC: ExtractIRForPassTest.py error reporting when pass not found (#6325 ) This just adds convenient error reporting to `ExtractIRForPassTest.py` when the specified pass is not found. No functional change.	2024-05-24 13:39:38 -07:00
Natalie Chouinard	cdc56031b5	[SPIR-V] Add error message for SamplerFeedback (#6640 ) Sampler feedback resource types are not supported by the SPIR-V backend, but they would previously fail silently until a function was called on them. This change makes the error message more explicit on the type. Related to #6614	2024-05-24 16:05:53 +02:00
David Neto	a1b945c1a3	Loop exit restructurizer: don't iterate over uses while mutating them (#6644 ) The SkipBlockWithBranch function does the following: - Splits the block into three blocks with an if-then-endif structure. - Moves most instructions from the original block into the "then" block - If any of those values are used outside the original block, they are propagated through newly-constructed phis in the 'endif' block. This algorithm had a bug where the uses of a value were being scanned while the uses were also being updated. In some cases a downstream out-of-block use could be skipped. That results in an invalid module because now the original definition is now in the 'then' block, which does not dominate the downstream out-of-block use. Add a test that demonstrates the problem.	2024-05-23 16:00:37 +00:00
Antonio Maiorano	b41d8a9478	Fix LoopDeletion incorrectly updating PHI with multiple duplicate inputs (#6643 ) LoopDeletion was incorrectly updating PHI nodes in the target block when it had duplicate input edges. This happens, for example, when deleting a loop that uses a switch with multiple cases that exit the same way. After determining that this was the bug, I found this fix in LLVM: https://reviews.llvm.org/D34516 and applied it here.	2024-05-23 10:21:30 -04:00
Greg Roth	a6f4025957	Calculate preferred alignment when lowering groupshared matrices (#6589 ) When flattening the global for a groupshared matrix, the alignment information was getting lost. As a result, the alignments of the loads and stores were calculating their own alignment based on preferred alignment and trailing zeros of the index. The preferred alignment switched to 16 when the type size was over 128 bits due to a heuristic whose rationale is lost to time. When the global has its own alignment, that gets used, so by calculating it at lowering, the alignments are consistent and reliable. Includes testing for a few matrix variants and a pass test. fixes #6416	2024-05-22 13:38:51 -07:00
Steven Perron	86da226c4f	Fix aligment for empty structs (#6635 ) We have a special case to that the the size and alignment for an empty struct is `{1,0}`. However that is not correct. See https://registry.khronos.org/vulkan/specs/1.3-extensions/html/vkspec.html#interfaces-alignment-requirements. > An empty structure has a base alignment equal to the size of the smallest scalar type permitted by the capabilities declared in the SPIR-V module. (e.g., for a 1 byte aligned empty struct in the StorageBuffer storage class, StorageBuffer8BitAccess or UniformAndStorageBuffer8BitAccess must be declared in the SPIR-V module. I'm not 100% sure how DXC handle this minimum alignment, but I figured I would inialize the alignment to 1. If there are not members, then it will remain 1, and I would let the rest of the logic happen. No special case. Fixes #2882	2024-05-22 12:44:02 -04:00
Nielsbishere	0bdd15b75f	Update SPIR-V.rst (#6642 ) Mini typo	2024-05-22 09:54:31 -04:00
Nathan Gauër	9ad095dfe6	[SPIR-V] Add support for SampleCmpLevel (#6618 ) SampleCmpLevel is similar to SampleCmpLevel0, except the LOD level can be specified using either a const-offset, or a variable. This should be available starting SM6.7 Fixes #6613 --------- Signed-off-by: Nathan Gauër <brioche@google.com>	2024-05-22 10:59:45 +02:00
Zhengxing li	9ee3f23d9c	More aggressive reassociations (#6626 ) Although DXC applied the upstream change, Reassociate: add global reassociation algorithm (https://github.com/llvm/llvm-project/commit/b8a330c) in this PR (https://github.com/microsoft/DirectXShaderCompiler/pull/6598), it still might overlook some obvious common factors. One case has been observed is: ``` %Float4_0 = call %dx.types.CBufRet.f32 @dx.op.cbufferLoadLegacy.f32(i32 59, %dx.types.Handle %1, i32 1) %Float4_0.w = extractvalue %dx.types.CBufRet.f32 %Float4_0, 3 %Float2_0 = call %dx.types.CBufRet.f32 @dx.op.cbufferLoadLegacy.f32(i32 59, %dx.types.Handle %1, i32 0) %Float2_0.y = extractvalue %dx.types.CBufRet.f32 %Float2_0, 1 /* %Float4_1 is redundant with %Float4_0 since they invokes cbufferLoadLegacy with same parameters / %Float4_1 = call %dx.types.CBufRet.f32 @dx.op.cbufferLoadLegacy.f32(i32 59, %dx.types.Handle %1, i32 1) / %Float4_1.w is redundant with %Float4_0.w / %Float4_1.w = extractvalue %dx.types.CBufRet.f32 %Float4_1, 3 / %Float2_1 is redundant with %Float2_0 since they invokes cbufferLoadLegacy with same parameters / %Float2_1 = call %dx.types.CBufRet.f32 @dx.op.cbufferLoadLegacy.f32(i32 59, %dx.types.Handle %1, i32 0) / %Float2_1.y is redundant with %Float2_0.y / %Float2_1.y = extractvalue %dx.types.CBufRet.f32 %Float2_1, 1 .... %11 = fmul fast float %Float4_0.w, %10 %12 = fmul fast float %11, %Float2_0.y .... %14 = fmul fast float %Float4_1.w, %13 %15 = fmul fast float %14, %Float2_1.y (%Float4_0.w %Float2_0.y) equals to (%Float4_1.w * %Float2_1.y), they should be reassociated to a common factor ``` The upstream change can't identify this common factor because DXC doesn't know (%Float4_0.w, %Float4_1.w) and (%Float2_0.y, %Float2_1.y) are redundant when running Reassociate pass. Those redundancies will be eliminated in GVN pass. For DXC can identify more common factors, this PR will aggressively run Reassociate pass again after GVN pass and then run GVN pass again to remove the redundancies generared in this run of Reassociate pass. Changing the order of floating point operations causes the precision issue. In case some shaders get unexpected results due to this PR, use "-opt-disable aggressive-reassociation" to disable this PR and roll back. This is part 3 of the fix for https://github.com/microsoft/DirectXShaderCompiler/issues/6593.	2024-05-21 15:37:04 -07:00
Zhengxing li	1ee70fdc64	Add a flag for the upstream global reassociation algorithm change (#6625 ) This PR (https://github.com/microsoft/DirectXShaderCompiler/pull/6598) pulls the upstream global reassociation algorithm change in DXC and can reduce redundant calculations obviously. However, from the testing result of a large offline suite of shaders, some shaders got worse compilation results and couldn't benefit from this upstream change. This PR adds a flag for the upstream global reassociation change. It would be easier to roll back if a shader get worse compilation result due to this upstream change. This is part 2 of the fix for #6593.	2024-05-21 13:45:38 -07:00
dependabot[bot]	6a34e29175	Bump requests from 2.31.0 to 2.32.0 in /utils/git (#6638 ) Bumps [requests](https://github.com/psf/requests) from 2.31.0 to 2.32.0. Release notes Sourced from https://github.com/psf/requests/releases requests's releases. v2.32.0 Fixed an issue where setting `verify=False` on the first request from a Session will cause subsequent requests to the _same_ origin to also ignore cert verification, regardless of the value of `verify`. ( https://github.com/psf/requests/security/advisories/GHSA-9wx4-h78v-vm56 Improvements `verify=True` now reuses a global SSLContext which should improve request time variance between first and subsequent requests. It should also minimize certificate load time on Windows systems when using a Python version built with OpenSSL 3.x. ( https://redirect.github.com/psf/requests/issues/6667) Requests now supports optional use of character detection (`chardet` or `charset_normalizer`) when repackaged or vendored. This enables `pip` and other projects to minimize their vendoring surface area. The `Response.text()` and `apparent_encoding` APIs will default to `utf-8` if neither library is present. (https://redirect.github.com/psf/requests/issues/6702) [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=requests&package-manager=pip&previous-version=2.31.0&new-version=2.32.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-21 12:39:35 -07:00
Zhengxing li	6f9c107b78	Reassociate: add global reassociation algorithm (#6598 ) This PR pulls the upstream change, Reassociate: add global reassociation algorithm (`b8a330c42a`), into DXC with miminal changes. For the code below: foo = (a * b) * c bar = (a * d) * c As the upstream change states, it can identify the a*c is a common factor and redundant. This is part 1 of the fix for #6593.	2024-05-21 11:55:40 -07:00
Natalie Chouinard	66acf8de22	[SPIR-V] Remove always disabled test (#6634 ) This test was marked DISABLED_ in gtest at the time it was added in PR #3155, so it appears that it was never passing. Specifically, the CHECK for `DebugFunction [[func1]]` fails. I don't think it's a priority to implement debug info for unreferenced functions at this point, so opting to simply remove it. This is the last test in the unsupported directory so it can now be removed entirely. Fixes #6616	2024-05-21 11:47:22 +02:00
Natalie Chouinard	1658b068b5	[SPIR-V] Enable more unsupported tests (#6630 ) The following tests only required minor test syntax changes to pass: - tools/clang/test/CodeGenSPIRV/cast.2float.interlocked.hlsl - tools/clang/test/CodeGenSPIRV/meshshading.nv.error.fncall.amplification.vulkan1.2.hlsl (+ replacing NV ext with EXT) - tools/clang/test/CodeGenSPIRV/var.init.extvector.hlsl Issue #6621 has been filed to track the failure of tools/clang/test/CodeGenSPIRV/oo.class.static.member.hlsl. Related to #6616	2024-05-17 15:06:11 +00:00
Natalie Chouinard	2432517221	[SPIR-V] Implement WaveMutliPrefix* (#6608 ) Implements the Shader Model 6.5 WaveMultiPrefix* intrinsic functions using the group operation from SPV_NV_shader_subgroup_partitioned, PartitionedExclusiveScanNV, which performs a partitioned exclusive scan operation across a subset of invocations ("lanes") in a subgroup ("wave"). The subset of the partition is determined by the provided ballot ("mask") parameter, which follows the same requirements for valid partitioning and active invocations/lanes as the HLSL parameter. Note that WaveMultiPrefixCountBits remains unimplemented because it does not directly map to a SPIR-V GroupNonUniformArithmetic instruction that accepts the PartitionedExclusiveScanNV Group Operation. DirectX Spec: https://microsoft.github.io/DirectX-Specs/d3d/HLSL_ShaderModel6_5.html#wavemultiprefix-functions SPIR-V Extension: https://htmlpreview.github.io/?https://github.com/KhronosGroup/SPIRV-Registry/blob/main/extensions/NV/SPV_NV_shader_subgroup_partitioned.html Depends on #6596 Fixes #6600	2024-05-17 09:59:35 -04:00
Natalie Chouinard	d64f5f2699	[SPIR-V] Enable rayquery tests (#6627 ) These tests were previously marked as "unsupported" because they were misconfigured at the time they were added and never run. Minor changes have been made to make them passing tests. Note that rayquery_assign.cs.hlsl has been removed because it no longer produces an error. A similar non-erroring check exists in rayquery_init_expr.hlsl Related to #6616	2024-05-17 09:33:42 -04:00
Natalie Chouinard	83ba82e1f3	[SPIR-V] Remove bad test (#6632 ) This test is marked as unsupported (ignored), but it currently fails to compile both for SPIR-V and DXIL with the same Sema error: "cannot implicitly convert from 'SecondStruct' to 'FirstStruct'". If it would have succeeded for SPIR-V at some point in the past it's no longer valid anyways, so removing it. Related to #6616	2024-05-17 09:07:59 -04:00
Antonio Maiorano	348040254e	Fix use-after-free in SimplifyCFG (#6628 ) When SimplifySwitchOnSelect calls SimplifyTerminatorOnSelect, it holds onto the select's condition value to use for the conditional branch it replaces the switch with. When removing the switch's unused predecessors, it must make sure not to delete PHIs in case one of them is used by the condition value, otherwise the condition value itself may get deleted, resulting in an use-after-free. Note that this was fixed in LLVM as well: `dc3b67b4ca`	2024-05-17 03:22:52 +00:00

1 2 3 4 5 ...

4649 Коммитов Все ветки Поиск

4649 Коммитов

Все ветки