[AutoBump] Merge with 1901f4ac (Feb 05) (29)#945
Open
jorickert wants to merge 53 commits intobump_to_58a88001from
Open
[AutoBump] Merge with 1901f4ac (Feb 05) (29)#945jorickert wants to merge 53 commits intobump_to_58a88001from
jorickert wants to merge 53 commits intobump_to_58a88001from
Conversation
This patch adds an initial implementation of VPInstruction::computeCost with support for only one instruction so far - VPInstruction::AnyOf. This is only used when vectorising loops with uncountable early exits.
…orithms (#122410) This PR addresses an undefined behavior that arises when using the `std::fill` and `std::fill_n` algorithms, as well as their ranges counterparts `ranges::fill` and `ranges::fill_n`, with `vector<bool, Alloc>` that utilizes a custom-sized allocator with small integral types.
…#125869) We only initialize the final field above, so make sure we're marking the links in the chain on the way there as initialized as well.
This fixes an issue where your host triple is not compatible with the 64 bit ptx being the offload architecture. At least, that's my guess. This failed on our Arm 32 bit bot: https://lab.llvm.org/buildbot/#/builders/154/builds/11413/steps/5/logs/FAIL__Clang__offload-Xarch_c Crucially it outputted: clang: warning: argument unused during compilation: '-Xarch_nvptx64 -O3' [-Wunused-command-line-argument] Making the triple always something 64 bit means this will work everywhere. Fixes 455cedc / llvm/llvm-project#125421.
…n targets (#125849) Fixes the failure of the [Solaris/sparcv9 buildbot](https://lab.llvm.org/buildbot/#/builders/13/builds/5103) caused by #125578. cc @rorth @tromey @dwblaikie
Fixed imports for all files *within* `libc/src/string/memory_utils`. Note: This doesn't include **all** files that need to be fixed. Fixes #86579
A few tests generate a statically-linked position-independent executable with `-nostdlib -Wl,--unresolved-symbols=ignore-all -pie` (`%clang`) and test PLT handling. (--unresolved-symbols=ignore-all suppresses undefined symbol errors and serves as a convenience hack.) This relies on an unguaranteed linker behavior: a statically-linked PIE does not necessarily generate PLT entries. While current lld generates a PLT entry, it will change to suppress the PLT entry to simplify internal handling and improve consistency. (The behavior has no consistency in GNU ld, some ports generated a .dynsym entry while some don't. While most seem to generate a PLT entry but some ports use a weird `R_*_NONE` relocation.)
…(#123792) This continues the work on dot product instructions already started in 3cdac06. This change adds support for all OpenCL integer dot product builtins under `cl_khr_integer_dot_product` extension, namely: ``` * dot * dot_acc_sat * dot_4x8packed_(uu/ss/su/us)_(u)int * dot_acc_sat_4x8packed_(uu/ss/su/us)_(u)int ```
…#125894) Another follow up fix to llvm/llvm-project#123910 to fix a build failure that sometimes happens in shared library builds: https://lab.llvm.org/buildbot/#/builders/50/builds/9724 In file included from /home/tcwg-buildbot/worker/flang-aarch64-dylib/llvm-project/mlir/test/lib/Transforms/TestInlining.cpp:16: /home/tcwg-buildbot/worker/flang-aarch64-dylib/llvm-project/mlir/test/lib/Transforms/../Dialect/Test/TestOps.h:148:10: fatal error: 'TestOps.h.inc' file not found 148 | #include "TestOps.h.inc" | ^~~~~~~~~~~~~~~ 1 error generated.
Updates the databease to the Unicode release 16.0.0. The algorithms of the Grapheme clustering rules have not changed.
…5572) This patch does a few things: - replace macro definitions with an inclusion of the linux/wait.h kernel header. - WNOHANG - WUNTRACED - WEXITED - WCONTINUED - WSTOPPED - P_ALL - P_PID - P_PGID - P_PIDFD - Add missing macro definitions mandated by POSIX. Some are needed to build LLVM. - WCOREDUMP - WIFCONTINUED - WIFSIGNALELD - WIFSTOPPED - WSTOPSIG - Remove glibc style __W* macros. Users should stick with the POSIX macros. We can re-add them if necessary. - __WEXITSTATUS - __WTERMSIG - __WIFEXITED - __WIFSIGNALED - __WIFSTOPPED - __WIFCONTINUED - __WCOREDUMP - __W_EXITCODE - __W_STOPCODE - __W_CONTINUED - __WCOREFLAG Fixes: #124944
This patch adds the ability to open URLs from lldb on macOS. This can be a website URL or a deep link to a particular application. rdar://143327616 Signed-off-by: Med Ismail Bennani <ismail@bennani.ma>
…ent (#125621) From OpenMP 6.0 features list - OpenMP directives in concurrent loop regions - atomics constructs on concurrent loop regions - Lift nesting restriction on concurrent loop Testing - Updated test/OpenMP/for_order_messages.cpp - check-all
check if op_sel is consistent with .l/.h syntax if both are presented reopen this llvm/llvm-project#123250 since problem is resolved in llvm/llvm-project#125561
This patch makes the following improvements: * Corrects the suggestion that `bbenv` needs to be made within an llvm-zorg checkout. * Gives workarounds for following the instructions on a system with Python 3.13 (it removed some long-deprecated libraries, which causes problems). * Adds a note about how some builder workflows involve checking out llvm-zorg to retrieve additional scripts and gives guidance on how you can still make and test local changes to those scripts when that's the case.
Summary: The CUDA impelementation has long supported the `width` argument on its shuffle instrucitons, which makes it more difficult to replace those uses with this helper. This patch just correctly implements that for AMDGPU and NVPTX so it's equivalent to `__shfl_sync` in CUDA. This will ease porting. Fortunately these get optimized out correctly when passing in known widths.
…23521) This adds the basic lexer, with unittests, for the Data Inspection Language (DIL) -- see https://discourse.llvm.org/t/rfc-data-inspection-language/69893 This version of the lexer only handles local variables and namespaces, and is designed to work with llvm/llvm-project#120971.
So that users can set these manually if needed.
LLVM has started to emit AArch64 build attributes sections called .ARM.attributes. LLD does not yet have support for these so they are accumulating in the ELF output. As the first part of that support discard all the .ARM.attributes sections. This can be built upon by the full implementation in LLD. The build attributes specification only defines build attributes for relocatable objects. The intention for LLD is that files of type ET_EXEC and ET_SHARED will not have a build attributes in the output. A relocatable link with -r will need a merged build attributes, but until the merge is implemented it is better to discard.
Add declarations of SDTypeConstraint's operator== and operator< to the llvm namespace. These are declared as friends inside the class which makes them part of the enclosing namespace, but gcc wants it to be more explicit. Fixes #125537.
Simple syscall. Fixes: #124647
`syncthreads_and`, `syncthreads_count`, `syncthreads_or`, `synwrap` must take their argument by value. This patch updates the interfaces and makes sure these functions can be called inside a cuff kernel as well.
…ilds (#125871) Including #124987, we have failures on Windows on x86 with `/O2` builds, similar to Darwin. This disables the test for MSVC-x86.
riscv32 specifically doesn't provide EITHER SYS_poll or SYS_ppoll. We may be able to reimplement poll in terms of syscalls to SYS_ppoll_time64, but will leave that as a TODO for the future. (Such as when we want to be able to cross compile for riscv32). Link: #125940 Fixes: #125118
Missed this one in llvm/llvm-project#125936
An upcoming change will need to use add additional tests to this file, so this patch updates the RUN line to use a test prefix.
… (#125897) Fixes #125088. When splitBB is called with createBranch=true, it creates a branch instruction in the old block. But no debug loc is set on that branch instruction. If that is used as InsertPoint in the restoreIP, it has the potential to set the current debug location to null and subsequent instruction will come out without a debug location. This caused the verification check to fail as shown in the bug report. This PR changes splitBB and spliceBB function to also take a debugLoc parameter which can be used to set the debug location of the branch instruction.
This is to ensure that calls to `setjmp(3)` result in correct code generation that respects `setjmp(3)`'s `returns_twice` behavior. Otherwise, we might run into bugs (for example, Clang may perform tail-call optimization on this function if `-fno-builtins` is set (#122840)). --------- Co-authored-by: Nick Desaulniers <nick.desaulniers@gmail.com>
Reviewers: aeubanks, nico Reviewed By: aeubanks Pull Request: llvm/llvm-project#124679
The APInt constructor asserts if bits are set past the size of the APInt unless it is signed. This currently fails on RV32 because more than XLen bits are set.
When building on Windows, dealing with the BlocksRuntime is slightly more complicated. As we are not guaranteed a formward declaration for the blocks runtime ABI symbols, we may generate the declarations for them. In order to properly link against the well-known types, we always annotated them as `__declspec(dllimport)`. This would require the dynamic linking of the blocks runtime under all conditions. However, this is the only the only possible way to us the library. We may be building a fully sealed (static) executable. In such a case, the well known symbols should not be marked as `dllimport` as they are assumed to be statically available with the static linking to the BlocksRuntime. Introduce a new driver/cc1 option `-static-libclosure` which mirrors the myriad of similar options (`-static-libgcc`, `-static-libstdc++`, -static-libsan`, etc).
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.