|
 | llvm/lib/Target/AArch64/AArch64ISelLowering.h (diff) |
 | llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td (diff) |
 | llvm/lib/Target/AArch64/AArch64ISelLowering.cpp (diff) |
 | llvm/lib/Target/AArch64/SVEInstrFormats.td (diff) |
 | llvm/test/CodeGen/AArch64/sve-fp.ll (diff) |
Commit
bb0344644a656734d707ab9c0baf6eb0533ac905
by listmail[memcpyopt] Conservatively handle non-integral pointers
If we allow the non-integral pointers to become memset and memcpy, we loose the ability to reason about pointer propagation. This patch is modeled on changes we've carried downstream for a long time, figured it was worth being equally conservative for other users. There is room to refine the semantics and handling here if anyone is motivated.
|
 | llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp (diff) |
 | llvm/test/Transforms/MemCpyOpt/non-integral.ll |
Commit
75a5ec1bad18ae1d741830cc46946da00fed6ed9
by pklausler[flang][msvc] Rework a MSVC work-around to avoid clang warning
A recent MSVC work-around patch is eliciting unused variable warnings from clang; package the lambda reference arguments into a struct to avoid the warning.
Differential revision: https://reviews.llvm.org/D88695
|
 | flang/lib/Evaluate/fold-implementation.h (diff) |
Commit
61687f3a48c254436cbdd55e10bfb23b727f3eb5
by pklausler[flang] Fix buffering read->write transition
The buffer needs to be Reset() after a Flush(), since the Flush() can be a no-op after a read->write transition. And record numbers are 1-based, not 0-based. This fixes a bug with rewrites of records that have been recently read.
Differential revision: https://reviews.llvm.org/D88612
|
 | flang/runtime/io-api.cpp (diff) |
 | flang/runtime/buffer.h (diff) |
Commit
78a9e62aa6f8f39fe8141e5486fca6db29947ecf
by jasonliu[XCOFF] Enable -fdata-sections on AIX
Summary: Some design decision worth noting about:
I've noticed a recent mailing discussing about why string literal is not affected by -fdata-sections for ELF target: http://lists.llvm.org/pipermail/llvm-dev/2020-September/145121.html
But on AIX, our linker could not split the mergeable string like other target. So I think it would make more sense for us to emit separate csect for every mergeable string in -fdata-sections mode, as there might not be other ways for linker to do garbage collection on unused mergeable string.
Reviewed By: daltenty, hubert.reinterpretcast
Differential Revision: https://reviews.llvm.org/D88339
|
 | llvm/lib/CodeGen/TargetLoweringObjectFileImpl.cpp (diff) |
 | llvm/lib/Target/PowerPC/PPCAsmPrinter.cpp (diff) |
 | llvm/test/CodeGen/PowerPC/aix-xcoff-data-sections.ll |
Commit
a94d943f1a3f42efede7e908bb250c84f9f442b1
by pklausler[flang] Fix actions at end of output record
It turns out that unformatted fixed-size output records do need to be padded out if short, in order to avoid a spurious EOF crash on a short record at the end of the file. While here in AdvanceRecord(), move the unformatted variable-length record header/footer writing code to here from EndIoStatement().
Differential revision: https://reviews.llvm.org/D88685
|
 | flang/runtime/io-stmt.h (diff) |
 | flang/runtime/io-stmt.cpp (diff) |
 | flang/runtime/unit.cpp (diff) |
Commit
3261aefc72b3769e8b3eccbb67e1145e195ffa8d
by pklausler[flang] Extend runtime API for PAUSE to allow a stop code
Support integer and default character stop codes on PAUSE statements. Add length argument to STOP statement with a character stop code.
Differential revision: https://reviews.llvm.org/D88692
|
 | flang/runtime/stop.cpp (diff) |
 | flang/runtime/stop.h (diff) |
Commit
c1dcb573a861dc45be6e4cfc598b340c9079fc1f
by clementval[flang][openacc] Update loop construct lowering
Update the loop construct lowering to support multiple occurences of the same clauses such as private. Add some utility functions used by other constructs.
Upstreaming part of https://github.com/flang-compiler/f18-llvm-project/pull/438/
Reviewed By: schweitz
Differential Revision: https://reviews.llvm.org/D88253
|
 | flang/lib/Lower/OpenACC.cpp (diff) |
Commit
82453e759c77941cf2281ade79fb9b945b7e9458
by jhuber6[OpenMP] Add Missing Runtime Call for Globalization Remarks
Summary: Add a missing runtime call to perform data globalization checks.
Reviewers: jdoerfert
Subscribers: guansong hiraditya llvm-commits sstefan1 yaxunl
Tags: #LLVM #OpenMP
Differential Revision: https://reviews.llvm.org/D88621
|
 | llvm/lib/Transforms/IPO/OpenMPOpt.cpp (diff) |
 | llvm/test/Transforms/OpenMP/globalization_remarks.ll (diff) |
Commit
c4690b007743d2f564bc1156fdbdbcaad2adddcc
by esme.yi[PowerPC] Put the CR field in low bits of GRC during copying CRRC to GRC.
Summary: How we copying the CRRC to GRC is using a single MFOCRF to copy the contents of CR field n (CR bits 4×n+32:4×n+35) into bits 4×n+32:4×n+35 of register GRC. That’s not correct because we expect the value of destination register equals to source so we have to put the the contents of CR field in the lowest 4 bits. This patch adds a RLWINM after MFOCRF to achieve that. The problem came up when adding builtins for xvtdivdp, xvtdivsp, xvtsqrtdp, xvtsqrtsp, as posted in D88278. We need to move the outputs (in CR register) to GRC. However outputs of these instructions may not in a fixed CR# register, so we can’t directly add a rotation instruction in the .td patterns, but need to wait until the CR register is determined. Then we confirmed this should be a bug in POST-RA PSEUDO PASS.
Reviewed By: nemanjai, shchenz
Differential Revision: https://reviews.llvm.org/D88274
|
 | llvm/lib/Target/PowerPC/PPCInstrInfo.cpp (diff) |
 | llvm/test/CodeGen/PowerPC/htm-ttest.ll (diff) |
 | llvm/lib/Target/PowerPC/PPCInstrHTM.td (diff) |
|
 | llvm/test/CodeGen/AMDGPU/si-lower-control-flow.mir (diff) |
 | llvm/lib/CodeGen/MachineBasicBlock.cpp (diff) |
Commit
a1e97923a025d09934b557ca4343d8e4b5a9973d
by Jason MolendaHave kernel binary scanner load dSYMs as binary+dSYM if best thing found
lldb's PlatforDarwinKernel scans the local filesystem (well known locations, plus user-specified directories) for kernels and kexts when doing kernel debugging, and loads them automatically. Sometimes kernel developers want to debug with *only* a dSYM, in which case they give lldb the DWARF binary + the dSYM as a binary and symbol file. This patch adds code to lldb to do this automatically if that's the best thing lldb can find.
A few other bits of cleanup in PlatformDarwinKernel that I undertook at the same time:
1. Remove the 'platform.plugin.darwin-kernel.search-locally-for-kexts' setting. When I added the local filesystem index at start of kernel debugging, I thought people might object to the cost of the search and want a way to disable it. No one has.
2. Change the behavior of 'plugin.dynamic-loader.darwin-kernel.load-kexts' setting so it does not disable the local filesystem scan, or use of the local filesystem binaries.
3. PlatformDarwinKernel::GetSharedModule into GetSharedModuleKext and GetSharedModuleKernel for easier readability & maintenance.
4. Added accounting of .dSYM.yaa files (an archive format akin to tar) that I come across during the scan. I'm not using these for now; it would be very expensive to expand the archives & see if the UUID matches what I'm searching for.
<rdar://problem/69774993> Differential Revision: https://reviews.llvm.org/D88632
|
 | lldb/source/Plugins/Platform/MacOSX/PlatformDarwinKernel.h (diff) |
 | lldb/source/Plugins/Platform/MacOSX/PlatformMacOSXProperties.td (diff) |
 | lldb/source/Plugins/Platform/MacOSX/PlatformDarwinKernel.cpp (diff) |
 | lldb/source/Plugins/DynamicLoader/Darwin-Kernel/DynamicLoaderDarwinKernel.cpp (diff) |
Commit
2ef9d21e1a3cf8a58049921c785de1487fbcd7e1
by carl.ritson[AMDGPU] SIInsertSkips: Tidy block splitting to use splitAt
Convert to use new MachineBasicBlock splitAt function. Place code in splitBlock function for reuse in future changes. Should yield no functional change.
Reviewed By: arsenm
Differential Revision: https://reviews.llvm.org/D88537
|
 | llvm/lib/Target/AMDGPU/SIInsertSkips.cpp (diff) |
Commit
f29645e7afdbb8d1fc2dd603c0b128bac055625c
by listmail[gvn] Handle a corner case w/vectors of non-integral pointers
If we try to coerce a vector of non-integral pointers to a narrower type (either narrower vector or single pointer), we use inttoptr and violate the semantics of non-integral pointers. In theory, we can handle many of these cases, we just need to use a different code idiom to convert without going through inttoptr and back.
This shows up as wrong code bugs, and in some cases, crashes due to failed asserts. Modeled after a change which has lived downstream for a couple years, though completely rewritten to be more idiomatic.
|
 | llvm/lib/Transforms/Utils/VNCoercion.cpp (diff) |
 | llvm/test/Transforms/GVN/non-integral-pointers.ll (diff) |
Commit
b8ac19cf1cca5faec8b4404bb0f666cb63c9e1de
by mkazantsev[SCEV] Limited support for unsigned preds in isImpliedViaOperations
The logic there only considers `SLT/SGT` predicates. We can use the same logic for proving `ULT/UGT` predicates if all involved values are non-negative.
Adding full-scale support for unsigned might be challenging because of code amount, so we can consider this in the future.
Differential Revision: https://reviews.llvm.org/D88087 Reviewed By: reames
|
 | llvm/lib/Analysis/ScalarEvolution.cpp (diff) |
 | llvm/unittests/Analysis/ScalarEvolutionTest.cpp (diff) |
Commit
47df8c57e4ed01fa0101aa0b320fc7cf5a90df28
by stephen.neuendorffer[MLIR] Updates around MemRef Normalization
The documentation for the NormalizeMemRefs pass and the associated MemRefsNormalizable traits was confusing and not on the website. This update clarifies the language around the difference between a MemRef Type, an operation that accesses the value of MemRef Type, and better documents the limitations of the current implementation. This patch also includes some basic debugging information for the pass so people might have a chance of figuring out why it doesn't work on their code.
Differential Revision: https://reviews.llvm.org/D88532
|
 | mlir/lib/Transforms/NormalizeMemRefs.cpp (diff) |
 | mlir/docs/Traits.md (diff) |
 | mlir/include/mlir/IR/OpDefinition.h (diff) |
 | mlir/include/mlir/Transforms/Passes.td (diff) |