|
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/SimpleExecutorMemoryManager.cpp (diff) |
|
 | compiler-rt/lib/profile/InstrProfilingPlatformLinux.c (diff) |
Commit
e9d1a679a1c9cb309aea8c5d944e55865d38b867
by hoy[CSSPGO] Do not pass -fpseudo-probe-for-profiling to the linker.
The correponding linker switch has been removed by https://reviews.llvm.org/D110209, so do not pass it in clang.
Reviewed By: wenlei
Differential Revision: https://reviews.llvm.org/D110371
|
 | clang/test/Driver/pseudo-probe-lto.c |
 | clang/lib/Driver/ToolChains/CommonArgs.cpp (diff) |
|
 | llvm/test/CodeGen/RISCV/addimm-mulimm.ll (diff) |
 | llvm/lib/Target/RISCV/RISCVISelLowering.cpp (diff) |
|
 | llvm/test/Analysis/CostModel/AMDGPU/add-sub.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/control-flow.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/fabs.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/fadd.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/reduce-and.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/addrspacecast.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/shufflevector.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/fsub.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/fdiv.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/shifts.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/fma.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/logicalop.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/fmul.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/fneg.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/reduce-or.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/extractelement.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/bit-ops.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/fused_costs.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/insertelement.ll (diff) |
 | llvm/test/Analysis/CostModel/AMDGPU/mul.ll (diff) |
Commit
7647a8413be55568a8a80fae379a872b7359f5b5
by iFix -fno-unwind-tables -fasynchronous-unwind-tables to emit unwind tables
This matches GCC.
Change the CC1 option to encode the unwind table level (1: needed by exceptions, 2: asynchronous) so that we can support two modes in the future.
|
 | clang/test/CodeGen/asan-globals.cpp (diff) |
 | clang/test/Driver/fuchsia.c (diff) |
 | clang/test/CodeGenCXX/linetable-eh.cpp (diff) |
 | clang/test/CodeGenCXX/thunks-ehspec.cpp (diff) |
 | clang/test/CodeGenCXX/thunks.cpp (diff) |
 | clang/test/CodeGenCXX/exceptions-seh-filter-uwtable.cpp (diff) |
 | clang/include/clang/Driver/Options.td (diff) |
 | clang/test/Driver/sanitize_unwind_tables.c (diff) |
 | clang/test/Driver/aarch64-features.c (diff) |
 | clang/test/Driver/ppc-features.cpp (diff) |
 | clang/lib/Driver/ToolChains/Clang.cpp (diff) |
 | clang/test/Driver/clang-translation.c (diff) |
 | clang/test/Driver/win-macho-unwind.c (diff) |
 | clang/test/Driver/freebsd.c (diff) |
 | clang/test/Driver/windows-exceptions.cpp (diff) |
 | clang/test/Preprocessor/unwind-tables.c (diff) |
 | clang/include/clang/Basic/CodeGenOptions.def (diff) |
Commit
76d845cb169f048cb6f2176c3e7a6534dc5af097
by nemanja.i.ibm[clang-format] Fix unittest failures with -Werror
Commit a44ab1702539 added a unit test that fails to build with -Werror which causes build bot breaks on bots that include that option in their build. This patch just adds the necessary casts to silence the warnings.
|
 | clang/unittests/Format/QualifierFixerTest.cpp (diff) |
Commit
c965fde7c234abbb311ab9f58e8684970a69c916
by Lang Hames[ORC] Shut down services in SimpleRemoteEPCServer.
This should have been included with ExecutorBootstrapService in 78b083dbb72, but was accidentally left out. It give services a chance to release any resources that they have acquired.
|
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/SimpleRemoteEPCServer.cpp (diff) |
Commit
c579c658cd42034449d4fa19f28b43f2082c0991
by leonardchan[compiler-rt][profile] Make corrupted-profile.c more robust
This test specifically checks that profiles are not mergeable if there's a change in the CounterPtr in the profile header. The test manually changes CounterPtr by explicitly calling memset on some offset into the profile file. This test would fail if binary IDs were emitted because the offset calculation does not take into account the binary ID sizes.
This patch updates the test to use types provided in profile/InstrProfData.inc to make it more resistant to profile layout changes.
Differential Revision: https://reviews.llvm.org/D110277
|
 | compiler-rt/test/profile/Linux/corrupted-profile.c (diff) |
Commit
2c1defeee40cf643ea6f0fa5e01164c9a4c48c30
by tejohnson[ThinLTO] Don't emit original GUID for locals to distributed indexes
In ThinLTO for locals we normally compute the GUID from the name after prepending the source path to get a unique global id. SamplePGO indirect call profiles contain the target GUID without this uniquification, however (unless compiling with -funique-internal-linkage-names). Therefore, the index contains the original GUID of the local symbols (without module path prepended to uniquify), in order to correctly handle the call edges added for these indirect call profile targets with SamplePGO.
We were emitting these to the combined index when writing it out as bitcode, which is unnecessary and causes overhead when writing out the indexes for distributed backends. The only use of the original GUID name is in the thin link. Suppress it in that case. This reduced the thin link time for a large distributed build by about 7%, and the aggregate size of the serialized indexes by over 2%.
Continue to print it when writing out the full index, since that is just used for debugging and testing.
Update a distributed thinlto index test to contain a local and ensure that we don't get a COMBINED_ORIGINAL_NAME record.
Differential Revision: https://reviews.llvm.org/D110296
|
 | llvm/test/ThinLTO/X86/distributed_indexes.ll (diff) |
 | llvm/lib/Bitcode/Writer/BitcodeWriter.cpp (diff) |
Commit
7da4ee2df088d39c7ca6531d80172af7d973bb67
by tejohnson[ThinLTO] Fix bot failures
Fix bot failures after 2c1defeee40cf643ea6f0fa5e01164c9a4c48c30. The new GUID I added isn't matching because it is a local with the source path prepended. There isn't much use in matching the GUID's exactly anyway, so remove those from the patterns.
|
 | llvm/test/ThinLTO/X86/distributed_indexes.ll (diff) |
Commit
8dc16ba8d2b429261dd95e88496b2a866dc18ae5
by springerm[mlir][linalg] Merge all tiling passes into a single one.
Passes such as `linalg-tile-to-tiled-loop` are merged into `linalg-tile`.
Differential Revision: https://reviews.llvm.org/D110214
|
 | mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h (diff) |
 | mlir/test/Dialect/Linalg/tile-conv.mlir (diff) |
 | mlir/include/mlir/Dialect/Linalg/Passes.h (diff) |
 | mlir/test/Dialect/Linalg/tile-tensors.mlir (diff) |
 | mlir/test/Dialect/Linalg/tile-pad-tensor-op.mlir (diff) |
 | mlir/test/Integration/Dialect/Linalg/CPU/test-conv-3d-ndhwc-dhwcf-call.mlir (diff) |
 | mlir/include/mlir/Dialect/Linalg/Passes.td (diff) |
 | mlir/test/Dialect/Linalg/tile-conv-padding.mlir (diff) |
 | mlir/test/Integration/Dialect/Linalg/CPU/test-conv-3d-call.mlir (diff) |
 | mlir/test/Integration/Dialect/Linalg/CPU/test-conv-2d-call.mlir (diff) |
 | mlir/test/Dialect/Linalg/tile-simple-conv.mlir (diff) |
 | mlir/test/Dialect/Linalg/tile-parallel.mlir (diff) |
 | mlir/test/Integration/Dialect/Linalg/CPU/test-conv-1d-nwc-wcf-call.mlir (diff) |
 | mlir/test/Integration/Dialect/Linalg/CPU/test-conv-2d-nhwc-hwcf-call.mlir (diff) |
 | mlir/test/Integration/Dialect/Linalg/CPU/test-conv-1d-call.mlir (diff) |
 | mlir/include/mlir/Dialect/Linalg/Utils/Utils.h (diff) |
 | mlir/lib/Dialect/Linalg/Transforms/Tiling.cpp (diff) |
 | mlir/test/Integration/Dialect/Linalg/CPU/test-tensor-matmul.mlir (diff) |
 | mlir/test/Dialect/Linalg/tile-parallel-reduce.mlir (diff) |
 | mlir/test/Dialect/Linalg/tile-indexed.mlir (diff) |
 | mlir/test/Dialect/Linalg/tile.mlir (diff) |
|
 | mlir/test/Dialect/Linalg/tile-and-peel-tensors.mlir (diff) |
 | mlir/test/lib/Dialect/Linalg/TestLinalgTransforms.cpp (diff) |
 | mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp (diff) |
Commit
83f3c615dde3fce5c0560c19316b08c1e6aa8c27
by joker.ephAdd missing storageType to AttrDef to ODS
This is only noticeable when using an attribute across dialects I think. Previously the namespace would be ommited, but it wouldn't matter as long as the generated code stays within a single namespace.
Differential Revision: https://reviews.llvm.org/D110367
|
 | mlir/test/mlir-tblgen/op-attribute.td (diff) |
 | mlir/include/mlir/IR/OpBase.td (diff) |
Commit
e470f9268a448fedea25289ec343f82ff52ccc36
by llvm-project[Polly] Implement user-directed loop distribution/fission.
This is a simple version without the possibility to define distribute points or followup-transformations. However, it is the first transformation that has to check whether the transformation is correct.
It interprets the same metadata as the LoopDistribute pass.
Re-apply after revert in c7bcd72a38bcf99e03e4651ed5204d1a1f2bf695 with fix: Take isBand out of #ifndef NDEBUG since it now is used unconditionally.
|
 | polly/lib/Analysis/DependenceInfo.cpp (diff) |
 | polly/lib/Transform/ScheduleOptimizer.cpp (diff) |
 | polly/include/polly/DependenceInfo.h (diff) |
 | polly/test/ScheduleOptimizer/ManualOptimization/distribute_illegal_looploc.ll |
 | polly/lib/Transform/ScheduleTreeTransform.cpp (diff) |
 | polly/include/polly/ScheduleTreeTransform.h (diff) |
 | polly/test/ScheduleOptimizer/ManualOptimization/distribute_illegal_pragmaloc.ll |
 | polly/include/polly/ManualOptimizer.h (diff) |
 | polly/lib/Transform/ManualOptimizer.cpp (diff) |
 | polly/test/ScheduleOptimizer/ManualOptimization/distribute_heuristic.ll |
|
 | clang/lib/Driver/ToolChains/Gnu.cpp (diff) |
 | clang/test/Driver/clang-translation.c (diff) |
Commit
7a62a5b56d670c4e152159740cd7fc4030a9470f
by Christudasan.Devadasan[AMDGPU] Legalize initialized LDS variables
We don't allow an initializer for LDS variables and there is an early abort during instruction selection. This patch legalizes them by ignoring the init values. During assembly emission, proper error reporting already exists for such instances.
Reviewed By: arsenm
Differential Revision: https://reviews.llvm.org/D109901
|
 | llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp (diff) |
 | llvm/test/CodeGen/AMDGPU/lds-zero-initializer.ll (diff) |
 | llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp (diff) |
 | llvm/test/CodeGen/AMDGPU/GlobalISel/lds-zero-initializer.ll (diff) |
Commit
25ac0d3c73d68c017546eb622ba7632c6b581bfb
by dblaikieDebugInfo: Implement the -gsimple-template-names functionality
This excludes certain names that can't be rebuilt from the available DWARF:
* Atomic types - no DWARF differentiating int from atomic int. * Vector types - enough DWARF (an attribute on the array type) to do this, but I haven't written the extra code to add the attributes required for this * Lambdas - ambiguous with any other unnamed class * Unnamed classes/enums - would need column info for the type in addition to file/line number * noexcept function types - not encoded in DWARF
|
 | clang/lib/CodeGen/CGDebugInfo.cpp (diff) |
 | clang/test/CodeGenCXX/debug-info-simple-template-names.cpp |
Commit
a2c1cf09dfaaa6d2161fee00f8317005bf955d64
by Lang Hames[ORC] Introduce EPCGenericDylibManager / SimpleExecutorDylibManager.
EPCGenericDylibManager provides an interface for loading dylibs and looking up symbols in the executor, implemented using EPC-calls to functions in the executor.
SimpleExecutorDylibManager is an executor-side service that provides the functions used by EPCGenericDylibManager.
SimpleRemoteEPC is updated to use an EPCGenericDylibManager instance to implement the ExecutorProcessControl loadDylib and lookup methods. In a future commit these methods will be removed, and clients updated to use EPCGenericDylibManagers directly.
|
 | llvm/include/llvm/ExecutionEngine/Orc/Shared/OrcRTBridge.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/TargetProcess/SimpleRemoteEPCServer.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/TargetProcess/SimpleExecutorDylibManager.h |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/CMakeLists.txt (diff) |
 | llvm/lib/ExecutionEngine/Orc/Shared/OrcRTBridge.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/EPCGenericDylibManager.h |
 | llvm/lib/ExecutionEngine/Orc/EPCGenericDylibManager.cpp |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/SimpleExecutorDylibManager.cpp |
 | llvm/lib/ExecutionEngine/Orc/SimpleRemoteEPC.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/SimpleRemoteEPCServer.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/SimpleRemoteEPC.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/CMakeLists.txt (diff) |
|
 | llvm/utils/gn/secondary/llvm/lib/ExecutionEngine/Orc/TargetProcess/BUILD.gn (diff) |
 | llvm/utils/gn/secondary/llvm/lib/ExecutionEngine/Orc/BUILD.gn (diff) |
Commit
40ddde5d1fa7e5eadb76f6c3cc37dae2f80a8ca2
by Christudasan.Devadasan[TableGen] Allow targets to entirely ignore Psets for registers
Tablegen currently expects targets to have at least one pressure set for every broader register category. AMDGPU's VGPR or AGPR, for instance, seemed to work correctly without any pset, though we have forced one for each type to avoid the assertion in computeRegUnitSets. However, psets can not be entirely empty. At least one set is mandatory for every target. This patch bypasses the assertion for the classes when GeneratePressureSet is zero while ensuring the RegUnitSets are not empty.
Reviewed By: arsenm, rampitec
Differential Revision: https://reviews.llvm.org/D110305
|
 | llvm/test/TableGen/empty-psets.td |
 | llvm/test/TableGen/bare-minimum-psets.td |
 | llvm/utils/TableGen/CodeGenRegisters.cpp (diff) |
|
 | compiler-rt/lib/sanitizer_common/sanitizer_common_interceptors.inc (diff) |
 | compiler-rt/test/sanitizer_common/TestCases/Linux/ttyent.cpp |
 | compiler-rt/lib/sanitizer_common/sanitizer_platform_limits_freebsd.cpp (diff) |
 | compiler-rt/lib/sanitizer_common/sanitizer_platform_limits_freebsd.h (diff) |
 | compiler-rt/test/sanitizer_common/TestCases/NetBSD/ttyent.cpp |
 | compiler-rt/lib/sanitizer_common/sanitizer_platform_interceptors.h (diff) |
|
 | llvm/docs/CommandGuide/opt.rst (diff) |
Commit
ef391df2b63320c9aec61045d8b96917081f0cb2
by Lang Hames[ORC] Rename ExecutorAddress to ExecutorAddr.
Removing the 'ess' suffix improves the ergonomics without sacrificing clarity. Since this class is likely to be used more frequently in the future it's worth some short term pain to fix this now.
|
 | llvm/include/llvm/ExecutionEngine/Orc/EPCGenericJITLinkMemoryManager.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/SimpleRemoteEPCServer.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/OrcRTBootstrap.cpp (diff) |
 | llvm/unittests/ExecutionEngine/Orc/LookupAndRecordAddrsTest.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/EPCGenericDylibManager.h (diff) |
 | llvm/unittests/ExecutionEngine/Orc/WrapperFunctionUtilsTest.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/MachOPlatform.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/EPCGenericDylibManager.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/OrcRPCExecutorProcessControl.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/Shared/ExecutorAddress.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/EPCGenericMemoryAccess.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/TargetProcess/SimpleExecutorDylibManager.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/Shared/OrcRTBridge.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/EPCGenericJITLinkMemoryManager.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/SimpleExecutorMemoryManager.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/TargetProcess/SimpleExecutorMemoryManager.h (diff) |
 | llvm/unittests/ExecutionEngine/Orc/SimpleExecutorMemoryManagerTest.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/LookupAndRecordAddrs.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/EPCEHFrameRegistrar.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/LookupAndRecordAddrs.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/ELFNixPlatform.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/TargetProcess/SimpleRemoteEPCServer.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/Shared/SimpleRemoteEPCUtils.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/Shared/WrapperFunctionUtils.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/ExecutorProcessControl.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/OrcRTBootstrap.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/JITLoaderGDB.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/MachOPlatform.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/ELFNixPlatform.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/Shared/SimpleRemoteEPCUtils.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/Shared/TargetProcessControlTypes.h (diff) |
 | llvm/unittests/ExecutionEngine/Orc/EPCGenericMemoryAccessTest.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/SimpleRemoteEPC.h (diff) |
 | llvm/unittests/ExecutionEngine/Orc/EPCGenericJITLinkMemoryManagerTest.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/SimpleExecutorDylibManager.cpp (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/ExecutorProcessControl.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/TargetProcess/ExecutorBootstrapService.h (diff) |
 | llvm/include/llvm/ExecutionEngine/Orc/Shared/SimplePackedSerialization.h (diff) |
 | llvm/lib/ExecutionEngine/Orc/TargetProcess/RegisterEHFrames.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/SimpleRemoteEPC.cpp (diff) |
 | llvm/lib/ExecutionEngine/Orc/EPCDebugObjectRegistrar.cpp (diff) |
|
 | clang/test/CodeGenCXX/debug-info-simple-template-names.cpp (diff) |
|
 | clang/test/CodeGenCXX/debug-info-simple-template-names.cpp (diff) |
 | clang/lib/CodeGen/CGDebugInfo.cpp (diff) |
|
 | clang/test/CodeGenCXX/debug-info-simple-template-names.cpp (diff) |
Commit
840afbde48e90213028174fdba715e10aa39dd85
by albionapc[PowerPC] SemaChecking for darn family of builtins
The __darn family of builtins are only available on Pwr9, and only __darn_32 is available on both 64 and 32 bit, while the rest are only available on 64 bit. The patch adds sema checking for these builtins and separate the __darn_32's 32 bit test cases.
Differential revision: https://reviews.llvm.org/D110282
|
 | clang/test/CodeGen/builtins-ppc-xlcompat-error.c (diff) |
 | clang/test/CodeGen/builtins-ppc-xlcompat-darn.c (diff) |
 | clang/lib/Sema/SemaChecking.cpp (diff) |
 | clang/test/CodeGen/builtins-ppc.c (diff) |
Commit
0820fbab99a1f1986a8c6e0a8e5cb4542ec52515
by Lang Hames[ORC-RT] Rename ExecutorAddress to ExecutorAddr.
This is an ORC-runtime counterpart to LLVM commit ef391df2b63, and the motivation is the same: to move to a shorter name to improve the ergonomics of this type before it's more widely adopted.
|
 | compiler-rt/lib/orc/simple_packed_serialization.h (diff) |
 | compiler-rt/lib/orc/macho_platform.cpp (diff) |
 | compiler-rt/lib/orc/elfnix_platform.h (diff) |
 | compiler-rt/lib/orc/macho_platform.h (diff) |
 | compiler-rt/lib/orc/elfnix_platform.cpp (diff) |
 | compiler-rt/lib/orc/executor_address.h (diff) |
Commit
661ab70314008b609dda792e2c9dc57afa1f8864
by Amara Emerson[AArch64][GlobalISel] Fix crash in the extend(extract_vector_elt) optimization.
It was assuming that GPR extends could only have destination sizes of 32 or 64 bits, but for AArch64 we allow < 32 bits even without matching size physregs.
|
 | llvm/test/CodeGen/AArch64/GlobalISel/select-extract-vector-elt-with-extend.mir (diff) |
 | llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp (diff) |
|
 | clang/docs/ClangOffloadBundler.rst (diff) |