Commit
67089a39a23b6ff4d1e2c16502cdf627cb56e6fc
by listmail[LSR] Regen tests to adjust for naming in SCEVExpander [nfc]
|
 | llvm/test/Transforms/LoopStrengthReduce/lsr-term-fold.ll |
 | llvm/test/Transforms/LoopStrengthReduce/lsr-term-fold-negative-testcase.ll |
Commit
edc03550063ce1c39bb47bf94937cf036359b487
by jhuber6[Libomptarget] Add missing explicit moves on llvm::Error
Summary: Some older compilers, which we still support, have problems handling the copy elision that allows us to directly move an `Error` to an `Expected`. This patch adds explicit moves to remove the error.
|
 | openmp/libomptarget/plugins-nextgen/amdgpu/src/rtl.cpp |
 | openmp/libomptarget/plugins-nextgen/common/PluginInterface/PluginInterface.cpp |
Commit
ad9f751a6e9085d45d7c33877f4a5185463c617f
by jhuber6[Libomptarget] Add missing explicit moves on llvm::Error
Summary: Some older compilers, which we still support, have problems handling the copy elision that allows us to directly move an `Error` to an `Expected`. This patch adds explicit moves to remove the error. Same as last patch but I forgot this one.
|
 | openmp/libomptarget/plugins-nextgen/amdgpu/src/rtl.cpp |
Commit
34f1da3468ab70b182072ddbe9e2784b3bde9923
by asb[RISCV] Make Zfh PseudoQuietFCMP definitions predicated on HasStdExtZfh
These are currently in a `Predicates = [HasStdExtZfhOrZfhmin]` block, but Zfhmin has no fcmp instructions so the definition makes no sense for Zfhmin.
Differential Revision: https://reviews.llvm.org/D146435
|
 | llvm/lib/Target/RISCV/RISCVInstrInfoZfh.td |
Commit
dfb40d3fd7a261d7e3f697242df4680f1e6780ff
by tejohnson[SimpleLoopUnswitch] Skip non-trivial unswitching of cold loop nests
This fixes a compile time issue due to guarding loop unswitching based on whether the enclosing function is cold. That approach is very inefficient in the case of large cold functions that contain numerous loops, since the loop pass calls isFunctionColdInCallGraph once per loop, and that function walks all BBs in the function (twice for Sample PGO) looking for any non-cold blocks.
Originally, this code only checked if the current Loop's header was cold (D129599). However, that apparently caused a slowdown on a SPEC benchmark, and the example given was that of a cold inner loop nested in a non-cold outer loop (see comments in D129599). The fix was to check if the whole function is cold, done in D133275.
This is overkill, and we can simply check if the header of any loop in the current loop's loop nest is non-cold (looking at both outer and inner loops). This patch drops the compile time for a large module by 40% with this approach.
I also updated PGO-nontrivial-unswitch2.ll since it only had one cold loop in a non-cold function, so that it instead had IR based off the example given in the comments relating to the SPEC degradation in D129599. I confirmed that the new version of the test fails with the original check done in D129599 of only the current loop's header coldness.
Similarly updated test PGO-nontrivial-unswitch.ll to contain a cold loop in a cold loop nest, and created PGO-nontrivial-unswitch3.ll to contain a non-cold loop in a non-cold loop nest.
Differential Revision: https://reviews.llvm.org/D146383
|
 | llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch3.ll |
 | llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch2.ll |
 | llvm/lib/Transforms/Scalar/SimpleLoopUnswitch.cpp |
 | llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch.ll |
Commit
4e298c32d558cbe7059bb90e4306c22c6702016e
by mcgrathr[libc] Make string tests compatible with the Fuchsia build
Some test code was doing loose conversions caught by compiler warnings in the Fuchsia build. This included duplicated code in a few tests that was reconsolidated with the existing header file copy of the same functions.
The MemoryMatcher abstraction presumes gtest-style matcher support, which is not available in Fuchsia's zxtest library. It's avoided in favor of simpler memory-comparing assertions.
Reviewed By: abrachet
Differential Revision: https://reviews.llvm.org/D146343
|
 | libc/test/src/string/strsignal_test.cpp |
 | libc/test/src/string/bcopy_test.cpp |
 | libc/test/src/string/memmove_test.cpp |
 | libc/test/UnitTest/MemoryMatcher.h |
 | libc/test/src/string/memory_utils/memory_check_utils.h |
Commit
e4f62da8129d9632d77fd4db5cf8caede0a1e5a6
by Jonas Devlieghere[lldb] Sidestep -Wformat warning by using LLDB_LOG instead of LLDB_LOGF
Fixes warning: format specifies type 'unsigned long' but the argument has type 'DataType' (aka 'unsigned long long') [-Wformat]
|
 | lldb/source/Plugins/SymbolFile/DWARF/DWARFDebugAranges.cpp |
Commit
cd11f55a0c6d8524f9ac494d26514d4d72d83435
by gribozavr[clang][dataflow] Fix indentation in a test
|
 | clang/unittests/Analysis/FlowSensitive/DataflowEnvironmentTest.cpp |
Commit
72073fc95cd4793a853925ddc8cc3fb2118808a5
by john.brawn[Serialization] Place command line defines in the correct file
Fix several problems related to serialization causing command line defines to be reported as being built-in defines: * When serializing the <built-in> and <command line> files don't convert them into absolute paths. * When deserializing SM_SLOC_BUFFER_ENTRY we need to call setHasLineDirectives in the same way as we do for SM_SLOC_FILE_ENTRY. * When created suggested predefines based on the current command line options we need to add line markers in the same way that InitializePreprocessor does. * Adjust a place in clangd where it was implicitly relying on command line defines being treated as builtin.
Differential Revision: https://reviews.llvm.org/D144651
|
 | clang-tools-extra/clangd/index/SymbolCollector.cpp |
 | clang/lib/Serialization/ASTReader.cpp |
 | clang/lib/Serialization/ASTWriter.cpp |
 | clang/test/PCH/ms-pch-macro.c |
 | clang/docs/ReleaseNotes.rst |
 | clang/test/PCH/macro-cmdline.c |
Commit
b9521484ec72142bce8124a171206fc577abfd3f
by listmail[LSR] Rewrite IV match for term-fold using existing utilities
Main benefit here is making the logic easier to follow, slightly more efficient, and more in line with LFTR. This is not NFC. There are three semantic changes here.
First, we drop handling for constants on the LHS of the comparison. These are non-canonical, and we're very late in the optimization pipeline here, so there's no point in supporting this. I removed a test which covered this case.
Second, we don't need the almost dead IV to be an addrec. We just need SCEV to be able to compute a trip count for it.
Third, we require a simple IV for the almost dead IV. In theory, this removes cases we could have previously handled, but given a) zero testing and b) multiple known correctness issues, I'm adopting an attidute of narrowing this down to something which works correctly, and *then* expanding.
|
 | llvm/lib/Transforms/Scalar/LoopStrengthReduce.cpp |
 | llvm/test/Transforms/LoopStrengthReduce/lsr-term-fold.ll |
 | llvm/test/Transforms/LoopStrengthReduce/lsr-term-fold-negative-testcase.ll |
Commit
c47da7f109468efbe77c27d436fd69fa2b3ad991
by Alex Langford[lldb] Introduce CMake variable LLDB_ENFORCE_STRICT_TEST_REQUIREMENTS
The goal of this patch is to add the ability for the CMake configure to fail when some optional test dependencies are not met. LLDB tries to be flexible when test dependencies are not present but there are cases where it would be useful to know that these dependencies are missing before we run the test suite.
The intent here is to apply this setting on CI machines and make sure that they have useful optional dependencies installed. We recently hit a case where some CI machines were timing out while running the test suite because a few tests were hanging. With this option, we'll be able to know if the machine does not have psutil installed so we can install it and avoid the timeout scenario altogether.
rdar://103194447
Differential Revision: https://reviews.llvm.org/D146335
|
 | lldb/test/CMakeLists.txt |
 | lldb/cmake/modules/LLDBConfig.cmake |
 | lldb/cmake/modules/AddLLDB.cmake |
Commit
60b117aa81493e4f91df11ec2322957ec7800f55
by maryammo[PowerPC] Adding test coverage for vector compatibility warning
This is to test D143210 patch to have the same vector compatibility logic for error and warning diagnostics.
Reviewed By: lei
Differential Revision: https://reviews.llvm.org/D144611
|
 | clang/test/Parser/lax-conv.cpp |
Commit
6ac632ad83fb0f1f91cfca2d05e074712aebce79
by prabhdeep.singh.soni3[MLIR][Linalg] Generate unique LibraryCallName for LinalgOps.
When lowering LinalgToStandard for named UnaryFn/BinaryFn ops, ensure the fun name appears in the generated library name. Further, for linalg.copy to/from different address spaces, ensure the to/from address spaces are appended onto the library name for uniqueness. This fixes the lowering error with the linalg.copy testcase shown in this patch.
Reviewed By: ftynse
Differential Revision: https://reviews.llvm.org/D145467
|
 | mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp |
 | mlir/test/Dialect/Linalg/library-calls.mlir |
Commit
5259da75b9352592cd12fc2c6b6b875567989867
by michaelfrancis[AIX][Clang] Respect -r when invoking the linker
On AIX, libraries are still being linked when `-r` is passed to the driver. This patch corrects this error.
Differential Revision: https://reviews.llvm.org/D145899
|
 | clang/test/Driver/aix-ld.c |
 | clang/lib/Driver/ToolChains/AIX.cpp |
Commit
fb3f6a95393f33bc8d8550a5ac62c18e488a9b6f
by kadircetRevert "[clangd] Fix AddUsing in the face of typo-correction"
This reverts commit 6f23fee4ef98a695062aa128a177478ba7d742d4. Breaks windows buildbots
|
 | clang-tools-extra/clangd/refactor/tweaks/AddUsing.cpp |
 | clang-tools-extra/clangd/unittests/tweaks/AddUsingTests.cpp |
Commit
427136dc355a2c70e7aae96460944816b4326111
by a.bataev[SLP][NFC]Add a test with missed buildvector node, matching the vectorized node.
|
 | llvm/test/Transforms/SLPVectorizer/X86/reused-buildvector-matching-vectorized-node.ll |
Commit
0c0468e6df2bcabd207858891c2387357857b0bc
by flo[InstCombine] Add test for issue fixed by 50fe87a5c8597e.
Extra test that was fixed by 50fe87a5c8597e to make sure it doesn't regress again.
|
 | llvm/test/Transforms/InstCombine/sincospi.ll |
Commit
d1e00b6f136ec71a4c95a7eb4fd81ec0ab547962
by Jan Svoboda[clang][deps] Only cache files with specific extension
In the scanner's VFS, we cache all files by default and only avoid caching stat failures for certain files. This tanks the performance of scanning with pre-populated module cache. When there is a stale PCM file, it gets cached by the scanner at the start and the rebuilt version never makes it through the VFS again. The TU invocation that rebuilds the PCM only sees the copy in its InMemoryModuleCache, which is invisible to other invocations. This means the PCM gets rebuilt for every TU given to the scanner.
This patch fixes the situation by flipping the default, only caching files that are known to be important, and letting everything else fall through to the underlying VFS.
rdar://106376153
Reviewed By: Bigcheese
Differential Revision: https://reviews.llvm.org/D146328
|
 | clang/unittests/Tooling/DependencyScannerTest.cpp |
 | clang/include/clang/Tooling/DependencyScanning/DependencyScanningFilesystem.h |
 | clang/lib/Tooling/DependencyScanning/DependencyScanningFilesystem.cpp |
Commit
6bd4d717d577b88e83a92ea865bb96dd5df45936
by jhuber6[libc] Add environment variables to GPU libc test for AMDGPU
This patch performs the same operation to copy over the `argv` array to the `envp` array. This allows the GPU tests to use environment variables.
Reviewed By: sivachandra
Differential Revision: https://reviews.llvm.org/D146322
|
 | libc/startup/gpu/amdgpu/start.cpp |
 | libc/test/integration/startup/gpu/CMakeLists.txt |
 | libc/utils/gpu/loader/amdgpu/Loader.cpp |
 | libc/test/integration/startup/gpu/args_test.cpp |
 | libc/utils/gpu/loader/Main.cpp |
 | libc/utils/gpu/loader/Loader.h |
Commit
41e9c4b88c28b0a6f3820b45000cedeced89206c
by Amara Emerson[NFC][Outliner] Delete default ctors for Candidate & OutlinedFunction.
I think it's good practice to avoid having default ctors unless they're really valid/useful. For OutlinedFunction the default ctor was used to represent a bail-out value for getOutliningCandidateInfo(), so I changed the API to return an optional<getOutliningCandidateInfo> instead which seems a tad cleaner.
Differential Revision: https://reviews.llvm.org/D146375
|
 | llvm/include/llvm/CodeGen/TargetInstrInfo.h |
 | llvm/lib/Target/X86/X86InstrInfo.cpp |
 | llvm/lib/Target/RISCV/RISCVInstrInfo.cpp |
 | llvm/lib/Target/AArch64/AArch64InstrInfo.cpp |
 | llvm/include/llvm/CodeGen/MachineOutliner.h |
 | llvm/lib/Target/ARM/ARMBaseInstrInfo.h |
 | llvm/lib/Target/RISCV/RISCVInstrInfo.h |
 | llvm/lib/CodeGen/MachineOutliner.cpp |
 | llvm/lib/Target/X86/X86InstrInfo.h |
 | llvm/lib/Target/AArch64/AArch64InstrInfo.h |
 | llvm/lib/Target/ARM/ARMBaseInstrInfo.cpp |
Commit
272ebd6957ef7bd39a6c6d2aaf7249d86e09791b
by listmail[LSR] Inline getAlternateIVEnd and simplify [nfc]
Also, add a comment to highlight that the "good" result on this test is accidental, and not based on a principled decision. I matched the original behavior to make this nfc, but selecting the last legal IV is not well motivated here.
|
 | llvm/test/Transforms/LoopStrengthReduce/lsr-term-fold-negative-testcase.ll |
 | llvm/lib/Transforms/Scalar/LoopStrengthReduce.cpp |
Commit
92416b63a57b74689abc175bcafd97b674ff9728
by kubak[ADT] Work around `enumerate` compilation error with modules enabled
This manifests on Apple clang 14 with `-DLLVM_ENABLE_MODULES=1` and `-DLLVM_ENABLE_ASSERTIONS=1` and seems like a host compiler bug.
Sample compilation failure: https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/52513/consoleFull#-458239162a1ca8a51-895e-46c6-af87-ce24fa4cd561.
Reviewed By: aprantl
Differential Revision: https://reviews.llvm.org/D146340
|
 | llvm/include/llvm/ADT/STLExtras.h |
Commit
2d373e4dc7e97b0b2d7022d8d04e64834204187c
by Akash.Banerjee[MLIR][OpenMP] Added OMPIRBuilder support for Target Data directives
This patch adds OpenMP IRBuilder support for the Target Data directives to allow lowering to LLVM IR.
The mlir::Translation is responsible for generating supporting code for processing the map_operands through the processMapOperand function, and also generate code for the r>
The OMPIRBuilder is responsible for generating the begin and end mapper function calls.
Limitations: - use_device_ptr and use_device_addr clauses are NOT supported for Target Data operation. - nowait clauses are NOT supported for Target Enter and Exit Data operations. - Only LLVMPointerType is supported for map_operands.
Differential Revision: https://reviews.llvm.org/D142914
|
 | mlir/include/mlir/Target/LLVMIR/Dialect/OpenMPCommon.h |
 | llvm/include/llvm/Frontend/OpenMP/OMPIRBuilder.h |
 | llvm/include/llvm/Frontend/OpenMP/OMPConstants.h |
 | mlir/lib/Target/LLVMIR/CMakeLists.txt |
 | mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp |
 | mlir/test/Target/LLVMIR/omptarget-llvm.mlir |
 | mlir/lib/Target/LLVMIR/Dialect/OpenMPCommon.cpp |
 | clang/lib/CodeGen/CGOpenMPRuntime.cpp |
 | llvm/lib/Frontend/OpenMP/OMPIRBuilder.cpp |
 | mlir/lib/Target/LLVMIR/Dialect/OpenACC/OpenACCToLLVMIRTranslation.cpp |
 | llvm/unittests/Frontend/OpenMPIRBuilderTest.cpp |
Commit
57bfe25574a0bfa9b6701e645d1f1ec966f838d9
by zeratul976[clangd] Remove reundant use of getSpellingLoc()
getFileLoc() is guaranteed to return a file loc, and getSpellingLoc() on a file loc is a no-op.
Differential Revision: https://reviews.llvm.org/D146377
|
 | clang-tools-extra/clangd/XRefs.cpp |
 | clang-tools-extra/clangd/FindSymbols.cpp |
Commit
5452d8607185a6328dae766330c023cbf1105c95
by efriedma[llvm-readobj] Pretty-print IMAGE_WEAK_EXTERN_ANTI_DEPENDENCY.
|
 | llvm/tools/llvm-readobj/COFFDumper.cpp |
Commit
59ff9d3777701ebbe6a59ab2edb8792ef3d2873f
by a.bataev[SLP]Fix PR61554: use of missing vectorized value in buildvector nodes.
If the buildvector node matches the vector node, it reuse the vector value from this vector node, but its VectorizedValue field is not updated. Need to update this field to avoid misses during the analysis of the reused gather/buildvector nodes.
|
 | llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp |
 | llvm/test/Transforms/SLPVectorizer/X86/reused-buildvector-matching-vectorized-node.ll |
Commit
f67b481098cc30567d0f50a2b21f8f57b92052bd
by mnadeem[Flang] Exit gracefully with a useful message when we fail to lookup a target
Without this patch we were asserting with a generic message `Failed to create Target`, but we already have a detailed error message stored in the variable `error` after calling `lookupTarget()` but this error was not getting used/printed.
With this patch we will emit a message with more details instead of a stack dump with a generic message.
Differential Revision: https://reviews.llvm.org/D146333
Change-Id: I7ddee917cf921a2133ca3e6b35791b2142f770a2
|
 | flang/include/flang/Frontend/FrontendActions.h |
 | flang/test/Driver/target-machine-error.f90 |
 | flang/lib/Frontend/FrontendActions.cpp |
Commit
82ac02e4a86070cf9924c245ff340aba1f62b45b
by Krzysztof.DrewniakAdd scalar support for amdgpu.raw_buffer_{load,store}
Introduce the possibility to load/store scalars via amdgpu.raw_buffer_{load,store}
Reviewed By: krzysz00
Differential Revision: https://reviews.llvm.org/D146413
|
 | mlir/test/Dialect/AMDGPU/ops.mlir |
 | mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp |
 | mlir/test/Conversion/AMDGPUToROCDL/amdgpu-to-rocdl.mlir |
 | mlir/include/mlir/Dialect/AMDGPU/AMDGPU.td |
Commit
da40f7e8b1b1c386cc4801f03082be582de93a65
by nemanja.i.ibm[PowerPC][NFC] Pre-commit a test case for upcoming patch
|
 | llvm/test/CodeGen/PowerPC/pr61315.ll |
Commit
eecb8c5f06149baf970fa0943e9fb9a6afe00207
by aeubanks[SampleProfile] Use LazyCallGraph instead of CallGraph
The function order in some tests had to be changed because they relied on ordering of functions returned in an SCC which is consistent but unspecified.
|
 | llvm/test/Other/new-pm-thinlto-prelink-samplepgo-defaults.ll |
 | llvm/test/Transforms/SampleProfile/profile-topdown-order.ll |
 | llvm/test/Other/new-pm-thinlto-postlink-samplepgo-defaults.ll |
 | llvm/lib/Transforms/IPO/SampleProfile.cpp |
 | llvm/test/Transforms/SampleProfile/profile-context-order.ll |
Commit
091422adc1d7478b126a967c795414840c5c0c97
by listmail[LSR] Fix wrapping bug in lsr-term-fold logic
The existing logic was unsound, in two ways.
First, due to wrapping on the trip count computation, it could compute a value which convert a loop which exiting on iteration 256, to one which exited at 255. (With i8 trip counts.)
Second, it allowed rewriting when the trip count implies wrapping around the alternate IV. As a trivial example, it allowed rewriting an i128 exit test in terms of an i64 IV. This is obviously wrong.
Note that the test change is fairly minimal - i.e. only the targeted test - but that's only because I precommitted a change which switched the test from 32 to 64 bit pointers. For 32 bit point architectures with 32 bit primary inductions, this transform is almost always unsound to perform.
Differential Revision: https://reviews.llvm.org/D146429
|
 | llvm/test/Transforms/LoopStrengthReduce/lsr-term-fold.ll |
 | llvm/lib/Transforms/Scalar/LoopStrengthReduce.cpp |
Commit
3af1c48c66a3cf906bd35c2a6e9d96b28ad19438
by ravishankarmChanges to `SCFFuseProducerOfSliceResult` to also return the operations created during fusion.
This is follow up to https://reviews.llvm.org/D145133 that allows propogating information about ops that are fused back to the caller.
Reviewed By: hanchung
Differential Revision: https://reviews.llvm.org/D146254
|
 | mlir/lib/Dialect/SCF/Transforms/TileUsingInterface.cpp |
 | mlir/include/mlir/Dialect/SCF/Transforms/TileUsingInterface.h |
Commit
fb1b9945be7657a3a25b727eaf0eeb3f74525210
by Lang Hames[JITLink][ELF][AArch64] Add support for ELF R_AARCH64_ABS32 relocation.
This relocation is commonly used in debug sections. Failure to handle it caused the test failure in https://lab.llvm.org/buildbot/#/builders/197/builds/4272, which forced the reversion, in f721fcb6ed0, of 57aeb305460 ("[JITLink][ELF] Don't skip debug info sections by default"). This fix should allow us to re-land 57aeb305460.
|
 | llvm/lib/ExecutionEngine/JITLink/ELF_aarch64.cpp |
 | llvm/test/ExecutionEngine/JITLink/AArch64/ELF_R_AARCH64_ABS32.yaml |
Commit
c21e88cc02617e0f04807a8dcf164b405d67d5e4
by ravishankarm[mlir][Tensor] Avoid dropping attributes for `tensor.pad` operations during canonicalization.
Reviewed By: hanchung
Differential Revision: https://reviews.llvm.org/D146440
|
 | mlir/include/mlir/Dialect/Linalg/Utils/Utils.h |
 | mlir/lib/Dialect/Tensor/IR/TensorOps.cpp |
 | mlir/lib/Dialect/Utils/StructuredOpsUtils.cpp |
 | mlir/include/mlir/Dialect/Tensor/IR/TensorOps.td |
 | mlir/include/mlir/Dialect/Utils/StructuredOpsUtils.h |
Commit
c71fe7c9e83c16c8304d5cbc0dc3936c41e3e703
by anlunx[mlir][openMP] Add bazel deps
Differential Revision: https://reviews.llvm.org/D146455
|
 | utils/bazel/llvm-project-overlay/mlir/BUILD.bazel |
Commit
d3d6a5ff184d4d9c7ac7bcd281281a3b53ed058b
by Vitaly Buka[NFC] Rename ArgSize to SpellingSize, and add ArgStringSize.
Differential Revision: https://reviews.llvm.org/D146394
|
 | llvm/lib/Option/Option.cpp |
Commit
38fc680c2dc31ae7c397faa919e1881fb9e8f82a
by Vitaly BukaReplace strlen with StringRef::size
Replace multiple strlen calls with a StringRef constructor and a StringRef::size call.
Differential Revision: https://reviews.llvm.org/D146394
|
 | llvm/lib/Option/Option.cpp |
Commit
d0e507f56990fa09fe9ce326419dda8d54486c0a
by ravishankarm[mlir][Tensor] Fix build error due to missing `<>` in D146440.
Differential Revision: https://reviews.llvm.org/D146458
|
 | mlir/lib/Dialect/Utils/StructuredOpsUtils.cpp |
Commit
411b1d8f079533860a990ee615abae3b0e6dbd8b
by ravishankarm[mlir][Tensor] Make `TilingInterface` implementation only return handle to the created `pad` operation.
Pad tiling implementation only needs to return the tiled pad operation. The rest of the generated code is related to handling boundary conditions.
Reviewed By: hanchung
Differential Revision: https://reviews.llvm.org/D146439
|
 | mlir/lib/Dialect/Tensor/IR/TensorTilingInterfaceImpl.cpp |
Commit
d03805f2ee0bdaa2513fbc3efb9e404e128bdbb3
by peiming[mlir][sparse] add merger/topo sort support for slice-based affine sparse index codegen
Reviewed By: aartbik
Differential Revision: https://reviews.llvm.org/D142928
|
 | mlir/include/mlir/Dialect/SparseTensor/Utils/Merger.h |
 | mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.h |
 | mlir/lib/Dialect/SparseTensor/Utils/Merger.cpp |
 | mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp |