|
 | clang/test/SemaSYCL/bf16.cpp (diff) |
Commit
55cf0de35efd6cb81d6a21fee35186f6fb6864c2
by nicolas.vasilache[mlir][Linalg] Adding a greedy packing transform dialect op.
This PR adds a `pack_greedily` transform operation that infers the packing for gemm subcomputations embedded within in any LinalgOp and packs accordingly. A normalization step guarantees that we get the innermost op dimensions in one of `8` possible `(m, n, k)` orders, specified as a parameter, from which we can emit all packed forms.
The current implementation takes an arbitrary LinalgOp and tries to pack it along the specified dimensions with specified sizes and inner dim permutation.
This achieves a new level of normalization and generalization for any `n-D` LinalgOp that contains a gemm embedded within it: we will always see a predictable packed form for any of these ops.
Differential Revision: https://reviews.llvm.org/D142661
|
 | mlir/test/Dialect/Linalg/transform-pack-greedily.mlir |
 | mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp (diff) |
 | mlir/include/mlir/Dialect/Linalg/TransformOps/LinalgTransformOps.td (diff) |
Commit
e054e0da9fd7055142188036ee713e8c0697324b
by szakharin[flang] Fixed uninitialized std::unique_ptr dereference.
This fixes unittest failures reported in D142279: flang-Unit :: Frontend/./FlangFrontendTests/5/7 flang-Unit :: Frontend/./FlangFrontendTests/6/7
|
 | flang/lib/Frontend/FrontendActions.cpp (diff) |
Commit
02445263e2f533573a935c1bd502d848bbe6bb27
by szakharin[flang] Fixed restrictions checking for OpenACC loop-associated constructs.
CheckDoConcurrentClauseRestriction and CheckTileClauseRestriction expect that the construct has associated DoConstruct, while it is not set when the do-loop has no loop control. The change is to skip the clauses checks, when the do-loop does not have the loop control.
An alternative fix would be to associate the DoConstruct even when the do-loop has no loop control and let Check*ClauseRestriction run their checks, but I am not sure if associating invalid DoConstruct is a good idea.
This fixes failure in Semantics/OpenACC/acc-canonicalization-validity.f90 reported in D142279.
Reviewed By: clementval
Differential Revision: https://reviews.llvm.org/D142652
|
 | flang/lib/Semantics/canonicalize-acc.cpp (diff) |
|
 | utils/bazel/llvm-project-overlay/mlir/BUILD.bazel (diff) |
Commit
dc682ed590f73f7b7417ecf66d18b303ab957c62
by rupprecht[test][lldb-vscode] Fix unclean shutdown in test_terminate_commands
This test manually sends `terminateCommands` to shutdown, but then the test attempts another `terminateCommands` request in test tear down, which fails since it's already torn down. This error is ignored in LLDB's old fork of unittest2, but will be reported as a test failure in `unittest` from the Python standard library. Use `disconnectAutomatically` to avoid terminating twice.
|
 | lldb/test/API/tools/lldb-vscode/launch/TestVSCode_launch.py (diff) |
|
 | llvm/utils/release/test-release.sh (diff) |
Commit
3df4c5a92f7f68d97ada6886c13d8900a5c7eef1
by jezng[NFC] Optimize vector usage in lld
By using emplace_back, as well as converting some loops to for-each, we can do more efficient vectorization.
Make copy constructor for TemporaryFile noexcept.
Reviewed By: #lld-macho, int3
Differential Revision: https://reviews.llvm.org/D139552
|
 | lld/COFF/Writer.cpp (diff) |
 | lld/ELF/InputFiles.cpp (diff) |
 | lld/ELF/OutputSections.cpp (diff) |
 | lld/MachO/SyntheticSections.h (diff) |
 | lld/COFF/MinGW.cpp (diff) |
 | lld/ELF/MapFile.cpp (diff) |
 | lld/COFF/DriverUtils.cpp (diff) |
 | lld/COFF/Driver.cpp (diff) |
 | lld/MachO/Arch/ARM64.cpp (diff) |
 | lld/COFF/Chunks.cpp (diff) |
|
 | mlir/lib/Dialect/Tensor/IR/TensorTilingInterfaceImpl.cpp (diff) |
 | mlir/include/mlir/Dialect/Tensor/IR/TensorTilingInterfaceImpl.h (diff) |