Skip to content

Commit

Permalink
Merge pull request #1895 from eeprude/axpby_improvement
Browse files Browse the repository at this point in the history
Axpby: improvement on unification attempt logic and on the execution of a diversity of situations
  • Loading branch information
lucbv authored Oct 23, 2023
2 parents f8aa101 + 12d1fd4 commit 10cbf87
Show file tree
Hide file tree
Showing 11 changed files with 4,748 additions and 1,188 deletions.
538 changes: 262 additions & 276 deletions blas/impl/KokkosBlas1_axpby_impl.hpp

Large diffs are not rendered by default.

1,550 changes: 854 additions & 696 deletions blas/impl/KokkosBlas1_axpby_mv_impl.hpp

Large diffs are not rendered by default.

332 changes: 239 additions & 93 deletions blas/impl/KokkosBlas1_axpby_spec.hpp

Large diffs are not rendered by default.

942 changes: 942 additions & 0 deletions blas/impl/KokkosBlas1_axpby_unification_attempt_traits.hpp

Large diffs are not rendered by default.

341 changes: 229 additions & 112 deletions blas/src/KokkosBlas1_axpby.hpp

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions blas/unit_test/Test_Blas.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -24,6 +24,7 @@
#include "Test_Blas1_asum.hpp"
#include "Test_Blas1_axpby.hpp"
#include "Test_Blas1_axpy.hpp"
#include "Test_Blas1_axpby_unification.hpp"
#include "Test_Blas1_dot.hpp"
#include "Test_Blas1_iamax.hpp"
#include "Test_Blas1_mult.hpp"
Expand Down
2 changes: 0 additions & 2 deletions blas/unit_test/Test_Blas1_axpby.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -109,8 +109,6 @@ void impl_test_axpby_mv(int N, int K) {
Kokkos::deep_copy(org_y.h_base, y.d_base);
Kokkos::deep_copy(x.h_base, x.d_base);

Kokkos::View<ScalarB*, Kokkos::HostSpace> r("Dot::Result", K);

KokkosBlas::axpby(a, x.d_view, b, y.d_view);
Kokkos::deep_copy(y.h_base, y.d_base);

Expand Down
Loading

0 comments on commit 10cbf87

Please sign in to comment.