add joint optimize W and alpha #15

youdongguo · 2026-02-05T06:01:41Z

not ready for review

youdongguo · 2026-02-05T19:03:56Z

test/runtests.jl

+    _, W_gsvd, H_gsvd = gsvdnmf(X, 9=>10; alg = :cd, maxiter = 10^5, tol_final=1e-4, tol_intermediate = 1e-4);
    img_tol_int = sum(abs2, X)
    @test size(W_gsvd, 2) == 10
-    @test sum(abs2, X-standard_nmf.W*standard_nmf.H)/sum(abs2, X) > sum(abs2, X-W_gsvd*H_gsvd)/sum(abs2, X)


I deleted this test because the standard NMF also generate perfect results on my machine. On RIS, it generate a bad results. One question: in the document of this repo, should we keep the standard NMF result?

I might need to have you walk me through the issues.

youdongguo · 2026-02-05T19:06:05Z

test/runtests.jl

+    _, W_gsvd_1, H_gsvd_1 = gsvdnmf(X, 10; alg=:cd)
+    _, W_gsvd_2, H_gsvd_2 = gsvdnmf(X, 9 => 10; alg=:cd)
    @test sum(abs2, W_gsvd_1-W_gsvd_2) <= 1e-12
    @test sum(abs2, H_gsvd_1-H_gsvd_2) <= 1e-12


This test is also interesting. One RIS, when I include the test file, it cannot pass and the sum(abs2, W_gsvd_1-W_gsvd_2)=1e-7. However, when I copy paste these lines in REPL, it always passes. On my machine and github, these tests always pass.

It probably depends on the specific random numbers. I think the seed gets saved in the testset, if that helps debug.

youdongguo · 2026-02-05T19:08:56Z

src/GsvdInitialization.jl

    else
        # @show alg
-        W_recover, H_recover, _ = gsvdrecover(X, copy(W), copy(H), kadd, f)
+        W_recover, H_recover, _ = gsvdrecover(X, copy(W), copy(H), kadd, f; initW=initW)


This version is ready for review. I have not written the document yet and after we merge this code change (or we finish the code change), I will update the documentation.

i added a keyword argument to add a new method for our jointly optimizing Wadd and alpha approach. The default approach is separately computing Wadd and alpha.

youdongguo · 2026-02-05T19:10:43Z

src/GsvdInitialization.jl

-end
-
 function gsvdrecover(X::AbstractArray, W0::AbstractArray, H0::AbstractArray, kadd::Int, f::Tuple; initW::Symbol = :standard, kwargs...)
    m, n = size(W0)


This is the main function change. A split of change is added.

timholy

A few things to think about.

timholy · 2026-02-08T22:56:02Z

src/GsvdInitialization.jl

+        # @show alg
+        W_recover, H_recover, _ = gsvdrecover(X, copy(W), copy(H), kadd, f; initW=initW)
+        if alg == :multmse
+            @show alg


timholy · 2026-02-08T22:56:34Z

src/GsvdInitialization.jl

+        W_recover, H_recover, _ = gsvdrecover(X, copy(W), copy(H), kadd, f; initW=initW)
+        if alg == :multmse
+            @show alg
+            W_recover, H_recover = max.(W_recover, 1e-5), max.(H_recover, 1e-5)


Should the 1e-5 be hard-coded or a kwarg?

timholy · 2026-02-08T23:00:11Z

src/GsvdInitialization.jl

+    m, r0 = size(W0)
+    k = size(Hadd, 1)
+    b = zeros(Float64, m*k + r0)
+    b[1:m*k] = vec(X * Hadd')
+    b[m*k+1:end] = diag(W0' * X * H0')


Suggested change

m, r0 = size(W0)

k = size(Hadd, 1)

b = zeros(Float64, m*k + r0)

b[1:m*k] = vec(X * Hadd')

b[m*k+1:end] = diag(W0' * X * H0')

b = vcat(vec(X * Hadd'), diag(W0' * X * H0'))

Though computing all of W0' * X * X0' and then keeping only the diagonal seems wasteful?

timholy · 2026-02-08T23:01:34Z

test/runtests.jl

+    _, W_gsvd, H_gsvd = gsvdnmf(X, 9=>10; alg = :cd, maxiter = 10^5, tol_final=1e-4, tol_intermediate = 1e-4);
    img_tol_int = sum(abs2, X)
    @test size(W_gsvd, 2) == 10
-    @test sum(abs2, X-standard_nmf.W*standard_nmf.H)/sum(abs2, X) > sum(abs2, X-W_gsvd*H_gsvd)/sum(abs2, X)


I might need to have you walk me through the issues.

timholy · 2026-02-08T23:02:24Z

test/runtests.jl

+    _, W_gsvd_1, H_gsvd_1 = gsvdnmf(X, 10; alg=:cd)
+    _, W_gsvd_2, H_gsvd_2 = gsvdnmf(X, 9 => 10; alg=:cd)
    @test sum(abs2, W_gsvd_1-W_gsvd_2) <= 1e-12
    @test sum(abs2, H_gsvd_1-H_gsvd_2) <= 1e-12


It probably depends on the specific random numbers. I think the seed gets saved in the testset, if that helps debug.

timholy · 2026-02-08T23:03:38Z

test/runtests.jl

+    X = W*H
+    standard_nmf = nnmf(X, 10; alg = :cd, init=:nndsvd, tol=1e-4, initdata = svd(float(X)))
+    _, W_gsvd, H_gsvd = gsvdnmf(X, 9=>10; alg = :cd, maxiter = 10^5, tol_final=1e-4, tol_intermediate = 1e-4, initW=:joint);
+    img_tol_int = sum(abs2, X)


This doesn't seem used, though you recompute the same quantity a couple lines down.

timholy · 2026-02-08T23:04:13Z

Project.toml

 version = "1.0.0"

 [deps]
+Kronecker = "2c470bb0-bcc8-11e8-3dad-c9649493f05e"


Don't forget to update [compat]

youdongguo added 2 commits February 5, 2026 00:00

add joint optimize W and alpha

9025350

add test for joint optimize W and alpha

8a2396d

youdongguo commented Feb 5, 2026

View reviewed changes

youdongguo requested a review from timholy February 5, 2026 19:11

timholy reviewed Feb 8, 2026

View reviewed changes

add joint optimize W and alpha #15

Are you sure you want to change the base?

add joint optimize W and alpha #15

Uh oh!

Conversation

youdongguo commented Feb 5, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

timholy left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants