SBiCGの非対角同時計算とLOBPCG/SBiCG/TPQに対する複ベクトル高速化 by mitsuaki1987 · Pull Request #148 · issp-center-dev/HPhi

mitsuaki1987 · 2023-06-08T04:54:56Z

以前のもの
#66
と違って、LanczosとConteneous memory accessは残しています。

# Conflicts: # src/PairExSpin.c

…VMC case

… it is used in LOBPCG.

lobcg_kondo*.sh: Indices for itenerant was incorrect. this was fixed before, but not applied in these tests. spectrum_spin_kagome.sh: S+S- excitation lobcg_genspin_ladder.sh: D-term with general spin spectrum_spin_kagome.sh

…tion. Adopt the warning message by -Wall of gcc (delete unused variables and arguments.

Mesh plot -> line segment plot (2 blank lines)

Remove Komega

dynamicalr2k, greenr2k: temperature dependent

…mura1

…ix_memalign (NOT malloc/calloc). Otherwise HPhi crashes when we use SVE.

k-yoshimi · 2024-01-31T08:12:52Z

@mitsuaki1987
スペクトル計算の仕様が変わってしまったため、tutorial4が動かなくなっているようです。元の計算と同じ計算ができるよう、そちらも修正していただけるでしょうか？

modification of the file-format change in spectrum function.

mitsuaki1987 · 2024-02-01T08:25:58Z

Tutorial 4が動くようにpythonスクリプトとAll.shを変更しました。
またCalcSpectrum.cのファイル読み込みのところにバグを入れてしまっていたので直しました。

The resulting vectors of subspace diagonalization should be the same across processes. This caused error in Fugaku with SVE. Also fix typo of overlap

Copilot

Pull request overview

This PR implements optimizations for complex vector operations in SBiCG, LOBPCG, and TPQ calculations by introducing simultaneous off-diagonal computations and vectorized operations. The changes remove the komega library dependency and refactor the codebase to use multi-state vector arrays instead of single vectors.

Changes:

Replaced single-state vector operations with multi-state arrays throughout the codebase
Removed komega library files and dependencies
Updated documentation to reflect new dynamical Green's function calculation modes and Fourier transformation utilities

Reviewed changes

Copilot reviewed 129 out of 186 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
src/mltply.c	Refactored to use 2D arrays for multi-state vectors and BLAS operations
src/matrixscalapack.c	Changed eigenvalue array from complex to real, removed unused variables
src/matrixlapack.c	Removed unused functions, updated eigenvalue handling to use real arrays
src/lapack_diag.c	Updated to use corrected eigenvector indexing and real energy arrays
src/komega/*	Removed entire komega library directory and files
src/input.c	Updated array indexing for Hamiltonian input
src/include/*	Updated function signatures to accept multi-state arrays
src/global.c	Changed vector declarations from 1D to 2D arrays
src/eigenIO.c	Commented out unused I/O functions
src/common/setmemory.*	Added allocation functions for unsigned int arrays and 4D complex arrays
src/check.c	Changed variable types and updated memory estimates
src/bitcalc.c	Changed return type of GetBitGeneral to unsigned int
src/StdFace	Updated submodule reference
src/SingleExHubbard.c	Updated to use multi-state arrays and BLAS operations
src/SingleEx.c	Updated function calls with new signatures
src/PowerLanczos.c	Removed entire file
src/PairEx.c	Updated function calls with new signatures
src/Multiply.c	Refactored for multi-state operations and norm calculations
src/MakeIniVec.c	Updated to support multiple random vectors
src/Lanczos_EigenVector.c	Updated vector operations for 2D arrays
src/Lanczos_EigenValue.c	Updated vector operations and added matrixlapack.h include
src/HPhiTrans.c	Changed variable types to unsigned int
src/HPhiMain.c	Added CALCSPEC_SCRATCH mode support
src/FirstMultiply.c	Updated for multi-state operations and added expectation value calculations
src/CheckMPI.c	Changed variable type to unsigned int
src/CalcSpectrum*.c	Updated function signatures for multi-state arrays
src/CalcByTEM.c	Updated for multi-state operations
src/CalcByLanczos.c	Updated vector operations and array indexing
src/CalcByFullDiag.c	Major refactoring for Lehmann representation calculation
src/CMakeLists.txt	Removed komega subdirectory and dependency
src/CG_EigenVector.c	Updated for 2D vector arrays
samples/tutorial_4.2/*	Updated for new output file format
samples/Spectrum/Scratch/*	Added new sample files
doc/*	Extensive documentation updates for new features

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-26T14:59:03Z

src/matrixlapack.c

+  k = 0;
+  for (i = 0; i < xNsize; i++) {
+    for (j = 0; j < xNsize; j++) {
+      vec[j][i] = a[k];


The eigenvector assignment appears to have transposed indices. Based on the Fortran column-major to C row-major conversion pattern used elsewhere in the file, this should be vec[i][j] = a[k]; to maintain consistency with the column-major output from zheev_.

Suggested change

vec[j][i] = a[k];

vec[i][j] = a[k];

Copilot · 2026-02-26T14:59:04Z

src/lapack_diag.c

  for (i = 0; i < i_max; i++) {
    for (j = 0; j < i_max; j++) {
-      Ham[i][j] = Ham[i + 1][j + 1];
+      v0[i][j] = v0[i + 1][j];


This line copies from v0[i+1][j] which appears to be accessing an out-of-bounds index when i reaches i_max-1. The intent seems unclear - if this is meant to shift data, the loop bounds or indexing should be corrected.

Copilot · 2026-02-26T14:59:04Z

src/input.c

+    v0[ham_i][ham_j-1]=dHam_re+I*dHam_im;
+    v0[ham_j][ham_i-1]=conj(v0[ham_i][ham_j-1]);


The indexing uses ham_j-1 and ham_i-1 which will cause an out-of-bounds access when ham_i or ham_j equals 0. This appears to be an incorrect transformation from the original 1-based indexing.

Suggested change

v0[ham_i][ham_j-1]=dHam_re+I*dHam_im;

v0[ham_j][ham_i-1]=conj(v0[ham_i][ham_j-1]);

/* Validate 1-based indices from file before converting to 0-based */

if (ham_i < 1 || ham_i > imax || ham_j < 1 || ham_j > imax) {

fprintf(stdoutMPI, "Error: Invalid Hamiltonian index: ham_i=%ld, ham_j=%ld (valid range 1..%ld)\n",

ham_i, ham_j, imax);

fclose(fp);

return -1;

}

/* Convert from 1-based (file) to 0-based (C arrays) consistently */

v0[ham_i-1][ham_j-1] = dHam_re + I * dHam_im;

v0[ham_j-1][ham_i-1] = conj(v0[ham_i-1][ham_j-1]);

Copilot · 2026-02-26T14:59:04Z

src/check.c

    NLocSpn = X->Def.NLocSpn;
    //4^Nc*2^Ns
-    for(i=0;i<(2*NCond+NLocSpn);i++){
+    for(u_loc=0;u_loc <(2*NCond+NLocSpn); u_loc++){


The loop variable u_loc is used but was previously declared for a different purpose in the Kondo case above. This variable reuse makes the code confusing and the variable name doesn't match the loop's purpose of calculating powers of 2.

mitsuaki1987 added 30 commits February 15, 2019 17:16

Backup

c470af1

Merge branch 'develop' into kawamura1

8eca673

# Conflicts: # src/PairExSpin.c

Backup

94b427e

Backup

259bda0

Backup

b84abfd

Backup

2ae1014

Backup

576e78f

Merge branch 'develop' into kawamura1

3350cc8

Backup

7a47d8a

Backup

8913dd9

Backup

d84960d

Backup

0b9cfb1

Backup

381d8a0

Backup

c028534

Backup

04bd7e1

backup

55dd4fd

Backup

bd22ba1

Backup

f47ad16

Backup

f76a0b7

Backup

7082cf6

Backup

4b8c549

BugFix in TPQ

fb232a9

Backup

c5351d7

greenr2k output TPQ correlation function and its eerror like in the m…

f2089fa

…VMC case

The first line has infinite temperature.

0576026

Backup

54e1af6

Bagfix

f10f5ff

Backup

5b104c3

BagFix : When building without MPI, the v1buf was not allocated while…

3a11864

… it is used in LOBPCG.

Backup

26daa11

mitsuaki1987 added 19 commits April 12, 2019 10:53

Implement CMA

4407225

Merge branch 'sz_omp' into kawamura1

3c25813

Merge remote-tracking branch 'remotes/origin/develop' into kawamura1

bd3466c

Restore the deleted submodule

c203e2b

To pass the tests.

ebf21cd

lobcg_kondo*.sh: Indices for itenerant was incorrect. this was fixed before, but not applied in these tests. spectrum_spin_kagome.sh: S+S- excitation lobcg_genspin_ladder.sh: D-term with general spin spectrum_spin_kagome.sh

Previous commit was failed in macos because of absence of inclusion.

32c5b57

FullDiag by ScaLAPACK did not work.

69febe0

Precious verion was not build in macos because of incomplete decleara…

8c598ee

…tion. Adopt the warning message by -Wall of gcc (delete unused variables and arguments.

omega in spectrum appears as (w - e_j + e_i), not (w - e_j).

fa5ce0e

Name of dynamical green's function was changed.

2e3040c

Mesh plot -> line segment plot (2 blank lines)

Backup

22bc1de

BugFix: All test passes

b52fccd

Merge remote-tracking branch 'remotes/origin/develop' into kawamura1

84b2ad9

Bugfix in spectrum by BiCG with multiple eigenstates

68d9290

Remove Komega

Update manual for dynamical green's function.

6c591ad

dynamicalr2k, greenr2k: temperature dependent

Manual for dynamical correlation functions are updated

a434670

This preamble must be included to avoid error in make tutorial-en-pdf

a3610f9

Dry run should be executed in serial

464912d

Merge branch 'kawamura1' of github.com:issp-center-dev/HPhi into kawa…

225d787

…mura1

mitsuaki1987 assigned k-yoshimi Oct 20, 2023

mitsuaki1987 mentioned this pull request Oct 20, 2023

SBiCGの非対角同時計算とLOBPCG/SBiCG/TPQに対する複ベクトル高速化 #150

Open

In Fugaku and FX1000, "work" variable for ZHEEVD mst be allocated pos…

8de7984

…ix_memalign (NOT malloc/calloc). Otherwise HPhi crashes when we use SVE.

(Bug Fix) Tutorial for the spectrum calculation did not work after the

84a1aea

modification of the file-format change in spectrum function.

mitsuaki1987 added 3 commits April 19, 2024 14:04

Fix unstable behaviorin LOBPCG:

d9e2072

The resulting vectors of subspace diagonalization should be the same across processes. This caused error in Fugaku with SVE. Also fix typo of overlap

Merge remote-tracking branch 'remotes/origin/develop' into kawamura1

23ae41c

Merge tag 'v3.5.2' into kawamura1

018bd55

k-yoshimi requested a review from Copilot February 26, 2026 14:57

Copilot AI reviewed Feb 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SBiCGの非対角同時計算とLOBPCG/SBiCG/TPQに対する複ベクトル高速化#148

SBiCGの非対角同時計算とLOBPCG/SBiCG/TPQに対する複ベクトル高速化#148
mitsuaki1987 wants to merge 58 commits intodevelopfrom
kawamura1

mitsuaki1987 commented Jun 8, 2023

Uh oh!

k-yoshimi commented Jan 31, 2024

Uh oh!

mitsuaki1987 commented Feb 1, 2024

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		v0[ham_i][ham_j-1]=dHam_re+I*dHam_im;
		v0[ham_j][ham_i-1]=conj(v0[ham_i][ham_j-1]);

-    v0[ham_i][ham_j-1]=dHam_re+I*dHam_im;
-    v0[ham_j][ham_i-1]=conj(v0[ham_i][ham_j-1]);
+    /* Validate 1-based indices from file before converting to 0-based */
+    if (ham_i < 1 || ham_i > imax || ham_j < 1 || ham_j > imax) {
+      fprintf(stdoutMPI, "Error: Invalid Hamiltonian index: ham_i=%ld, ham_j=%ld (valid range 1..%ld)\n",
+              ham_i, ham_j, imax);
+      fclose(fp);
+      return -1;
+    }
+    /* Convert from 1-based (file) to 0-based (C arrays) consistently */
+    v0[ham_i-1][ham_j-1] = dHam_re + I * dHam_im;
+    v0[ham_j-1][ham_i-1] = conj(v0[ham_i-1][ham_j-1]);

Conversation

mitsuaki1987 commented Jun 8, 2023

Uh oh!

k-yoshimi commented Jan 31, 2024

Uh oh!

mitsuaki1987 commented Feb 1, 2024

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants