refactor: clean up legacy load_state_dict for linear layers by guocuimi · Pull Request #503 · vectorch-ai/ScaleLLM

guocuimi · 2025-09-24T06:26:03Z

No description provided.

Copilot

Pull Request Overview

This PR refactors the legacy load_state_dict method for linear layers by removing the transform function variant and updating method calls to use simplified loading mechanisms. The changes also improve code structure by using structured bindings for QKV projections and updating parameter registration to use sharded parameters.

Removes the legacy load_state_dict method with transform functions from linear layer implementations
Updates QKV projection calls across multiple model architectures to use structured bindings instead of array indexing
Refactors parameter registration in quantized linear layers to use register_sharded_parameter instead of register_parameter

Reviewed Changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
src/quantization/qlinear_impl_test.cpp	Updates test calls to use `load()` and `verify()` methods instead of legacy state dict methods
src/quantization/qlinear_impl.cpp	Replaces parameter registration calls with sharded parameter registration including rank and world_size
src/models/meta/llama.h	Replaces QKV array indexing with structured binding for cleaner code
src/models/google/gemma2.h	Replaces QKV array indexing with structured binding for cleaner code
src/models/google/gemma.h	Replaces QKV array indexing with structured binding for cleaner code
src/models/alibaba/qwen2.h	Replaces QKV array indexing with structured binding for cleaner code
src/layers/qkv_linear_test.cpp	Updates test to use structured binding for QKV outputs
src/layers/qkv_linear.h	Updates forward method signature to return tuple instead of vector
src/layers/linear_impl.h	Removes legacy transform function overload declaration
src/layers/linear_impl.cpp	Removes legacy transform function implementation and simplifies loading
src/layers/linear.h	Removes virtual transform function method from base class

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

guocuimi · 2025-10-07T22:30:45Z

src/models/meta/llama.h

  torch::Tensor forward(torch::Tensor x) {
-    const auto gate_up = gate_up_proj_(x);
-    return down_proj_(act_func_(gate_up[0]) * gate_up[1]);
+    // const auto gate_up = gate_up_proj_(x);


guocuimi added 2 commits September 23, 2025 23:25

refactor: clean up legacy load_state_dict for linear layers

2972034

update qlinear

b4cc8f2

guocuimi requested a review from Copilot September 25, 2025 01:25

Copilot AI reviewed Sep 25, 2025

View reviewed changes

clean up load_state_dict for linear layers

31afc66

guocuimi force-pushed the linear_refactor branch from 3c883b4 to 31afc66 Compare October 7, 2025 17:51

rename.

36449ad

guocuimi commented Oct 7, 2025

View reviewed changes

guocuimi added 9 commits October 7, 2025 15:32

revert

93c4409

move linear into linear folder

8a102cd

refactor

0ca92b5

remove linear.h/cpp

8e4bacb

move quantization into layers folder

16c2674

refactor ModuleHolder

8d49372

add ref link

2500606

add unittests

8cbcd1c

move '"module' into layers folder

3e0d016

guocuimi merged commit f29965e into main Oct 8, 2025
3 checks passed

guocuimi deleted the linear_refactor branch October 8, 2025 00:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: clean up legacy load_state_dict for linear layers#503

refactor: clean up legacy load_state_dict for linear layers#503
guocuimi merged 13 commits intomainfrom
linear_refactor

guocuimi commented Sep 24, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

guocuimi Oct 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

guocuimi commented Sep 24, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

guocuimi Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants