Linear quantization for signed integers #13

pabvald · 2024-10-06T17:49:30Z

@milankl
I have the basic implementation:

# minimum and maximum representable value of type T
Tmin, Tmax = Float64(typemin(T)), Float64(typemax(T))

# LinQuantization
Δ⁻¹ =  (Tmax-Tmin)/(Amax-Amin) # (Tmax -  Tmin) == (2^sizeof(T)-1), but  imo Tmax - Tmin is more informative and the values are used afterwards again anyways

Q[i] = round(T, clamp((A[i]-Amin)*Δ⁻¹ + Tmin, Tmin, Tmax)) 
...

# Base.Array
Δ = (Qmax-Qmin)/(Tmax-Tmin)
A[i] = Qmin + (Q[i] - Tmin)*Δ

And I have two questions:

Could I remove the n::Integer argument from Base.Array ? In the end, the range of values can be calculated as typemax(eltype(Q)) - typemin(eltype(Q)) instead of 2^n - 1.
Does it make sense to create aliases for LinQuantization and LinQuantArray, since the alias is equally long as the original call? e.g.

LinQuantInt8Array(A::AbstractArray{T,N},dim::Int,e::Option{Tuple}) where {T,N} = LinQuantArray(Int8,A,dim,e)

Also I am facing some issues. The back and forth conversion with signed integers is not equal for some values, with a different of 1.0 or 2.0. For Float16 and Int16 some values of A2 end up being -∞ or ∞. I don't exactly know what the problem is.

for T in [Float64, Float32, Float16]
   for s in [(100,), (10,20), (13,14,15), (23,17,12,5)]:
        A = rand(T,s...)

        for U in [Int8, Int16,  Int32]
            A2 = Array{T}(LinQuantization(U,A))
            B = Array{T}(LinQuantization(U,A2))

            if A2 != B 
                println("T = ", T, " U = ", U)
            end 
            for i in eachindex(A2)
                if A2[i] != B[i]
                    println(" A2[i] = ", A2[i], " B[i] = ", B[i])
                end
            end
        end
    end
end

# T = Float32 U = Int32
#  A2[i] = 2.0507566e7 B[i] = 2.0507564e7
# T = Float32 U = Int32
#  A2[i] = 8.266649e6 B[i] = 8.266648e6
# T = Float32 U = Int32
#  A2[i] = 6.41308e6 B[i] = 6.413079e6
#  A2[i] = 848596.0 B[i] = 848595.0
#  A2[i] = 3.2166338e7 B[i] = 3.2166336e7
#  A2[i] = 1.717558e6 B[i] = 1.717557e6
#  A2[i] = 1.5504138e7 B[i] = 1.5504137e7
#  A2[i] = 3.065833e6 B[i] = 3.065832e6
#  A2[i] = 6.550137e6 B[i] = 6.550136e6
#  A2[i] = 9.974494e6 B[i] = 9.974493e6
# T = Float32 U = Int32
#  A2[i] = 1.3689715e7 B[i] = 1.3689714e7
#  A2[i] = 7.192029e6 B[i] = 7.192028e6
#  A2[i] = 8.769405e6 B[i] = 8.769404e6
#  A2[i] = 6.903743e6 B[i] = 6.903742e6
#  A2[i] = 2.544131e6 B[i] = 2.54413e6
#  A2[i] = 1.0867795e7 B[i] = 1.0867794e7
# ...

… extrema

pabvald · 2024-10-06T17:50:42Z

TODOs:

Fix issues
Extend tests
Extend README

Project.toml

src/LinLogQuantization.jl

src/linquantarrays.jl

milankl · 2024-10-07T11:43:06Z

Could I remove the n::Integer argument from Base.Array ? In the end, the range of values can be calculated as typemax(eltype(Q)) - typemin(eltype(Q)) instead of 2^n - 1.

Yes, I agree with that. See comments.

Does it make sense to create aliases for LinQuantization and LinQuantArray, since the alias is equally long as the original call? e.g.
LinQuantInt8Array(A::AbstractArray{T,N},dim::Int,e::Option{Tuple}) where {T,N} = LinQuantArray(Int8,A,dim,e)

Yeah, no, I prefer not to export 8 new function names, the latter sounds better to me.

Also I am facing some issues. The back and forth conversion with signed integers is not equal for some values

That's expected given the lossy compression. That information loss happens in the round function but it should be round-to-nearest tie-to-even but not sure we actually test that. Only for large integers that error becomes smaller but also depends on the range of values. We could formulate also a idempotency test because that loss would only happen on the first round trip.

with a different of 1.0 or 2.0.

Linear quantization introduces an absolute error that is bounded by $\Delta/2$ with $\Delta$ being the spacing between quants that's also computed for the (de-)quantization.

For Float16 and Int16 some values of A2 end up being -∞ or ∞. I don't exactly know what the problem is.

Okay yeah that shouldn't happen, if inputs are finite, the quantization should create finite values too. But if you start in integer-quantization-space you can create numbers that are quantized perfectly representable but aren't in Float16.
E.g. you can represent -32768:32767 with Int16 (typemin-typemax range), but for a spacing of $\Delta = 2$ then your largest number you can represent becomes 65534 which is just larger than floatmin(Float16) which is 65504.

julia> Float16(typemax(Int16)*2)
Inf16

So finite floats should be quantized into finite (signed) integers, but vice versa can trigger overflow for floats!

pabvald · 2024-10-07T14:24:31Z

Okay yeah that shouldn't happen, if inputs are finite, the quantization should create finite values too. But if you start in integer-quantization-space you can create numbers that are quantized perfectly representable but aren't in Float16.

I have implemented your feedback but I still have to check the issues with the overflowing of Float16.

Edit: Everything seems to be working now 😃

milankl · 2024-10-09T17:10:26Z

Fails because tuples aren't equal with different but otherwise equal == (not identical ===) elements

   Expression: Base.extrema(A2) == ext
   Evaluated: (0.22f0, 0.75f0) == (0.22, 0.75)

pabvald · 2024-10-23T07:58:14Z

@milankl I have updated the README, although the benchmarking is missing. Do you happen to still have the code for benchmarking?

milankl · 2024-10-24T10:21:18Z

I don't have the benchmarking anymore, I believe I did something like

julia> A = rand(10_000_000);   # 80MB vector
julia> sizeof(A)/1000^2
80.0

julia> using BenchmarkTools

julia> @btime LinQuant16Array($A);
  37.425 ms (7 allocations: 20.27 MiB)

julia> 80/40e-3    # Float64 -> UInt16 at 2000MB/s
2000.0

and coming from single precision

julia> A = rand(Float32, 10_000_000);

julia> @btime LinQuant16Array($A);
  42.537 ms (7 allocations: 20.27 MiB)

julia> 40/40e-3    # Float32 -> UInt16 at 1000MB/s
1000.0

pabvald · 2024-10-24T12:57:08Z

It has complete backwards compatibility. I would like to extend the README with a use case about using signed integer quantization to quantize document embeddings for a RAG application but I didn't have the time. When dealing with document/word embeddings it makes sense to keep negative values and therefore using signed integer quantization.

pabvald · 2024-11-04T07:25:35Z

@milankl When do you think you will be able to review it?

milankl · 2024-11-06T17:12:20Z

This looks fantastic, thanks so much for all your work on this pull request. I'm on it, I'll do a review today/tomorrow. Found a few typos otherwise it looks like it's ready to go, but I'll review it more thoroughly!

README.md

src/linquantarrays.jl

test/runtests.jl

…hub.com/pabvald/linlogquantization.jl into pr/pabvald/13

pabvald · 2024-11-15T15:48:43Z

@milankl I have refactored the code to use a keyword argument dims instead of dim. Check it out and let me know if there's something that needs fixing. Otherwise, looking forward to merge this!

P.S. I have modified the version to 1.0.0 because it now includes breaking changes

pabvald · 2024-11-26T11:12:51Z

@milankl Could you please run the test pipeline and merge it if everything is green 🙏🏻 ?

milankl · 2024-11-26T14:54:30Z

Running now 🥳

pabvald · 2024-12-02T13:24:30Z

@milankl There was a typo using the old notation without the keyword argument dims. Should work now.

Project.toml

README.md

milankl · 2024-12-03T12:30:09Z

Shoot yeah, I forgot the renaming from dim to dims. I did put in the dim functionality but didn't use it much so intuitively didn't consider it to be public API. But yes, in that case can we go to v0.3 first, and then if you are arguing to go to v1 straight, happy to consider this too

pabvald · 2024-12-03T13:28:13Z

Version 0.3.0 is ready

pabvald · 2024-12-04T07:50:57Z

@milankl The tests for logarithmic quantization were using signed integers, I guess I fixed it in a later commit. It should pass now.

I have also added benchmarking for Base.extrema.

milankl · 2024-12-04T19:21:29Z

Awesome, so we merge this, dim is dim not dims yet. I would then actually just merge #15 too, and tag both as v0.3 denothing breaking and new features. Deciding on v1.0 we can then still do?

pabvald added 5 commits October 5, 2024 20:56

feat: support linear quantization for signed integers, support custom…

0510d1d

… extrema

chore: up version to 0.3.0

15d4df2

undo: use Delta instead of alpha

e4ab178

rename: Maybe{T} to Option{T}

716e296

refactor: use eltype

fe49ba1

pabvald marked this pull request as draft October 6, 2024 20:05

milankl reviewed Oct 7, 2024

View reviewed changes

milankl added the enhancement New feature or request label Oct 7, 2024

milankl linked an issue Oct 7, 2024 that may be closed by this pull request

Extending Linear Quantization for Signed Integers #12

Closed

refactor: implement feedback

f1347e5

pabvald added 7 commits October 7, 2024 16:25

chore: rm unnecesary dependencies

eac3eeb

refactor: make extrema a keyword argument

5ae30f1

feat: export Int24

1541782

fix: signature of linear quantization along one dimension

6f8f6e6

feat: extend tests for signed integers

fd4e7f3

fix: set both extrema to positive values

b943bfe

undo: recover StatsBase dependency

10609bf

pabvald mentioned this pull request Oct 10, 2024

[FR] RAG: Add support for Int8 embeddings svilupp/PromptingTools.jl#118

Open

pabvald added 4 commits October 23, 2024 09:04

chore: update README

01a24c0

chore: formatting

9b36077

fix: make extrema keyword argument for LinQuantArray{U}()

e19b331

tests: for custom extrema and negative values

82deffe

pabvald marked this pull request as ready for review October 23, 2024 07:57

fix: matrix for LogQuant cannot have negative values

83ff7a3

pabvald added 2 commits October 27, 2024 12:20

test: to check backwards compatibility

fd99846

docs: extend docstrings

ea9b1f4

typos and linting

c450922

milankl reviewed Nov 7, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

src/linquantarrays.jl Outdated Show resolved Hide resolved

src/linquantarrays.jl Show resolved Hide resolved

src/linquantarrays.jl Show resolved Hide resolved

test/runtests.jl Outdated Show resolved Hide resolved

milankl and others added 4 commits November 7, 2024 16:16

add some idempotency tests for along dimension

eac5236

chore: benchmarking

cc2dc9e

higher tolerance for Int8 in tests

e92b194

Merge branch 'linear-quantization-for-signed-integers' of https://git…

4bf816a

…hub.com/pabvald/linlogquantization.jl into pr/pabvald/13

milankl reviewed Dec 2, 2024

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

README.md Show resolved Hide resolved

pabvald force-pushed the linear-quantization-for-signed-integers branch from 5a05c88 to 4bf816a Compare December 3, 2024 12:37

pabvald added 3 commits December 3, 2024 13:38

format: files

81f8269

chore: update Project.toml

d3607c9

chore: up version to 0.3.0

4669277

pabvald mentioned this pull request Dec 3, 2024

Dims standard keyword argument #15

Merged

pabvald added 3 commits December 4, 2024 08:24

fix: add missing signature LogQuantArray{U}

f384b64

test, fix: use unsigned integers for logarithmic quantiztion tests

b208d57

benchmark: overhead of Base.extrema

9cea909

milankl merged commit de6436e into milankl:main Dec 4, 2024
1 check passed

pabvald deleted the linear-quantization-for-signed-integers branch December 5, 2024 07:25

Linear quantization for signed integers #13

Linear quantization for signed integers #13

Uh oh!

Conversation

pabvald commented Oct 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pabvald commented Oct 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

milankl commented Oct 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pabvald commented Oct 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

milankl commented Oct 9, 2024

Uh oh!

pabvald commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

milankl commented Oct 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pabvald commented Oct 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pabvald commented Nov 4, 2024

Uh oh!

milankl commented Nov 6, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pabvald commented Nov 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pabvald commented Nov 26, 2024

Uh oh!

milankl commented Nov 26, 2024

Uh oh!

pabvald commented Dec 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

milankl commented Dec 3, 2024

Uh oh!

pabvald commented Dec 3, 2024

Uh oh!

pabvald commented Dec 4, 2024

Uh oh!

milankl commented Dec 4, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pabvald commented Oct 6, 2024 •

edited

Loading

pabvald commented Oct 6, 2024 •

edited

Loading

milankl commented Oct 7, 2024 •

edited

Loading

pabvald commented Oct 7, 2024 •

edited

Loading

pabvald commented Oct 23, 2024 •

edited

Loading

milankl commented Oct 24, 2024 •

edited

Loading

pabvald commented Oct 24, 2024 •

edited

Loading

pabvald commented Nov 15, 2024 •

edited

Loading

pabvald commented Dec 2, 2024 •

edited

Loading