Skip to content

Releases: GreenBitAI/gbx-lm

Release v0.4.1

05 May 19:45

Choose a tag to compare

Added

  • support for Qwen3 models.
  • better dependency management and setup configuration.
  • model evaluation method.
  • support for MOE model architectures.

Release v0.4.0

17 Mar 16:09
cc54f57

Choose a tag to compare

Updated

  • improved fastapi-server.
  • improved bge embedding model serving method.

Release v0.3.8

19 Feb 17:04

Choose a tag to compare

Updated

  • fixed some issues in fastapi-server and langchain pipeline.

Release v0.3.7

18 Feb 22:07

Choose a tag to compare

Updated

  • synchronized with mlx==0.23.0 and mlx-lm==0.21.4

Release v0.3.6

15 Jan 22:22

Choose a tag to compare

Updated

  • created async_generate_step in fast-api
  • added token usage information in fast-api
  • extended libra router types

Release v0.3.5

25 Nov 09:16

Choose a tag to compare

Updated

  • improved fastAPI server
  • support libra confidence router

Release v0.3.4

15 Oct 08:44

Choose a tag to compare

Updated

  • improved the hidden states generation method
  • project structure refactoring

Release v0.3.3

10 Oct 20:15
9750cda

Choose a tag to compare

Added

  • langchain integration
  • local_rag and graph_rag example

Updated

  • generate method to support hidden states output

Release v0.3.2

13 Sep 16:16

Choose a tag to compare

Added

  • model management, FastAPI-server
  • unit test

Updated

  • synchronized with the mlx-lm
  • simplified README

Release v0.3.1

14 Jun 14:19

Choose a tag to compare

Updated

  • updated mlx_fastchat_worker for supporting mlx >= 0.14.
  • updated conda config.