Skip to content

issues with regression suite for nlp_text branch #20

@zurabksmlp

Description

@zurabksmlp

As of now SMLP regression suite includes test 229 to 262 to cover NLP and LLM capabilities of SMLP. The LLM related modes llm, finetune and rag work also by re-using SMLP-trained and saved models to perform LLM tasks like generation, question-answering, summarization. Because most of these saved models contain large files that are beyond git allowed size limit (or at least some default limit), these saved models (which are directories with possibly multiple sub-directories and files) were not included in git and are not uploaded to the main repository -- they are included in local .gitignore file for nlp_text branch.

The model directories includid in .gitigmore are:
smlp/repo/smlp/regr_smlp/rag_models
smlp/repo/smlp/regr_smlp/finetune_models
smlp/repo/smlp/regr_smlp/llm_models

The tests that load the saved llm, finetune and rag models are:
RAG: 240,242,244,246,248
Finetune: 250,252,254,256
LLM: 260,262.

In addition: test 245 that trains RAG model has relatively high memory requirements. When run on a laptop using the command line the test (usually) succeeds while when this test is run using smlp_regr.py as part of regression, the process gets killed.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions