Skip to content

Fix HDFS training failure with small datasets (Issue #51)#54

Open
LazyCr0w wants to merge 1 commit intoHelenGuohx:mainfrom
LazyCr0w:main
Open

Fix HDFS training failure with small datasets (Issue #51)#54
LazyCr0w wants to merge 1 commit intoHelenGuohx:mainfrom
LazyCr0w:main

Conversation

@LazyCr0w
Copy link

This commit fixes the issue where HDFS training fails when the dataset has sequences shorter than min_len.

Changes:

  1. Modified fixed_window to only filter sequences when min_len > 0
  2. Added handling for empty sequences in generate_train_valid
  3. Fixed division by zero errors in calculate_center and iteration methods
  4. Set min_len=0 in HDFS/logbert.py to allow any sequence length
  5. Added proper error handling throughout the code

This commit fixes the issue where HDFS training fails when the dataset has sequences shorter than min_len.

Changes:
1. Modified fixed_window to only filter sequences when min_len > 0
2. Added handling for empty sequences in generate_train_valid
3. Fixed division by zero errors in calculate_center and iteration methods
4. Set min_len=0 in HDFS/logbert.py to allow any sequence length
5. Added proper error handling throughout the code
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant