Support generation tasks and various improvements by ZhaofengWu · Pull Request #3 · msclar/formatspread

ZhaofengWu · 2024-08-03T00:52:50Z

I will add bertscore later. But also you don't have to merge this since this doesn't block me. Only if you think this is helpful.

ZhaofengWu · 2024-08-03T00:53:22Z

data_loading.py


    # SuperNaturalInstructions Tasks without a defined format
-    if any(t in args.task_filename for t in SUPERNATURAL_INSTRUCTIONS_TASKS_WITH_NO_FORMAT):
+    if any(t in args.task_filename for t in SUPERNATURAL_INSTRUCTIONS_TASKS_WITH_NO_FORMAT + OPEN_GENERATION_SUPERNATURAL_INSTRUCTIONS_TASKS):


Seems like _setup_non_formatted_dataset_with_one_field_only works well for generation tasks out of the box.

ZhaofengWu · 2024-08-03T00:53:35Z

utils.py

-    inputs = tokenizer(prompt_list, padding=True, return_tensors='pt', return_token_type_ids=False).to('cuda')
+    if tokenizer.chat_template is None:
+        inputs = tokenizer(prompt_list, padding=True, return_tensors='pt', return_token_type_ids=False).to('cuda')
+    else:


Support chat models

ZhaofengWu · 2024-08-03T00:54:17Z

utils.py


-    generated_answer_list = [s.lower() for s in tokenizer.batch_decode(outputs['sequences'], skip_special_tokens=True)]
+    sequences = outputs['sequences']
+    if tokenizer.chat_template is not None:


The truncation of chat models is a bit different from regular models and need to be done separately. Also got rid of lowercasing for generation tasks (done below for classification tasks).

ZhaofengWu · 2024-08-03T00:55:02Z

utils.py

    full_prompt_string_list = []
    for input_element, idx in zip(inputs, selected_dataset_ids):
-        full_prompt_string_list.append(input_element if n_shot == 0 else demonstration_string + "\n\n" + input_element)
+        full_prompt_string_list.append(demonstration_string + "\n\n" + input_element)


I think even for 0-shot, the instruction (demonstration_definition) should still be there.

ZhaofengWu · 2024-08-03T00:55:25Z

utils.py


    # 2. update the output values if needed, i.e. if the multiple choice classes now have different names
-    assert all(len(dataset[idx]['output']) == 1 for idx in selected_dataset_ids)
+    assert all(len(dataset[idx]['output']) >= 1 for idx in selected_dataset_ids)


Be a bit lenient here. This doesn't hold for some generation tasks. When that happens, only take the first one.

Wouldn't it be better to raise some kind of warning in the > 1 case? Otherwise the code will silently take the first output and you might never know it happened

Yeah makes sense. Added a warning below.

ZhaofengWu · 2024-08-05T21:35:30Z

bertscore added

msclar · 2024-08-12T18:10:58Z

Amazing, thank you for extending FormatSpread for generation tasks!!!

Support generation tasks and various improvements

c691d44

ZhaofengWu commented Aug 3, 2024

View reviewed changes

Support bertscore; support chat models for probability ranking

48ea81c

Add warning

ef0056a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support generation tasks and various improvements#3

Support generation tasks and various improvements#3
ZhaofengWu wants to merge 3 commits intomsclar:mainfrom
ZhaofengWu:main

ZhaofengWu commented Aug 3, 2024 •

edited

Loading

Uh oh!

ZhaofengWu Aug 3, 2024

Uh oh!

ZhaofengWu Aug 3, 2024

Uh oh!

ZhaofengWu Aug 3, 2024

Uh oh!

ZhaofengWu Aug 3, 2024

Uh oh!

ZhaofengWu Aug 3, 2024

Uh oh!

msclar Aug 12, 2024

Uh oh!

ZhaofengWu Aug 26, 2024

Uh oh!

ZhaofengWu commented Aug 5, 2024

Uh oh!

msclar commented Aug 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ZhaofengWu commented Aug 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZhaofengWu Aug 3, 2024

Choose a reason for hiding this comment

Uh oh!

ZhaofengWu Aug 3, 2024

Choose a reason for hiding this comment

Uh oh!

ZhaofengWu Aug 3, 2024

Choose a reason for hiding this comment

Uh oh!

ZhaofengWu Aug 3, 2024

Choose a reason for hiding this comment

Uh oh!

ZhaofengWu Aug 3, 2024

Choose a reason for hiding this comment

Uh oh!

msclar Aug 12, 2024

Choose a reason for hiding this comment

Uh oh!

ZhaofengWu Aug 26, 2024

Choose a reason for hiding this comment

Uh oh!

ZhaofengWu commented Aug 5, 2024

Uh oh!

msclar commented Aug 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ZhaofengWu commented Aug 3, 2024 •

edited

Loading