avoid recency bias in prompt construction

**Context**
According to this [paper](http://proceedings.mlr.press/v139/zhao21c/zhao21c.pdf) ChatGPT (and likely other LLMs) suffer from a recency bias. Whatever class comes last has a higher propability of being selected.
**Issue**
Currently scikit-llm constructs prompts based on the order of the training data. 
Since we are recommended to restrict the training data I would usually do something like this:
~~~python
df = df.groupby(label_col).apply(lambda x: x.sample(n_samples))
df = df.reset_index(drop=True)
~~~
Which returns a sorted dataframe by label_col. Even if `sort=False` is passed to `groupby` the instances are still clustered by label.

**Question/Solution**
Should a method be implemented that randomizes the order of samples in the prompt / training data, or should users take care of that themselves?
The most straightforward way would be to simply add this to sampling:
~~~python
df = df.sample(frac=1)
~~~
Which leaves it up to chance to balance it reasonably.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid recency bias in prompt construction #104

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

avoid recency bias in prompt construction #104

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions