`scipy.stats` throws error when there's no reference summary (factCC and qags dataset)

This `scipy.stats` call:

https://github.com/SigmaWe/EvalBase/blob/cef8dbfa1b414faf36acff82aa8c75022d39c506/eval_utils.py#L70-L84

will throw error:

```py
File ~/git/evalbase/eval_utils.py:85, in batched_corr(corr_df, human_scores, batch_result_df, corr_metrics, batchID)
     83             else:
     84                 pass
---> 85             cc = eval(f"scipy.stats.{corr_metric}")(human_score, metric_score)[0]
     86             corr_df.loc[
     87                 (corr_metric, aspect_name, approach, model, score_name),  # row
     88                 batchID
     89             ] = cc
     90 return corr_df

File ~/.local/lib/python3.10/site-packages/scipy/stats/_stats_py.py:4411, in pearsonr(x, y, alternative)
   4408     raise ValueError('x and y must have the same length.')
   4410 if n < 2:
-> 4411     raise ValueError('x and y must have length at least 2.')
   4413 x = np.asarray(x)
   4414 y = np.asarray(y)

ValueError: x and y must have length at least 2.
```

when

```
human_score.shape = (1,)
metric_score.shape = (1,)
```

This issue appears in factCC and qags (which have no reference summary), but not in frank (which has reference summary). Not sure if it is caused by the lack of reference summary.

	def batched_corr(corr_df, human_scores, batch_result_df, corr_metrics, batchID):
	"""Compute the correlations between human scores and automated metric scores on batch of samples, each of which is a pair of (doc, sys summ) or (ref summ, sys summ)

	Iteratively add rows to corr_df.
	"""
	for corr_metric in corr_metrics:
	for aspect_name, human_score in human_scores.items():
	for (approach, model, score_name) in batch_result_df.columns:
	metric_score = batch_result_df[(approach, model, score_name)]
	cc = eval(f"scipy.stats.{corr_metric}")(human_score, metric_score)[0]
	corr_df.loc[
	(corr_metric, aspect_name, approach, model, score_name), # row
	batchID
	] = cc
	return corr_df

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`scipy.stats` throws error when there's no reference summary (factCC and qags dataset) #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

scipy.stats throws error when there's no reference summary (factCC and qags dataset) #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`scipy.stats` throws error when there's no reference summary (factCC and qags dataset) #5