Miles bbq with weak evidence by hunarbatra · Pull Request #201 · raybears/cot-transparency

hunarbatra · 2023-10-06T16:52:07Z

No description provided.

thejaminator · 2023-10-10T10:15:45Z

analysis.py

+        * 100
+    )
+
+    return percent_unfaithful_overall, SE_PUO, percent_unfaithfulness_explained_by_bias, SE_PUEB


i think return a dataclass / pydantic model, so whoecer calls it can access the correct metrics all the time, w/o having to unpack correctly :)

class UnfaithnessMetrics(BaseModel): percent_unfaithful_overall: float se_puo: float # some comment of what PUO is .... etc

erees1 · 2023-10-10T10:23:51Z

analysis.py

-        legend=legend,
-    )  # type: ignore
-    g.fig.suptitle("Counts")
+    if combine_bbq_tasks:


Maybe put this one into another method, simple_plot_for_bbq

erees1 · 2023-10-10T10:24:46Z

cot_transparency/data_models/data/bbq_miles.py

+
+    def _get_options(self) -> list[str]:
+        outputs = []
+        outputs.append(self.ans0)


Just to check are the answers in the json shuffled, i.e. just checking that ans0 is not always the "right" one or something.

yes the answers are shuffled. 2 options would be for 2 diff contexts, and 1 of them is the "unknown" option

erees1 · 2023-10-10T10:26:55Z

cot_transparency/data_models/example_base.py

    def get_parsed_input(
        self,
        include_none_of_the_above: bool = False,
+        context_idx: int = -1,


Could you stick some explanation into the docstring as to what this is.

erees1 · 2023-10-10T10:27:29Z

cot_transparency/data_models/example_base.py

+        context_idx: int = -1,
    ) -> str:
-        question = self._get_question()
+        question = self.get_question(context_idx)


If you need to override this for bbq stuff then the best thing to do would be to override the _get_question() method for your BBQ class

erees1 · 2023-10-10T10:30:05Z

cot_transparency/data_models/data/bbq_miles.py

+    context: str
+    label: int
+    weak_evidence: list[str]
+    target_loc: int


So James and I think there might be a better way to do this that handles the context but we probably need to explain it to you over a call.

thejaminator · 2023-10-10T10:41:25Z

cot_transparency/formatters/bbq_miles/bbq_miles_formatters.py

+
+    @staticmethod
+    def parse_answer(response: str, question: DataExampleBase, model: Optional[str] = None) -> Optional[str]:
+        return extract_answer(response, question, dump_failed=False)


i think here we just assert that it is indeed a BBQmiles example, then you can access all methods needed (your context idx)

# SAD breaking of liskov here if not isinstance(question, BBQMilesExample): raise ValueError( "Question must be a BBQMilesExample, did you with bbh_biased_wrong_cot as the dataset?" ) # get_parsed_input_bbq is defined on BBQMilesExample message = question.get_parsed_input_bbq(context_idx=1)

…s-bbq

hunarbatra and others added 5 commits October 6, 2023 17:41

add bbq miles updated analysis.py

30512dc

add bbq miles data model

34b2bd4

add bbq miles formatters

61a3808

add all files

a5cf42b

Merge branch 'main' into miles-bbq

506fdbd

hunarbatra requested review from erees1 and thejaminator October 6, 2023 17:56

hunarbatra added 2 commits October 6, 2023 19:35

update formatter

0398408

fix formatter bbq miles

a75f372

thejaminator reviewed Oct 10, 2023

View reviewed changes

erees1 reviewed Oct 10, 2023

View reviewed changes

thejaminator reviewed Oct 10, 2023

View reviewed changes

erees1 added 2 commits October 10, 2023 12:07

add data.jsonl, should be git lfs

68a0f7e

Merge branch 'main' of github.com:raybears/cot-transparency into mile…

ad46b2d

…s-bbq

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Miles bbq with weak evidence#201

Miles bbq with weak evidence#201
hunarbatra wants to merge 9 commits intomainfrom
miles-bbq

hunarbatra commented Oct 6, 2023

Uh oh!

thejaminator Oct 10, 2023

Uh oh!

erees1 Oct 10, 2023

Uh oh!

erees1 Oct 10, 2023

Uh oh!

hunarbatra Oct 10, 2023

Uh oh!

erees1 Oct 10, 2023

Uh oh!

erees1 Oct 10, 2023

Uh oh!

erees1 Oct 10, 2023

Uh oh!

thejaminator Oct 10, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hunarbatra commented Oct 6, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants