logic issue with hierarchical context compression?

https://github.com/NVlabs/ToolOrchestra/blob/f9a2a22d6458f12930e4887f8a1b805b2bf556cb/training/lead_agent/llm_agent/generation_quick3.py#L1124

My understanding with the following piece of code is:
- we consider adding documents to the context only if the pre-truncation context (as code + attempt) was <24k -- this makes sense
- but if this did not apply, we use the truncated to 16k sequence of (code +attempt) -- shouldn't we truncate code + attempt to 24k instead, to take advantage of more context? Or maybe eventually using 16k of code+ attempt, and adding 8k of documents at the beginning? 

With current logic, it looks like long trajectories would not have all of the context they could have? 

If that's a cautious design decision, I'd love to learn more about the motivations! Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

logic issue with hierarchical context compression? #13

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

logic issue with hierarchical context compression? #13

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions