Initially to be implemented for file utilities, llm calls and document parsing, as these are the most time intensive operations and will require iterations to be improved.
To test that the changes truly are improving what we're doing, it's a good idea to benchmark these with simple benchmarking scripts against a set of standard inputs, comparing with older versions.