Plug & play framework for evaluating chain-of-thought reasoning
Currently in development
Access by request