How do you evaluate this? Claude is horrible at performance analysis without data, does it have a feedback loop here that actually moves the needle.