How broad or narrow to go with a test hypothesis for comparing old designs with a redesigns

Say I am given an open brief like ‘improved the sign-up journey for product X’ or ‘improve the search experience for product Y’ etc.
I want to do some user testing around the current experience so this will be the baseline metrics and when…