Is the MOS score obtained solely through subjective evaluation of the SR images? When inferring on the test dataset, is the final MOS score also obtained using only the SR images?