i think the new version is only counting the tokens generated in the response but not the ones used inside the thinking tags.