Re: every byte mattered ...
"meaning my subroutine was just E1 E9 instead of E1 E5 C9. "
And working out clock cycles for code then agonising over whether you could spare a few extra bytes by using only low cycle count instructions for the speed or use some of the more clever op codes to save RAM but took many more clock cycles to execute.