Loop start takes 4 bytes (E9 XXXX YY, XXXX-address, YY-number of loops) and the end takes 1 byte (00-end of data). Label loop is just only a copy of the loop start. Not sure about the superloops.
Also I think that the length and q are redefined in every loop.
I'm sure that AMK actually optimizes the ^s, by for example changing ^16^16 into ^8 or c4^8 into c=72 (also c1 doesn't exist, it's actualy c2^2).
--------------------Check out Team Hack Fix/Translations' thread full of fixed and translated SMW hacks!
Tag (div) was not opened.