By taking STA $xxxx as example. It takes 4 cycles, right? Two cycles to fetch the address (low/high byte) and 2 more cycles to complete the instruction.
I'm asking this because I'm not "eating" CPU cycles on data read/write, so I'd like to fix this problem, that (supposely) makes the blargg's APU tests to fail here.
I'm asking this because I'm not "eating" CPU cycles on data read/write, so I'd like to fix this problem, that (supposely) makes the blargg's APU tests to fail here.