upd7725 overflow (attn: byuu)

upd7725 overflow (attn: byuu)
by AWJ on 2017-08-31 (#203580)

I noticed via Twitter that byuu acquired additional upd7725 documentation and changed the implementation of the overflow flags in higan. The upd7725 overflow flags are something I had puzzled over and discussed with Lord Nightmare once (quite a long time ago, probably when I was initially backporting the DSP LLE into bsnes-classic) because I'd noticed that he'd changed the way MAME calculated the flags from how bsnes did it. Neither the original bsnes implementation nor LN's modified MAME implementation looked quite right to me, but although I understood how the flags were meant to be used, I couldn't figure out how to calculate them so that they could be used in that way.

Thanks to the new documentation, specifically the explanation of how the S1 flag is calculated and the "1, 0, 1" (overflow, no overflow, overflow) case, I think I've figured out how everything works, and it's a good bit simpler than byuu's new implementation. In particular, I believe that there is no need for a flag "history buffer" and that the chip contains no such thing.

First of all, we have to understand what the OV0 and OV1 flags mean arithmetically. Basically, whereas OV0 indicates whether the most recent operation produced a signed overflow, OV1 indicates whether the value in the accumulator is in bounds (between -32768 and +32767) or whether it is overflowed. Let's think about how to calculate that, and build up a truth table.

First, the easy cases. If the accumulator previously contained an in-bounds value, and no overflow occurred in the last operation, then the accumulator must still contain an in-bounds value. Likewise, if the accumulator previously contained an in-bounds value and an overflow occurred, the value in the accumulator is now out of bounds.

Code:

OV1in  OV0 | OV1out
-----------+--------
  0     0  |   0
  0     1  |   1

Next, if the accumulator was previously out of bounds, and no overflow occurred in the last operation, then the accumulator is still out of bounds. This is perhaps not quite as easy to intuit as the first two cases, but think about it: the only way the accumulator can go from out of bounds back to in bounds is if a second overflow occurs, in the opposite direction of the original overflow.

Code:

OV1in  OV0 | OV1out
-----------+--------
  1     0  |   1

Finally, the hard case: what happens if the accumulator was out of bounds and another overflow occurs? Let's look at a couple of examples:

Code:

32767 + 1 + (-2) (hex: $7FFF + $0001 + $FFFE)

$7FFF + $0001 = $8000 + overflow
$8000 + $FFFE = $7FFE + overflow

Adding $7FFF to $0001 gives a result of $8000 with an overflow (positive + positive = negative). Adding $FFFE to the result gives a result of $7FFE and a second overflow (negative + negative = positive). However, despite the two overflows, the final result is correct and in bounds: 32767 + 1 + (-2) equals 32766. The two overflows have cancelled each other out. Now let's look at another example:

Code:

32767 + 32767 + 32767 + 32767 (hex: $7FFF + $7FFF + $7FFF + $7FFF)

$7FFF + $7FFF = $FFFE + overflow
$FFFE + $7FFF = $7FFD
$7FFD + $7FFF = $FFFC + overflow

On the first addition, an overflow occurs (positive + positive = negative). No overflow occurs on the second addition, but on the third addition another positive + positive = negative overflow occurs. This time, the final result is not in bounds: 32767 + 32767 + 32767 + 32767 isn't -4 or even 65532, it's 131068 (hex $1FFFC).

The difference between these two cases is that in the first case the two overflows were in opposite directions, and in the second case both overflows were in the same direction. The purpose of the S1 flag is to distinguish between these two cases. According to the datasheet, the S1 flag contains the sign of the result of the last operation that took place with the incoming OV1 flag clear; in other words, the last operation that took place while the incoming accumulator was in bounds. If the S1 flag is the same as the S0 flag produced by the current overflowing operation, then two overflows in the same direction have occurred and the accumulator is still out of bounds. If the S1 flag and the S0 flag are different, then two overflows in opposite directions have occurred, meaning the accumulator went out of bounds and then back in bounds.

So here are the complete truth tables for S1 and OV1:

Code:

OV1in | S1
------+----
  0   | S0
  1   | unchanged

OV1in  OV0 | OV1out
-----------+--------
  0     0  |   0
  0     1  |   1
  1     0  |   1
  1     1  | (S0 == S1)

byuu was wondering whether the OV1 test should use the new or old value of S1, but you can see from these truth tables that it doesn't matter. The value of S1 only changes if the previous OV1 was clear, while OV1 only depends on S1 if the previous OV1 was set.

The datasheet implies that "overflow, no overflow, overflow" is some kind of special case that the chip explicitly checks for, but in fact it's just a consequence of the math. Two consecutive operations can't both overflow in the same direction; just look at the results from adding $7FFF (the largest possible positive number) to itself. If you work out the results of repeatedly adding $8000 (the smallest possible negative number) to itself, it's the same. You can only have two overflows in the same direction if there is at least one non-overflowing operation between them.

Note that in order to make use of this overflow mechanism, it is essential that the OV1 flag be cleared before you start doing your additions, or the S1 flag won't be updated when it should be. According to the datasheet, any ALU operation other than an addition or subtraction clears the OV1 flag, and if you look at a disassembly of the DSP1 program or Lord Nightmare's prose2k DSP program, you can see that they in fact do xor a,a or and a,a prior to any sequence of calculations that use the OV1 flag or the SGN pseudo-register.

One more thing: why is this overflow mechanism only good for three operations? Let's look at what happens if you do four additions in a row with the following values:

Code:

32767 + 32767 + 32767 + 32767 + -32768 (hex: $7FFF + $7FFF + $7FFF + $7FFF + $8000)

$7FFF + $7FFF = $FFFE  S0 = 1 S1 = 1 OV0 = 1 OV1 = 1
$FFFE + $7FFF = $7FFD  S0 = 0 S1 = 1 OV0 = 0 OV1 = 1
$7FFD + $7FFF = $FFFC  S0 = 1 S1 = 1 OV0 = 1 OV1 = 1
$FFFC + $8000 = $7FFC  S0 = 0 S1 = 1 OV0 = 1 OV1 = 0

We've already gone over the first three additions, so just look at what happens with the fourth. An overflow occurs (OV0 = 1) and S1 and S0 have opposite values, so the OV1 flag is cleared. Which means the accumulator is considered to be in bounds. But this is wrong--32767 + 32767 + 32767 + 32767 + -32768 is 98300, not 32764! If you do four additions in a row, it becomes possible for two overflows in one direction to occur followed by an overflow in the opposite direction, resulting in a false negative.

Re: upd7725 overflow (attn: byuu)
by AWJ on 2017-08-31 (#203581)

Here's a python program that implements the overflow flags according to the previous post and verifies that for a representative range of input values they produce arithmetically correct results for three additions (OV1 is set if and only if the final result is out of bounds) but that they don't produce correct results for four additions.

Code:

#!/usr/bin/python3

def int16(n):
    return (n & 0x7fff) - (n & 0x8000)

class ALU(object):
    def __init__(self):
      self.a = 0
      self.s0 = 0
      self.s1 = 0
      self.ov0 = 0
      self.ov1 = 0

    def add(self, right):
      left = self.a
      result = left + right

      self.s0 = result & 0x8000
      if not self.ov1:
          self.s1 = self.s0

      # ov0 (result overflow)
      self.ov0 = (left ^ result) & (right ^ result) & 0x8000

      # ov1 (accumulator overflow)
      if self.ov1 and self.ov0:
          self.ov1 = (self.s1 == self.s0)
      else:
          self.ov1 = self.ov1 or self.ov0

      self.a = int16(result)


def main():
    alu = ALU()

    testrange = (-0x8000, -0x4000, -1, 0, 1, 0x3fff, 0x4000, 0x7fff)
    for a in testrange:
        for b in testrange:
            for c in testrange:
                for d in testrange:
                    alu.a = a
                    alu.ov1 = 0
                    alu.add(b)
                    alu.add(c)
                    alu.add(d)
                    overflow = (alu.a != a+b+c+d)
                    if overflow != bool(alu.ov1):
                        print("Uh oh! %d %d %d %d" % (a, b, c, d))

    for a in testrange:
        for b in testrange:
            for c in testrange:
                for d in testrange:
                    for e in testrange:
                        alu.a = a
                        alu.ov1 = 0
                        alu.add(b)
                        alu.add(c)
                        alu.add(d)
                        alu.add(e)
                        overflow = (alu.a != a+b+c+d+e)
                        if overflow != bool(alu.ov1):
                            print("4 is too many! %d %d %d %d %d OV1=%d" % (a, b, c, d, e, bool(alu.ov1)))

if __name__ == "__main__":
    main()

Re: upd7725 overflow (attn: byuu)
by byuu on 2017-08-31 (#203590)

... wow, hat tip to you! This also solves a third question of mine, which was whether the three OV history values were cleared on other ALU operations (strongly leaning toward yes.) If they don't exist, then the question is void.

So we can essentially boil this down to:

Code:

s0 = result&0x8000;
ov0 = (the usual xor-and overflow magic);
if(!ov1) s1 = s0;
ov1 = ov0&ov1 ? s0==s1 : ov0|ov1;

However, I do want to say ... there's no absolute guarantee the original designers of this chip were so clever, short of studying die scans. It's quite possible they implemented this with two extra boolean latches. The implementation details were simply beyond the scope of the programmer's manuals. And though I'll definitely add this, it does lose a bit of clarity in the process. Not that my article's code example was all that clear to begin with, but still. If there were ever a time for source code comments, this would be it.

By the way, my DSP LLE had a nasty flaw with SGN:

Code:

case 7: idb = 0x8000 - flags.a.s1; break;

I'm not sure why I chose to hard-code this to OVA1. Should be:

Code:

case 7: idb = 0x8000 - (!asl ? flags.a.s1 : flags.b.s1); break;

=> mov a,sgn
=> mov b,sgn

Quote:

if you look at a disassembly of the DSP1 program or Lord Nightmare's prose2k DSP program, you can see that they in fact do xor a,a or and a,a prior to any sequence of calculations that use the OV1 flag or the SGN pseudo-register.

So I did hear that on the SNES coprocessors, only the DSP1 uses SGN once. Does it actually do anything significant with the result where proper emulation of S1 would make an observable difference? If you're not sure, no need to look into it. I've been operating under the assumption that proper S1/OV1 support was mostly busywork, but ... perfectionism and all, finally got around to it thanks to much help from Cydrak.

Quote:

Thanks to the new documentation

If you'd like, send me your e-mail or let me know if you'd rather a mega link, and I can get the new documents to you. They're very low-quality scans, but they're quite thorough and explain a lot of the operations the SNES lacks in more detail: serial transfers, interrupts, etc.

Probably not very useful unless you're also a big fan of the prose2k hardware.

Quote:

The upd7725 overflow flags are something I had puzzled over and discussed with Lord Nightmare once

Heh, he went after you too, huh? ^-^
I'm sure he will be absolutely thrilled at your findings here :)

Re: upd7725 overflow (attn: byuu)
by Lord Nightmare on 2017-08-31 (#203599)

All of the docs came from https://web.archive.org/web/20170313202 ... -dsps/nec/
I ended up hand-feeding that entire directory of the site to archive.org manually page by page, because the site is hosted on a toaster or something and will go down hard for a few days if you download more than a dozen MB of data or so.

LN

Re: upd7725 overflow (attn: byuu)
by AWJ on 2017-08-31 (#203602)

byuu wrote:

By the way, my DSP LLE had a nasty flaw with SGN:

Actually, that's apparently not a flaw in your emulation. The upd7725 docs indicate that SGN is only affected by operations on accumulator A and not accumulator B. The upd7720 datasheet says that SGN is affected by either S1 flag, but the 1984-07 memo by Ted Knowlton points out that this is an error in the datasheet.

Re: upd7725 overflow (attn: byuu)
by byuu on 2017-08-31 (#203607)

> https://web.archive.org/web/20170313202 ... -dsps/nec/

Oh, forgot you did that. Awesome! Much easier than e-mail.

> The upd7720 datasheet says that SGN is affected by either S1 flag

... which is the one I was reading for these new updates. God dammit >_>

Okay, I'm definitely adding a source code comment on this one. Good catch.

> this is an error in the datasheet.

... as well as in the processor. Quite the hardware design oversight. Ruins using the three chained add/sub operations and then saturating the result if you use the B accumulator. Have to use J(N)SB1 now.

Re: upd7725 overflow (attn: byuu)
by Lord Nightmare on 2017-08-31 (#203615)

byuu wrote:

> this is an error in the datasheet.

... as well as in the processor. Quite the hardware design oversight. Ruins using the three chained add/sub operations and then saturating the result if you use the B accumulator. Have to use J(N)SB1 now.

Based on Ted's note on that site, I don't believe it affects the upd7720 itself, it is just a datasheet error, which was corrected on the upd7725 datasheet.

LN

Re: upd7725 overflow (attn: byuu)
by jbo_85 on 2017-08-31 (#203616)

SGN is used twice by the DSP1, for op02 (Projection Parameter Setting) and for op06 (Object Projection Calculation). The result doesn't seem to be discarded.

Re: upd7725 overflow (attn: byuu)
by byuu on 2017-08-31 (#203618)

> Based on Ted's note on that site, I don't believe it affects the upd7720 itself, it is just a datasheet error, which was corrected on the upd7725 datasheet.

I meant the CPU should honor the ASL bit to select between SA1 and SB1. Seems like an oversight in the design.

> SGN is used twice by the DSP1, for op02 (Projection Parameter Setting) and for op06 (Object Projection Calculation). The result doesn't seem to be discarded.

Interesting. Would think that would cause some observable emulation bugs given how wrong our S1 emulation was before ... :|

Re: upd7725 overflow (attn: byuu)
by byuu on 2017-09-01 (#203674)

Quote:

Well, on that note ...

Code:

if(!ov1) s1 = s0;
ov1 = ov0&ov1 ? s0==s1 : ov0|ov1;

Here, we can see the s0==s1 test doesn't get hit if ov1==0.

But what if we reverse this?

Code:

ov1 = ov0&ov1 ? s0==s1 : ov0|ov1;
if(!ov1) s1 = s0;

Let's say at the start, ov0 was set (from this ALU operation), but ov1 was clear (from the previous ALU operation.) ov1 will be set to ov0|ov1, or 1. And now the if(!ov1)s1=s0; test will fail, whereas if we did the s1 test before the ov1 assignment, it would have transferred s0 into s1. ov1 will be set correctly either way, but the order of operations will affect the s1 output when s0!=s1.

It seems pretty clear (the manual basically says as much), and your truth table seems to confirm, we should do the if(!ov1) s1=s0; test first, but it's always good to clarify these things in documentation. Flag assignments are not usually dependent upon the results of other flag assignments in CPU emulators.

Re: upd7725 overflow (attn: byuu)
by tepples on 2017-09-01 (#203682)

byuu wrote:

Flag assignments are not usually dependent upon the results of other flag assignments in CPU emulators.

Except carry, where ROR A or its counterpart in other instruction sets moves carry to A (and thus to sign).

Re: upd7725 overflow (attn: byuu)
by AWJ on 2017-09-01 (#203700)

byuu wrote:

Quote:

Well, on that note ...

Code:

if(!ov1) s1 = s0;
ov1 = ov0&ov1 ? s0==s1 : ov0|ov1;

Here, we can see the s0==s1 test doesn't get hit if ov1==0.

But what if we reverse this?

Code:

ov1 = ov0&ov1 ? s0==s1 : ov0|ov1;
if(!ov1) s1 = s0;

I said that it doesn't matter whether you use the new or old S1. It does matter whether you use the new or old OV1, which is why both truth tables specify "OV1in".

Quote:

I meant the CPU should honor the ASL bit to select between SA1 and SB1. Seems like an oversight in the design.

It seems that accumulator A is meant to be used as the primary accumulator, and accumulator B to hold either temporary values or the low 16 bits of a 32-bit calculation (notice that each accumulator uses the opposite one's carry flag as its incoming carry)

Re: upd7725 overflow (attn: byuu)
by nocash on 2017-09-03 (#203787)

This is the flags description from fullsnes.htm, I think it's looking a bit different than your code.

Code:

  S0  Sign Flag     (set if result.bit15)
  Z   Zero Flag     (set if result=0000h)
  C   Carry Flag    (set if carry or borrow)
  OV0 Overflow Flag (set if result>+7FFFh or result<-8000h)
  S1  Direction of Last Overflow (if OV0 then S1=S0, else S1=unchanged)
  OV1 Number of Overflows (0=even, 1=odd) (inverted when OV0 gets set)

On a 80x86 processor it's mainly needing LAHF+JO opcodes to get the 80x86 flags (and some more difficult handling when the JO jump on overflow is taken):

Code:

 lahf    ;ah=flags (bit0=cy, bit6=zf, bit7=sf)
 jo   short @@overflow
 and  ah,($upd_flg_s0+$upd_flg_c+$upd_flg_z)/100h  ;isolate new flags (and clear OV0)
 mov  byte ptr [upd7725_reg_flg&acc&+1],ah         ;apply 8bit (keep LSB = OV1,S1)
 jmp  upd7725_do_alu_done
;---
@@overflow:
 mov  al,byte ptr [upd7725_reg_flg&acc&+0]             ;get old OV1
 and  ax,$upd_flg_s0+$upd_flg_c+$upd_flg_z+$upd_flg_o1 ;new S0,C,Z + old OV1
 or   ah,($upd_flg_o0)/100h                            ;set OV0=1
 xor  al,ah                                            ;set S1=S0, toggle OV1 (by XORing it with OV0=1)
 mov  word ptr [upd7725_reg_flg&acc&],ax               ;apply whole 16bit
 jmp  upd7725_do_alu_done

Re: upd7725 overflow (attn: byuu)
by byuu on 2017-09-03 (#203789)

> S1 Direction of Last Overflow (if OV0 then S1=S0, else S1=unchanged)

This is wrong. It's set based on OV1, not OV0.

> OV1 Number of Overflows (0=even, 1=odd) (inverted when OV0 gets set)

Also wrong, see AWJ's truth tables above.

If you based your notes off my initial implementation, then my apologies.
Those notes were based off the only uPD7725 manual I had at the time, which did not explain S1/OV1 nearly as well as the newly discovered uPD7720 documents do.

The code you want is:

Code:

if(!ov1) s1 = s0;
ov1 = ov0&ov1 ? s0==s1 : ov0|ov1;

Re: upd7725 overflow (attn: byuu)
by nocash on 2017-09-03 (#203793)

There are tons of uPD77C2xxx scans on datasheetarchive.com.
I have uPD77C20 and uPD77C25 datasheets on my harddisk... downloaded back in March 2011, going by the file timestamps.
My description/code is different than AWJ's table/code, I don't know which is closer to real hardware.

PS. I think my implemention might have opposite S1 values (ie. 0=positive vs 1=positive), this works as long as SGN opcode is processing S1 accordingly.

Re: upd7725 overflow (attn: byuu)
by byuu on 2017-09-03 (#203801)

So, the real way to rule all of this out for good ...

Lord Nightmare has a pile of uPD7720 chips unflashed. If we:

1) write a uPD7720 assembler,
2) write a test program in uPD7720 syntax to evaluate all input/output combinations of flags,
3) write an emulator for the 7720 (MAME's could work but it's an insane hack) to debug the test program,

Then Lord Nightmare could try to flash the program onto one of his chips. I'm not sure if he's capable of then putting it into a circuit design and sending / receiving values from the chip, but it's possible we can find someone else capable of doing so that would be up for the task.

Barring this, we're never really going to know for sure, with 100% confidence, how all of this works.

There's still the open question of how exactly S1 is modified on other ALU operations. Manual just says indeterminate, but there are several valid possibilities.

If someone is willing to do the other parts ... I'll volunteer to do part 1 and write a uPD7720 assembler. But you might not like the syntax I come up with ... and I'll be quite saddened if I do the work for nothing >_<

...

Also, here are the relevant sections from the manuals I have:

Re: upd7725 overflow (attn: byuu)
by AWJ on 2017-09-03 (#203802)

nocash wrote:

PS. I think my implemention might have opposite S1 values (ie. 0=positive vs 1=positive), this works as long as SGN opcode is processing S1 accordingly.

The manual is clear that the chip can distinguish between two overflows in the same direction (OV1 flag remains set) and two overflows in opposite directions (OV1 flag becomes clear). It has to be able to do so in order to produce the correct overflow state after three operations, as advertised. Your implementation doesn't do that.

Re: upd7725 overflow (attn: byuu)
by nocash on 2017-09-03 (#203807)

Ah, yes, got it you are right. Two overflows in same direction won't work with my code.

Hmmm, web.archive is somehow overloading my PC (no chance to download the pdf files), but it does seem to have more/different files than datasheetarchive. Thanks for posting the relevant pages!

Re: upd7725 overflow (attn: byuu)
by AWJ on 2017-09-03 (#203810)

Just noticed the question about multiply overflow in byuu's forum thread. 0x8000 * 0x8000 -> int31 is considered an "overflow" because the sign of the result is wrong. Multiplying two negative numbers should produce a positive result, but int31(0x40000000) is negative.

Re: upd7725 overflow (attn: byuu)
by Lord Nightmare on 2017-09-03 (#203834)

I'm trying to figure out in what circumstances the " if (!flag.ov1) flag.s1 = flag.s0;" should run.
In byuu's code in higan, it runs on OP opcodes, before ov1 is recalculated(which I believe is correct), it is updated on any add/sub/adc/sbb/inc/dec opcode(which I believe is correct), it is left alone on a nop(which I believe is correct), and not on an or/and/xor/cmp/shr1/shl1/shl2/shl4/xchg(which I'm not sure about). It also isn't updated on an RT/JMP opcode, nor on an LD opcode. (also not sure about these, guessing that it shouldn't be touched on RT/JMP, unsure about LD)

In MAME, I made it run for all OP alu types except for NOP(where it is left alone), and ov1 is only updated during add/sub/adc/sbb/inc/dec opcodes (ov1 is updated after ov0 is), and is explicitly cleared(along with ov0) on or/and/xor/cmp/shr1/shl1/shl2/shl4/xchg opcodes.

I'm not 100% sure which way is correct, given that branches can happen based on s1 and ov1 status.

LN

Re: upd7725 overflow (attn: byuu)
by byuu on 2017-09-03 (#203836)

You're guessing no matter what you do with the other ALU ops.

I'm willing to match your behavior, but I worry that people might see it in the future and think that's definitely how it works, when in truth we have no idea. Needs a clear source code comment that we're guessing, even if it is the most logical conclusion to us.

Re: upd7725 overflow (attn: byuu)
by AWJ on 2017-09-03 (#203839)

Well, I wrote a program that should allow you to test any sequence of ALU operations you want, if someone wants to figure out how to assemble, burn and test it (don't forget the screwy bit order the '7720 needs for its program ROM) The command protocol is heavily based on the DSP-1, including the need to input a dummy operand for no-operand commands. It doesn't use serial I/O, DMA, or anything else fancy. It also doesn't use the data ROM at all.

Protocol:

Wait for RQM to go high, write 8 bit opcode/carry selector:

Code:

000CPPPP

000 = must be zero
C = incoming carry B flag
PPPP = opcode (same order as native instruction encoding--0001 is OR, etc.)

Wait for RQM to go high, write 16-bit right-side operand (LSB first). You have to do this even for unary opcodes like the shifts (the operand value is ignored).

Wait for RQM to go high, read 16-bit result (value in accumulator A after executing opcode).

Wait for RQM to go high, read 8-bit flags as follows:

Code:

11SsczVv

11 = always 1
S = SA1
s = SA0
c = CA
z = ZA
V = OVA1
v = OVA0

Accumulator A and its flags (should be) preserved between operations, i.e. you should be able to do a sequence of additions/subtractions and look at the flags after each step.

Code:

       : ld     0400,sr ; 8-bit
       : ld     00c0,dr
; === main loop ===
loop   : ld     00e0,b
       : jrqm   *       ; wait for (8-bit) read or write (read flags or write opcode)
       : mov    dr,tr | and dr,b   ; get opcode and carry-in
       : jnzb   loop

; === decode opcode ===
       : ld     0000,sr ; 16-bit
       : or     tr,b
       : jrqm   *       ; wait for (16-bit) write (operand)
       : mov    drnf,tr | shr1 b ; get operand
       : jcb    opxxx1
opxxx0 : shr1   b
       : jcb    opxx10
opxx00 : shr1   b
       : jcb    opx100
opx000 : shr1   b
       : jncb   afterop ; 0000 = nop
       : jcb    op1000
opx100 : shr1   b
       : jncb   op0100
       : jcb    op1100
opxx10 : shr1   b
       : jcb    opx110
opx010 : shr1   b
       : jncb   op0010
       : jcb    op1010
opx110 : shr1   b
       : jncb   op0110
       : jcb    op1110
opxxx1 : shr1   b
       : jcb    opxx11
opxx01 : shr1   b
       : jcb    opx101
opx001 : shr1   b
       : jncb   op0001
       : jcb    op1001
opx101 : shr1   b
       : jncb   op0101
       : jcb    op1101
opxx11 : shr1   b
       : jcb    opx111
opx011 : shr1   b
       : jncb   op0011
       : jcb    op1011
opx111 : shr1   b
       : jncb   op0111
       : jcb    op1111

; === execute opcode ===
op0001 : shr1   b
       : or     tr,a
       : jmp    afterop
op0010 : shr1   b
       : and    tr,a
       : jmp    afterop
op0011 : shr1   b
       : xor    tr,a
       : jmp    afterop
op0100 : shr1   b
       : sub    tr,a
       : jmp    afterop
op0101 : shr1   b
       : add    tr,a
       : jmp    afterop
op0110 : shr1   b
       : sbb    tr,a
       : jmp    afterop
op0111 : shr1   b
       : adc    tr,a
       : jmp    afterop
op1000 : shr1   b
       : dec    a
       : jmp    afterop
op1001 : shr1   b
       : inc    a
       : jmp    afterop
op1010 : shr1   b
       : cmp    a
       : jmp    afterop
op1011 : shr1   b
       : shr1   a
       : jmp    afterop
op1100 : shr1   b
       : shl1   a
       : jmp    afterop
op1101 : shr1   b
       : shl2   a
       : jmp    afterop
op1110 : shr1   b
       : shl4   a
       : jmp    afterop
op1111 : shr1   b
       : xchg   a

; === output result and flags ===
afterop: mov    a,dr    ; send result
       : ld     00c0,b
       : jnsa1  nosa1
       : ld     0020,tr
       : or     tr,b
nosa1  : jnsa0  nosa0
       : ld     0010,tr
       : or     tr,b
nosa0  : jnca   noca
       : ld     0008,tr
       : or     tr,b
noca   : jnza   noza
       : ld     0004,tr
       : or     tr,b
noza   : jnova1 noova1
       : ld     0002,tr
       : or     tr,b
noova1 : jnova0 noova0
       : ld     0001,tr
       : or     tr,b
noova0 : jrqm   *       ; wait for (16-bit) read (result)
       : ld     0400,sr ; 8-bit
       : mov    b,dr    ; send flags
       : jmp    loop    ; back to the top

Someone should eyeball my code before using up rare chips, especially the rats' nest of branches that is the command decoding (also based on the DSP1 code--and there really isn't any better way to do it as far as I can tell)

Quote:

I'm trying to figure out in what circumstances the " if (!flag.ov1) flag.s1 = flag.s0;" should run.

The docs say that S1 is "indeterminate" after any ALU operation other than addition/subtraction or NOP. LD and JMP instructions definitely shouldn't affect any flags at all (there's no accumulator/flag select field in those instructions, and somewhere one of the docs explicitly clarifies that loading a value into one of the accumulators via MOV or LD doesn't update its sign or zero flags)

Re: upd7725 overflow (attn: byuu)
by byuu on 2017-09-03 (#203840)

Any test program should be debugged thoroughly with an emulator before being burned.

These chips are exceedingly rare.

Re: upd7725 overflow (attn: byuu)
by Lord Nightmare on 2017-09-03 (#203849)

The 10 blank upd77P20s I have have windows; if I run out, I can always uv-erase them and try again, I think they're good for 100+ prog/erase cycles at least.
What I don't have (and I wish I did) are blank upd77P25D chips with windows; those are significantly rarer.

EDIT: There is actually someone selling pulled upd77P25D chips, from china, on ebay. Whether these are 'real' or remarked something else, I don't know. They're about $5 each + $6shipping even for more than one, and there's 50+ available. I ordered two of them, since for $16 its worth a shot at them being real.

LN

Re: upd7725 overflow (attn: byuu)
by AWJ on 2017-09-03 (#203853)

The 7720 and 7725 appear to be pin-compatible, except that the EPROM versions of both require power on pin 1. I wonder if you could put a programmed 7720 onto a suitable SNES cartridge for software testing, rather than have to use an Arduino or something.