Log inRegister an accountBrowse CSDbHelp & documentationFacts & StatisticsThe forumsAvailable RSS-feeds on CSDbSupport CSDb Commodore 64 Scene Database
You are not logged in - nap
CSDb User Forums


Forums > C64 Coding > Long division/modulo with byte-size divisor
2021-06-13 12:42
Krill

Registered: Apr 2002
Posts: 2982
Long division/modulo with byte-size divisor

So i needed something to divide a large integer (160 or more bits) by a small integer (5 bits) while also performing a modulo operation.

After a few rounds of optimisation, turns out the 6502 implementation is surprisingly compact and fast:
            ; dividend is in BIGNUM, little-endian
            ; divisor is in accu
            sta DIVISOR
            lda #0
            ldx #BIGNUMBYTES
longdivmod  ldy #8
            asl BIGNUM - 1,x
-           rol
            cmp DIVISOR
            bcc +
            sbc DIVISOR
+           rol BIGNUM - 1,x
            dey
            bne -
            dex
            bne longdivmod
            ; quotient is in BIGNUM, little-endian
            ; remainder is in accu
This runs in O(n) linear time. The routine can be executed continuously on the dividend/quotient to extract modulo values.

It's also possible to have a big-endian long integer argument/result by simply iterating over the byte array in reverse order.

Of course, can slightly optimise performance (2 cycles per output bit) by sacrificing a few bytes to self-modify the two DIVISOR arguments with immediate operands. Or unroll the entire thing.

This shall go to Codebase64 at some point, of course, but there might be some more optimisation opportunities.

Also i think there are some requirements on the arguments, such that their MSB must be clear or so.
 
... 13 posts hidden. Click here to view all posts....
 
2021-06-14 19:09
Quiss

Registered: Nov 2016
Posts: 43
Actually, for small divisors like 3, you could invest a full 2kb and have a (here) 10 bit lookup for div and mod, since that brings it down to

    # [init zp_{mod,div} to point to {mod,div}_lookup]
loop:
    ldy BIGNUM, x
    lda (zp_div), y
    sta RESULT, x
    lda (zp_mod), y
    ora #(>div_lookup)
    sta zp_div+1
    eor #(>div_lookup) ^ (>mod_lookup)
    sta zp_mod+1
    inx
    cpx #BIGNUMBYTES
    bne loop

(Several optimizations are possible, of course. Like having the mod_lookup table already to the ora)
2021-06-14 19:35
chatGPZ

Registered: Dec 2001
Posts: 11391
All of you should shut up and make a demo about it :)
2021-06-29 21:21
Fred

Registered: Feb 2003
Posts: 287
Here's another division routine with byte-size divisor. It has one loop less compared to Krill's routine and is a few bytes shorter:

; div16
;   input:
;   - 16-bit number in dividend, little-endian
;   - 8-bit in divisor
;   output:
;   - 16-bit number in dividend
;   - AC: remainder
;   - XR: 0
;   - YR: unchanged
div16           ldx #16
                lda #0
-               asl dividend
                rol dividend+1
                rol a
                cmp divisor
                bcc +
                sbc divisor
                inc dividend
+               dex
                bne -
                rts
divisor         .byte 0
dividend        .byte 0, 0
2021-06-29 22:39
Krill

Registered: Apr 2002
Posts: 2982
Yeah cool, but the point was having arbitrarily large dividends to divide by an 8-bit divisor. =)
2021-06-30 08:45
Fred

Registered: Feb 2003
Posts: 287
Right, the routine I provided is 16 bits dividend only.
2021-06-30 21:05
Fred

Registered: Feb 2003
Posts: 287
Here is my version of a long division, specified in bytes in DIV_IN_BYTES:

; div
;   input:
;   - n-bytes dividend, little-endian
;   - 8-bit divisor
;   output:
;   - n-bytes result stored in dividend
;   - AC: remainder
;   - XR: 0
;   - YR: 0

DIV_IN_BYTES    = 20

div             ldy #DIV_IN_BYTES * 8
                lda #0
-               clc
                ldx #-DIV_IN_BYTES & $ff
-               rol dividend + DIV_IN_BYTES - $100,x
                inx
                bmi -

                rol
                cmp divisor
                bcc +
                sbc divisor
                inc dividend
+               dey
                bne --
                rts
                
divisor         .byte $00
dividend        .byte $00, $00, $00, $00, $00, $00, $00, $00
                .byte $00, $00, $00, $00, $00, $00, $00, $00
                .byte $00, $00, $00, $00
2021-06-30 22:33
Krill

Registered: Apr 2002
Posts: 2982
At a quick glance, this seems to run in O(n^2) quadratic time rather than O(n) linear.

With a growing input dividend, both the inner and the outer loops take more iterations.

So i guess this will run slower.
2021-07-01 13:20
Monte Carlos

Registered: Jun 2004
Posts: 364
Using both cmp as well as sbc seems subject to possible optimization.
2021-07-01 14:01
Krill

Registered: Apr 2002
Posts: 2982
Quoting Monte Carlos
Using both cmp as well as sbc seems subject to possible optimization.
If the X register can be spared, replacing
            cmp DIVISOR
            bcc +
            sbc DIVISOR
+
with something like
            tax
            sbx #DIVISOR
            bcc +
            txa
+
might be possible, but this hardly looks like it would save cycles on average. =)
2021-07-01 14:03
Krill

Registered: Apr 2002
Posts: 2982
That said, the long division as in the OP is quite swift, but the long multiplication i also need in the same context turned out to be considerably slower. :) (I guess mostly because it's executed a lot more often, though.)
Previous - 1 | 2 | 3 - Next
RefreshSubscribe to this thread:

You need to be logged in to post in the forum.

Search the forum:
Search   for   in  
All times are CET.
Search CSDb
Advanced
Users Online
blitzed
Guests online: 95
Top Demos
1 Next Level  (9.7)
2 13:37  (9.7)
3 Coma Light 13  (9.6)
4 Edge of Disgrace  (9.6)
5 Mojo  (9.6)
6 Uncensored  (9.6)
7 The Demo Coder  (9.6)
8 Comaland 100%  (9.6)
9 What Is The Matrix 2  (9.6)
10 Wonderland XIV  (9.5)
Top onefile Demos
1 Layers  (9.7)
2 Cubic Dream  (9.6)
3 Party Elk 2  (9.6)
4 Copper Booze  (9.6)
5 Dawnfall V1.1  (9.5)
6 Rainbow Connection  (9.5)
7 Morph  (9.5)
8 Libertongo  (9.5)
9 Onscreen 5k  (9.5)
10 It's More Fun to Com..  (9.5)
Top Groups
1 Booze Design  (9.3)
2 Oxyron  (9.3)
3 Performers  (9.3)
4 Triad  (9.3)
5 Censor Design  (9.3)
Top Fullscreen Graphicians
1 Joe  (9.7)
2 Sulevi  (9.6)
3 Veto  (9.6)
4 The Sarge  (9.6)
5 Facet  (9.5)

Home - Disclaimer
Copyright © No Name 2001-2025
Page generated in: 0.046 sec.