Jump to content

WDC 65C816: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Fix a typographic error
(43 intermediate revisions by 21 users not shown)
Line 8: Line 8:
|produced-end =
|produced-end =
|slowest = 1 | slow-unit = MHz
|slowest = 1 | slow-unit = MHz
|fastest = 20 | fast-unit = MHz
|fastest = 14 | fast-unit = MHz
|manuf1 = [[Western Design Center]], others
|manuf1 = [[Western Design Center]], others
|arch = [[MOS Technology 6502|6502]]
|arch = [[MOS Technology 6502|6502]]
Line 15: Line 15:
* 8 (external)
* 8 (external)
* 16 (internal)}}
* 16 (internal)}}
|address-width = 24
|address-width = 24 bits
|pack1 = 40-pin [[Dual in-line package|DIP]]
|pack1 = 40-pin [[Dual in-line package|DIP]]
|pack2 = 44-pin [[Plastic leaded chip carrier|PLCC]], others
|pack2 = 44-pin [[Plastic leaded chip carrier|PLCC]], others
|variant = {{plainlist|
* W65C802 (pin-compatible with W65C02)
* W65C265 (MCU)}}
|predecessor = {{plainlist|
|predecessor = {{plainlist|
* MOS 6502
* MOS 6502
* [[WDC 65C02]]}}
* [[WDC 65C02]]}}
|successor = WDC 65C832<ref>{{Cite web |last=Mensch |first=William D. |title=A Report on the 65c832 |url=https://mirrors.apple2.org.za/apple2.caltech.edu/miscinfo/65xxx.chronology |url-status=live |archive-url=https://web.archive.org/web/20240407125418/https://mirrors.apple2.org.za/apple2.caltech.edu/miscinfo/65xxx.chronology | archive-date=7 Apr 2024 |access-date=7 Apr 2024}}</ref><ref>{{Cite web |date=6 Sep 2010 |orig-date=Mar 1990 |title=W65C832 Information, Specification, and Data Sheet (March 1990) |url=https://downloads.reactivemicro.com/Electronics/CPU/W65C832%20CPU%20Datasheet%20v2.0.pdf |url-status=live |archive-url=https://web.archive.org/web/20240407125548/https://downloads.reactivemicro.com/Electronics/CPU/W65C832%20CPU%20Datasheet%20v2.0.pdf |archive-date= 7 Apr 2024 |access-date=7 Apr 2024 |website=ReActive Micro }}</ref><ref>{{Cite web |orig-date=Mar 1991 |title=W65C832 Information, Specification, and Data Sheet |url=http://6502.org/documents/datasheets/wdc/wdc_w65c832_preliminary_mar_1991.pdf |url-status=dead |archive-url=https://web.archive.org/web/20230630173006/http://archive.6502.org/datasheets/wdc_w65c832_preliminary_mar_1991.pdf |archive-date=30 Jun 2023 |access-date=7 Apr 2024 |website=6502.org }}</ref> (never released)
|successor =
}}
}}


The '''W65C816S''' (also '''65C816''' or '''65816''') is an 8/16-bit [[microprocessor]] (MPU) developed and sold by the [[Western Design Center]] (WDC). Introduced in 1983,<ref>[https://web.archive.org/web/20100206192831/http://processortimeline.info/proc1980.htm Chronology of Microprocessors (1980–1989)]</ref> the W65C816S is an enhanced version of the [[WDC 65C02]] [[8-bit computing|8-bit]] MPU, itself a [[CMOS]] enhancement of the venerable [[MOS Technology]] [[MOS Technology 6502|6502]] [[NMOS logic|NMOS]] MPU. The 65C816 is the CPU for the [[Apple IIGS]] and, in modified form, the [[Super Nintendo Entertainment System]].
The '''W65C816S''' (also '''65C816''' or '''65816''') is a 16-bit [[microprocessor]] (MPU) developed and sold by the [[Western Design Center]] (WDC). Introduced in 1983,<ref>[https://web.archive.org/web/20100206192831/http://processortimeline.info/proc1980.htm Chronology of Microprocessors (1980–1989)]</ref> the W65C816S is an enhanced version of the [[WDC 65C02]] [[8-bit computing|8-bit]] MPU, itself a [[CMOS]] enhancement of the venerable [[MOS Technology]] [[MOS Technology 6502|6502]] [[NMOS logic|NMOS]] MPU. The 65C816 is the CPU for the [[Apple IIGS]] and, in modified form, the [[Super Nintendo Entertainment System]].


The ''65'' in the part's designation comes from its 65C02 compatibility mode, and the ''816'' signifies that the MPU has selectable 8- and [[16-bit computing|16-bit]] [[processor register|register]] sizes. In addition to the availability of 16-bit registers, the W65C816S extends [[memory address]]ing to [[24-bit computing|24 bit]]s, supporting up to 16 [[megabyte]]s of [[random-access memory]]. It has an enhanced instruction set and a 16-bit [[Stack (data structure)|stack pointer]], as well as several new electrical signals for improved system hardware management.
The ''65'' in the part's designation comes from its 65C02 compatibility mode, and the ''816'' signifies that the MPU has selectable 8- and [[16-bit computing|16-bit]] [[processor register|register]] sizes. In addition to the availability of 16-bit registers, the W65C816S extends [[memory address]]ing to [[24-bit computing|24 bit]]s, supporting up to 16 [[megabyte]]s of [[random-access memory]]. It has an enhanced instruction set and a 16-bit [[Call stack|stack pointer]], as well as several new electrical signals for improved system hardware management.

Despite 24-bit addressing, the 65816 isn't a true 24-bit computer. Programs still operate in 64K chunks since registers are 16-bit. To access other parts of internal memory a technique similar to [[bank switching]] is needed. Apple, Atari and Commodore all opted for the [[Motorola 68000]] processor alternative that offered a fully 32-bit programming model when it came to replacing their 6502-based lines of 8-bit computers, despite the 65816 being faster cycle for cycle.<ref>"An 8-MHz 65816 is about equivalent to a 16-MHz 68000 in speed, and a 16-MHz 68000 doesn't exist.", interview with [[Steve Wozniak]] called "The Apple Story Part 2: More History And The Apple III", Volume 10, Number 1, "Through the Hourglass", [[Byte Magazine]], 1985, page 175, https://www.apple2history.org/museum-a-l/articles/byte8412/</ref><ref>[https://archive.org/details/byte-magazine-1985-01-rescan January 1985 issue of Byte Magazine</ref>


At [[reset (computing)|reset]], the W65C816S starts in "emulation mode", meaning it substantially behaves as a 65C02. Thereafter, the W65C816S may be switched to "native mode" with a two instruction sequence, causing it to enable all enhanced features, yet still maintain a substantial degree of [[backward compatibility]] with most 65C02 software. However, unlike the [[PDIP|PDIP40]] version of the 65C02, which is a [[pin-compatible]] replacement for its NMOS ancestor, the PDIP40 W65C816S is not pin-compatible with any other 6502 family MPU.
At [[reset (computing)|reset]], the W65C816S starts in "emulation mode", meaning it substantially behaves as a 65C02. Thereafter, the W65C816S may be switched to "native mode" with a two instruction sequence, causing it to enable all enhanced features, yet still maintain a substantial degree of [[backward compatibility]] with most 65C02 software. However, unlike the [[PDIP|PDIP40]] version of the 65C02, which is a [[pin-compatible]] replacement for its NMOS ancestor, the PDIP40 W65C816S is not pin-compatible with any other 6502 family MPU.


The '''W65C802''' has the same internal structure and 16-bit support, but a 40-pin layout compatible with the original 6502. This allows it to be used as a drop-in replacement in certain roles. However, the 65C802 cannot emit a full 24-bit address, limiting it to 64&nbsp;KB of memory. The 65C802 and its relatives are no longer produced.
The '''W65C802''' or '''65802''' is completely software-compatible with the 65C816, but is electrically-compatible with the 6502 and 65C02. Hence the 65C802 could be used as a drop-in replacement in most systems equipped with a 6502 or 65C02. However, the 65C802 cannot emit a 24-bit address, which limits it to a 64&nbsp;KB address space. The 65C802 is no longer produced.


==History==
==History==
[[File:65c816plcc.jpg|thumb|[[plastic leaded chip carrier|PLCC-44]] version of '''W65C816S''' microprocessor, shown mounted on a [[single-board computer]].]]
[[File:65c816plcc.jpg|thumb|[[plastic leaded chip carrier|PLCC-44]] version of '''W65C816S''' microprocessor, shown mounted on a [[single-board computer]].]]
In 1981, [[Bill Mensch]], founder and [[chief executive officer|CEO]] of WDC, began development of the 65C02 with his production partners, primarily [[Rockwell Semiconductor]] and [[Synertek]]. The primary goal of the 65C02 effort was to move from the original 6502's NMOS process to the 65C02's CMOS, which would allow it to run at much lower power levels, somewhere between {{frac|10}} and {{frac|20}} when running at the same clock speeds. A number of new [[opcode]]s and bug fixes were also worked into the design.{{sfn|Eyes|Lichty|1986|p=42}}
In 1981, [[Bill Mensch]], founder and [[chief executive officer|CEO]] of WDC, began development of the 65C02 with his production partners, primarily [[Rockwell Semiconductor]] and [[Synertek]]. The primary goal of the 65C02 effort was to move from the original 6502's NMOS process to the CMOS process, which would allow it to run at much lower power levels, somewhere between {{frac|10}} and {{frac|20}} at any given clock speed. Also desired was the ability to raise the maximum supported clock speed. The 65C02 design addressed chip errata present in the NMOS 6502 (e.g., the infamous <code>JMP (<addr>)</code> bug) and introduced new instructions and new addressing modes for some existing instructions.{{sfn|Eyes|Lichty|1986|p=42}}


Development of the W65C816S commenced in 1982 after Mensch consulted with [[Apple Computer]] on a new version of the [[Apple II]] series of [[personal computer]]s that would, among other things, have improved graphics and sound. Apple wanted an MPU that would be software compatible with the 6502 then in use in the Apple II but with the ability to address more memory, and to load and store 16 bit words. The result was the 65C816, finished in March 1984, with samples provided to both Apple and [[Atari]] in the second half of the year and full release in 1985.{{sfn|Eyes|Lichty|1986|p=44}} Mensch was aided during the design process by his sister Kathryn, who was responsible for part of the device's layout.
Development of the W65C816S commenced in 1982 after Mensch consulted with [[Apple Computer]] on a new version of the [[Apple II]] series of [[personal computer]]s that would, among other things, have improved graphics and sound. Apple wanted an MPU that would be software compatible with the 6502 then in use in the Apple II but with the ability to address more memory, and to load and store 16 bit words. The result was the 65C816, finished in March 1984, with samples provided to both Apple and [[Atari]] in the second half of the year and full release in 1985.{{sfn|Eyes|Lichty|1986|p=44}} Mensch was aided during the design process by his sister Kathryn, who was responsible for part of the device's layout.
Line 42: Line 43:
The same process also led to the 65C802, which was identical inside to the 65C816. Both were produced on the same fabrication lines and diverged only during the last metalization stages when the chip was being connected to the external pins. In the 65C802, those pins had the same layout as the original 6502, which allowed it to be used as a drop-in replacement while still allowing the 16-bit processing of the CPU to be used. However, as it used the original pinout it had only 16 addressing pins, and could therefore only access 64&nbsp;KB of external memory.{{sfn|Eyes|Lichty|1986|p=45}} Typically, when hardware manufacturers designed a project from the ground up, they used the 65C816 rather than the 65C802, resulting in the latter being withdrawn from production.
The same process also led to the 65C802, which was identical inside to the 65C816. Both were produced on the same fabrication lines and diverged only during the last metalization stages when the chip was being connected to the external pins. In the 65C802, those pins had the same layout as the original 6502, which allowed it to be used as a drop-in replacement while still allowing the 16-bit processing of the CPU to be used. However, as it used the original pinout it had only 16 addressing pins, and could therefore only access 64&nbsp;KB of external memory.{{sfn|Eyes|Lichty|1986|p=45}} Typically, when hardware manufacturers designed a project from the ground up, they used the 65C816 rather than the 65C802, resulting in the latter being withdrawn from production.


Apple subsequently integrated the 65C816 into the [[Apple IIGS]] computer. The basic 65C816 design was [[second source|second-sourced]] by [[GTE]], [[Sanyo]] and others from the mid-to-late 1980s to the early 1990s.
Apple subsequently integrated the 65C816 into the [[Apple IIGS]] computer. The basic 65C816 design was [[second source|second-sourced]] by [[VLSI Technology]],<ref name="vti_65816">{{ cite book | url=https://archive.org/details/1988vlsidatabookocrbm/page/n263/mode/2up | title=Application Specific Logic Products Data Book 1988 | publisher=VLSI Technology Inc. | date=1988 | access-date=18 March 2024 | pages=257–279 }}</ref> [[GTE]], [[Sanyo]] and others from the mid-to-late 1980s to the early 1990s.


In the 1990s, both the 65C816 and 65C02 were converted to a fully [[static core]], which made it possible to completely stop the [[clock signal|processor's Ø2 clock]] without loss of register contents. This feature, along with the use of [[static RAM|asynchronous static RAM]], made it possible to produce designs that used minimal power when in a standby state.
In the 1990s, both the 65C816 and 65C02 were converted to a fully [[static core]], which made it possible to completely stop the [[clock signal|processor's Ø2 clock]] without loss of register contents. This feature, along with the use of [[static RAM|asynchronous static RAM]], made it possible to produce designs that used minimal power when in a standby state.


{{As of|April 2024}}, the W65C816S is available from WDC in 40 pin [[PDIP]], [[plastic leaded chip carrier|PLCC44]], or 44-pin [[Quad flat package#PQFP|TQFP]] packaging, as an [[microcontroller|MCU]] through the W65C265,<ref>{{Cite web |date=5 Jan 2021 |title=W65C265S 16-bit Microcontroller |url=https://www.westerndesigncenter.com/wdc/w65c265s-chip.php |url-status=live |archive-url=https://web.archive.org/web/20240402205917/https://www.westerndesigncenter.com/wdc/w65c265s-chip.php |archive-date=2 Apr 2024 |access-date=7 Apr 2024 |website=The Western Design Center, Inc.}}</ref> and as IP cores for [[Application-specific integrated circuit|ASIC]] integration<ref>{{Cite web |date=5 Jan 2021 |title=W65C816 8/16-bit Microprocessor |url=https://www.westerndesigncenter.com/wdc/w65c816s-core.php |url-status=live |archive-url=https://web.archive.org/web/20231115050721/https://www.westerndesigncenter.com/wdc/w65c816s-core.php |archive-date=15 Nov 2023 |access-date=7 Apr 2024 |website=The Western Design Center, Inc.}}</ref><ref>{{Cite web |date=5 Jan 2021 |title=W65C265S 8/16-bit Microcontroller |url=https://www.westerndesigncenter.com/wdc/w65c265s-core.php |url-status=live |archive-url=https://web.archive.org/web/20240407131630/https://www.westerndesigncenter.com/wdc/w65c265s-core.php |archive-date=7 Apr 2024 |access-date=7 Apr 2024 |website=The Western Design Center, Inc.}}</ref> (for example [[Winbond]]'s W55V9x series of TV [[Edutainment]] [[integrated circuit|IC]]s<ref>{{Cite web |date=2 May 2006 |title=W55V92 TV-toy Controller Data Sheet |url=https://static6.arrow.com/aropdfconversion/5557b1c0d4895012527b1add354b861f938e4daa/w55v92a1.7.pdf |access-date=12 June 2024 |website=Arrow Electronics}}</ref>).
{{As of|2021}}, the W65C816S is available from WDC in 40 pin [[PDIP]], [[plastic leaded chip carrier|PLCC44]] or 44-pin [[Quad_flat_package#PQFP|TQFP]] packaging, as well as a core for [[Application-specific integrated circuit|ASIC]] integration (for example [[Winbond]]'s W55V9x series of TV [[Edutainment]] [[integrated circuit|IC]]s). WDC, itself a [[fabless semiconductor company]], works with various [[semiconductor fabrication plant|foundries]] to produce the W65C816S, as well as other compatible products. Discrete processors are available through a number of electronics distributors. For designers who wish to include W65C816S functionality into a custom [[Application-specific integrated circuit|ASIC]], WDC offers RTL ([[register-transfer level]]) code in [[Verilog]].


[[File:WDC W65C802P 4 1.jpg|thumb|W65C802P]]
[[File:WDC W65C802P 4 1.jpg|thumb|W65C802P]]
Line 109: Line 110:
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| colspan="16" style="text-align:center; background:silver" | '''D'''
| colspan="16" style="text-align:center; background:silver" | '''DP'''
| style="background:white; color:black;"| [[Zero_page#Computers_with_few_CPU_registers|'''D'''irect page pointer]]
| style="background:white; color:black;"| [[Zero page#Computers with few CPU registers|'''D'''irect '''P'''age pointer]]
|- style="background:silver;color:black"
|- style="background:silver;color:black"
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
Line 120: Line 121:
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| style="text-align:center;" colspan="16"| '''S'''
| style="text-align:center;" colspan="16"| '''SP'''
| style="background:white; color:black;"| [[Stack pointer|'''S'''tack pointer]]
| style="background:white; color:black;"| [[stack pointer|'''S'''tack '''P'''ointer]]
|- style="background:silver;color:black"
|- style="background:silver;color:black"
| style="text-align:center;" colspan="8"| '''DB'''
| style="text-align:center;" colspan="8"| '''DB'''
Line 140: Line 141:
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| style="text-align:center; background:white"| 0
| style="background:white; color:black;"| [[Bank_switching#Technique|'''D'''ata '''B'''ank register]]
| style="background:white; color:black;"| '''D'''ata '''B'''ank register
|- style="background:silver;color:black"
|- style="background:silver;color:black"
| style="text-align:center;" colspan="8"| '''PB'''
| style="text-align:center;" colspan="8"| '''PB'''
| style="text-align:center;" colspan="16"| '''PC'''
| style="text-align:center;" colspan="16"| '''PC'''
| style="background:white; color:black;"| [[Bank_switching#Technique|'''P'''rogram '''B'''ank]] ''':''' [[Program counter|'''P'''rogram '''C'''ounter]]
| style="background:white; color:black;"| '''P'''rogram '''B'''ank ''':''' [[Program counter|'''P'''rogram '''C'''ounter]]
|-
|-
|colspan="25" | '''Status register'''
|colspan="25" | '''Status register'''
Line 157: Line 158:
| style="text-align:center;"| '''z'''
| style="text-align:center;"| '''z'''
| style="text-align:center;"| '''c'''
| style="text-align:center;"| '''c'''
| style="background:white; color:black" | [[Status register|status register '''P''']]
| style="background:white; color:black" | [[Status register|'''P'''rogram status register]]
|- style="background:silver;color:black"
|- style="background:silver;color:black"
| style="text-align:center; background:white" colspan="23" |
| style="text-align:center; background:white" colspan="23" |
| style="text-align:center;"| '''e'''
| style="text-align:center;"| '''e'''
| style="background:white; color:black" | status register '''P''' [[WDC_65C816#Switching_between_modes|emulation mode flag]]
| style="background:white; color:black" | '''P'''rogram status register [[WDC 65C816#Switching between modes|mode flag]]
|}
|}
|}
|}
Line 168: Line 169:
* Fully static [[CMOS]] design offers low power consumption (300&nbsp;[[micro-|µ]][[amperes|A]] at 1&nbsp;[[megahertz|MHz]]) and increased noise immunity.
* Fully static [[CMOS]] design offers low power consumption (300&nbsp;[[micro-|µ]][[amperes|A]] at 1&nbsp;[[megahertz|MHz]]) and increased noise immunity.
* Wide operating [[voltage]] range: 1.8&nbsp;V to 5.0&nbsp;V ± 5%.
* Wide operating [[voltage]] range: 1.8&nbsp;V to 5.0&nbsp;V ± 5%.
* Wide [[clock rate|operating frequency]] range, officially 14&nbsp;MHz maximum at 5 volts (20Mhz in [[SuperCPU]]), using a single-phase clock source (hobbyists have successfully operated the 65C816 beyond 20&nbsp;MHz).
* Wide [[clock rate|operating frequency]] range, officially 14&nbsp;MHz maximum at 5 volts (20Mhz in [[SuperCPU]]), using a single-phase clock source.
* [[emulator|Emulation]] mode allows substantial software compatibility with the NMOS 6502 and CMOS 65C02, excepting undocumented [[opcode]]s. All 256 opcodes in the 65C816 are functional in both operating modes.
* Emulation mode allows substantial software compatibility with the NMOS 6502 and CMOS 65C02, excepting undocumented [[opcode]]s. All 256 opcodes in the 65C816 are functional in both operating modes.
* 24-bit memory addressing provides access to 16&nbsp;MB of [[Memory space (computational resource)|memory space]].
* 24-bit memory addressing provides access to 16&nbsp;MB of [[Memory space (computational resource)|memory space]].
* 16-bit [[arithmetic logic unit|ALU]], [[accumulator (computing)|accumulator]] (<code>C</code>), [[stack pointer]] (<code>S</code>), and [[index register]]s (<code>X</code> and <code>Y</code>).
* 16-bit [[arithmetic logic unit|ALU]], [[accumulator (computing)|accumulator]] (<code>C</code>), [[stack pointer]] (<code>SP</code>), and [[index register]]s (<code>X</code> and <code>Y</code>).
* 16-bit Direct Page (aka zero page) register (<code>D</code>).
* 16-bit direct page (aka zero page) register (<code>DP</code>).
* 8-bit data bank (<code>DB</code>) and program bank (<code>PB</code>) registers, generating bits 16&ndash;23 of 24-bit code and data addresses. Separate program and data bank registers allow program [[segmentation (memory)|segmentation]] and 16&nbsp;MB linear data addressing.
* 8-bit data bank (<code>DB</code>) and program bank (<code>PB</code>) registers, generating bits 16&ndash;23 of 24-bit code and data addresses. Separate program and data bank registers allow program [[segmentation (memory)|segmentation]] and 16&nbsp;MB linear data addressing.
* Valid data address (<code>VDA</code>) and valid program address (<code>VPA</code>) control outputs for memory qualification, dual [[CPU cache|cache]] and cycle steal [[direct memory access|DMA]] implementation.
* Valid data address (<code>VDA</code>) and valid program address (<code>VPA</code>) control outputs for memory qualification, dual [[CPU cache|cache]] and cycle steal [[direct memory access|DMA]] implementation.
* Vector pull (<code>VPB</code>) control output to indicate when an [[interrupt vector]] is being fetched.
* Vector pull (<code>VPB</code>) control output to indicate when an [[interrupt vector]] is being fetched.
* Abort (<code>ABORTB</code>) input and associated vector supports processor repairs of bus error conditions, such as [[page fault]]s and memory access violations.
* Abort (<code>ABORTB</code>) input and associated vector supports processor repairs of bus error conditions, such as [[page fault]]s and memory access violations.
* Direct Page register and stack relative addressing provides capability for [[reentrant (subroutine)|reentrant]], [[recursion|recursive]] and [[relocation (computer science)|re-locatable]] programming.
* Direct page register and stack relative addressing provides capability for [[reentrant (subroutine)|reentrant]], [[recursion|recursive]] and [[relocation (computer science)|re-locatable]] programming.
* 24 [[addressing mode]]s&mdash;13 original 6502 modes with 92 [[instruction set|instructions]] using 256 [[op code]]s, including most new opcodes implemented in the 65C02.
* 24 [[addressing mode]]s&mdash;13 original 6502 modes with 92 [[instruction set|instructions]] using 256 [[op code]]s, including most new opcodes implemented in the 65C02.
* Block-copy instructions, allowing rapid copying of data structures from one area of [[RAM]] to another with minimal code.
* Block-copy instructions (<code>MVN</code> and <code>MVP</code>), allowing rapid copying of data structures from one area of [[RAM]] to another with minimal code.
* Wait-for-Interrupt (<code>WAI</code>) and Stop-the-Clock (<code>STP</code>) instructions further [[low-power electronics|reduce power consumption]], decrease [[interrupt latency]] and allow synchronization with external events.
* Wait-for-Interrupt (<code>WAI</code>) and Stop-the-Clock (<code>STP</code>) instructions further [[low-power electronics|reduce power consumption]], decrease [[interrupt latency]] and allow synchronization with external events.
* [[Coprocessor|Co-Processor]] (<code>COP</code>) instruction with associated vector supports co-processor configurations, e.g., [[floating point unit|floating point processor]]s.
* [[Coprocessor|Co-Processor]] (<code>COP</code>) instruction with associated vector supports co-processor configurations, e.g., [[floating point unit|floating point processor]]s.
Line 185: Line 186:


==Comparison with earlier models==
==Comparison with earlier models==



===Two modes===
===Two modes===


The 65C816 has two operating modes: "emulation mode", in which the 16-bit operations are invisible—the index registers are forced to eight bits—and the chip appears to be very similar to the 6502, with the same cycle timings for the opcodes; and "native mode", which exposes all new features. The CPU automatically enters emulation mode when it is powered on or reset, which allows it to replace a 65(C)02, assuming one makes the required circuit changes to accommodate the different pin layout.{{sfn|Eyes|Lichty|1986|p=42}}
The 65C816 has two operating modes: "emulation mode", in which the 16-bit operations are invisible—the index registers are forced to eight bits—and the chip appears to be very similar to the 6502, with the same cycle timings for the opcodes; and "native mode", which exposes all new features. The CPU automatically enters emulation mode when it is powered on or reset, which allows it to replace a 65(C)02, assuming one makes the required circuit changes to accommodate the different pin layout.{{sfn|Eyes|Lichty|1986|p=42}}



===16-bit registers===
===16-bit registers===


The most obvious change to the 65C816 when running in native mode is the expansion of the various registers from 8-bit to 16-bit sizes. This enhancement affects the accumulator (<code>A</code>), the <code>X</code> and <code>Y</code> [[index register]]s, and the [[stack pointer]] (<code>S</code>). It does not affect the [[program counter]] (<code>PC</code>), which has always been 16-bit.{{sfn|Eyes|Lichty|1986|p=46}}
The most obvious change to the 65C816 when running in native mode is the expansion of the various registers from 8-bit to 16-bit sizes. This enhancement affects the accumulator (<code>A</code>), the <code>X</code> and <code>Y</code> [[index register]]s, and the [[stack pointer]] (<code>SP</code>). It does not affect the [[program counter]] (<code>PC</code>), which has always been 16-bit.{{sfn|Eyes|Lichty|1986|p=46}}

When running in native mode, two bits in the status register change their meaning. In the original 6502, bits 4 and 5 were not used, although bit 4 is referred to as the break (<code>b</code>) flag. In native mode, bit 4 becomes the <code>x</code> flag and bit 5 becomes the <code>m</code> flag. These bits control whether or not the [[index register]]s (<code>x</code>) and/or accumulator/memory (<code>m</code>) are 8-bit or 16-bit in size. Zeros in these bits set 16-bit sizes, ones set 8-bit sizes. These bits are fixed at ones when the processor is powered on or reset, but become changeable when the processor is switched to native mode.{{sfn|Eyes|Lichty|1986|p=46}}


It might not be immediately obvious why one would want to use the now 16-bit registers in 8-bit mode. Switching to 8-bit mode using the new <code>SEP</code> (SEt Processor status bit) and <code>REP</code> (REset Processor status bit) instructions means that all subsequent instructions accessing those registers will only read or write a single byte instead of two. For instance, if the <code>m</code> bit is set to 1 when an <code>LDA $1234</code> instruction is executed, only a single byte at address <code>$1234</code> will be read, thereby reducing the number of cycles needed to execute the instruction.{{sfn|Eyes|Lichty|1986|p=50}} This is particularly useful when dealing with 8-bit character data, for instance.{{sfn|Eyes|Lichty|1986|p=65}}
When running in native mode, two bits in the status register change their meaning. In the original 6502, bits 4 and 5 were not used, although bit 4 is referred to as the break (<code>b</code>) flag. In native mode, bit 4 becomes the <code>x</code> flag and bit 5 becomes the <code>m</code> flag. These bits control whether or not the [[index register]]s (<code>x</code>) and accumulator/memory (<code>m</code>) are 8-bit or 16-bit in size. Zeros in these bits set 16-bit sizes, ones set 8-bit sizes. These bits are locked at ones when the processor is powered on or reset, but become changeable when the processor is switched to native mode.{{sfn|Eyes|Lichty|1986|p=46}}


In native mode operation, the accumulator and index registers may be set to 16- or 8-bit sizes at the programmer’s discretion by using the <code>REP</code> and <code>SEP</code> instructions to manipulate the <code>m</code> and <code>x</code> status register bits. This feature gives the programmer the ability to perform operations on either word- and byte-size data. As the accumulator and index register sizes are independently settable, it is possible, for example, to have the accumulator set to eight bits and the index registers set to 16 bits, giving the programmer the ability to manipulate individual bytes over a 64KB range without having to perform pointer arithmetic.
When register sizes are set to 16 bits, memory access will access two contiguous bytes of memory, at the cost of one extra clock cycle. Furthermore, a read-modify-write instruction, such as <code>ROR <addr></code>, when used while the accumulator is set to 16 bits, will affect two contiguous bytes of memory, not one. Similarly, all arithmetic and logical operations will be 16-bit operations.{{sfn|Eyes|Lichty|1986|p=52}}


When register sizes are set to 16 bits, a memory access will fetch or store two contiguous bytes at the rate of one byte per clock cycle. Hence a read-modify-write instruction, such as <code>ROR <addr></code>, when used while the accumulator is set to 16 bits, will affect two contiguous bytes of memory, not one and will consume more clock cycles than when the accumulator is set to eight bits. Similarly, all arithmetic and logical operations will be 16-bit operations.{{sfn|Eyes|Lichty|1986|p=52}}


===24-bit addressing===
===24-bit addressing===
Line 211: Line 209:
During a data fetch or store cycle, <code>DB</code> is prepended to a 16-bit data address to form the 24-bit effective address at which data will be accessed. This processor characteristic makes it possible to sanely execute 6502 or 65c02 code that uses 16-bit addresses to reference data elements. Unlike <code>PB</code>, <code>DB</code> can be changed under program control, something that might be done to access data beyond the limits of 16-bit addressing. Also, <code>DB</code> will temporarily increment if an address is indexed beyond the limits of the bank currently in <code>DB</code>. <code>DB</code> is ignored if a 24-bit address is specified as the operand to a data fetch/store instruction, or if the effective address is on direct (zero) page or the [[hardware stack]]. In the latter case, an implied bank <code>$00</code> is used to generate the effective address.{{sfn|Eyes|Lichty|1986|p=55}}
During a data fetch or store cycle, <code>DB</code> is prepended to a 16-bit data address to form the 24-bit effective address at which data will be accessed. This processor characteristic makes it possible to sanely execute 6502 or 65c02 code that uses 16-bit addresses to reference data elements. Unlike <code>PB</code>, <code>DB</code> can be changed under program control, something that might be done to access data beyond the limits of 16-bit addressing. Also, <code>DB</code> will temporarily increment if an address is indexed beyond the limits of the bank currently in <code>DB</code>. <code>DB</code> is ignored if a 24-bit address is specified as the operand to a data fetch/store instruction, or if the effective address is on direct (zero) page or the [[hardware stack]]. In the latter case, an implied bank <code>$00</code> is used to generate the effective address.{{sfn|Eyes|Lichty|1986|p=55}}


A further addition to the register set is the 16-bit direct page register (<code>D</code>), which sets the base address for what was formerly called the '''zero page''', but now referred to as '''direct page'''. Direct page addressing uses an 8-bit address, which results in faster access than when a 16- or 24-bit address is used. Also, some addressing modes that offer indirection are only possible on direct page. In the 65(c)02, the direct page is always the first 256 bytes of memory, hence "zero page". In native mode, the 65c816 can relocate direct (zero) page anywhere in bank <code>$00</code> (the first 64&nbsp;KB of memory) by writing the 16-bit starting address into <code>D</code>. There is a one-cycle access penalty if <code>D</code> is not set to an even page boundary.{{sfn|Eyes|Lichty|1986|p=80}}
A further addition to the register set is the 16-bit direct page register (<code>DP</code>), which sets the base address for what was formerly called the '''zero page''', but now referred to as '''direct page'''. Direct page addressing uses an 8-bit address, which results in faster access than when a 16- or 24-bit address is used. Also, some addressing modes that offer indirection are only possible on direct page. In the 65(c)02, the direct page is always the first 256 bytes of memory, hence “zero page”. In native mode, the 65c816 can relocate direct (zero) page anywhere in bank <code>$00</code> (the first 64&nbsp;KB of memory) by writing the 16-bit starting address into <code>DP</code>. There is a one-cycle access penalty if <code>DP</code> is not set to an exact page boundary, that is, if the value in <code>DP</code> is not <code>$xx00</code>, where <code>xx</code> is the most-significant byte.{{sfn|Eyes|Lichty|1986|p=80}}


===Switching between modes===
===Switching between modes===


The current mode of operation is stored in the emulation (<code>e</code>) bit. Having already added the new <code>x</code> and <code>m</code> bits to the previous set of six flags in the status register (<code>P</code>), there were not enough bits left to hold the new mode bit. Instead, a unique solution was used in which the mode bit was left "invisible", unable to be directly accessed. The new <code>XCE</code> (eXchange Carry with Emulation) instruction exchanges the value of the emulation bit with the carry (<code>c</code>) bit, bit 0 in the status register. For instance, if one wants to enter native mode after the processor has started up, one would use <code>CLC</code> to clear the carry bit, and then <code>XCE</code> to write it to the emulation bit.{{sfn|Eyes|Lichty|1986|p=64}} Returning to 65c02 emulation mode uses <code>SEC</code> followed by <code>XCE</code>.{{sfn|Eyes|Lichty|1986|p=65}}
The current mode of operation is stored in the emulation (<code>e</code>) bit. Having already added the new <code>x</code> and <code>m</code> bits to the previous set of six flags in the status register (<code>SR</code>), there were not enough bits left to hold the new mode bit. Instead, a unique solution was used in which the mode bit was left "invisible", unable to be directly accessed. The <code>XCE</code> (e'''X'''change '''C'''arry with '''E'''mulation) instruction exchanges the value of the emulation bit with the carry (<code>c</code>) bit, bit 0 in <code>SR</code>. For instance, if one wants to enter native mode after the processor has started up, one would use <code>CLC</code> to clear the carry bit, and then <code>XCE</code> to write it to the emulation bit.{{sfn|Eyes|Lichty|1986|p=64}} Returning to 65c02 emulation mode uses <code>SEC</code> followed by <code>XCE</code>.{{sfn|Eyes|Lichty|1986|p=65}}


Internally, the 65c816 is a fully 16-bit design. The <code>m</code> and <code>x</code> bits in <code>P</code> determine how the user registers (accumulator and index) appear to the rest of the system. Upon reset, the 65c816 starts in 6502 emulation mode, in which <code>m</code> and <code>x</code> are set to 1 and cannot be changed. Hence the registers appear to the rest of the system as 8 bits. The most significant byte (MSB) of the accumulator (the <code>B</code>-accumulator) is not directly accessible but can be swapped with the least significant byte (LSB) of the accumulator (the <code>A</code>-accumulator) by using the <code>XBA</code> instruction. There is no corresponding operation for the index registers (<code>X</code> and <code>Y</code>).
Internally, the 65c816 is a fully 16-bit design. The <code>m</code> and <code>x</code> bits in <code>SR</code> determine how the user registers (accumulator and index) appear to the rest of the system. Upon reset, the 65c816 starts in 6502 emulation mode, in which <code>m</code> and <code>x</code> are locked to <code>1</code>. Hence the registers are locked to eight-bit size. The most significant byte (MSB) of the accumulator (the <code>B</code>-accumulator) is not directly accessible but can be swapped with the least significant byte (LSB) of the accumulator (the <code>A</code>-accumulator) by using the <code>XBA</code> instruction. There is no corresponding operation for the index registers (<code>X</code> and <code>Y</code>), whose MSBs are locked at <code>$00</code>.


Upon being switched to native mode, the MSB of <code>X</code> and <code>Y</code> will be zero, and the <code>B</code>-accumulator will be unchanged. If the <code>m</code> bit in <code>P</code> is cleared, the <code>B</code>-accumulator is "ganged" to the <code>A</code>-accumulator to form a 16-bit register (called the <code>C</code>-accumulator). A load/store or arithmetic/logical operation involving the accumulator and/or memory will be a 16-bit operation—two bus cycles are required to fetch/store a 16-bit value.
Upon being switched to native mode, the MSB of <code>X</code> and <code>Y</code> will be zero, and the <code>B</code>-accumulator will be unchanged. If the <code>m</code> bit in <code>SR</code> is cleared, the <code>B</code>-accumulator will be "ganged" to the <code>A</code>-accumulator to form a 16-bit register (called the <code>C</code>-accumulator). A load/store or arithmetic/logical operation involving the accumulator or memory will be a 16-bit operation—two bus cycles are required to fetch/store a 16-bit value.


If the <code>x</code> bit in <code>P</code> is cleared, both index registers will be set to 16 bits. If used to index an address, e.g., <code>LDA SOMEWHERE,X</code>, the 16-bit value in the index register will be added to the base address to form the effective address.
If the <code>x</code> bit in <code>SR</code> is cleared, both index registers will be set to 16 bits. If used to index an address, e.g., <code>LDA SOMEWHERE,X</code>, the 16-bit value in the index register will be added to the base address to form the effective address.


If the <code>m</code> bit in <code>P</code> is set the accumulator will return to being an 8-bit register and subsequent operations on the accumulator, with a few exceptions, will be 8-bit operations. The <code>B</code>-accumulator will retain the value it had when the accumulator was operating in 16-bit mode. The exceptions are the instructions that transfer the direct page register (<code>D</code>) and stack pointer (<code>S</code>) to/from the accumulator. These operations are always 16 bits wide in native mode, regardless of the condition of the <code>m</code> bit in the status register.
If the <code>m</code> bit in <code>SR</code> is set, the accumulator will return to being an 8-bit register and subsequent operations on the accumulator, with a few exceptions, will be 8-bit operations. The <code>B</code>-accumulator will retain the value it had when the accumulator was set to 16 bits. The exceptions are the instructions that transfer the direct page register (<code>DP</code>) and stack pointer (<code>SP</code>) to/from the accumulator. These operations are always 16 bits wide in native mode, regardless of the condition of the <code>m</code> bit in <code>SR</code>.


If the <code>x</code> bit in <code>P</code> is set, not only will the index registers return to being 8 bits, whatever was in the MSB while in 16-bit mode will be lost, something an assembly language programmer cannot afford to forget.{{sfn|Eyes|Lichty|1986|p=51}}
If the <code>x</code> bit in <code>SR</code> is set, not only will the index registers return to being 8 bits, whatever was in the MSB while they were 16 bits wide will be lost, something an assembly language programmer cannot afford to forget.{{sfn|Eyes|Lichty|1986|p=51}}


==Applications==
==Applications==
Line 236: Line 234:
* [[Super Nintendo Entertainment System]]: the [[video game console|console]]'s [[Ricoh 5A22]] CPU is based on the 65c816.
* [[Super Nintendo Entertainment System]]: the [[video game console|console]]'s [[Ricoh 5A22]] CPU is based on the 65c816.
** Additionally, 30+ Super NES games include the [[Nintendo SA-1|Nintendo SA1]], a 65c816-based co-processor chip, in each cartridge.
** Additionally, 30+ Super NES games include the [[Nintendo SA-1|Nintendo SA1]], a 65c816-based co-processor chip, in each cartridge.
* C256 Foenix's U/U+ model<ref>https://c256foenix.com/16bits-new-retro-computers/?v=7516fd43adaa</ref> and F256K model<ref>https://c256foenix.com/f256k/?v=7516fd43adaa</ref>
* Foenix Retro Systems C256 U/U+ model<ref>{{cite web |url=https://c256foenix.com/16bits-new-retro-computers/?v=7516fd43adaa |title=16bits CPU – New Retro Computers |website=Foenix Retro Systems}}</ref> and F256K model<ref>{{cite web |url=https://c256foenix.com/f256k/?v=7516fd43adaa |title=F256K |website=Foenix Retro Systems}}</ref>


==See also==
==See also==
Line 251: Line 249:
{{See also|MOS Technology 6502#Further reading|l1=List of books about 65xx microprocessor families}}
{{See also|MOS Technology 6502#Further reading|l1=List of books about 65xx microprocessor families}}
* ''[http://www.westerndesigncenter.com/wdc/documentation/w65c816s.pdf 65C816 Datasheet]''; Western Design Center; 55 pages; 2018.
* ''[http://www.westerndesigncenter.com/wdc/documentation/w65c816s.pdf 65C816 Datasheet]''; Western Design Center; 55 pages; 2018.
* Eyes, David and Ron Lichty; ''Programming the 65816: Including the 6502, 65C02, and 65802''; Brady Publishing; 636 pg; 2015
* Fischer, Michael; ''65816/65802 assembly language programming''; Osborne/McGraw-Hill; 686 pg; 1986


==External links==
==External links==
Line 257: Line 257:
* [http://www.zophar.net/tech/65816.html 65816/65C816 Technical Documents] - zophar.net
* [http://www.zophar.net/tech/65816.html 65816/65C816 Technical Documents] - zophar.net
* [http://www.defence-force.org/computing/oric/coding/annexe_2/ A 6502 Programmer's Introduction to the 65816] &ndash; A ''Commodore World'' article by Brett Tabke; includes [[Creative Micro Designs|CMD]]'s instruction set summary
* [http://www.defence-force.org/computing/oric/coding/annexe_2/ A 6502 Programmer's Introduction to the 65816] &ndash; A ''Commodore World'' article by Brett Tabke; includes [[Creative Micro Designs|CMD]]'s instruction set summary
* [http://sbc.bcstechnology.net/65c816interrupts.html Investigating 65C816 Interrupts] &ndash; An extensive discussion of interrupt processing on the 65C816
* [http://sbc.steggy.net/65c816interrupts.html Investigating 65C816 Interrupts] &ndash; An extensive discussion of interrupt processing on the 65C816
<!--- *[http://www.winbond-usa.com/mambo/content/view/286/523/#ProductSelectionGuide Winbond TV Edutainment IC Selection Guide] &ndash; ICs with 65816 CPU core --->
<!--- *[http://www.winbond-usa.com/mambo/content/view/286/523/#ProductSelectionGuide Winbond TV Edutainment IC Selection Guide] &ndash; ICs with 65816 CPU core --->



Revision as of 03:59, 16 July 2024

WDC 65C816
PDIP40 package
General information
Launched1985; 39 years ago (1985)
Common manufacturer
Performance
Max. CPU clock rate1 MHz to 14 MHz
Data width
  • 8 (external)
  • 16 (internal)
Address width24 bits
Architecture and classification
Instruction set6502
Number of instructions92
Physical specifications
Packages
Products, models, variants
Variant
    • W65C802 (pin-compatible with W65C02)
    • W65C265 (MCU)
History
Predecessors
SuccessorWDC 65C832[1][2][3] (never released)

The W65C816S (also 65C816 or 65816) is a 16-bit microprocessor (MPU) developed and sold by the Western Design Center (WDC). Introduced in 1983,[4] the W65C816S is an enhanced version of the WDC 65C02 8-bit MPU, itself a CMOS enhancement of the venerable MOS Technology 6502 NMOS MPU. The 65C816 is the CPU for the Apple IIGS and, in modified form, the Super Nintendo Entertainment System.

The 65 in the part's designation comes from its 65C02 compatibility mode, and the 816 signifies that the MPU has selectable 8- and 16-bit register sizes. In addition to the availability of 16-bit registers, the W65C816S extends memory addressing to 24 bits, supporting up to 16 megabytes of random-access memory. It has an enhanced instruction set and a 16-bit stack pointer, as well as several new electrical signals for improved system hardware management.

At reset, the W65C816S starts in "emulation mode", meaning it substantially behaves as a 65C02. Thereafter, the W65C816S may be switched to "native mode" with a two instruction sequence, causing it to enable all enhanced features, yet still maintain a substantial degree of backward compatibility with most 65C02 software. However, unlike the PDIP40 version of the 65C02, which is a pin-compatible replacement for its NMOS ancestor, the PDIP40 W65C816S is not pin-compatible with any other 6502 family MPU.

The W65C802 or 65802 is completely software-compatible with the 65C816, but is electrically-compatible with the 6502 and 65C02. Hence the 65C802 could be used as a drop-in replacement in most systems equipped with a 6502 or 65C02. However, the 65C802 cannot emit a 24-bit address, which limits it to a 64 KB address space. The 65C802 is no longer produced.

History

PLCC-44 version of W65C816S microprocessor, shown mounted on a single-board computer.

In 1981, Bill Mensch, founder and CEO of WDC, began development of the 65C02 with his production partners, primarily Rockwell Semiconductor and Synertek. The primary goal of the 65C02 effort was to move from the original 6502's NMOS process to the CMOS process, which would allow it to run at much lower power levels, somewhere between 110 and 120 at any given clock speed. Also desired was the ability to raise the maximum supported clock speed. The 65C02 design addressed chip errata present in the NMOS 6502 (e.g., the infamous JMP (<addr>) bug) and introduced new instructions and new addressing modes for some existing instructions.[5]

Development of the W65C816S commenced in 1982 after Mensch consulted with Apple Computer on a new version of the Apple II series of personal computers that would, among other things, have improved graphics and sound. Apple wanted an MPU that would be software compatible with the 6502 then in use in the Apple II but with the ability to address more memory, and to load and store 16 bit words. The result was the 65C816, finished in March 1984, with samples provided to both Apple and Atari in the second half of the year and full release in 1985.[6] Mensch was aided during the design process by his sister Kathryn, who was responsible for part of the device's layout.

The same process also led to the 65C802, which was identical inside to the 65C816. Both were produced on the same fabrication lines and diverged only during the last metalization stages when the chip was being connected to the external pins. In the 65C802, those pins had the same layout as the original 6502, which allowed it to be used as a drop-in replacement while still allowing the 16-bit processing of the CPU to be used. However, as it used the original pinout it had only 16 addressing pins, and could therefore only access 64 KB of external memory.[7] Typically, when hardware manufacturers designed a project from the ground up, they used the 65C816 rather than the 65C802, resulting in the latter being withdrawn from production.

Apple subsequently integrated the 65C816 into the Apple IIGS computer. The basic 65C816 design was second-sourced by VLSI Technology,[8] GTE, Sanyo and others from the mid-to-late 1980s to the early 1990s.

In the 1990s, both the 65C816 and 65C02 were converted to a fully static core, which made it possible to completely stop the processor's Ø2 clock without loss of register contents. This feature, along with the use of asynchronous static RAM, made it possible to produce designs that used minimal power when in a standby state.

As of April 2024, the W65C816S is available from WDC in 40 pin PDIP, PLCC44, or 44-pin TQFP packaging, as an MCU through the W65C265,[9] and as IP cores for ASIC integration[10][11] (for example Winbond's W55V9x series of TV Edutainment ICs[12]).

W65C802P

Features

WDC 65c816 registers
23 22 21 20 19 18 17 16 15 14 13 12 11 10 09 08 07 06 05 04 03 02 01 00 (bit position)
Main registers
B A accumulators (C Combined)
Index registers
X X index
Y Y index
0 0 0 0 0 0 0 0 DP Direct Page pointer
0 0 0 0 0 0 0 0 SP Stack Pointer
DB 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Data Bank register
PB PC Program Bank : Program Counter
Status register
n v m x d i z c Program status register
e Program status register mode flag

WDC 65c816 features:

  • Fully static CMOS design offers low power consumption (300 µA at 1 MHz) and increased noise immunity.
  • Wide operating voltage range: 1.8 V to 5.0 V ± 5%.
  • Wide operating frequency range, officially 14 MHz maximum at 5 volts (20Mhz in SuperCPU), using a single-phase clock source.
  • Emulation mode allows substantial software compatibility with the NMOS 6502 and CMOS 65C02, excepting undocumented opcodes. All 256 opcodes in the 65C816 are functional in both operating modes.
  • 24-bit memory addressing provides access to 16 MB of memory space.
  • 16-bit ALU, accumulator (C), stack pointer (SP), and index registers (X and Y).
  • 16-bit direct page (aka zero page) register (DP).
  • 8-bit data bank (DB) and program bank (PB) registers, generating bits 16–23 of 24-bit code and data addresses. Separate program and data bank registers allow program segmentation and 16 MB linear data addressing.
  • Valid data address (VDA) and valid program address (VPA) control outputs for memory qualification, dual cache and cycle steal DMA implementation.
  • Vector pull (VPB) control output to indicate when an interrupt vector is being fetched.
  • Abort (ABORTB) input and associated vector supports processor repairs of bus error conditions, such as page faults and memory access violations.
  • Direct page register and stack relative addressing provides capability for reentrant, recursive and re-locatable programming.
  • 24 addressing modes—13 original 6502 modes with 92 instructions using 256 op codes, including most new opcodes implemented in the 65C02.
  • Block-copy instructions (MVN and MVP), allowing rapid copying of data structures from one area of RAM to another with minimal code.
  • Wait-for-Interrupt (WAI) and Stop-the-Clock (STP) instructions further reduce power consumption, decrease interrupt latency and allow synchronization with external events.
  • Co-Processor (COP) instruction with associated vector supports co-processor configurations, e.g., floating point processors.
  • Reserved "escape" (WDM) instruction for future two-byte opcodes and a link to future designs (WDM are the initials of W65C816S designer William D. Mensch).

Comparison with earlier models

Two modes

The 65C816 has two operating modes: "emulation mode", in which the 16-bit operations are invisible—the index registers are forced to eight bits—and the chip appears to be very similar to the 6502, with the same cycle timings for the opcodes; and "native mode", which exposes all new features. The CPU automatically enters emulation mode when it is powered on or reset, which allows it to replace a 65(C)02, assuming one makes the required circuit changes to accommodate the different pin layout.[5]

16-bit registers

The most obvious change to the 65C816 when running in native mode is the expansion of the various registers from 8-bit to 16-bit sizes. This enhancement affects the accumulator (A), the X and Y index registers, and the stack pointer (SP). It does not affect the program counter (PC), which has always been 16-bit.[13]

When running in native mode, two bits in the status register change their meaning. In the original 6502, bits 4 and 5 were not used, although bit 4 is referred to as the break (b) flag. In native mode, bit 4 becomes the x flag and bit 5 becomes the m flag. These bits control whether or not the index registers (x) and accumulator/memory (m) are 8-bit or 16-bit in size. Zeros in these bits set 16-bit sizes, ones set 8-bit sizes. These bits are locked at ones when the processor is powered on or reset, but become changeable when the processor is switched to native mode.[13]

In native mode operation, the accumulator and index registers may be set to 16- or 8-bit sizes at the programmer’s discretion by using the REP and SEP instructions to manipulate the m and x status register bits. This feature gives the programmer the ability to perform operations on either word- and byte-size data. As the accumulator and index register sizes are independently settable, it is possible, for example, to have the accumulator set to eight bits and the index registers set to 16 bits, giving the programmer the ability to manipulate individual bytes over a 64KB range without having to perform pointer arithmetic.

When register sizes are set to 16 bits, a memory access will fetch or store two contiguous bytes at the rate of one byte per clock cycle. Hence a read-modify-write instruction, such as ROR <addr>, when used while the accumulator is set to 16 bits, will affect two contiguous bytes of memory, not one and will consume more clock cycles than when the accumulator is set to eight bits. Similarly, all arithmetic and logical operations will be 16-bit operations.[14]

24-bit addressing

The other major change to the system while running in native mode is that the memory model is expanded to a 24-bit format from the original 16-bit format of the 6502. The 65c816 makes use of two 8-bit registers, the data bank register (DB) and the program bank register (PB), to set bits 16-23 of the address, effectively generating 24-bit addresses. In both cases, 'bank' refers to a contiguous 64 KB segment of memory that is bounded by the address range $xx0000-$xxFFFF, where xx is the bank address, that is, bits 16-23 of the effective address. Both DB and PB are initialized to $00 at power-on or reset.[15]

During an opcode or operand fetch cycle, PB is prepended to the program counter (PC) to form the 24-bit effective address. Should PC "wrap" (return to zero), PB will not be incremented. Hence a program is bounded by the limits of the bank in which it is executing. Implied by this memory model is that branch and subroutine targets must be in the same bank as the instruction making the branch or call, unless "long" jumps or subroutine calls are used to execute code in another bank. There is no programmatic means by which PB can be directly changed.[16]

During a data fetch or store cycle, DB is prepended to a 16-bit data address to form the 24-bit effective address at which data will be accessed. This processor characteristic makes it possible to sanely execute 6502 or 65c02 code that uses 16-bit addresses to reference data elements. Unlike PB, DB can be changed under program control, something that might be done to access data beyond the limits of 16-bit addressing. Also, DB will temporarily increment if an address is indexed beyond the limits of the bank currently in DB. DB is ignored if a 24-bit address is specified as the operand to a data fetch/store instruction, or if the effective address is on direct (zero) page or the hardware stack. In the latter case, an implied bank $00 is used to generate the effective address.[17]

A further addition to the register set is the 16-bit direct page register (DP), which sets the base address for what was formerly called the zero page, but now referred to as direct page. Direct page addressing uses an 8-bit address, which results in faster access than when a 16- or 24-bit address is used. Also, some addressing modes that offer indirection are only possible on direct page. In the 65(c)02, the direct page is always the first 256 bytes of memory, hence “zero page”. In native mode, the 65c816 can relocate direct (zero) page anywhere in bank $00 (the first 64 KB of memory) by writing the 16-bit starting address into DP. There is a one-cycle access penalty if DP is not set to an exact page boundary, that is, if the value in DP is not $xx00, where xx is the most-significant byte.[18]

Switching between modes

The current mode of operation is stored in the emulation (e) bit. Having already added the new x and m bits to the previous set of six flags in the status register (SR), there were not enough bits left to hold the new mode bit. Instead, a unique solution was used in which the mode bit was left "invisible", unable to be directly accessed. The XCE (eXchange Carry with Emulation) instruction exchanges the value of the emulation bit with the carry (c) bit, bit 0 in SR. For instance, if one wants to enter native mode after the processor has started up, one would use CLC to clear the carry bit, and then XCE to write it to the emulation bit.[19] Returning to 65c02 emulation mode uses SEC followed by XCE.[20]

Internally, the 65c816 is a fully 16-bit design. The m and x bits in SR determine how the user registers (accumulator and index) appear to the rest of the system. Upon reset, the 65c816 starts in 6502 emulation mode, in which m and x are locked to 1. Hence the registers are locked to eight-bit size. The most significant byte (MSB) of the accumulator (the B-accumulator) is not directly accessible but can be swapped with the least significant byte (LSB) of the accumulator (the A-accumulator) by using the XBA instruction. There is no corresponding operation for the index registers (X and Y), whose MSBs are locked at $00.

Upon being switched to native mode, the MSB of X and Y will be zero, and the B-accumulator will be unchanged. If the m bit in SR is cleared, the B-accumulator will be "ganged" to the A-accumulator to form a 16-bit register (called the C-accumulator). A load/store or arithmetic/logical operation involving the accumulator or memory will be a 16-bit operation—two bus cycles are required to fetch/store a 16-bit value.

If the x bit in SR is cleared, both index registers will be set to 16 bits. If used to index an address, e.g., LDA SOMEWHERE,X, the 16-bit value in the index register will be added to the base address to form the effective address.

If the m bit in SR is set, the accumulator will return to being an 8-bit register and subsequent operations on the accumulator, with a few exceptions, will be 8-bit operations. The B-accumulator will retain the value it had when the accumulator was set to 16 bits. The exceptions are the instructions that transfer the direct page register (DP) and stack pointer (SP) to/from the accumulator. These operations are always 16 bits wide in native mode, regardless of the condition of the m bit in SR.

If the x bit in SR is set, not only will the index registers return to being 8 bits, whatever was in the MSB while they were 16 bits wide will be lost, something an assembly language programmer cannot afford to forget.[21]

Applications

Systems based on 65c816 variants:

See also

References

Citations

  1. ^ Mensch, William D. "A Report on the 65c832". Archived from the original on 7 Apr 2024. Retrieved 7 Apr 2024.
  2. ^ "W65C832 Information, Specification, and Data Sheet (March 1990)" (PDF). ReActive Micro. 6 Sep 2010 [Mar 1990]. Archived (PDF) from the original on 7 Apr 2024. Retrieved 7 Apr 2024.
  3. ^ "W65C832 Information, Specification, and Data Sheet" (PDF). 6502.org. Archived from the original (PDF) on 30 Jun 2023. Retrieved 7 Apr 2024.
  4. ^ Chronology of Microprocessors (1980–1989)
  5. ^ a b Eyes & Lichty 1986, p. 42.
  6. ^ Eyes & Lichty 1986, p. 44.
  7. ^ Eyes & Lichty 1986, p. 45.
  8. ^ Application Specific Logic Products Data Book 1988. VLSI Technology Inc. 1988. pp. 257–279. Retrieved 18 March 2024.
  9. ^ "W65C265S 16-bit Microcontroller". The Western Design Center, Inc. 5 Jan 2021. Archived from the original on 2 Apr 2024. Retrieved 7 Apr 2024.
  10. ^ "W65C816 8/16-bit Microprocessor". The Western Design Center, Inc. 5 Jan 2021. Archived from the original on 15 Nov 2023. Retrieved 7 Apr 2024.
  11. ^ "W65C265S 8/16-bit Microcontroller". The Western Design Center, Inc. 5 Jan 2021. Archived from the original on 7 Apr 2024. Retrieved 7 Apr 2024.
  12. ^ "W55V92 TV-toy Controller Data Sheet" (PDF). Arrow Electronics. 2 May 2006. Retrieved 12 June 2024.
  13. ^ a b Eyes & Lichty 1986, p. 46.
  14. ^ Eyes & Lichty 1986, p. 52.
  15. ^ Eyes & Lichty 1986, p. 53.
  16. ^ Eyes & Lichty 1986, p. 54.
  17. ^ Eyes & Lichty 1986, p. 55.
  18. ^ Eyes & Lichty 1986, p. 80.
  19. ^ Eyes & Lichty 1986, p. 64.
  20. ^ Eyes & Lichty 1986, p. 65.
  21. ^ Eyes & Lichty 1986, p. 51.
  22. ^ "16bits CPU – New Retro Computers". Foenix Retro Systems.
  23. ^ "F256K". Foenix Retro Systems.

Bibliography

Further reading

  • 65C816 Datasheet; Western Design Center; 55 pages; 2018.
  • Eyes, David and Ron Lichty; Programming the 65816: Including the 6502, 65C02, and 65802; Brady Publishing; 636 pg; 2015
  • Fischer, Michael; 65816/65802 assembly language programming; Osborne/McGraw-Hill; 686 pg; 1986