Ghost6502

A MOS 6502 simulator implemented by JS built-in functions, which can be used for anti-debugging.

This project is an experiment in the article "JS Obfuscation: Mining Built-in Functions and Crafting Non-debuggable VMs".

Demo1

This demo calculates the sum of numbers from 1 to 10:

import OP from './opcode.js'
import ghost6502 from './ghost6502.js'

ghost6502.mem.set([
  OP.LDA_IMM, 0,    // A = 0
  OP.LDX_IMM, 1,    // X = 1
  OP.STX_ZPG, 100,  // LOOP:
  OP.ADC_ZPG, 100,  // A += X
  OP.INX,           // X += 1
  OP.CPX_IMM, 11,   // IF X < 11 THEN GOTO LOOP
  OP.BCC,     -9,
])

debugger
ghost6502.reset()   // 👈🏻 Can you step into this function?

alert('sum(1, 10) = ' + ghost6502.reg.a)

http://etherdream.github.io/ghost6502/demo1.html

Open the debugger and you will find that you cannot step into the ghost6502.reset function!

In fact, this function performs a lot of operations, but there is no source code for debugging.

Demo2

This demo reads P and Q through the input box, calculates their sum, and outputs it via the message box:

import OP from './opcode.js'
import ghost6502 from './ghost6502.js'

const inputP = prompt.bind(window, 'Enter the value P', 10)
const inputQ = prompt.bind(window, 'Enter the value Q', 20)
const output = alert.bind(window, ghost6502.bus.data)

ghost6502.bus.mapRead(253, inputP)
ghost6502.bus.mapRead(254, inputQ)
ghost6502.bus.mapWrite(255, output)

ghost6502.mem.set([
  OP.LDA_ZPG, 253,    // A = inputP()
  OP.LDX_ZPG, 254,    // X = inputQ()
  OP.STX_ZPG, 100,    // mem[100] = X
  OP.ADC_ZPG, 100,    // A += mem[100]
  OP.STA_ZPG, 255,    // output(A)
])

debugger
ghost6502.reset()

http://etherdream.github.io/ghost6502/demo2.html

Since the input box and message box are based on built-in functions, the ghost6502.reset function is still non-debuggable!

Demo3

This demo uses a timer to generate interrupts for simulating events:

import OP from './opcode.js'
import ghost6502 from './ghost6502.js'

const obj = document.body
const key = 'textContent'
const val = {
  toString: ''.concat.bind('count: ', ghost6502.bus.data)
}
const display = Reflect.set.bind(null, obj, key, val)
ghost6502.bus.mapWrite(255, display)


const PROGRAM_ADDRESS = 0x8000
const RESET_ADDRESS = 0xFFFC
const IRQ_ADDRESS = 0xFFFE

const program = [
  /* $8000 */ OP.LDX_IMM, 60,     // X = 60
  /* $8002 */ OP.STX_ZPG, 255,    // display(X)
  /* $8004 */ OP.BRK,             // exit reset

  /* $8005 */ OP.INX,             // X++
  /* $8006 */ OP.STX_ZPG, 255,    // display(X)
  /* $8008 */ OP.RTI,             // exit interrupt
]
ghost6502.mem.set(program, PROGRAM_ADDRESS)
ghost6502.mem.set([0x00, 0x80], RESET_ADDRESS)
ghost6502.mem.set([0x05, 0x80], IRQ_ADDRESS)

ghost6502.reset()
setInterval(ghost6502.irq, 1000)

http://etherdream.github.io/ghost6502/demo3.html

Even when the "Timer" option in "Event Listener Breakpoints" is checked, the irq function still cannot be paused by the debugger.

API

The above Demo3 is configured with a reset vector so that the program can be run from the specified address, whereas Demo1 and Demo2 are not configured, so 0x0000 is used as the default entry address.

If you do not want to set the entry address via the reset vector, you can manually initialize the PC register and start the program using the run function:

const ENTRY_ADDR = 0x8000
const program = [
  // ...
]
ghost6502.mem.set(program, ENTRY_ADDR)
ghost6502.reg.pc.fill(ENTRY_ADDR)
ghost6502.run()

For simplicity and efficiency, this VM does not simulate clock cycles, but uses interpreter loops instead. Each loop executes one instruction.

The run function executes loop number of instructions and stops when it encounters BRK (0x00), RTI or an illegal instruction. Additionally, the reset, irq and nmi functions also call the run function internally.

The default value of loop is 2³² - 1, so each call to run can execute enough instructions. You can modify it through the setLoop function to customize the time slice:

// each run() executes at most 10k instructions
ghost6502.setLoop(10000)

function onTimeSlice() {
  if (ghost6502.run() === -1) {
    return
  }
  requestIdleCallback(onTimeSlice)
}
onTimeSlice()

This allows a time-consuming task to be split into multiple executions, so that the main thread will not be blocked for a long time, and this is transparent to the program without changing it.

Furthermore, you can use the runOp function to execute a single instruction, which allows for more flexible scheduling. See index.d.ts for details.

You can get the source code from ghost6502.ts, or install it from NPM:

npm install ghost6502

Note that this package does not include the opcode enum file. You can get it from opcode.js or opcode.ts.

Playground

http://etherdream.github.io/ghost6502/

The default program is a snake game (WASD keys for directions):

source code: snake.asm

In addition to the CPU interpreter, keyboard reading, canvas rendering, and register bar updating are also driven by built-in functions, so none of them can be debugged!

The second program is for drawing. Press the left button to draw a point, the right button to erase a point, and the 0~9 keys to switch colors:

source code: paint.asm

Note that this page does not provide compilation capabilities, it only extracts hex codes from the textarea. You can generate hex codes through virtual 6502. For example:

.ORG $FF00
  LDX 0        ; x = 0
  BRK          ; exit reset

.ORG $FF80
  INX          ; x++
  STX $0200    ; draw(0, 0, x)
  RTI          ; exit interrupt

.ORG $FFFC
  .WORD $FF00  ; RESET
  .WORD $FF80  ; IRQ (60FPS)

Copy the above code into "src" and click the "Assembly" button, then copy the data of "object code" into our page:

FF80: A6 00 00 00 00 00 00 00
FF90: E8 8E 00 02 40 00 00 00
FFF8: 00 00 00 00 80 FF 90 FF

Blank lines can be omitted.

Click the "Reset" button to run this bytecode. You can see that the color of the first point will keep changing.

This playground provides the following APIs through memory mapping:

API	Type	Address	Description
Mouse Button	Input	0xFB	none: 0, left: 1, right: 2
Mouse X	Input	0xFC	[0, 32)
Mouse Y	Input	0xFD	[0, 32)
Random	Input	0xFE	[0, 256)
Last Key	Input	0xFF	ASCII code, 0 if keyup
Screen	Output	[0x0200, 0x0600)	32x32 (1 byte/pixel)

Performance

Here's a performance test using an infinite loop:

import OP from './opcode.js'
import ghost6502 from './ghost6502.js'

ghost6502.mem.set([OP.JMP_ABS, 0x00, 0x00])
ghost6502.setLoop(1e6)

const t0 = performance.now()
ghost6502.run()
const t1 = performance.now()

const ips = (1e6 / (t1 - t0) * 1000) | 0
console.log('Speed:', ips.toLocaleString() + ' IPS')

http://etherdream.github.io/ghost6502/perf.html

It can reach 1.7 MIPS on my MBP M1 CPU. Due to the differing underlying operations of each instruction, there may be some variations in actual use.

Although the performance is significantly worse than normal JavaScript, it is still several times faster than the original 6502 from decades ago, which ran at 1-3 MHz and only achieved a few hundred kIPS.

How it works

Since the source code of built-in functions is not public, the debugger can only step over them, for example:

const set = new Set()
const add = set.add.bind(set)

const arr = [11, 22, 33, 44, 55, 66]
const run = arr.forEach.bind(arr, add)

// ƒ add() { [native code] }
console.log(add)

// ƒ forEach() { [native code] }
console.log(run)

debugger

// This step will call the `add` method 6 times,
// but the debugger cannot step into it.
run()

console.log(set)

By mining built-in functions as raw material and crafting components such as registers, arithmetic logic units, instruction decoders, and clock signals, we can construct a Turing-complete and non-debuggable CPU.

...

For example, we can use an object with a valueOf or toString property as a trigger, which will invoke the property when it is converted to a number or string:

const a = {
  valueOf: prompt.bind(window, 'Enter the value A', 2)
}
const b = {
  valueOf: prompt.bind(window, 'Enter the value B', 10)
}
const pow = Math.pow.bind(null, a, b)

const cat = ''.concat.bind('A ** B = ', {
  toString: pow
})
const run = alert.bind(window, {
  toString: cat
})

debugger
run()

This allows multiple operations to be chained together.

Similarly, we can use array getters to store multiple operations and trigger them through an iterative method:

const arr = Object.defineProperties([], [
  { get: console.log.bind(console, 'begin') },
  { get: alert.bind(window, '11')           },
  { get: alert.bind(window, '22')           },
  { get: console.log.bind(console, 'end')   },
])
const run = arr.includes.bind(arr, 0xDEADBEEF)
run()

We can use the Atomics API to implement arithmetic and logical operations:

const reg = Uint32Array.of(100)
Atomics.add(reg, 0, 200)
console.log(reg[0])   // 300 (100 + 200)

Atomics.xor(reg, 0, 100)
console.log(reg[0])   // 328 (300 ^ 100)

Furthermore, we can use array callbacks to implement loops:

const loop = Array(10)
const input = {
  toString: prompt.bind(window, 'Enter stop to exit')
}
const strcmp = {
  valueOf: ''.localeCompare.bind('stop', input)
}
const lut = [].at.bind([true, false], strcmp)
loop.find(lut)

This allows the CPU clock signal to be simulated.

Unsupported features

Clock cycles
BCD math
Software interrupt (The BRK instruction is only used to exit)
Undocumented opcodes

TODO

I'm trying to make a non-debuggable NES emulator. The first solution is based on this project, the second solution is to port all the logic to GLSL and run it on the GPU.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
src		src
.gitignore		.gitignore
README.md		README.md
index.d.ts		index.d.ts
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ghost6502

Demo1

Demo2

Demo3

API

Playground

Performance

How it works

Unsupported features

TODO

About

Releases

Packages

Languages

EtherDream/ghost6502

Folders and files

Latest commit

History

Repository files navigation

Ghost6502

Demo1

Demo2

Demo3

API

Playground

Performance

How it works

Unsupported features

TODO

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages