Assembly language tutorial overview

Simple Instructions - SkullSecurity https://wiki.skullsecurity.
org/Simple_Instructions
Simple Instructions
From SkullSecurity
This section will go over some basic assembly

instructions that you will likely see frequently. Assembly Language Tutorial
Some of the functions shown here are tricky, and Please choose a tutorial page:
some have special properties (such as the registers
they use). Additionally, x86 assembly is Fundamentals -- Information about C
comprised of hundreds of different instructions. Tools
As a result, you will likely want to find a complete Registers
reference book or website to have alongside you. Simple Instructions
Example 1 -- SC CDKey Initial Verification
This page however, will give enough of an
Example 2 -- SC CDKey Shuffle
introduction to get you started. Example 2b -- SC CDKey Final Decode
The Stack
Stack Example
Functions
Contents Example 3 -- Storm.dll SStrChr
Assembly Summary
1 Pointers and Dereferencing
2 Doing Nothing
3 Moving Data Around Machine Code
3.1 mov, movsx, movzx Example 4 -- Smashing the Stack
3.2 lea Cracking a Game
4 Math and Logic Example 5 -- Cracking a game
Example 6 -- Writing a keygen
4.1 add, sub
.dll Injection and Patching
4.2 inc, dec Memory Searching
4.3 and, or, xor, neg Example 7 -- Writing a cheat for Starcraft (1.05)
4.4 mul, imul, div, idiv, cdq Example 7 Step 1 -- Displaying Messages
5 shl, shr, sal, sar Example 7 Step 1b -- Above, w/ func ptrs
6 Jumping Around Example 7 Final
6.1 jmp Example 8 -- Getting IX86.dll files
6.2 call, ret 16-bit Assembly
6.3 cmp, test Example 9 -- Keygen for a 16-bit game
6.4 jz/je, jnz/jne, jl/jb, jg, jle, jge Example 10 -- Writing a loader
7 Manipulating the Stack
7.1 push, pop
7.2 pushaw, pushad, popaw, popad
8 Questions
Pointers and Dereferencing

First, we will start with the hard stuff. If you understood the pointers section, this shouldn't be too bad. If you
didn't, you should probably go back and refresh your memory.
Recall that a pointer is a data type that stores an address as its value. Since registers are simply 32-bit values
with no actual types, any register may or may not be a pointer, depending on what is stored. It is the
1 of 9 8/10/2019, 6:08 PM
Simple Instructions - SkullSecurity https://wiki.skullsecurity.org/Simple_Instructions
responsibility of the program to treat pointers as pointers and to treat non-pointers as non-pointers.
If a value is a pointer, it can be dereferenced. Recall that dereferencing a pointer retrieves the value stored at the
address being pointed to. In assembly, this is generally done by putting square brackets ("[" and "]") around the
register. For example:
eax -- is the value stored in eax

[eax] -- is the value pointed to by eax
This will be thoroughly discussed in upcoming sections.
Doing Nothing
The nop instruction is probably the simplest instruction in assembly. nop is short for "no operation" and it does
nothing. This instruction is used for padding.
Moving Data Around

The instructions in this section deal with relocating numbers and pointers.
mov, movsx, movzx
mov is the instruction used for assignment, analogous to the "=" sign in most languages. mov can move data
between a register and memory, two registers, or a constant to a register. Here are some examples:
mov eax, 1 ; set eax to 1 (eax = 1)

mov edx, ecx ; set edx to whatever ecx is (edx = ecx)
mov eax, 18h ; set eax to 0x18
mov eax, [ebx] ; set eax to the value in memory that ebx is pointing at
mov [ebx], 3 ; move the number 3 into the memory address that ebx is pointing at
movsx and movzx are special versions of mov which are designed to be used between signed (movsx) and
unsigned (movzx) registers of different sizes.
movsx means move with sign extension. The data is moved from a smaller register into a bigger register, and the
sign is preserved by either padding with 0's (for positive values) or F's (for negative values). Here are some
examples:
0x1000 becomes 0x00001000, since it was positive

0x7FFF becomes 0x00007FFF, since it was positive
0xFFFF becomes 0xFFFFFFFF, since it was negative (note that 0xFFFF is -1 in 16-bit signed, and
0xFFFFFFFF is -1 in 32-bit signed)
0x8000 becomes 0xFFFF8000, since it was negative (note that 0x8000 is -32768 in 16-bit signed, and
0xFFFF8000 is -32768 in 32-bit signed)
movzx means move with zero extension. The data is moved from a smaller register into a bigger register, and the
sign is ignored. Here are some examples:
0x1000 becomes 0x00001000

0x7FFF becomes 0x00007FFF
0xFFFF becomes 0x0000FFFF
2 of 9 8/10/2019, 6:08 PM
0x8000 becomes 0x00008000
lea
lea is very similar to mov, except that math can be done on the original value before it is used. The "[" and "]"
characters always surround the second parameter, but in this case they do not indicate dereferencing, it is easiest
to think of them as just being part of the formula.
lea is generally used for calculating array offsets, since the address of an element of the array can be found with
[arraystart + offset*datasize]. lea can also be used for quickly doing math, often with an addition and a
multiplication. Examples of both uses are below.
Here are some examples of using lea:
lea eax, [eax+eax] ; Double the value of eax -- eax = eax * 2

lea edi, [esi+0Bh] ; Add 11 to esi and store the result in edi
lea eax, [esi+ecx*4] ; This is generally used for indexing an array of integers. esi is a
pointer to the beginning of an array, and ecx is the index of the
element that is to be retrieved. The index is multiplied by 4
because Integers are 4 bytes long. eax will end up storing the
address of the ecx'th element of the array.
lea edi, [eax+eax*2] ; Triple the value of eax -- eax = eax * 3

lea edi, [eax+ebx*2] ; This likely indicates that eax stores an array of 16-bit (2 byte)
values, and that ebx is an offset into it. Note the similarities
between this and the previous example: the same math is being done,
but for a different reason.
Math and Logic

The instructions in this section deal with math and logic. Some are simple, and others (such as multiplication
and division) are pretty tricky.
add, sub
A register can have either another register, a constant value, or a pointer added to or subtracted from it. The
syntax of addition and subtraction is fairly simple:
add eax, 3 ; Adds 3 to eax -- eax = eax + 3

add ebx, eax ; Adds the value of eax to ebx -- ebx = ebx + eax
sub ecx, 3 ; Subtracts 3 from ecx -- ecx = ecx - 3
inc, dec
These instructions simply increment and decrement a register.
inc eax ; eax++

dec ecx ; ecx--
and, or, xor, neg
3 of 9 8/10/2019, 6:08 PM
All logical instructions are bitwise. If you don't know what "bitwise arithmetic" means, you should probably
look it up. The simplest way of thinking of this is that each bit in the two operands has the operation done
between them, and the result is stored in the first one.
The instructions are pretty self-explanatory: and does a bitwise 'and', or does a bitwise 'or', xor does a bitwise
'xor', and neg does a bitwise negation.
Here are some examples:
and eax, 7 ; eax = eax & 7 -- because 7 is 000..000111, this clears all bits
except for the last three.
or eax, 16 ; eax = eax | 16 -- because 16 is 000..00010000, this sets the 5th
bit from the right to "1".
xor eax, 1 ; eax = eax ^ 1 -- this toggles the right-most bit in eax, 0=>1 or
1=>0.
xor eax, FFFFFFFFh ; eax = eax ^ 0xFFFFFFFF -- this toggles every bit in eax, which is
identical to a bitwise negation.
neg eax ; eax = ~eax -- inverts every bit in eax, same as the previous.
xor eax, eax ; eax = 0 -- this clears eax quickly, and is extremely
common.
mul, imul, div, idiv, cdq
Multiplication and division are the trickiest operations commonly used, because of how they deal with overflow
issues. Both multiplication and division make use of the 64-bit register edx:eax.
mul multiplies the unsigned value in eax with the operand, and stores the result in the 64-bit pointer edx:eax.
imul does the same thing, except the value is signed. Here are some examples of mul:
mul ecx ; edx:eax = eax * ecx (unsigned)

imul edx ; edx:eax = eax * edx (signed)
When used with two parameters, mul instead multiplies the first by the second as expected:
mul ecx, 10h ; ecx = ecx * 0x10 (unsigned)

imul ecx, 20h ; ecx = ecx * 0x20 (signed)
div divides the 64-bit value in edx:eax by the operand, and stores the quotient in eax. The remainder (modulus)
is stored in edx. In other words, div does both division and modular division, at the same time. Typically, a
program will only use one or the other, so you will have to check which instructions follow to see whether eax
or edx is saved. Here are some examples:
div ecx ; eax = edx:eax / ecx (unsigned)

; edx = edx:eax % ecx (unsigned)
idiv ecx ; eax = edx:eax / ecx (signed)

; edx = edx:eax % ecx (signed)
cdq is generally used immediately before idiv. It stands for "convert double to quad." In other words, convert
the 32-bit value in eax to the 64-bit value in edx:eax, overwriting anything in edx with either 0's (if eax is
positive) or F's (if eax is negative). This is very similar to movsx, above.
xor edx, edx is generally used immediately before div. It clears edx to ensure that no leftover data is divided.
4 of 9 8/10/2019, 6:08 PM
Here is a common use of cdq and idiv:
mov eax, 1007 ; 1007 will be divided

mov ecx, 10 ; .. by 10
cdq ; extends eax into edx
idiv ecx ; eax will be 1007/10 = 100, and edx will be 1007%10 = 7
Here is a common use of xor and div (the results are the same as the previous example):
mov eax, 1007

mov ecx, 10
xor edx, edx
div ecx
shl, shr, sal, sar

shl - shift left, shr - shift right.
sal - shift arithmetic left, sar - shift arithmetic right.
These are used to do a binary shift, equivalent to the C operations << and >>. They each take two operations:
the register to use, and the number of places to shift the value in the register. As computers operate in base 2,
these commands can be used as a faster replacement for multiplication/division operations involving powers of
2.
Divide by 2 (unsigned):
mov eax, 16 ; eax = 16

shr eax, 1 ; eax = 8
Multiply by 4 (signed):
mov eax, 5 ; eax = 5

sal eax, 2 ; eax = 20
Visualising the bits moving:
mov eax, 7 ; = 0000 0111 (7)

shl eax, 1 ; = 0000 1110 (14)
shl eax, 2 ; = 0011 1000 (56)
shr eax, 1 ; = 0001 1100 (28)
Jumping Around
Instructions in this section are used to compare values and to make jumps. These jumps are used for calls, if
statements, and every type of loop. The operand for most jump instructions is the address to jump to.
jmp
jmp, or jump, sends the program execution to the specified address no matter what. Here is an example:
5 of 9 8/10/2019, 6:08 PM
jmp 1400h ; jump to the address 0x1400
call, ret
call is similar to jump, except that in addition to sending the program to the specified address, it also saves
("pushes") the address of the executable instruction onto the stack. This will be explained more in a later
section.
ret removes ("pops") the first value off of the stack, and jumps to it. In almost all cases, this value was placed
onto the stack by the call instruction. If the stack pointer is at the wrong location, or the saved address was
overwritten, ret attempts to jump to an invalid address which usually crashes the program. In some cases, it may
jump to the wrong place where the program will almost inevitably crash.
ret can also have a parameter. This parameter is added to the stack immediately after ret executes its jump. This
addition allows the function to remove values that were pushed onto the stack. This will be discussed in a later
section.
The combination of call and ret are used to implement functions. Here is an example of a simple function:
call 4000h
...... ; any amount of code
4000h:
mov eax, 1
ret ; Because eax represents the return value, this function would return 1, and
nothing else would happen
cmp, test
cmp, or compare, compares two operands and sets or unsets flags in the flags register based on the result.
Specialized jump commands can check these flags to jump on certain conditions. One way of remembering how
cmp works is to think of it as subtracting the second parameter from the first, comparing the result to 0, and
throwing away the result.
test is very similar to cmp, except that it performs a bitwise 'and' operation between the two operands. test is
most commonly used to compare a variable to itself to check if it's zero.
jz/je, jnz/jne, jl/jb, jg, jle, jge
jz and je (which are synonyms) will jump to the address specified if and only if the 'zero' flag is set, which
indicates that the two values were equal. In other words, "jump if equal".
jnz and jne (which are also synonyms) will jump to the address specified if and only if the 'zero' flag is
not set, which indicates that the two values were not equal. In other words, "jump if different".
jl and jb (which are synonyms) jumps if the first parameter is less than the second.
jg jumps if the first parameter is greater than the second.
jle jumps if the 'less than' or the 'zero' flag is set, so "less than or equal to".
jge jumps if the first is "greater than or equal to" the second.
These jumps are all used to implement various loops and conditions. For example, here is some C code:
if(a == 3)
6 of 9 8/10/2019, 6:08 PM
b;
else
c;
And here is how it might look in assembly (not exactly assembly, but this is an example):
10 cmp a, 3
20 jne 50
30 b
40 jmp 60
50 c
60
Here is an example of a loop in C:
for(i = 0; i < 5; i++)

{
a;
b;
}
And here is the equivalent loop in assembly:
10 mov ecx, 0
20 a
30 b
40 inc ecx
50 cmp ecx, 5
60 jl 20
Manipulating the Stack

Functions in this section are used for adding and removing data from the stack. The stack will be examined in
detail in a later section; this section will simply show some commonly used commands.
push, pop
push decrements the stack pointer by the size of the operand, then saves the operand to the new address. This
line:
push ecx
Is functionally equivalent to:
sub esp, 4
mov [esp], ecx
pop sets the operand to the value on the stack, then increments the stack pointer by the size of the operand. This
assembly:
pop ecx
7 of 9 8/10/2019, 6:08 PM
Is functionally equivalent to:
mov ecx, [esp]

add esp, 4
This will be examined in detail in the Stack section of this tutorial.
pushaw, pushad, popaw, popad
pushaw and pushad save all 16-bit or 32-bit registers (respectively) onto the stack.
popaw and popad restore all 16-bit or 32-bit registers from the stack.
Questions
Feel free to edit this section and post questions, and I will do my best to answer them; however, you may need
to contact me to let me know that a question exists.
Further explain bitwise
In your example code:
10 mov ecx, 0
20 a
30 b
40 cmp ecx, 5
50 jl 20
Wont this just loop forever because ecx is never incremented? Total noob here so i may have missed something
obvious.
Yes, my mistake.
Indeed, would not this be more accurate?:
10 mov ecx, 0
20 a
30 b
40 inc ecx
50 comp ecx, 5
60 jl 20
Changed.
Also, The stack is decremented when pushed, but increased when poped? Isn't this counterintuitive?
Yes, the stack starts high and grows downwards. Welcome to x86 assembler!
Retrieved from "https://wiki.skullsecurity.org/index.php?title=Simple_Instructions&oldid=3140"
Category: Assembly Guide
8 of 9 8/10/2019, 6:08 PM
This page was last modified on 16 January 2012, at 01:01.
normal
9 of 9 8/10/2019, 6:08 PM

Assembly language tutorial overview

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Assembly language tutorial overview

Uploaded by

Copyright:

Available Formats

Simple Instructions - SkullSecurity https://wiki.skullsecurity.

This section will go over some basic assembly

Pointers and Dereferencing

eax -- is the value stored in eax

This will be thoroughly discussed in upcoming sections.

Moving Data Around

mov, movsx, movzx

mov eax, 1 ; set eax to 1 (eax = 1)

0x1000 becomes 0x00001000, since it was positive

0x1000 becomes 0x00001000

0x8000 becomes 0x00008000

Here are some examples of using lea:

lea eax, [eax+eax] ; Double the value of eax -- eax = eax * 2

lea edi, [eax+eax*2] ; Triple the value of eax -- eax = eax * 3

Math and Logic

add eax, 3 ; Adds 3 to eax -- eax = eax + 3

These instructions simply increment and decrement a register.

inc eax ; eax++

and, or, xor, neg

Here are some examples:

mul, imul, div, idiv, cdq

mul ecx ; edx:eax = eax * ecx (unsigned)

mul ecx, 10h ; ecx = ecx * 0x10 (unsigned)

div ecx ; eax = edx:eax / ecx (unsigned)

idiv ecx ; eax = edx:eax / ecx (signed)

Here is a common use of cdq and idiv:

mov eax, 1007 ; 1007 will be divided

mov eax, 1007

shl, shr, sal, sar

sal - shift arithmetic left, sar - shift arithmetic right.

mov eax, 16 ; eax = 16

mov eax, 5 ; eax = 5

Visualising the bits moving:

mov eax, 7 ; = 0000 0111 (7)

jmp 1400h ; jump to the address 0x1400

jz/je, jnz/jne, jl/jb, jg, jle, jge

Here is an example of a loop in C:

for(i = 0; i < 5; i++)

And here is the equivalent loop in assembly:

Manipulating the Stack

Is functionally equivalent to:

Is functionally equivalent to:

mov ecx, [esp]

This will be examined in detail in the Stack section of this tutorial.

pushaw, pushad, popaw, popad

Further explain bitwise

In your example code:

Indeed, would not this be more accurate?:

Retrieved from "https://wiki.skullsecurity.org/index.php?title=Simple_Instructions&oldid=3140"

Category: Assembly Guide

This page was last modified on 16 January 2012, at 01:01.

You might also like

lea edi, [eax+eax2] ; Triple the value of eax -- eax = eax 3