A deep dive into the inner workings of the stack

The stack is a fundamental part of how computers manage function calls, variables, and execution flow, it’s crucial to understand how the stack operates, especially when exploring vulnerabilities like stack overflow or buffer overflow attacks, in this blog article, we’ll first delve into how the stack works before touching on how it can be exploited.

What is a Stack?

The stack is a section of memory in a computer’s architecture that operates on the principle of LIFO (Last In, First Out ). It is used to store:

Function calls and their return addresses.
Local variables.
Temporary data during program execution.

The stack grows downward in memory, starting at a high address and moving to lower addresses as new data is added.

Stack frame

This LIFO structure is extremely useful. When a function is called, all the data necessary for the function’s execution, as well as for returning to the initial state, are pushed onto the stack. Once the function is complete, the program must return to the line following the function call, and this is done by popping everything that was previously pushed onto the stack, leaving the rest of the stack and any other stack frames intact, below a diagram that attempts to summarize my explanation:

Base and stack pointers

The ESP (Extended Stack Pointer) and RSP (Register Stack Pointer) serve the same purpose in different architectures: ESP is used in 32-bit systems, while RSP is used in 64-bit systems. They point to the top of the stack and are updated automatically as the stack grows or shrinks.

The EBP (Extended Base Pointer) and RBP (Register Base Pointer) also share a similar distinction: EBP is used in 32-bit systems, and RBP in 64-bit systems. These registers store the base address of the current stack frame, making it easier to access function parameters and local variables. Unlike ESP/RSP, they are typically set manually by the program or compiler.

What we’ve just seen holds true as long as we stay within the same stack frame. However, what happens when a new function is called? Once this new function is finished, how does the processor return to the previous state? That’s what we’ll explore next.

Stack on motion

To fully understand the rest of this article, basic knowledge of assembly language is helpful, let’s consider the following C program.

Copy to Clipboard

Disassembling the main function

After compilation, we disassemble the main function to view the assembly instructions it consists of.

Let’s quickly review the purpose of the commands we’ve used:

gcc (GNU Compiler Collection) command is a Linux command that was originally used to compile programs written in C. However, it has since evolved to support the compilation of programs in various languages (C, C++, Java, etc.), so now we have a binary called “fonction.binary” ready to be executed.
gdb (GNU Project Debugger) is a powerful, fully command-line debugger. Among other features, it allows you to disassemble a program, run it, pause it during execution, read memory, modify memory during execution, and much more.

Tip: While in a gdb session, you can use a wide range of commands, since some of these commands can have very long names or may be used very frequently, abbreviations are available, for example, the command to view information about registers is “info registers”, but it can also be executed using the shorter command i r, for example to disassemble a function from a program loaded into gdb, we use the command disassemble function, here, we want to disassemble the main function, so we run the command disas main, note that disas is an alias for disassemble, as we just explained.

We notice several things now, first, we see the call to the function reponse at line +23 (address 0x000055555555515c) with the call instruction, then, we observe the three preceding lines, which involve pushing the arguments onto some registers that will be stored later in the stack to perform the compute operations.

Execution of the program step by step

We will use breakpoints to pause the execution at multiple points, allowing us to check the stack status and analyze the program’s behavior

Breakpoint 1:

Initialization of the main function.

Breakpoint 2:

At this point, we executed the first instruction stored at the address 0x0000555555555145, and performed the following actions:

Save the RBP pointer value on the stack.
- The head of the stack (RSP pointer) moved from the address 0x7fffffffe078 to address 0x7fffffffe070.
  - 8 additional bytes = the size of the RBP register that will be saved in the address 0x7fffffffe070.