DSA Lab | Recursion & Backtracking

What Is Recursion?

Recursion is when a function calls itself to solve a smaller version of the same problem. That's the entire idea. Everything else — base cases, the call stack, memoization — is just managing that one idea.

The Origin of Recursion

Recursion has roots in mathematics going back centuries. The Fibonacci sequence was described by Leonardo of Pisa (Fibonacci) in 1202, though he defined it iteratively. The idea of functions defined in terms of themselves became formalized in the 1930s through the work of Kurt Gödel, Alonzo Church, and Alan Turing. Church's lambda calculus (1936) and Turing's machines both support recursive definitions — recursion turned out to be fundamental to what "computation" even means.

In programming, LISP (1958) was the first widely-used language to support recursive function calls. Before that, many languages didn't even have a call stack — they couldn't support recursion at all. Today every major language supports it, though some (like early FORTRAN and COBOL) originally didn't.

What Actually Happens Under the Hood

When you call a function recursively, the computer pushes a new stack frame onto the call stack (a region of memory that tracks active function calls). That frame holds the function's local variables, its parameters, and its return address (where to continue executing after the function returns). When the function returns, the frame pops off and execution resumes at the return address.

If you recurse too deeply (say, 10,000 calls deep in Python, or a few thousand in Java), you run out of stack space and get a stack overflow — the call stack has hit its size limit. Python's default recursion limit is 1,000 frames (you can increase it with sys.setrecursionlimit(), but don't go crazy — the OS stack has hard limits too).

The Two Essential Parts

Every recursive solution has two parts:

Base case — the simplest input where you just return an answer directly, without calling yourself again. Without this, your recursion never stops. It just keeps pushing frames until the stack overflows. Fibonacci's base cases: fib(0) = 0, fib(1) = 1.
Recursive case — break the problem into smaller subproblems and call yourself on those smaller inputs. Fibonacci: fib(n) = fib(n-1) + fib(n-2). You trust that the recursive calls return correct answers — this is called the "recursive leap of faith." Don't try to trace every call in your head. Define the recurrence, trust it works for smaller inputs, and verify the base case handles the bottom.

What About Backtracking?

Backtracking is recursion with a specific strategy: explore a possibility, and if it leads to a dead end, undo it and try the next option. The three steps repeat at every decision point: make a choice, recurse, undo the choice.

Think of it like navigating a maze. At each fork, you pick a direction. If you hit a dead end, you walk back to the last fork and try a different direction. You're systematically exploring every path without getting lost because you always undo your steps.

Backtracking solves constraint satisfaction problems — problems where you need to find an arrangement that satisfies multiple rules. N-Queens (place queens so none attack each other), Sudoku (fill a grid so every row, column, and box has 1-9), and graph coloring (color nodes so no adjacent nodes share a color) are all constraint satisfaction problems.

Pruning — skipping branches you know can't lead to valid solutions — is what makes backtracking practical. Without pruning, you'd explore the entire search tree (often exponentially large). With good pruning, you can eliminate vast swaths of the tree and make what looks like an intractable problem run in seconds.

How It Works

Recursion: The Call Stack in Action

When you call factorial(4), here's what actually happens in memory:

factorial(4) calls factorial(3) — pushes a frame onto the stack
factorial(3) calls factorial(2) — another frame
factorial(2) calls factorial(1) — another frame
factorial(1) hits the base case, returns 1 — frame pops
factorial(2) gets 1 back, returns 2×1=2 — frame pops
factorial(3) gets 2 back, returns 3×2=6 — frame pops
factorial(4) gets 6 back, returns 4×6=24 — frame pops

The stack grew 4 frames deep, then unwound. That's O(n) space just for the stack, even though the "work" at each level is O(1). Every recursive algorithm has this implicit space cost.

Tail Recursion: A Space Optimization

Tail recursion is when the recursive call is the very last thing the function does — there's no work left after the call returns. In that case, the current stack frame is unnecessary (it has nothing left to do), so a smart compiler can reuse it. This converts the recursion into a loop internally, using O(1) stack space.

Unfortunately, Python and Java do not optimize tail recursion. Scheme, Haskell, and some C compilers do. In Python, you'll want to convert deep tail-recursive functions to iterative ones manually if stack depth is a concern.

Fibonacci: Why Naive Recursion Is Terrible

Naive fib(n) calls fib(n-1) and fib(n-2). Each of those makes two calls, and so on. The result is a binary tree of calls with O(2ⁿ) nodes. fib(40) makes over a billion calls. fib(50) would take minutes.

The problem: overlapping subproblems. fib(5) computes fib(3) twice, fib(2) three times, fib(1) five times. Massive redundancy.

The fix: memoization (not memorization — it comes from "memo," as in a note to yourself). Cache each result so you never compute the same subproblem twice. With memoization, fib(n) runs in O(n) time and O(n) space. This is the bridge between recursion and dynamic programming — memoized recursion IS top-down DP.

Backtracking: N-Queens Step by Step

Place N queens on an N×N chessboard so no two queens threaten each other. Queens attack along rows, columns, and diagonals.

Work row by row. In row 0, try placing a queen in column 0.
Move to row 1. Try each column — check if it conflicts with any queen already placed (same column? same diagonal?). A diagonal conflict exists when the absolute difference in rows equals the absolute difference in columns: |row1 - row2| == |col1 - col2|.
If no column works in the current row, backtrack: remove the queen from the previous row and try the next column there.
If you successfully place a queen in every row, you've found a solution. Record it, then backtrack to find more solutions.

The "undo" step — removing the queen — is what makes it backtracking rather than plain recursion. You're exploring a decision tree: each node is a partially-filled board, each branch is a column choice, and leaves are either complete solutions or dead ends.

Pruning in action: Without pruning, you'd try all N^N possible placements (every column in every row). With the constraint checks, you prune branches as soon as a conflict is detected. For N=8, the full tree has 8^8 = 16.7 million leaves, but pruning reduces the search to about 15,000 nodes. That's a 1000x reduction.

Interactive Visualization

Switch between two visualizations: a Fibonacci recursion tree (watch redundant calls pile up) and an N-Queens backtracking solver.

Recursion & Backtracking

Mode: Fibonacci | Calls: 0 | Stack Depth: 0

Call Stack

Empty

Common Mistakes

⚠️ Mistake #1: Missing or wrong base case The most common recursion bug. If you forget the base case, you get infinite recursion and a stack overflow. If the base case is wrong (e.g., returning 1 instead of 0 for fib(0)), every single result in the chain will be wrong. Always verify your base cases with the smallest inputs first.

⚠️ Mistake #2: Not making the problem smaller Each recursive call must work on a strictly smaller input than the current one. If you call f(n) with f(n) or f(n+1), you'll loop forever. Common trap: in backtracking, forgetting to advance the index or position, so you keep trying the same element over and over.

⚠️ Mistake #3: Mutating shared state without undoing it In backtracking, if you modify a shared data structure (like a board or a visited set) during the "choose" step, you must undo the modification in the "unchoose" step. Forgetting the undo corrupts state for sibling branches. The pattern is always: modify → recurse → un-modify.

⚠️ Mistake #4: Trying to trace every recursive call mentally Don't do this. For anything beyond trivial inputs, it's impossible and counterproductive. Instead, trust the "recursive leap of faith": define what the function should do, verify the base case, and trust that recursive calls on smaller inputs return correct results. Your job is the recurrence relation and the base case — not simulating the entire call tree.

⚠️ Mistake #5: Ignoring stack depth limits Python's default recursion limit is 1,000. If your recursion depth could reach 10,000 (e.g., a linked list of 10,000 nodes), you'll crash. Either increase the limit with sys.setrecursionlimit() or convert to an iterative approach with an explicit stack. In interviews, mention this trade-off even if you write the recursive solution.

Complexity Analysis

Recursive complexity depends on two things: the branching factor (how many recursive calls per level) and the depth (how deep the recursion goes). The number of nodes in the recursion tree is roughly branchingFactor^depth.

For space complexity, remember that each active frame uses memory. The maximum number of simultaneous frames is the depth of the deepest branch — that's your stack space. Memoization adds additional space for the cache.

Algorithm	Time	Space	Notes
Factorial	O(n)	O(n)	Linear recursion, n stack frames
Fibonacci (naive)	O(2ⁿ)	O(n)	Exponential time, tree depth = n
Fibonacci (memo)	O(n)	O(n)	Each subproblem computed once
Binary search (recursive)	O(log n)	O(log n)	Stack depth = log n
Merge sort	O(n log n)	O(n)	Divides in half, merges linearly
Permutations	O(n × n!)	O(n)	n! leaves, O(n) work per leaf
N-Queens	O(n!)	O(n)	Pruning helps enormously in practice
Subsets/power set	O(n × 2ⁿ)	O(n)	2ⁿ subsets, O(n) to copy each

💡 The Master Theorem (Quick Reference) For recurrences of the form T(n) = a·T(n/b) + O(nᶜ): compare log_b(a) to c. If log_b(a) > c, time is O(n^(log_b a)). If equal, O(nᶜ log n). If less, O(nᶜ). Merge sort: T(n) = 2T(n/2) + O(n) → a=2, b=2, c=1 → log₂(2) = 1 = c → O(n log n). Binary search: T(n) = T(n/2) + O(1) → O(log n).

Implementation

Fibonacci with Memoization

Python


from functools import lru_cache

@lru_cache(maxsize=None)  # Python's built-in memoization decorator
def fib(n):
    """Fibonacci with automatic caching. First call computes;
    subsequent calls with same n return cached result in O(1)."""
    if n <= 1:
        return n
    # Trust that fib(n-1) and fib(n-2) return correct answers
    return fib(n - 1) + fib(n - 2)

# Manual memoization — same idea, explicit cache
def fib_memo(n, memo={}):
    if n in memo:
        return memo[n]
    if n <= 1:
        return n
    memo[n] = fib_memo(n - 1, memo) + fib_memo(n - 2, memo)
    return memo[n]

# Iterative conversion — O(1) space, no stack risk
def fib_iterative(n):
    if n <= 1:
        return n
    a, b = 0, 1
    for _ in range(2, n + 1):
        a, b = b, a + b
    return b

JavaScript


function fib(n, memo = {}) {
  if (n in memo) return memo[n];
  if (n <= 1) return n;
  // Cache the result before returning
  memo[n] = fib(n - 1, memo) + fib(n - 2, memo);
  return memo[n];
}

N-Queens (Backtracking)

Python


def solve_n_queens(n):
    """Find all valid placements of n queens on an n×n board.
    Uses sets for O(1) conflict checking on columns and diagonals."""
    solutions = []
    board = [-1] * n  # board[row] = column where queen is placed

    # Track occupied columns and diagonals for O(1) lookup
    cols = set()           # Occupied columns
    diag1 = set()          # Occupied "\" diagonals (row - col is constant)
    diag2 = set()          # Occupied "/" diagonals (row + col is constant)

    def backtrack(row):
        if row == n:
            # All queens placed — record this solution
            solutions.append(board[:])
            return

        for col in range(n):
            # Check all three constraints in O(1)
            if col in cols or (row - col) in diag1 or (row + col) in diag2:
                continue  # Pruning: skip invalid placements

            # Make the choice
            board[row] = col
            cols.add(col)
            diag1.add(row - col)
            diag2.add(row + col)

            backtrack(row + 1)     # Recurse to next row

            # Undo the choice (backtrack)
            board[row] = -1
            cols.remove(col)
            diag1.remove(row - col)
            diag2.remove(row + col)

    backtrack(0)
    return solutions

Generate All Subsets (Include/Exclude Pattern)

Python


def subsets(nums):
    """Generate all 2^n subsets. At each element, two choices:
    include it or skip it. This creates a binary decision tree."""
    result = []

    def backtrack(index, current):
        if index == len(nums):
            result.append(current[:])  # Reached a leaf — record subset
            return

        # Choice 1: skip this element
        backtrack(index + 1, current)

        # Choice 2: include this element
        current.append(nums[index])
        backtrack(index + 1, current)
        current.pop()  # Undo (backtrack)

    backtrack(0, [])
    return result

# Alternative: iterative bit manipulation
def subsets_bits(nums):
    """Each integer from 0 to 2^n - 1 represents a subset.
    Bit i is 1 → include nums[i]. Bit i is 0 → exclude."""
    n = len(nums)
    result = []
    for mask in range(1 << n):  # 0 to 2^n - 1
        subset = [nums[i] for i in range(n) if mask & (1 << i)]
        result.append(subset)
    return result

Generate All Permutations

Python


def permutations(nums):
    """Generate all n! orderings. At each position, try every
    remaining element. Backtrack by removing the choice."""
    result = []

    def backtrack(path, remaining):
        if not remaining:
            result.append(path[:])  # Complete permutation
            return

        for i in range(len(remaining)):
            path.append(remaining[i])          # Choose
            # Explore: recurse without element i
            backtrack(path, remaining[:i] + remaining[i+1:])
            path.pop()                          # Un-choose

    backtrack([], nums)
    return result

Real-World Example: File System Traversal

File systems are trees. Directories contain files and other directories. Listing all files under a directory is a naturally recursive operation — you process the current directory's contents, and for each subdirectory, you recursively process its contents too.

Python


import os

def find_files(directory, extension=".py"):
    """Recursively find all files with a given extension.
    This is what tools like 'find' and 'grep -r' do internally.
    
    The directory tree IS the recursion tree:
    - Base case: a file (leaf node) — check if it matches
    - Recursive case: a directory — process its children"""
    matches = []

    for entry in os.listdir(directory):
        full_path = os.path.join(directory, entry)

        if os.path.isfile(full_path):
            # Base case: it's a file
            if full_path.endswith(extension):
                matches.append(full_path)
        elif os.path.isdir(full_path):
            # Recursive case: it's a directory — go deeper
            matches.extend(find_files(full_path, extension))

    return matches

# Usage
python_files = find_files("/home/user/project", ".py")
for f in python_files:
    print(f)

# This is equivalent to:  find /home/user/project -name "*.py"
# The recursion depth equals the deepest directory nesting level.
# For typical projects, that's 5-15 levels — well within stack limits.

Every time you run ls -R, find, du, or tree, you're running a recursive directory traversal. The file system is one of the most natural examples of recursion in everyday computing — the data structure itself is recursive (a directory contains directories), so the algorithm to process it is recursive too.

Interview Patterns

💡 The backtracking template Almost every backtracking problem follows one pattern: for each choice: make it → recurse → undo it. The variations are just what "choice" means: which column for a queen, which number for a Sudoku cell, which letter for a phone number digit. Learn the template, then adapt. Write it on the whiteboard first, then fill in the problem-specific logic.

💡 Recursion → DP pipeline If your recursive solution has overlapping subproblems (same function called with same arguments multiple times), add memoization — you've just invented dynamic programming. Start with recursion to get the structure right, then optimize. Interviewers love seeing this thought process: "I'll start with the recursive solution, then memoize it."

💡 "Can you do it iteratively?" Most recursive solutions can be converted to iterative ones using an explicit stack (a list you push/pop from manually). DFS is naturally recursive, but you can use a stack instead of the call stack. This matters for deep recursion where stack overflow is a risk. The conversion is mechanical: push the initial state, loop while the stack isn't empty, pop and process.

💡 Pruning is everything Backtracking without pruning explores the entire search tree — which is usually exponential. Good pruning can eliminate huge branches early. In N-Queens, checking diagonals before placing cuts the tree dramatically. In Sudoku, keeping track of which numbers are available per row/column/box prunes most branches. Always think: "can I detect a dead end early?"

💡 Subsets vs Permutations vs Combinations These three problems look similar but have different structures. Subsets: for each element, include or exclude (2^n results). Permutations: choose an ordering of all elements (n! results). Combinations: choose k elements from n without regard to order (C(n,k) results). Know which template to use for each — the recursion trees are shaped differently.

💡 Handling duplicates in backtracking When the input has duplicates and you need unique results (like "permutations of [1,1,2]"), sort the input first, then skip elements that equal their predecessor at the same recursion level: if i > start and nums[i] == nums[i-1]: continue. This avoids generating the same subset/permutation twice without needing a set for deduplication.

Practice Problems

509. Fibonacci Number — Easy. Write it recursively, then add memoization, then convert to iterative. Three solutions for one problem.
78. Subsets — Medium. Generate all 2ⁿ subsets. The "include or exclude" choice at each element.
46. Permutations — Medium. Classic backtracking. Make sure you can handle the "used" tracking.
39. Combination Sum — Medium. Backtracking with unlimited reuse of elements. Sort first to enable pruning.
17. Letter Combinations of Phone Number — Medium. Map digits to letters, then backtrack through all combinations.
51. N-Queens — Hard. The classic backtracking problem. Place queens row by row, prune with column/diagonal checks.
37. Sudoku Solver — Hard. Constraint propagation + backtracking. A satisfying problem when your solver fills in the entire board.
212. Word Search II — Hard. Backtracking on a grid with Trie-based pruning. Combines multiple techniques.

Recursion & Backtracking

On This Page

What Is Recursion?

The Origin of Recursion

What Actually Happens Under the Hood

The Two Essential Parts

What About Backtracking?

How It Works

Recursion: The Call Stack in Action

Tail Recursion: A Space Optimization

Fibonacci: Why Naive Recursion Is Terrible

Backtracking: N-Queens Step by Step

Interactive Visualization

Common Mistakes

Complexity Analysis

Implementation

Fibonacci with Memoization

N-Queens (Backtracking)

Generate All Subsets (Include/Exclude Pattern)

Generate All Permutations

Real-World Example: File System Traversal

Interview Patterns

Practice Problems