DSA Lab | Depth-First Search

What Is DFS?

Depth-First Search (DFS) is a graph traversal algorithm — a systematic way to visit every node in a graph (a collection of nodes connected by edges). DFS picks a direction and follows it as far as possible before backing up. Think of exploring a maze by always turning right until you hit a dead end, then retracing your steps to the last fork and trying a different path.

The idea is ancient. French mathematician Charles Pierre Trémaux described a version of it in the 19th century as a strategy for solving mazes. You mark where you've been, go as deep as you can, and backtrack when stuck. Computer scientists formalized it in the 1970s — Tarjan and Hopcroft built some of the most elegant graph algorithms on top of DFS, including bridge-finding and strongly connected components.

DFS uses a stack — either the call stack (via recursion, where the programming language tracks function calls automatically) or an explicit stack data structure you manage yourself. The stack remembers where to go back to when you hit a dead end. This is the fundamental difference from BFS (Breadth-First Search), which uses a queue and explores level by level instead of diving deep.

Two Important Timestamps

DFS assigns each node two timestamps that encode a lot of structural information:

Discovery time (d): When DFS first visits the node — the moment it "discovers" it
Finish time (f): When DFS is done with the node and all its descendants (nodes reachable by following edges forward)

These timestamps tell you about the graph's structure. If node A's interval [d_A, f_A] completely contains node B's interval [d_B, f_B], then B is a descendant of A in the DFS tree. This is called the parenthesis theorem — the intervals either nest perfectly or don't overlap at all, like properly matched parentheses.

Why DFS Instead of BFS?

BFS and DFS both visit every node. The difference is how they explore, and that changes what problems each one solves naturally:

DFS is natural for problems with recursive structure. Tree height, path existence, backtracking — these all decompose into "solve for children, combine results." DFS maps directly to recursion.
DFS uses less memory on wide graphs. BFS stores an entire frontier level in its queue. On a tree with branching factor b and depth d, BFS needs O(b^d) space. DFS only needs O(d) — it just stores the current path.
DFS finds cycles and ordering. The timestamps and edge classification (covered below) give you cycle detection and topological sort essentially for free.

Where DFS Shows Up

Topological sort: Order tasks so that dependencies come first (build systems, course prerequisites). This requires a DAG — a Directed Acyclic Graph, meaning no cycles.
Cycle detection: Finding back edges (edges pointing to an ancestor in the DFS tree) in directed graphs reveals cycles
Connected components: In undirected graphs, each DFS run from an unvisited node discovers one connected component — a maximal group of nodes where every node is reachable from every other
Strongly connected components: Tarjan's and Kosaraju's algorithms both use DFS to find SCCs in directed graphs — maximal groups where every node can reach every other node following directed edges
Maze generation: Randomized DFS creates mazes with long, winding paths
Backtracking: N-Queens, Sudoku solvers, permutation generation — all DFS at heart
Bridge and articulation point finding: Tarjan's bridge algorithm uses DFS discovery times to find edges and nodes whose removal disconnects the graph

How It Works

Recursive DFS

Visit the current node. Mark it as visited. Record its discovery time.
Recurse on each unvisited neighbor. The function calls itself for every adjacent node it hasn't seen.
Backtrack when all neighbors are visited. Record the finish time. The function returns, and the call stack unwinds to the previous node.

Simple and clean. The call stack handles backtracking automatically — when a recursive call returns, you're back at the parent. The downside: deep graphs can overflow the stack. Python's default recursion limit is 1000 (you can check with sys.getrecursionlimit()). A chain graph with 10,000 nodes will crash.

Iterative DFS

Push the start node onto an explicit stack (a data structure you create, not the call stack).
Pop the top node. If not visited, mark it visited and process it.
Push all unvisited neighbors onto the stack.
Repeat until the stack is empty.

No recursion limit issues, but computing discovery/finish times is trickier — you need to push sentinel markers onto the stack to know when to record finish times. Use the iterative version when the graph might be deep (10,000+ nodes) or when you want explicit control over the traversal.

💡 Subtle difference: neighbor ordering Recursive DFS visits neighbors in the order your adjacency list stores them. Iterative DFS with a stack reverses this order (last pushed = first popped). To get identical traversal order, push neighbors in reverse. This matters if the problem requires a specific visit order.

Edge Classification

DFS classifies every edge in the graph into one of four types. This classification is central to cycle detection and topological sort:

Tree edge: Leads to an unvisited node. These edges form the DFS tree (or DFS forest if the graph is disconnected).
Back edge: Points to an ancestor (a node earlier on the current DFS path) in the DFS tree. A back edge means there's a cycle. This is the key insight for cycle detection.
Forward edge: Points to a descendant that was already discovered via a different path. Only shows up in directed graphs.
Cross edge: Points to a node in a different subtree — one that was fully processed before the current subtree. Only in directed graphs.

You can classify edges using the node colors/states during DFS. When you encounter a neighbor that's:

Unvisited → tree edge
In-progress (discovered but not finished) → back edge
Completed (finished) → forward or cross edge (compare discovery times to distinguish)

💡 Cycle Detection Shortcut In an undirected graph, any edge to an already-visited node (that isn't the parent you came from) is a cycle. In a directed graph, you need to check if the neighbor is currently "in progress" (discovered but not finished) — that's a back edge, which means a cycle. Tracking three states (white/gray/black or unvisited/in-progress/done) is the standard approach.

DFS on Trees vs Graphs

On trees, DFS is simpler because there are no cycles and no "visited" check needed (just don't revisit the parent). Most binary tree problems — height, diameter, path sum, lowest common ancestor — are DFS by nature. You recurse on children, combine results, return up.

On general graphs, you need the visited set. Without it, a cycle sends DFS into an infinite loop. This is a mistake beginners make constantly.

Interactive Visualization

Click any node to start DFS. Watch the algorithm dive deep along one branch before backtracking. Nodes show discovery/finish times (d/f). The stack contents update on the right. Backtracking edges are shown in red.

DFS Traversal

Common Mistakes

⚠️ Forgetting the visited set on graphs On trees, you can skip the visited set because there are no cycles. On general graphs, forgetting it means infinite loops on cyclic graphs and redundant work on DAGs. Always use a visited set (or equivalent coloring scheme) for graph DFS. Even if the problem says "DAG," defensive coding includes the check.

⚠️ Confusing "visited" with "in-progress" for cycle detection In directed graphs, visiting an already-completed node is NOT a cycle — it's a forward or cross edge. Only visiting an in-progress node (one that's on the current DFS path, still on the call stack) is a back edge indicating a cycle. Using a simple boolean visited set misses this distinction. You need three states: unvisited, in-progress, done.

⚠️ Stack overflow on deep graphs Python's default recursion limit is 1000. Java's stack is typically a few thousand frames. A linked-list-shaped graph with 50,000 nodes will crash recursive DFS. Either increase the recursion limit (sys.setrecursionlimit(200000) in Python) or switch to iterative DFS. In interviews, always mention this tradeoff.

⚠️ Not handling disconnected graphs A single DFS call from one start node only visits nodes reachable from that node. If the graph has multiple connected components (separate groups of nodes with no edges between them), you need to loop over all nodes and start a new DFS from each unvisited one. Forgetting this means silently skipping entire parts of the graph.

⚠️ Modifying the graph during traversal Some problems tempt you to mark visited cells by changing them in-place (e.g., setting grid cells to '#'). This works but can cause bugs if you need to restore the grid later (like in backtracking). Use a separate visited set when the grid state matters for future computations.

Complexity Analysis

Representation	Time	Space	Notes
Adjacency List	O(V + E)	O(V)	Same as BFS. Each vertex/edge visited once.
Adjacency Matrix	O(V²)	O(V)	Must scan entire row per vertex to find neighbors
Tree (n nodes)	O(n)	O(h)	h = height. O(n) for skewed, O(log n) balanced.
Grid (R × C)	O(R × C)	O(R × C)	Each cell visited once. Stack depth = path length.

Time is identical to BFS: O(V + E) with adjacency lists. Here V is the number of vertices (nodes) and E is the number of edges. You visit each vertex once and scan its adjacency list once, giving V + E total work.

The space difference between DFS and BFS is in what they store. DFS keeps only the current path on the stack — O(V) worst case for a chain graph, but often much less. BFS stores an entire frontier level in the queue, which can be O(V) on the last level of a complete binary tree. For trees specifically, DFS space is O(h) where h is the height. On a balanced tree that's O(log n). On a completely skewed tree (basically a linked list), it's O(n).

Implementation

Recursive DFS

Python

def dfs(graph, node, visited=None):
    """Basic recursive DFS on an adjacency list.
    
    graph: dict mapping each node to a list of its neighbors
           e.g. {0: [1, 2], 1: [3], 2: [3], 3: []}
    node:  starting node
    visited: set of already-visited nodes (created on first call)
    """
    if visited is None:
        visited = set()

    visited.add(node)
    print(node)  # Process the node — replace with whatever work you need

    for neighbor in graph[node]:
        if neighbor not in visited:
            dfs(graph, neighbor, visited)

    return visited

DFS with Discovery/Finish Times

Python

def dfs_timestamps(graph):
    """Compute discovery and finish times for all nodes.
    
    These timestamps power edge classification, topological sort,
    and SCC algorithms. The time counter increments globally across
    all DFS trees in the forest.
    """
    visited = set()
    discovery = {}
    finish = {}
    time = [0]  # Mutable container so the nested function can modify it
                # (Python closures can read but not assign to enclosing variables)

    def visit(node):
        time[0] += 1
        discovery[node] = time[0]
        visited.add(node)

        for neighbor in graph[node]:
            if neighbor not in visited:
                visit(neighbor)

        time[0] += 1
        finish[node] = time[0]

    # Handle disconnected graphs: try starting DFS from every node
    for node in graph:
        if node not in visited:
            visit(node)

    return discovery, finish

Iterative DFS

JavaScript

function dfsIterative(graph, start) {
  const visited = new Set();
  const stack = [start];
  const order = [];

  while (stack.length > 0) {
    const node = stack.pop();

    if (visited.has(node)) continue;
    visited.add(node);
    order.push(node);

    // Push neighbors in reverse order so we visit them
    // in the same order as recursive DFS (stack is LIFO,
    // so last pushed = first popped)
    const neighbors = graph[node] || [];
    for (let i = neighbors.length - 1; i >= 0; i--) {
      if (!visited.has(neighbors[i])) {
        stack.push(neighbors[i]);
      }
    }
  }
  return order;
}

Cycle Detection in Directed Graphs (Three-Color)

Python

def has_cycle(graph):
    """Detect cycles in a directed graph using three-state DFS.
    
    WHITE (0) = unvisited
    GRAY (1)  = in-progress (on the current DFS path)
    BLACK (2) = finished (all descendants explored)
    
    A back edge (to a GRAY node) means a cycle exists.
    """
    WHITE, GRAY, BLACK = 0, 1, 2
    color = {node: WHITE for node in graph}

    def dfs(u):
        color[u] = GRAY
        for v in graph[u]:
            if color[v] == GRAY:
                return True   # Back edge found — cycle!
            if color[v] == WHITE and dfs(v):
                return True
        color[u] = BLACK
        return False

    # Check every node (graph might be disconnected)
    return any(color[node] == WHITE and dfs(node) for node in graph)

DFS on a Grid (Flood Fill)

Python

def count_islands(grid):
    """Count connected components of '1's in a 2D grid.
    
    Classic 'Number of Islands' problem. Each island is a connected
    component of land cells ('1'). DFS from each unvisited '1' marks
    the whole island as visited.
    """
    if not grid:
        return 0
    
    rows, cols = len(grid), len(grid[0])
    count = 0

    def flood_fill(r, c):
        # Out of bounds or not land? Stop.
        if r < 0 or r >= rows or c < 0 or c >= cols or grid[r][c] != '1':
            return
        grid[r][c] = '#'  # Mark visited by modifying in-place
        # Recurse in all 4 directions
        flood_fill(r + 1, c)
        flood_fill(r - 1, c)
        flood_fill(r, c + 1)
        flood_fill(r, c - 1)

    for r in range(rows):
        for c in range(cols):
            if grid[r][c] == '1':
                flood_fill(r, c)
                count += 1

    return count

Real-World Example: Web Crawler

Search engines like Google use crawlers to discover web pages. At its core, a web crawler is DFS (or BFS) over the graph of hyperlinks. You start at a seed URL, fetch the page, extract all links, and follow each one — going deeper and deeper until you've found everything reachable.

In practice, crawlers use BFS more often (to prioritize pages close to well-known sites), but the DFS approach works well for focused crawls — say, discovering all pages within a single domain.

Python

import requests
from urllib.parse import urljoin, urlparse
from html.parser import HTMLParser

class LinkExtractor(HTMLParser):
    """Pulls all href links out of an HTML page."""
    def __init__(self):
        super().__init__()
        self.links = []
    
    def handle_starttag(self, tag, attrs):
        if tag == 'a':
            for name, value in attrs:
                if name == 'href':
                    self.links.append(value)

def crawl_domain(start_url, max_pages=100):
    """DFS crawl of a single domain. Discovers all reachable pages.
    
    This mirrors how DFS works on any graph:
    - Nodes = web pages (URLs)
    - Edges = hyperlinks from one page to another
    - visited set = pages we've already fetched
    - Stack = pages to visit next (using the call stack here)
    """
    domain = urlparse(start_url).netloc
    visited = set()
    pages_found = []

    def visit(url):
        if len(visited) >= max_pages:
            return
        if url in visited:
            return

        # Normalize URL and ensure we stay on the same domain
        parsed = urlparse(url)
        if parsed.netloc != domain:
            return

        visited.add(url)
        pages_found.append(url)

        try:
            response = requests.get(url, timeout=5)
            parser = LinkExtractor()
            parser.feed(response.text)

            for link in parser.links:
                # Convert relative URLs to absolute
                absolute = urljoin(url, link)
                visit(absolute)  # DFS: go deep before coming back
        except Exception:
            pass  # Skip pages that fail to load

    visit(start_url)
    return pages_found

# Usage: crawl_domain("https://example.com")

Notice the structure: it's the exact same pattern as basic DFS — visit node, mark visited, recurse on unvisited neighbors. The "graph" is implicit (defined by hyperlinks rather than an adjacency list), but the algorithm is identical.

Interview Patterns

💡 DFS = Recursion = Backtracking Most backtracking problems (N-Queens, generate parentheses, subsets, permutations) are DFS over a decision tree. At each node you make a choice, recurse, then undo the choice. If you can solve DFS, you can solve backtracking. The template is always: choose → explore → un-choose.

💡 Cycle Detection in Directed Graphs Track three states per node: unvisited, in-progress, and completed. If you visit a neighbor that's "in-progress," that's a back edge — you found a cycle. This is the standard approach for course schedule problems (can you take all courses given prerequisites?).

💡 "Number of Islands" and Grid DFS Any problem that says "count connected regions" or "flood fill" on a 2D grid is DFS. Iterate over every cell. When you find an unvisited cell of interest, DFS to mark the entire connected region. Count how many times you start a new DFS. This pattern handles Number of Islands, Max Area of Island, Surrounded Regions, and dozens of variants.

💡 Tree Diameter and Path Problems Finding the longest path in a tree (the diameter) uses two DFS calls: first from any node to find the farthest node A, then from A to find the farthest node B. The distance A→B is the diameter. Alternatively, DFS from every node tracking depth of left/right subtrees. Most "path sum" or "longest path" tree problems decompose into DFS that returns values upward.

💡 Recursion Depth Warning Python's default recursion limit is 1000. For large graphs (10K+ nodes), either increase it with sys.setrecursionlimit() or use iterative DFS. In interviews, mention this tradeoff — it shows you think about practical constraints, not just theory.

💡 Clone Graph = DFS + Hash Map When asked to deep-copy a graph, DFS through the original while building the copy. Use a hash map from original→clone to handle nodes you've already copied (avoiding infinite loops on cycles). This is a direct application of DFS with a visited map that also stores the mapping.

Common DFS Applications

Flood fill: Mark connected regions (Number of Islands). DFS from each unvisited land cell.
Path existence: "Can you reach B from A?" DFS is simpler than BFS here — just follow edges.
Topological sort: Reverse post-order of DFS gives a valid ordering. Used in build systems, course scheduling.
Tree problems: Most binary tree questions (height, diameter, path sum, serialize/deserialize) are naturally recursive DFS.
Graph coloring: DFS + backtracking for map coloring, Sudoku, constraint satisfaction.
Bridges and articulation points: Tarjan's algorithm uses DFS with low-link values to find critical edges/nodes.
Bipartite check: DFS with two-coloring — alternate colors on each level. If you ever color-conflict, it's not bipartite.

Practice Problems

Problem	Difficulty	Pattern	Link
Number of Islands	Medium	Grid DFS / flood fill	LC #200
Clone Graph	Medium	DFS + hash map	LC #133
Course Schedule	Medium	Cycle detection (directed)	LC #207
Course Schedule II	Medium	Topological sort	LC #210
Path Sum	Easy	Tree DFS	LC #112
Surrounded Regions	Medium	Border DFS	LC #130
Word Search	Medium	Backtracking DFS	LC #79
Max Area of Island	Medium	Grid DFS + tracking size	LC #695
Critical Connections in a Network	Hard	Tarjan's bridges	LC #1192

Depth-First Search

On This Page

What Is DFS?

Two Important Timestamps

Why DFS Instead of BFS?

Where DFS Shows Up

How It Works

Recursive DFS

Iterative DFS

Edge Classification

DFS on Trees vs Graphs

Interactive Visualization

Common Mistakes

Complexity Analysis

Implementation

Recursive DFS

DFS with Discovery/Finish Times

Iterative DFS

Cycle Detection in Directed Graphs (Three-Color)

DFS on a Grid (Flood Fill)

Real-World Example: Web Crawler

Interview Patterns

Common DFS Applications

Practice Problems