72. Edit Distance
Problem Description
The task is to find the minimum number of operations required to convert one string (word1
) into another (word2
). The only operations allowed are:
- Inserting a character.
- Deleting a character.
- Replacing one character with another.
These operations can be applied in any order and any number of times, and the goal is to achieve this transformation with the least number of them.
Intuition
To solve this problem, we use a technique in computer science known as dynamic programming. The core idea is to break down the big problem into smaller subproblems and solve each of them just once, storing their solutions - often in a table - so that the next time the same subproblem occurs, instead of recomputing its solution, one simply looks it up in the table. This is particularly effective for problems where the same subproblems recur many times.
For our specific case, we construct a matrix f
where each cell f[i][j]
represents the minimum number of operations to convert the first i
characters of word1
into the first j
characters of word2
. The first row (f[0][j]
) is initialized with the sequence 0, 1, 2, ..., n
because if word1
is empty, the only option is to insert characters into it, and the number of operations equals the number of characters in word2
. Similarly, the first column (f[i][0]
) is 0, 1, 2, ..., m
because if word2
is empty, the only option is to delete characters from word1
.
The intuition for the recursive step is as follows:
- If the current characters in
word1
andword2
are equal (word1[i - 1] == word2[j - 1]
), no operation is needed, and the number of operations will be the same as it was fori - 1
andj - 1
. - If they are not equal, we need to consider three possible operations:
- Inserting (
f[i][j - 1] + 1
): We have matched up toj - 1
ofword2
, and then by adding thej
-th character ofword2
, we will matchj
characters. The number of operations is one more than it took to matchj - 1
characters. - Deleting (
f[i - 1][j] + 1
): If we remove thei
-th character fromword1
, we fall back to the subproblem of matchingi - 1
characters ofword1
withj
characters ofword2
, and again, this is one more operation than that subproblem. - Replacing (
f[i - 1][j - 1] + 1
): Here, we change thei
-th character ofword1
to match thej
-th character ofword2
. So, the number of operations is one more than the operations needed fori - 1
andj - 1
.
- Inserting (
We take the minimum of these three options at each step, and the last cell f[m][n]
will give us the minimum number of operations required to transform word1
into word2
.
Learn more about Dynamic Programming patterns.
Solution Approach
The approach to this problem is a classic example of Dynamic Programming (DP), which uses a 2D table to store solutions to subproblems. This memory storage is critical because many subproblems are solved multiple times, and storing their solutions significantly reduces computation time. This technique is known as memoization.
To implement this:
-
We initialize a 2D array
f
withm+1
rows andn+1
columns, wherem
is the length ofword1
andn
is the length ofword2
. Each elementf[i][j]
represents the minimum number of operations needed to convert the firsti
characters ofword1
to the firstj
characters ofword2
. -
We fill in the base cases:
- The first row represents converting an empty
word1
into the firstj
characters ofword2
, which obviously requiresj
insertions. So, we setf[0][j] = j
for allj
. - The first column represents converting the first
i
characters ofword1
into an emptyword2
, which requiresi
deletions. Hence,f[i][0] = i
for alli
.
- The first row represents converting an empty
-
We iterate over the array starting from
f[1][1]
to fill in the remaining cells. At each cellf[i][j]
, we decide the best option (minimum operations) based on whether the characters at positionsi-1
inword1
andj-1
inword2
are the same. -
The choice at each step is between:
- Keeping the character if it's the same (
f[i-1][j-1]
) or - Performing one operation (delete, insert, or replace) to make the strings match up to that point.
- Keeping the character if it's the same (
-
We apply the following state transition equation:
f[i][j] = { f[i-1][j-1] if word1[i-1] == word2[j-1] min(f[i-1][j], f[i][j-1], f[i-1][j-1]) + 1 otherwise }
f[i-1][j] + 1
represents deleting thei
-th character fromword1
.f[i][j-1] + 1
means inserting thej
-th character intoword1
.f[i-1][j-1] + 1
indicates replacing thei
-th character ofword1
with thej
-th character ofword2
.
-
After filling the DP table, the value at
f[m][n]
gives us the minimum number of operations required to convertword1
toword2
.
This algorithm's runtime complexity is O(m * n)
because we have to fill a table with m * n
cells, and the work for each cell is constant. The space complexity is also O(m * n)
for the DP table.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's consider a small example where we want to convert word1 = "intention"
to word2 = "execution"
. We will walk through the Dynamic Programming approach to illustrate the solution:
-
Initialize a 2D array
f
with 10 rows (sinceword1
has 9 characters plus 1 for the empty prefix) and 10 columns (sinceword2
has 9 characters plus 1 for the empty prefix). -
Fill in the base cases:
- The first row
f[0][j]
fromf[0][0]
tof[0][9]
will be0, 1, 2, ..., 9
as it takesj
insertions to convert an emptyword1
intoword2[0..j-1]
. - The first column
f[i][0]
fromf[0][0]
tof[9][0]
will be0, 1, 2, ..., 9
as it takesi
deletions to convertword1[0..i-1]
into an emptyword2
.
- The first row
-
Now, we iterate over the remaining cells starting from
f[1][1]
. We compare characters ofword1
andword2
starting from first (i-1
forword1
andj-1
forword2
) and fillf[i][j]
considering three cases:- If
word1[i-1] == word2[j-1]
, we copy the value fromf[i-1][j-1]
tof[i][j]
(since no operation is needed). - Otherwise, we find the minimum of:
f[i-1][j]
+ 1 (delete case),f[i][j-1]
+ 1 (insert case),f[i-1][j-1]
+ 1 (replace case).
- If
For the first non-base cell f[1][1]
, since word1[0]
is 'i' and word2[0]
is 'e', they're not the same, so we compute:
f[1][1] = min(f[0][1], f[1][0], f[0][0]) + 1 = min(1, 1, 0) + 1 = 1
-
We continue this process for each cell. For instance:
f[3][2] (to convert "int" to "ex"): - word1[2] is "t" and word2[1] is "x", so they are different. - f[3][2] = min(f[2][2], f[3][1], f[2][1]) + 1 = min(2, 3, 2) + 1 = 3
-
After we have populated the entire array, we look at the last cell
f[9][9]
to find the minimum number of operations required to convertword1
intoword2
. In this example, let's say the last cell value is 5 (as your specific DP table may vary during actual execution).
Thus, the answer is that it requires a minimum of 5 operations to transform "intention" into "execution" using the allowed operations.
Solution Implementation
1class Solution:
2 def minDistance(self, word1: str, word2: str) -> int:
3 # Get the lengths of both words
4 len_word1, len_word2 = len(word1), len(word2)
5
6 # Initialize a table to store the edit distances
7 # The table size will be (len_word1+1) x (len_word2+1)
8 dp_table = [[0] * (len_word2 + 1) for _ in range(len_word1 + 1)]
9
10 # Set up the initial state where converting an empty string to word2
11 # requires adding all letters of word2
12 for j in range(1, len_word2 + 1):
13 dp_table[0][j] = j
14
15 # Set up the state where converting word1 to an empty string
16 # requires removing all letters of word1
17 for i in range(1, len_word1 + 1):
18 dp_table[i][0] = i
19
20 for j in range(1, len_word2 + 1):
21 # If the current characters match, take the previous best without these characters
22 if word1[i - 1] == word2[j - 1]:
23 dp_table[i][j] = dp_table[i - 1][j - 1]
24 else:
25 # If the characters don't match, consider all possible operations
26 # 1. Add a character (dp_table[i][j - 1])
27 # 2. Remove a character (dp_table[i - 1][j])
28 # 3. Replace a character (dp_table[i - 1][j - 1])
29 # Take the minimum of these possibilities and add 1 to represent the cost of the operation
30 dp_table[i][j] = min(
31 dp_table[i - 1][j], # Deletion
32 dp_table[i][j - 1], # Insertion
33 dp_table[i - 1][j - 1] # Substitution
34 ) + 1
35
36 # The answer is in the bottom-right cell of the table
37 # It represents the minimum edit distance between the two full words
38 return dp_table[len_word1][len_word2]
39
1class Solution {
2 public int minDistance(String word1, String word2) {
3 // Lengths of the input strings
4 int lenWord1 = word1.length();
5 int lenWord2 = word2.length();
6
7 // Create a 2D array to store the subproblem results
8 int[][] dpTable = new int[lenWord1 + 1][lenWord2 + 1];
9
10 // Initialize the first column, representing insertions needed to transform an empty string into word2
11 for (int indexWord2 = 1; indexWord2 <= lenWord2; ++indexWord2) {
12 dpTable[0][indexWord2] = indexWord2;
13 }
14
15 // Fill out the dpTable for all subproblems
16 for (int indexWord1 = 1; indexWord1 <= lenWord1; ++indexWord1) {
17 // First row represents deletions needed to transform word1 into an empty string
18 dpTable[indexWord1][0] = indexWord1;
19
20 for (int indexWord2 = 1; indexWord2 <= lenWord2; ++indexWord2) {
21 // If the characters are the same, take the value from the diagonal (no operation needed)
22 if (word1.charAt(indexWord1 - 1) == word2.charAt(indexWord2 - 1)) {
23 dpTable[indexWord1][indexWord2] = dpTable[indexWord1 - 1][indexWord2 - 1];
24 } else {
25 // If the characters are different, take the minimum operations from left (insert), top (delete), or diagonal (replace) and add 1
26 int insertOps = dpTable[indexWord1][indexWord2 - 1];
27 int deleteOps = dpTable[indexWord1 - 1][indexWord2];
28 int replaceOps = dpTable[indexWord1 - 1][indexWord2 - 1];
29
30 dpTable[indexWord1][indexWord2] = Math.min(insertOps, Math.min(deleteOps, replaceOps)) + 1;
31 }
32 }
33 }
34
35 // The bottom-right cell gives the final result
36 return dpTable[lenWord1][lenWord2];
37 }
38}
39
1#include <vector>
2#include <string>
3#include <algorithm>
4
5class Solution {
6public:
7 int minDistance(string word1, string word2) {
8 int lengthWord1 = word1.size(), lengthWord2 = word2.size();
9
10 // Create a DP table with dimensions (lengthWord1+1) x (lengthWord2+1)
11 std::vector<std::vector<int>> dpTable(lengthWord1 + 1, std::vector<int>(lengthWord2 + 1));
12
13 // Initialize the first column of the DP table which represents
14 // the number of operations required to convert an empty string to word2
15 for (int j = 0; j <= lengthWord2; ++j) {
16 dpTable[0][j] = j;
17 }
18
19 // Fill out the DP table
20 for (int i = 1; i <= lengthWord1; ++i) {
21 // The first row of the DP table represents the number of operations
22 // required to convert word1 to an empty string
23 dpTable[i][0] = i;
24
25 for (int j = 1; j <= lengthWord2; ++j) {
26 // If characters at current position in both words are equal,
27 // take the value from the previous top-left diagonal cell,
28 // as no operation is required
29 if (word1[i - 1] == word2[j - 1]) {
30 dpTable[i][j] = dpTable[i - 1][j - 1];
31 } else {
32 // Otherwise, use the minimum value from the cell to the left (insert),
33 // above (delete) or top-left diagonal (replace), plus one for the current operation
34 dpTable[i][j] = std::min({
35 dpTable[i - 1][j], // Deletion
36 dpTable[i][j - 1], // Insertion
37 dpTable[i - 1][j - 1] // Replacement
38 }) + 1;
39 }
40 }
41 }
42
43 // The bottom-right cell of the DP table contains the final answer
44 return dpTable[lengthWord1][lengthWord2];
45 }
46};
47
1function minDistance(word1: string, word2: string): number {
2 const lenWord1 = word1.length;
3 const lenWord2 = word2.length;
4 // Create a 2D array to hold the minimum edit distances.
5 const dp: number[][] = Array.from(Array(lenWord1 + 1), () => new Array(lenWord2 + 1).fill(0));
6
7 // Initialize the first row of the matrix.
8 for (let col = 1; col <= lenWord2; ++col) {
9 dp[0][col] = col;
10 }
11 // Initialize the first column of the matrix.
12 for (let row = 1; row <= lenWord1; ++row) {
13 dp[row][0] = row;
14 for (let col = 1; col <= lenWord2; ++col) {
15 // Check if the current characters are the same.
16 if (word1[row - 1] === word2[col - 1]) {
17 dp[row][col] = dp[row - 1][col - 1];
18 } else {
19 // If not, find the minimum cost among deletion, insertion, and replacement.
20 dp[row][col] = Math.min(
21 dp[row - 1][col], // Deletion (from word1 to word2).
22 dp[row][col - 1], // Insertion (from word1 to word2).
23 dp[row - 1][col - 1] // Replacement (from word1 to word2).
24 ) + 1;
25 }
26 }
27 }
28 // The bottom-right cell contains the final minimum edit distance.
29 return dp[lenWord1][lenWord2];
30}
31
Time and Space Complexity
The provided code snippet is an implementation of the dynamic programming approach to solve the problem of finding the minimum number of operations required to convert one word into another, where operations can be insertion, deletion, or substitution of a single character.
Time Complexity: The time complexity of this algorithm is O(m * n)
where m
is the length of word1
and n
is the length of word2
. This time complexity arises because the algorithm iterates through all characters of word1
using the variable i
and all characters of word2
using the variable j
. For each pair of characters (i, j)
, a constant amount of work is done to compute f[i][j]
. Since the two for-loops are nested, each of the m * n
pairs is considered exactly once, leading to the overall time complexity of O(m * n)
.
Space Complexity: The space complexity of the algorithm is also O(m * n)
due to the utilization of a two-dimensional array f
that has (m + 1) * (n + 1)
elements. Each element in f
represents the minimum number of operations required to convert the first i
characters of word1
to the first j
characters of word2
. Since the array f
has a size proportional to the product of m
and n
, the space complexity is O(m * n)
.
Learn more about how to find time and space complexity quickly using problem constraints.
Which type of traversal does breadth first search do?
Recommended Readings
What is Dynamic Programming Prerequisite DFS problems dfs_intro Backtracking problems backtracking Memoization problems memoization_intro Pruning problems backtracking_pruning Dynamic programming is an algorithmic optimization technique that breaks down a complicated problem into smaller overlapping sub problems in a recursive manner and uses solutions to the sub problems to construct a solution
LeetCode Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
Want a Structured Path to Master System Design Too? Don’t Miss This!