44. Wildcard Matching
Problem Description
The problem is a classic example of pattern matching where we are given a string s
and a pattern p
that includes wildcard characters. We need to determine if the pattern p
matches the entire string s
. The wildcard characters are defined as follows:
- A question mark (
'?'
) matches any single character. - An asterisk (
'*'
) matches any sequence of characters, including an empty sequence.
The goal is to check if there is a complete match between the entire string and the pattern, not just a partial match.
Intuition
For solving this problem, dynamic programming is a common approach because it allows us to break down the complex problem into smaller subproblems and then build up the solution from these.
The key insight is to realize that we can make a decision at each character in the string based on two conditions:
-
If the current character in the pattern is a
?
or matches the current character in the string, we refer to previous states where the match has been progressing without the current character and pattern. -
If the current character in the pattern is a
*
, it can be complex because*
can match an empty sequence or any sequence of characters. We need to consider multiple cases: either we use the*
to match zero characters in the string (which means we look at the state of the match without the*
), or we use the*
to match at least one character in the string (which means we look at the state of the match without the current character in the string, but keep the*
).
This method of breaking down the problem helps us to derive a solution using a 2D matrix dp
where dp[i][j]
represents whether the first i
characters of the string s
can be matched with the first j
characters of the pattern p
. The final answer at dp[m][n]
(where m
and n
are the lengths of the string and pattern respectively) gives us the answer to whether the entire string matches the pattern.
By filling up the matrix by iterating over the string and the pattern, we use the previously solved subproblems to inform the solution of the current subproblem. Eventually, we derive the solution to the entire problem.
Learn more about Greedy, Recursion and Dynamic Programming patterns.
Solution Approach
The solution uses dynamic programming, a method where complex problems are broken into simpler subproblems and solved individually, with the solutions to the subproblems stored to avoid redundant calculations.
Here is a breakdown of the implementation steps:
-
Initialize a 2D matrix
dp
with dimensions(m + 1) x (n + 1)
, wherem
is the length of the strings
andn
is the length of the patternp
. Each elementdp[i][j]
in this matrix will store a boolean value indicating ifs[0..i-1]
matchesp[0..j-1]
. The+1
offset allows us to easily handle the empty string and pattern cases. -
Set the first element
dp[0][0]
toTrue
to represent that an empty string matches an empty pattern. -
Pre-fill the first row of the matrix by setting
dp[0][j]
toTrue
ifp[j-1]
is*
anddp[0][j-1]
is alsoTrue
. This loop accounts for the situation where the pattern starts with one or multiple*
characters, which can match the empty string.for j in range(1, n + 1): if p[j - 1] == '*': dp[0][j] = dp[0][j - 1]
-
Iterate through the matrix starting from
i = 1
andj = 1
, and calculate thedp[i][j]
value based on the following rules:-
If the current character of
s[i - 1]
matches the current character ofp[j - 1]
orp[j - 1]
is a?
(which can match any single character), then setdp[i][j]
to the value ofdp[i - 1][j - 1]
since we can carry forward the match status from previous characters.if s[i - 1] == p[j - 1] or p[j - 1] == '?': dp[i][j] = dp[i - 1][j - 1]
-
If the current character of the pattern is
*
, there are two possibilities:- The
*
matches zero characters: carry forward the status fromdp[i][j - 1]
. - The
*
matches one or more characters: carry forward the status fromdp[i - 1][j]
.
- The
Hence:
```python elif p[j - 1] == '*': dp[i][j] = dp[i - 1][j] or dp[i][j - 1] ```
- For all other cases where characters don't match and there is no wildcard,
dp[i][j]
will remain False, which is the default value after initialization.
-
-
After filling the entire matrix, the value at
dp[m][n]
will indicate whether the entire strings
matches the patternp
.
This approach ensures that each subproblem is only calculated once and then reused, dramatically reducing the time complexity from exponential (which would be the case with a naive recursive approach) to polynomial time.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's illustrate the solution approach using a small example:
Assume s = "xaabyc"
and p = "*a?b*"
.
-
Initialize the 2D matrix
dp
: Sinces
has a length of 6 andp
has a length of 5, our matrixdp
will be a 7x6 matrix (including the extra row and column for the empty string and pattern case). Initially, all values indp
are set toFalse
. -
First element
dp[0][0]
: We setdp[0][0]
toTrue
to denote that an empty pattern matches an empty string. -
First row pre-fill: We then iterate over the first row of
dp
to account for a pattern that starts with*
. In this case,p[0]
is*
, sodp[0][1]
should be set toTrue
. Following that logic, here's how the first row will look after pre-filling:*
a
?
b
*
T T F F F F -
Iterate and fill the matrix:
a. For
i = 1
andj = 1
, the pattern has*
which matches zero characters froms
. So,dp[1][1]
isTrue
becausedp[0][0]
was true.b. When
j = 2
andi = 1
, the pattern isa
but our string isx
. Since they don't match and the pattern is not a?
,dp[1][2]
staysFalse
.*
a
?
b
*
T T F F F F x
F T F F F F c. Eventually, by following the iteration rules while filling out the matrix, we will have:
*
a
?
b
*
T T F F F F x
F T F F F F a
F T T F F F a
F T T T F F b
F T F T T F y
F T F F T T c
F T F F F T -
Final Check: Our final cell
dp[6][5]
containsTrue
. Therefore, we can deduce that with the given example, the strings
matches the patternp
.
By processing the dp
matrix according to the dynamic programming approach, we've avoided redundant calculations and determined the match between s
and p
efficiently.
Solution Implementation
1class Solution:
2 def isMatch(self, string: str, pattern: str) -> bool:
3 # Get the lengths of the input string and the pattern
4 length_s, length_p = len(string), len(pattern)
5
6 # Create a DP table with default values False
7 dp = [[False] * (length_p + 1) for _ in range(length_s + 1)]
8
9 # The empty pattern matches the empty string
10 dp[0][0] = True
11
12 # Initialize first row of the DP table, considering the pattern starting with '*'
13 for j in range(1, length_p + 1):
14 if pattern[j - 1] == '*':
15 dp[0][j] = dp[0][j - 1]
16
17 # Fill the DP table
18 for i in range(1, length_s + 1):
19 for j in range(1, length_p + 1):
20 # If characters match or pattern has '?', we can move back diagonally in the table (match found)
21 if string[i - 1] == pattern[j - 1] or pattern[j - 1] == '?':
22 dp[i][j] = dp[i - 1][j - 1]
23 # If pattern has '*', we check two cases:
24 # 1. '*' matches no character: move left in the table
25 # 2. '*' matches at least one character: move up in the table
26 elif pattern[j - 1] == '*':
27 dp[i][j] = dp[i - 1][j] or dp[i][j - 1]
28
29 # Return the value at the bottom-right corner of the DP table
30 return dp[length_s][length_p]
31
1class Solution {
2
3 public boolean isMatch(String str, String pattern) {
4 // Lengths of the input string and the pattern
5 int strLen = str.length(), patternLen = pattern.length();
6
7 // dp[i][j] will be true if the first i characters in given string
8 // match the first j characters of the pattern
9 boolean[][] dp = new boolean[strLen + 1][patternLen + 1];
10
11 // Empty string and empty pattern are a match
12 dp[0][0] = true;
13
14 // Initialize the first row for the cases where pattern contains *
15 // as they can match the empty string
16 for (int j = 1; j <= patternLen; ++j) {
17 if (pattern.charAt(j - 1) == '*') {
18 dp[0][j] = dp[0][j - 1];
19 }
20 }
21
22 // Build the dp matrix in bottom-up manner
23 for (int i = 1; i <= strLen; ++i) {
24 for (int j = 1; j <= patternLen; ++j) {
25 // If the current characters match or pattern has '?',
26 // we can propagate the diagonal value
27 if (str.charAt(i - 1) == pattern.charAt(j - 1) || pattern.charAt(j - 1) == '?') {
28 dp[i][j] = dp[i - 1][j - 1];
29 }
30 // If pattern contains '*', it can either match zero characters
31 // in the string or it can match one character in the string
32 // and continue matching
33 else if (pattern.charAt(j - 1) == '*') {
34 dp[i][j] = dp[i - 1][j] || dp[i][j - 1];
35 }
36 // If the current pattern character is not a wildcard and the characters
37 // don't match, dp[i][j] remains false, which is the default value.
38 }
39 }
40
41 // The value in the bottom right corner will be our answer
42 return dp[strLen][patternLen];
43 }
44}
45
1#include <vector>
2#include <string>
3
4class Solution {
5public:
6 bool isMatch(const std::string& s, const std::string& p) {
7 int strSize = s.size(), patternSize = p.size();
8 // Create a DP table with dimensions (m+1) x (n+1) initialized to false.
9 // dp[i][j] will be true if the first i characters of s match the first j
10 // characters of p.
11 std::vector<std::vector<bool>> dp(strSize + 1, std::vector<bool>(patternSize + 1, false));
12
13 // The empty pattern matches the empty string.
14 dp[0][0] = true;
15
16 // Initialize the first row of the DP table. If we find '*', it can match
17 // an empty string, which is the state of dp[0][j-1].
18 for (int j = 1; j <= patternSize; ++j) {
19 if (p[j - 1] == '*') {
20 dp[0][j] = dp[0][j - 1];
21 }
22 }
23
24 // Fill the DP table.
25 for (int i = 1; i <= strSize; ++i) {
26 for (int j = 1; j <= patternSize; ++j) {
27 // If the characters match or the pattern character is '?',
28 // we can transition from the state dp[i-1][j-1].
29 if (s[i - 1] == p[j - 1] || p[j - 1] == '?') {
30 dp[i][j] = dp[i - 1][j - 1];
31 }
32 // If the pattern character is '*', it can either match zero characters,
33 // meaning we transition from dp[i][j-1], or it can match one character,
34 // meaning we transition from dp[i-1][j].
35 else if (p[j - 1] == '*') {
36 dp[i][j] = dp[i][j - 1] || dp[i - 1][j];
37 }
38 }
39 }
40
41 // The final state dp[m][n] gives us the answer to whether the entire
42 // strings s and p match with each other.
43 return dp[strSize][patternSize];
44 }
45};
46
1function isMatch(s: string, p: string): boolean {
2 let strLen: number = s.length;
3 let patternLen: number = p.length;
4
5 // Create a DP table with dimensions (strLen+1) x (patternLen+1) initialized to false.
6 // dp[i][j] will be true if the first i characters of s match the first j characters of p.
7 let dp: boolean[][] = Array.from({ length: strLen + 1 },
8 () => Array<boolean>(patternLen + 1).fill(false));
9
10 // The empty pattern matches the empty string.
11 dp[0][0] = true;
12
13 // Initialize the first row of the DP table. If we find '*', it can match
14 // an empty string, thereby adopting the value from dp[0][j-1].
15 for (let j = 1; j <= patternLen; j++) {
16 if (p[j - 1] === '*') {
17 dp[0][j] = dp[0][j - 1];
18 }
19 }
20
21 // Fill the DP table.
22 for (let i = 1; i <= strLen; i++) {
23 for (let j = 1; j <= patternLen; j++) {
24 // If the characters match or the pattern character is '?',
25 // we can transition from the state dp[i-1][j-1].
26 if (s[i - 1] === p[j - 1] || p[j - 1] === '?') {
27 dp[i][j] = dp[i - 1][j - 1];
28 }
29 // If the pattern character is '*', it can either match zero characters,
30 // meaning we transition from dp[i][j-1], or it can match one or more characters,
31 // meaning we transition from dp[i-1][j].
32 else if (p[j - 1] === '*') {
33 dp[i][j] = dp[i][j - 1] || dp[i - 1][j];
34 }
35 }
36 }
37
38 // The final state dp[strLen][patternLen] gives us the answer to whether the entire
39 // strings s and p match with each other.
40 return dp[strLen][patternLen];
41}
42
Time and Space Complexity
The time complexity of the provided code is O(m * n)
, where m
is the length of the string s
and n
is the length of the pattern p
. This is because the code involves a nested loop structure that iterates through the lengths of s
and p
. Each cell in the DP table dp[i][j]
computes whether the first i
characters of s
match the first j
characters of p
, and the computation of each cell is constant time.
The space complexity of the code is also O(m * n)
. This is due to the use of a two-dimensional list dp
, which has (m + 1) * (n + 1)
elements, to store the states of substring matches. Each element of this list represents a subproblem, with extra rows and columns to handle empty strings.
Learn more about how to find time and space complexity quickly using problem constraints.
Which of the following uses divide and conquer strategy?
Recommended Readings
Greedy Introduction div class responsive iframe iframe src https www youtube com embed WTslqPbj7I title YouTube video player frameborder 0 allow accelerometer autoplay clipboard write encrypted media gyroscope picture in picture web share allowfullscreen iframe div When do we use greedy Greedy algorithms tend to solve optimization problems Typically they will ask you to calculate the max min of some value Commonly you may see this phrased in the problem as max min longest shortest largest smallest etc These keywords can be identified by just scanning
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
What is Dynamic Programming Prerequisite DFS problems dfs_intro Backtracking problems backtracking Memoization problems memoization_intro Pruning problems backtracking_pruning Dynamic programming is an algorithmic optimization technique that breaks down a complicated problem into smaller overlapping sub problems in a recursive manner and uses solutions to the sub problems to construct a solution
Want a Structured Path to Master System Design Too? Don’t Miss This!