2746. Decremental String Concatenation
Problem Description
In this problem, we are given an array called words
, which contains 'n' strings, each indexed from 0
to n-1
. We need to combine these strings using a special 'join' operation which merges two strings together. But there's a twist: if the last character of the first string is the same as the first character of the second string, we have to remove one of these matching characters while joining them. The goal is to perform n - 1
join operations to generate a single string in such a way that the final string's length is as small as possible.
Here's what we are allowed to do during each join operation:
- We can append the next string in the array
words
to the current stringstr_i
. - Alternatively, we can prepend the next string in the array
words
to the current stringstr_i
.
Intuition
To arrive at the solution approach, consider that at every step of joining, we have two choices: either append or prepend the next word. The impact of this choice on the final string length can vary depending on the matching characters at the edges of the strings being joined.
We aim to minimize the overall length each time we make a join. To do this for n - 1
operations, an intuition may lead us to dynamic programming or recursion to check every possibility, remembering the results of sub-problems to avoid redundant calculations.
Since the problem is about making choices at each step and optimizing for the 'smallest length', and considering that each choice might impact the subsequent ones, we can visualize the decision-making process as a tree. This is a common characteristic of problems where backtracking or dynamic programming is a fit.
The solution uses recursion with memoization (notice the use of the @cache
decorator) to explore all possible concatenations and record the results for sub-problems, thus avoiding the recalculation of the same state. The dfs
function takes into account the current position i
, as well as the potential characters at the beginning and end of the current concatenated string (a
and b
) to make decisions leading towards the minimum length of the final string.
Each call to the dfs
function considers the effect of performing a join operation via appending or prepending the current word to the string being built so far and tracks which operation gives the shorter result. The use of a recursive function with memoization allows the efficient solving of this optimization problem by making the complexity feasible for larger input sizes.
Learn more about Dynamic Programming patterns.
Solution Approach
The implementation relies on Depth-First Search (DFS) recursion with memoization to efficiently explore all possible concatenation sequences. The @cache
decorator is used to memoize the results. Let's examine how the solution works by breaking down the key components of the implementation:
-
The function
dfs
is the core of this solution. It takes three parameters: the current indexi
in thewords
array, a charactera
representing the beginning character of the string up to this point, and a characterb
representing the ending character of the string up to this point. -
The base case for the recursion is when
i
exceeds the number of words we have. At that point, there are no more words to join, and the function returns0
because there is no extra length to add from additional words. -
When inside
dfs
, we consider the next words
to be joined with the current string. We have two choices resembling a binary tree's branches:- Append: Where we consider adding
s
to the end of the current string, so we calldfs(i + 1, a, s[-1])
, passings[-1]
as the new ending character since we're appending. We subtract1
from the length if the current ending characterb
matches the starting character ofs
. - Prepend: Where we consider adding
s
at the beginning, so we calldfs(i + 1, s[0], b)
, passings[0]
as the new starting character. We subtract1
from the length if the ending characters[-1]
matches the current beginning charactera
.
- Append: Where we consider adding
-
The recursive calls return the sum of the length of the current word
s
and the minimum length from the two choices (append or prepend). The subtraction of1
accounts for the removed character when there is a match. -
The final return statement
return len(words[0]) + dfs(1, words[0][0], words[0][-1])
initiates the process with the first word, taking its length and appending the result of the recursive call starting from the second word with the first word's beginning and ending characters.
The @cache
decorator on dfs
ensures that for each unique set of inputs to the function (combinations of i
, a
, and b
), the result is stored. If the same inputs are ever called again, the stored result is returned immediately, which significantly reduces the number of recursive calls, especially for large arrays with many possible combinations.
Overall, this solution showcases a classic recursive problem-solving strategy enhanced with memoization, a powerful way to optimize recursive depth-first search algorithms.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's take an example array of words
: ["abc", "cda", "dac"]
. We will walk through the process to understand how we might join these strings to minimize the overall length.
Firstly, starting with the first word "abc", we initiate the recursive process. There are no characters on either side, so a
and b
are 'a' and 'c' respectively, which represent the first and last characters of the current string.
Now, we look at the next word "cda". We have two choices:
-
Append: If we append "cda" to "abc", the resulting string would be "abccda" because the last character of "abc" is the same as the first of "cda", we can combine them by removing one 'c'. So, the string becomes "abcdac". We then call
dfs(2, 'a', 'a')
since we have moved on to the next word and the end character has been updated to 'a' from "cda". -
Prepend: If we prepend "cda" to "abc", the resulting string would be "cdabc". There is no matching character in this case, so no character is removed. We then call
dfs(2, 'c', 'c')
, updating the start character to 'c'.
The return value for each of these choices would be the length of "cda" (which is 3) plus the minimum of the lengths obtained from appending or prepending the next word "dac".
Continuing with the recursion, we now have the third word "dac" and the choices described above for the second word:
-
If we chose to append during the previous operation, we now have "abcdac":
- Append: Append "dac" leading to "abcdacdac", and after removing the duplicate 'a', we have "abcdacdc". The recursive call would be
dfs(3, 'a', 'c')
. - Prepend: Prepend "dac" leading to "dacabcdac", and after removing the duplicate 'd', it remains the same as prepending will not benefit us this time. The recursive call would be
dfs(3, 'd', 'c')
.
- Append: Append "dac" leading to "abcdacdac", and after removing the duplicate 'a', we have "abcdacdc". The recursive call would be
-
If we chose to prepend in the previous operation, our string would be "cdabc":
- Append: Append "dac" leading to "cdabcdac", with no character removed. The recursive call would be
dfs(3, 'c', 'c')
. - Prepend: Prepend "dac" to "cdabc" leading to "daccdabc" and after removing the duplicate 'c', we have "dacdabc". The recursive call would be
dfs(3, 'd', 'c')
.
- Append: Append "dac" leading to "cdabcdac", with no character removed. The recursive call would be
For each of these calls, we consider the removal of a duplicate character if there's a match at any end.
When i
becomes equal to n
, the recursion has considered all words, and we return 0
since there are no more words to add length from.
The memoization stores the results of each recursive call like dfs(2, 'a', 'a')
, dfs(2, 'c', 'c')
, dfs(3, 'a', 'c')
, etc., so if the function with the same set of parameters is called again, it doesn't recalculate but retrieves the value from the cache, saving time.
In our example, assuming optimal choices were made, the solution might deduce that calling dfs(3, 'a', 'c')
results in the smallest final string length. Ending with an efficient concatenated string "abcdacdc" or "dacdabc", depending on the sequence of operations chosen by the DFS algorithm.
Solution Implementation
1class Solution:
2 def minimize_concatenated_length(self, words: List[str]) -> int:
3 # Introduce caching to avoid recomputing overlapping subproblems
4 @functools.lru_cache(None)
5 def search(i: int, first_char: str, last_char: str) -> int:
6 # Base case: all words have been considered
7 if i >= len(words):
8 return 0 # No extra length is added
9
10 current_word = words[i] # Current word at index i
11 # Recursive case for appending the current word to the first string
12 # Subtract 1 if the first character of current word matches the last character of the second string
13 append_to_first = search(i + 1, first_char, current_word[-1]) - int(current_word[0] == last_char)
14 # Recursive case for appending the current word to the second string
15 # Subtract 1 if the last character of current word matches the first character of the first string
16 append_to_second = search(i + 1, current_word[0], last_char) - int(current_word[-1] == first_char)
17
18 # Add the length of the current word to the minimum of both scenarios
19 return len(current_word) + min(append_to_first, append_to_second)
20
21 # Start the search with the first word and its first and last characters
22 return len(words[0]) + search(1, words[0][0], words[0][-1])
23
24# Example usage:
25# solution = Solution()
26# result = solution.minimize_concatenated_length(["abc", "bcd", "cde"])
27```
28
29Lines added include `functools.lru_cache(None)` to cache results of subproblems, explanatory comments, and better-named variables for readability. The `List` type also should be imported from `typing` at the beginning of the file, as follows:
30
31```python
32from typing import List
33import functools
34
1class Solution {
2 private Integer[][][] memoization; // 3D array for memoization to save calculated values
3 private String[] words; // array of words
4 private int wordCount; // number of words
5
6 // The entry method that initiates the process
7 public int minimizeConcatenatedLength(String[] words) {
8 this.wordCount = words.length; // store the number of words
9 this.words = words; // store the array of words
10 memoization = new Integer[wordCount][26][26]; // initialize the 3D array for memoization
11 // Start the recursion with the first word, and use the first and last characters to track the state
12 return words[0].length() + dfs(1, words[0].charAt(0) - 'a',
13 words[0].charAt(words[0].length() - 1) - 'a');
14 }
15
16 // Recursive method to find the minimum concatenated length of words
17 private int dfs(int currentIndex, int firstCharIndex, int lastCharIndex) {
18 // Base case: if we have processed all words, return 0
19 if (currentIndex >= wordCount) {
20 return 0;
21 }
22 // If this state has already been computed, return the stored value
23 if (memoization[currentIndex][firstCharIndex][lastCharIndex] != null) {
24 return memoization[currentIndex][firstCharIndex][lastCharIndex];
25 }
26
27 String currentWord = words[currentIndex]; // get the current word
28 int currentWordLength = currentWord.length(); // length of the current word
29 int newLastCharIndex = currentWord.charAt(currentWordLength - 1) - 'a'; // index of the last character
30
31 // Choose the current word to be at the start of remaining part and compute the remaining minimum
32 int choiceAtStart = dfs(currentIndex + 1, firstCharIndex, newLastCharIndex) -
33 (currentWord.charAt(0) - 'a' == lastCharIndex ? 1 : 0);
34
35 // Choose the current word to be at the end of previously considered words and compute the remaining minimum
36 int choiceAtEnd = dfs(currentIndex + 1, currentWord.charAt(0) - 'a', lastCharIndex) -
37 (newLastCharIndex == firstCharIndex ? 1 : 0);
38
39 // Store the computed value in memoization table and return it,
40 // The total minimum length for this state is the length of the current word plus
41 // the minimum of choosing the word at the start or the end
42 memoization[currentIndex][firstCharIndex][lastCharIndex]
43 = currentWordLength + Math.min(choiceAtStart, choiceAtEnd);
44
45 return memoization[currentIndex][firstCharIndex][lastCharIndex];
46 }
47}
48
1#include <vector>
2#include <string>
3#include <cstring>
4#include <algorithm>
5#include <functional>
6
7using namespace std;
8
9class Solution {
10public:
11 // Function to minimize concatenated length of the given words sequence
12 int minimizeConcatenatedLength(vector<string>& words) {
13 int numWords = words.size(); // Number of words in the vector
14 int dp[numWords][26][26]; // Dynamic Programming (DP) table
15 memset(dp, 0, sizeof(dp)); // Initialize the DP table with zeros
16
17 // Recursive depth-first search function to find the
18 // minimized concatenation length for the given words
19 // starting from index i with a and b as the last characters
20 function<int(int, int, int)> dfs = [&](int index, int lastCharOfPrev, int firstCharOfNext) {
21 if (index >= numWords) { // Base case: reached the end of the words vector
22 return 0;
23 }
24
25 if (dp[index][lastCharOfPrev][firstCharOfNext]) { // Already calculated this subproblem
26 return dp[index][lastCharOfPrev][firstCharOfNext];
27 }
28
29 string currentWord = words[index]; // Current word to process
30 int wordLength = currentWord.size(); // Length of the current word
31 int option1 = dfs(index + 1, lastCharOfPrev, currentWord[wordLength - 1] - 'a') - (currentWord[0] - 'a' == firstCharOfNext);
32 int option2 = dfs(index + 1, currentWord[0] - 'a', firstCharOfNext) - (currentWord[wordLength - 1] - 'a' == lastCharOfPrev);
33
34 // Store the result in the DP table for memoization
35 // Adding the length of the current word and taking the minimum option
36 return dp[index][lastCharOfPrev][firstCharOfNext] = wordLength + min(option1, option2);
37 };
38
39 // Start DFS with the first word and the last characters of the first word
40 return words[0].size() + dfs(1, words[0].front() - 'a', words[0].back() - 'a');
41 }
42};
43
1// Define the function to minimize the concatenated length of strings in an array
2function minimizeConcatenatedLength(words: string[]): number {
3 const numWords = words.length;
4
5 // Create a 3D array to store intermediate results for memoization
6 // f[i][a][b] represents the minimum length from words[i], ending
7 // with char a and before words[i] starts with char b.
8 const memo: number[][][] = Array.from({ length: numWords }, () =>
9 Array.from({ length: 26 }, () => Array(26).fill(0))
10 );
11
12 // Define a helper method for depth-first search
13 const dfs = (index: number, prevLastCharIndex: number, nextFirstCharIndex: number): number => {
14 // Base case: when all words have been considered, return 0
15 if (index >= numWords) {
16 return 0;
17 }
18 // If the result is already computed, return it to avoid redundant calculations
19 if (memo[index][prevLastCharIndex][nextFirstCharIndex] > 0) {
20 return memo[index][prevLastCharIndex][nextFirstCharIndex];
21 }
22 // Get the current word and its length
23 const currentWord = words[index];
24 const currentWordLength = currentWord.length;
25
26 // Calculate the next value recursively while reducing 1 if the first or last character matches
27 // the adjacent character index from the previous or next word, respectively
28 // The two options represent attaching the current word at the beginning or the end
29 const option1 =
30 dfs(index + 1, prevLastCharIndex, currentWord.charCodeAt(currentWordLength - 1) - 97) -
31 (currentWord.charCodeAt(0) - 97 === nextFirstCharIndex ? 1 : 0);
32 const option2 =
33 dfs(index + 1, currentWord.charCodeAt(0) - 97, nextFirstCharIndex) -
34 (currentWord.charCodeAt(currentWordLength - 1) - 97 === prevLastCharIndex ? 1 : 0);
35
36 // Store the result (minimum of the two options) in the memo array and return it
37 return (memo[index][prevLastCharIndex][nextFirstCharIndex] = Math.min(option1, option2) + currentWordLength);
38 };
39
40 // Call the helper function with the first word and initial indices for previous last character and next first character
41 return (
42 words[0].length +
43 dfs(1, words[0].charCodeAt(0) - 97, words[0].charCodeAt(words[0].length - 1) - 97)
44 );
45}
46
Time and Space Complexity
The given Python code implements a solution to minimize the concatenated length of a list of words by exploiting memoization (using the @cache
decorator, which caches the results of the dfs
function calls). Analyzing the provided code, we can deduce the complexity as follows:
Time Complexity
The function dfs(i: int, a: str, b: str)
is recursively called with different parameters, which correspond to the current index (i
), the first character of the last added word to string a
(a
), and the last character of the last added word to string b
(b
). Since memoization is used, each state will be computed at most once. There are n
possible indices (n
being the size of words
), and each word can provide 26 possible characters for a
and b
(assuming the input is lower-case English letters). Therefore, we have:
- Number of states:
n * 26 * 26
For each state, operations are performed in constant time (O(1)
) considering there is no loop within dfs
, just two recursive calls and a few constant-time conditions and assignments:
- Operation per state:
O(1)
Therefore, the total time complexity is the number of states times the operations per state, which is:
- Total Time Complexity:
O(n * 26^2)
Space Complexity
The space complexity is mainly affected by the call stack during the recursive calls, and the space used for memoization. In the worst case, the recursion depth can go up to n
:
- Recursion stack:
O(n)
The memoization will store results for every possible state:
- Memoization storage:
O(n * 26^2)
Hence, the total space complexity can be seen as the maximum of the above two concerns. Since n * 26^2
is typically greater than just n
:
- Total Space Complexity:
O(n * 26^2)
Note: The exact constants for the number of characters would depend on the character set allowed in the input.
Learn more about how to find time and space complexity quickly using problem constraints.
Which of the following is a good use case for backtracking?
Recommended Readings
What is Dynamic Programming Prerequisite DFS problems dfs_intro Backtracking problems backtracking Memoization problems memoization_intro Pruning problems backtracking_pruning Dynamic programming is an algorithmic optimization technique that breaks down a complicated problem into smaller overlapping sub problems in a recursive manner and uses solutions to the sub problems to construct a solution
Patterns The Shortest Path Algorithm for Coding Interviews The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we can determine the
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
Got a question?ย Ask the Monster Assistantย anything you don't understand.
Still not clear? ย Submitย the part you don't understand to our editors. Or join ourย Discord and ask the community.