1977. Number of Ways to Separate Numbers

HardStringDynamic ProgrammingSuffix Array

Problem Description

In this problem, we have a string num which consists of concatenated positive integers. We're told that the original list of integers was written in a non-decreasing order and none of the integers had leading zeros. The challenge is to figure out how many different lists of positive integers could have been combined to form the initial string num.

Since the answer could be very large, we need to return it modulo 10^9 + 7.

To illustrate with an example, if the input string num is "327", there are two possible lists of non-decreasing integers: [3, 27] and [327]. Hence the function would return 2.

Quick Interview Experience

Help others by sharing your interview experience

Have you seen this problem before?

Intuition

Approaching this problem requires that we solve it piece by piece. We need to determine under what conditions we can split the string num into different integers such that they maintain a non-decreasing order. We can start from the first character and continue appending the following characters until we get a valid number, then process the remainder of the string in a similar manner.

Here's how we might think about the solution:

Dynamic Programming (DP): We can consider this problem as one that can be solved using dynamic programming. We try to build up the number of ways to form the string num from smaller subproblems based on the length of the last number used.
Longest Common Prefix (LCP): To maintain the non-decreasing order of numbers, we need a way to compare the numbers in num efficiently. The LCP (Longest Common Prefix) array can be used to store the length of the longest common prefix between two substrings of num. This helps us compare numbers quickly without generating all substrings, which would be inefficient.
Construction of DP Table: We construct a 2D DP table with dp[i][j] meaning the number of ways to form the first i digits of num where the last number used has j digits.
Transition: We consider two cases for the transition. Either the last two numbers used are the same length - in which case we can form a new list of integers by using the same length of the number only if the two numbers are in non-decreasing order. Or, the last number used is shorter than the previous - in which case you can always append it because it won’t violate the non-decrease constraint.
Base Case: We initiate dp[0][0] as 1 because there is one way to form an empty string with an empty list.

This solution iteratively builds upon smaller subproblems and combines their results to find the total number of combinations for larger lengths of the string num. At each step, it ensures that numbers are added in a non-decreasing order and without leading zeros.

Learn more about Dynamic Programming patterns.

Solution Approach

The solution utilizes a nested function cmp to compare substrings of num, and a Dynamic Programming (DP) table to keep track of the number of valid combinations that can be formed up to certain points in the string num.

Let's walk through the components of the implementation:

Longest Common Prefix (LCP) Calculation: Before the main DP logic, the function computes the LCP array using a nested loop. For each pair of indices i and j in the string num, lcp[i][j] is calculated to store the length of the longest common prefix between substrings starting from i and j. This pre-computation is essential for the cmp function.
Comparison function (cmp): The function cmp(i, j, k) is used to compare two numbers within the string num of length k starting from i and j respectively. This function will indirectly utilize the precomputed LCP values to efficiently determine if the first number starting at index i is greater than or equal to the second number starting at index j.
Dynamic Programming: The 2D DP table dp is created to keep track of valid combinations. dp[i][j] represents the number of ways to partition num[0:i] such that the last number used has j digits.
DP Initialization: The dp table is initialized such that dp[0][0] is 1, since there is exactly one way to create an empty partition.
DP Iteration: The code iterates through each end index i of the substring and considers all possible lengths j for the last number. Inside this nested loop, the algorithm decides whether the substring of length j ending at i can form a valid number (it cannot start with '0') and if it maintains the non-decreasing property compared to the previous number.
DP Transition: The crucial part of this solution is how the DP values transition from one state to another. For each i and j, the implementation considers the number of combinations if the last two numbers are of equal length (and the second is greater or equal) or if the second number may length less.
- If the last two numbers are of equal length and the comparison via cmp is true, it takes the value from dp[i-j][j] (which represents the count up to the first number's end index and the subsequent number also ending at i-j and also has length j).
- Otherwise, it uses the value from dp[i-j][min(j-1,i-j)] to represent scenarios wherein the most recent number is of smaller size and doesn't threaten the non-decreasing order.
DP State Updates: After ascertaining which previous DP state can be transitioned from, dp[i][j] is updated with the count of combinations by considering the previous number lengths into account. It sums the count from the previously calculated number of ways and the new count, keeping in mind that we must perform this addition modulo 10^9 + 7.
Result Extraction: Lastly, after populating the DP table, the result is the total combinations possible which is dp[n][n], meaning the number of combinations to form the entire string with all possible last number lengths considered.

The solution is a well-crafted example of combining DP with string comparisons optimized by LCP to ensure polynomial time complexity and manage the large answer space creatively with modular arithmetic.

Ready to land your dream job?

Unlock your dream job with a 2-minute evaluator for a personalized learning plan!

Start Evaluator

Example Walkthrough

Let's walk through a small example to illustrate the solution approach. Suppose our input string num is "1123". We want to find out in how many different ways we can split this string into a list of non-decreasing positive integers.

Step 1: Longest Common Prefix (LCP) Calculation: We calculate the LCP for every pair of starting positions in the string num. For simplicity, let's assume we have a function that does this efficiently.

Step 2: Comparison function (cmp): This function will compare two numbers within the string num based on their starting indices and length using the LCP information.

Step 3: Dynamic Programming (DP): We initialize a DP table dp where dp[i][j] will represent the number of ways to split the substring num[0:i] such that the last number has length j.

Step 4: DP Initialization: We set dp[0][0] to 1 because there's one way to form an empty string which is an empty list of integers.

Step 5: DP Iteration:

For i = 1, the substring "1" can form one list, [1].
For i = 2, the substring "11" can form two lists, [1, 1] or [11].
For i = 3, the substring "112" can form three lists, [1, 1, 2], [1, 12], or [11, 2].
For i = 4, the substring "1123" can form the following lists: [1, 1, 2, 3], [1, 1, 23], [1, 12, 3], [1, 123], [11, 2, 3], [11, 23]. That's six valid lists.

Step 6: DP Transition:

We check if we can append a number to the existing one to make sure they are in non-decreasing order.
For instance, at i = 4, j = 2, we compare "23" with "12" (previous number of length 2). Since "23" >= "12", we can consider the count from dp[2][2].

Step 7: DP State Updates: We update the dp table by adding the number of ways to the current count, ensuring we consider each possible length of the last number.

Step 8: Result Extraction: We find the total count of combinations to form the string "1123", which will be dp[4][4] + dp[4][3] + dp[4][2] + ... + dp[4][1] after iterating through all possible lengths for the last number.

For this example, by walking through the DP table, we find that there are six valid lists that can be formed from "1123". This approach ensures we explore all possibilities iteratively while keeping the problem manageable using the principles of dynamic programming.

Solution Implementation

1class Solution:
2    def numberOfCombinations(self, num: str) -> int:
3        # Helper function to compare the lexicographical order of two numbers within 'num'
4        # representing the start indices and their length 'length'.
5        def is_greater_or_equal(start1, start2, length):
6            common_prefix_length = longest_common_prefix[start1][start2]
7            return common_prefix_length >= length or num[start1 + common_prefix_length] >= num[start2 + common_prefix_length]
8
9        MOD = 10**9 + 7
10        num_length = len(num)
11
12        # Pre-compute the longest common prefix (LCP) array.
13        longest_common_prefix = [[0] * (num_length + 1) for _ in range(num_length + 1)]
14        for i in range(num_length - 1, -1, -1):
15            for j in range(num_length - 1, -1, -1):
16                if num[i] == num[j]:
17                    longest_common_prefix[i][j] = 1 + longest_common_prefix[i + 1][j + 1]
18
19        # Dynamic programming table where dp[i][j] represents the number of combinations
20        # of valid integers with length 'j' that end at index 'i - 1'.
21        dp = [[0] * (num_length + 1) for _ in range(num_length + 1)]
22        dp[0][0] = 1
23
24        # Build up the DP table.
25        for i in range(1, num_length + 1):
26            for j in range(1, i + 1):
27                current_value = 0
28                if num[i - j] != '0':  # Skip numbers with leading zero.
29                    if i - j - j >= 0 and is_greater_or_equal(i - j, i - j - j, j):
30                        # The substring is greater or equal to the previous,
31                        # so we can safely append to the previous.
32                        current_value = dp[i - j][j]
33                    else:
34                        # Take the combination counts from the smaller number if present.
35                        current_value = dp[i - j][min(j - 1, i - j)]
36
37                # Update the current dp value including the current value with MOD.
38                # Accumulate with the previous j-1 combinations.
39                dp[i][j] = (dp[i][j - 1] + current_value) % MOD
40
41        # Total number of combinations is the last value in the dp table.
42        return dp[num_length][num_length]
43

1class Solution {
2    private static final int MOD = (int) 1e9 + 7;
3
4    public int numberOfCombinations(String num) {
5        int length = num.length();
6        // Initialize the longest common prefix (LCP) array.
7        int[][] longestCommonPrefix = new int[length + 1][length + 1];
8
9        // Calculate the LCP for each substring pair.
10        for (int i = length - 1; i >= 0; --i) {
11            for (int j = length - 1; j >= 0; --j) {
12                if (num.charAt(i) == num.charAt(j)) {
13                    longestCommonPrefix[i][j] = 1 + longestCommonPrefix[i + 1][j + 1];
14                }
15            }
16        }
17      
18        // dp[i][j] will hold the number of ways to partition the string ending at i using a number ending at j.
19        int[][] dp = new int[length + 1][length + 1];
20        dp[0][0] = 1; // Base case: one way to partition an empty string.
21
22        // Build the dp array from the bottom up.
23        for (int i = 1; i <= length; ++i) {
24            for (int j = 1; j <= i; ++j) {
25                int currentVal = 0;
26              
27                // We should not start with a leading zero.
28                if (num.charAt(i - j) != '0') {
29                    // There should be enough characters for the previous number.
30                    if (i >= 2 * j) {
31                        int commonPrefixLength = longestCommonPrefix[i - j][i - 2 * j];
32                      
33                        // Check if the current number is greater than or equal to the previous number.
34                        if (commonPrefixLength >= j || num.charAt(i - j + commonPrefixLength) >= num.charAt(i - 2 * j + commonPrefixLength)) {
35                            currentVal = dp[i - j][j];
36                        }
37                    }
38                    // If the number is not valid, use the number of ways of the smaller previous number.
39                    if (currentVal == 0) {
40                        currentVal = dp[i - j][Math.min(j - 1, i - j)];
41                    }
42                }
43
44                // Sum the ways, and do mod to avoid overflow.
45                dp[i][j] = (dp[i][j - 1] + currentVal) % MOD;
46            }
47        }
48        // The answer is the number of ways to form the number sequence using the entire string.
49        return dp[length][length];
50    }
51}
52

1class Solution {
2public:
3    const int MOD = 1e9 + 7; // Define modulo value for the results to prevent overflow
4
5    // This function counts the number of non-empty increasing sequences of integers
6    // that can be formed by the digits of the given string 'num'
7    int numberOfCombinations(string num) {
8        int n = num.size(); // Size of the input string
9        // 2D vector to store the longest common prefix (LCP) lengths between suffixes of 'num'
10        vector<vector<int>> longestCommonPrefix(n + 1, vector<int>(n + 1));
11      
12        // Preprocess to fill in the longestCommonPrefix array
13        for (int i = n - 1; i >= 0; --i) {
14            for (int j = n - 1; j >= 0; --j) {
15                if (num[i] == num[j]) {
16                    longestCommonPrefix[i][j] = 1 + longestCommonPrefix[i + 1][j + 1];
17                }
18            }
19        }
20
21        // Comparator function to compare two substrings in 'num' when one is a prefix of the other
22        auto canPlace = [&](int start, int prevStart, int len) {
23            int commonPrefixLen = longestCommonPrefix[start][prevStart];
24            return commonPrefixLen >= len || num[start + commonPrefixLen] >= num[prevStart + commonPrefixLen];
25        };
26
27        // 2D vector for dynamic programming (dp), where dp[i][j] represents the number of ways
28        // to form an increasing sequence upto index i, using a last number of length j
29        vector<vector<int>> dp(n + 1, vector<int>(n + 1));
30        dp[0][0] = 1; // Base case: 1 way to form a sequence of length 0
31
32        // Populate the dp array
33        for (int i = 1; i <= n; ++i) {
34            for (int j = 1; j <= i; ++j) {
35                int count = 0;
36                // Check if the current number doesn't start with '0' and if it can be placed in the sequence
37                if (num[i - j] != '0') {
38                    if (i - j - j >= 0 && canPlace(i - j, i - j - j, j)) {
39                        count = dp[i - j][j];
40                    } else {
41                        count = dp[i - j][min(j - 1, i - j)];
42                    }
43                }
44                // Update the dp table taking the modulo to prevent overflow
45                dp[i][j] = (dp[i][j - 1] + count) % MOD;
46            }
47        }
48        // Final answer: sum of all sequences ending at the last digit 'n'
49        return dp[n][n];
50    }
51};
52

1const MOD: number = 1e9 + 7; // Define modulo value for the results to prevent overflow
2
3function numberOfCombinations(num: string): number {
4    const n: number = num.length; // Size of the input string
5    // 2D array to store the longest common prefix (LCP) lengths between suffixes of 'num'
6    let longestCommonPrefix: number[][] = Array.from({length: n + 1}, () => Array(n + 1).fill(0));
7  
8    // Preprocess to fill in the longestCommonPrefix array
9    for (let i = n - 1; i >= 0; --i) {
10        for (let j = n - 1; j >= 0; --j) {
11            if (num[i] === num[j]) {
12                longestCommonPrefix[i][j] = 1 + longestCommonPrefix[i + 1][j + 1];
13            }
14        }
15    }
16
17    // Comparator function to compare two substrings in 'num' when one is a prefix of the other
18    const canPlace: (start: number, prevStart: number, len: number) => boolean = (start, prevStart, len) => {
19        const commonPrefixLen: number = longestCommonPrefix[start][prevStart];
20        return commonPrefixLen >= len || num[start + commonPrefixLen] >= num[prevStart + commonPrefixLen];
21    };
22
23    // 2D array for dynamic programming (dp), where dp[i][j] represents the number of ways
24    // to form an increasing sequence up to index i, using a last number of length j
25    let dp: number[][] = Array.from({length: n + 1}, () => Array(n + 1).fill(0));
26    dp[0][0] = 1; // Base case: 1 way to form a sequence of length 0
27
28    // Populate the dp array
29    for (let i = 1; i <= n; ++i) {
30        for (let j = 1; j <= i; ++j) {
31            let count: number = 0;
32            // Check if the current number doesn't start with '0' and if it can be placed in the sequence
33            if (num[i - j] !== '0') {
34                if (i - j - j >= 0 && canPlace(i - j, i - j - j, j)) {
35                    count = dp[i - j][j];
36                } else {
37                    count = dp[i - j][Math.min(j - 1, i - j)];
38                }
39            }
40            // Update the dp table taking the modulo to prevent overflow
41            dp[i][j] = (dp[i][j - 1] + count) % MOD;
42        }
43    }
44    // Final answer: sum of all sequences ending at the last digit 'n'
45    return dp[n][n];
46}
47

Time and Space Complexity

The time complexity of the provided algorithm is primarily determined by two nested loops that iterate over the length of the input string num and the computation of the longest common prefix (LCP) for all substrings. Calculating the LCP takes O(n^2) time since it requires two nested loops, each running for n iterations, where n is the length of the string num. Afterward, the main part of the algorithm uses dynamic programming where there are two nested loops that also run for n iterations each, thus also taking O(n^2) time. In the innermost part of these loops, the cmp function is called, which takes constant time due to the precomputed LCP values. Therefore, the dominant part of the time complexity is O(n^2).

The space complexity of the algorithm comes from the storage of the dynamic programming table dp, and the longest common prefix table lcp, each of size n x n. Therefore, the space complexity is O(n^2) to store these two tables.

Learn more about how to find time and space complexity quickly using problem constraints.

Discover Your Strengths and Weaknesses: Take Our 2-Minute Quiz to Tailor Your Study Plan:

Consider the classic dynamic programming of longest increasing subsequence:

Find the length of the longest subsequence of a given sequence such that all elements of the subsequence are sorted in increasing order.

For example, the length of LIS for [50, 3, 10, 7, 40, 80] is 4 and LIS is [3, 7, 40, 80].

What is the recurrence relation?

dp[i] = dp[i] + 1

dp[i] = dp[i] + dp[i - 1]

dp[i] = (dp[i] + dp[j] + 1) for j in 0 to i

dp[i] = max(dp[i], dp[j] + 1) for j in 0 to i

I don't know