2464. Minimum Subarrays in a Valid Split


Problem Description

In this problem, we must take an integer array nums and divide it into subarrays according to specific rules. A subarray here is defined as a contiguous part of the array. A splitting is considered valid if every subarray meets these conditions:

  • The greatest common divisor (GCD) of the first and last elements of the subarray is greater than 1.
  • Every element in nums must be included in exactly one subarray.

Our objective is to find the minimum number of such subarrays into which the original array can be split while maintaining the validity criteria. If we can't split the array to meet these conditions, the function should return -1.

To give an example, suppose nums = [2, 3, 5, 2, 4]. In this array, we could split it into two subarrays. One subarray could be [2, 3], as the GCD of 2 and 3 is 1; hence we cannot stop here. We consider the next element, which is 5, and since the GCD of 2 and 5 is also 1, we continue to the next element and check 2 with 2. Now, the GCD of 2 and 2 is 2, which is greater than 1, so we can make a valid subarray [2, 3, 5, 2]. The remaining single element, 4, by itself forms a valid subarray because the first and last elements are the same, and obviously, the GCD of a number with itself is greater than 1. Thus, we have split nums into 2 subarrays and met all the criteria, so the answer is 2.

Intuition

The solution approach requires dynamic programming to avoid recalculating subproblems multiple times. The idea is to use a function dfs which, through recursion, iteratively tries to split the array from a starting index, say i, incrementing one element at a time until we find a valid split (where the GCD of the first and last is greater than 1) and then moves to find the next split from the next index, say j+1.

We utilize the concept of memoization with the @cache decorator to store the results of already computed states. This enhancement prevents unnecessary recomputations and allows us to find the minimum number of subarrays efficiently.

Each recursive call of the dfs function will return the minimum number of subarrays starting from the index i. We loop from i to the end of the array (n), each time checking if we can get a valid subarray with nums[i] as the first element and nums[j] as the last element. If we find a valid split (GCD > 1), we compute the number of splits from j+1 onwards and add 1 to it (as 1 split is done up to j) and keep track of the least number of splits needed.

At the end of the recursion, we clear the cache for efficiency and return the answer if it's finite; otherwise, we return -1. This approach ensures that we explore all possible splits and always move towards the solution with the minimum splits required.

Learn more about Math and Dynamic Programming patterns.

Solution Approach

The provided solution takes advantage of recursive depth-first search (DFS) with memoization, which is a common technique in dynamic programming. The goal is to explore all possible ways to split the array and use memoization to store the results of subproblems to avoid recalculating them. Let's walk through the critical components of the algorithm:

  • Recursion: The core of our solution is a recursive function named dfs. This function accepts an index i and returns the minimum number of valid subarrays the array can be split into starting from that index. By calling dfs(0), we start the process from the beginning of the array.

  • Memoization: We use Python's @cache decorator from the functools module on our dfs function, which automatically handles memoization for us. The dfs.cache_clear() is called at the end before returning the final answer to clear the cache, ensuring that no memory is wasted after the computation.

  • Base Condition: The base case for our recursive function is when the index i is greater than or equal to n, the length of the array. In this case, the function returns 0, indicating that no further splits are needed.

  • Iteration within Recursion: Inside the dfs function, we loop through the array starting from the current index i to the end index n. In each iteration, we are checking whether we can create a valid subarray starting at index i and ending at index j. We use the built-in [math](/problems/math-basics).gcd function to find the greatest common divisor between the first and last elements of the current subarray being considered.

  • Validation: If the GCD of nums[i] and nums[j] is greater than 1, it indicates that a valid subarray is found. We then recursively call dfs(j + 1) to determine how many subarrays we can get starting from the next index after the current valid subarray. To ensure we find the minimum number of needed splits, we keep track of the smallest result returned from these recursive calls.

  • Identifying the Minimum Splits: We initiate an ans variable for each recursive call with a value of infinity (inf). Whenever a valid subarray is found, we update ans to be the minimum between the existing ans and the result of 1 + dfs(j + 1) - the 1 added represents the current valid subarray we have just found.

  • Returning the Result: After the recursion terminates, if ans is still infinity, it means no valid subarray was created, and we return -1. If ans is finite, it means a valid split was found, and we return the ans as the minimum number of splits needed.

In summary, the solution recursively explores all possible subarray splits and intelligently caches results to minimize runtime. This dynamic programming approach, combining recursion and memoization, enables us to solve the problem optimally.

Ready to land your dream job?

Unlock your dream job with a 2-minute evaluator for a personalized learning plan!

Start Evaluator

Example Walkthrough

Let's illustrate the solution approach with a small example. Suppose we have an array nums = [2, 6, 3, 4].

  1. We start by calling dfs(0) because we need to examine the array starting from the first element.
  2. The recursive call dfs(i) loop is initiated, and i = 0. We start looking for valid subarrays beginning with the first element nums[0] = 2.
  3. The iteration within the recursion checks pairs of elements starting at i = 0 and ending at j = 0, 1, 2, ... until the end looking for valid splits.
  4. When j = 0, our subarray is [2]. The GCD of 2 and 2 is 2, which is greater than 1, so we found a valid subarray. We then call dfs(j + 1), which is dfs(1).
  5. The dfs(1) will perform its loop. For j = 1, we get the subarray [6], and the GCD of 6 and 6 is 6, so another valid subarray is found. We call dfs(2).
  6. The dfs(2) will similarly check for valid subarrays starting from nums[2] = 3. However, we soon realize when we reach j = 3 that [3, 4] is a valid subarray because the GCD of 3 and 4 is 1 (not valid), so we must include both in the subarray where the GCD of 3 and 4 is still 1. Finally, we must continue with the single element subarray [4], with a GCD of 4 with itself which is valid.
  7. As the base condition is met (no further elements left to process), the dfs function would return 0.
  8. Adding up the splits, we see that dfs(2) returns 1 (it found one valid subarray), dfs(1) returns 2 (it found a valid subarray plus the one found by dfs(2)), and dfs(0) returns 3 since it identifies the valid subarray [2] and then relies on dfs(1).

At the end of the recursion, we find that the minimum number of subarrays is 3. This is the optimal solution as no fewer subarrays can satisfy the conditions.

Throughout the process, memoization saves the result of each recursive call, preventing the re-computation of dfs(j) where j is any index greater than 0, thus optimizing the solution significantly. If we were to reach an index i that had been previously explored, the saved result would be used instead of re-calculating it.

After exploring all possibilities, since ans is not infinity, we do not return -1. Instead, we return the minimum number of splits found, which is 3 in this case. The cache is cleared after returning the final answer to free up memory.

Solution Implementation

1from functools import lru_cache
2from math import gcd
3from typing import List
4
5class Solution:
6    def valid_subarray_split(self, nums: List[int]) -> int:
7        # Define a Depth-First Search (DFS) function with memoization to find the minimum number of valid splits
8        @lru_cache(maxsize=None)
9        def dfs(start_index):
10            # Base case: if we have gone past the end of the array, no more splits are needed
11            if start_index >= length:
12                return 0
13          
14            # Start with a large number assuming no valid split is possible initially
15            min_splits = float('inf')
16          
17            # Try splitting the array at different positions, updating the minimum splits if a valid split is found
18            for end_index in range(start_index, length):
19                # Check if the gcd of the numbers at the current segment (start_index to end_index) is greater than 1
20                if gcd(nums[start_index], nums[end_index]) > 1:
21                    # If so, include this segment and add 1 for the split to the result of the recursive call for the next segment
22                    min_splits = min(min_splits, 1 + dfs(end_index + 1))
23          
24            # Return the minimum number of splits found
25            return min_splits
26
27        # Get the length of the input list
28        length = len(nums)
29      
30        # Call the DFS function starting with index 0
31        min_splits_result = dfs(0)
32      
33        # Clear the cache of the DFS function
34        dfs.cache_clear()
35      
36        # Return the final result: if min_splits_result is still inf, return -1 to indicate no valid splits; otherwise, return min_splits_result
37        return min_splits_result if min_splits_result < float('inf') else -1
38
1class Solution {
2    private int arrayLength;       // Represents the length of the input array
3    private int[] memo;            // Memoization array to store results of sub-problems
4    private int[] numbers;         // The input array of numbers
5    private final int INFINITY = Integer.MAX_VALUE; // A value to represent infinity
6
7    // Method to find the minimum number of valid subarrays with GCD greater than 1
8    public int validSubarraySplit(int[] nums) {
9        arrayLength = nums.length;  // Initialize the length of the array
10        memo = new int[arrayLength]; // Initialize the memo array
11        numbers = nums;             // Assign the input array to the instance variable numbers
12        int answer = depthFirstSearch(0); // Begin the DFS from the first index
13        return answer < INFINITY ? answer : -1; // If no valid split found, return -1, else return answer
14    }
15
16    // Performs a depth-first search to find the minimum number of valid subarrays
17    private int depthFirstSearch(int index) {
18        // If the index is beyond the last element, return 0 as no further split needed
19        if (index >= arrayLength) {
20            return 0;
21        }
22        // If we already computed the result for this index, return the stored value
23        if (memo[index] > 0) {
24            return memo[index];
25        }
26        int answer = INFINITY; // Start with an infinite answer
27        // Iterate over the array elements starting from current index
28        for (int j = index; j < arrayLength; ++j) {
29            // If the GCD of the starting element and the current element is greater than 1
30            if (greatestCommonDivisor(numbers[index], numbers[j]) > 1) {
31                // We have a valid subarray. Recursively solve for the remaining subarray,
32                // counting 1 for the current valid subarray and attempting to minimize the answer
33                answer = Math.min(answer, 1 + depthFirstSearch(j + 1));
34            }
35        }
36        // Store the result for the current index in the memoization array
37        memo[index] = answer;
38        return answer;
39    }
40
41    // Helper method to compute the Greatest Common Divisor of two numbers.
42    private int greatestCommonDivisor(int a, int b) {
43        return b == 0 ? a : greatestCommonDivisor(b, a % b);
44    }
45}
46
1#include <vector>
2#include <functional>
3#include <algorithm>
4#include <numeric> // For std::gcd
5
6class Solution {
7public:
8    // Define a constant for infinity to compare against during computation
9    static constexpr int INF = 0x3f3f3f3f;
10
11    // Function to calculate the number of valid subarray splits
12    int validSubarraySplit(std::vector<int>& nums) {
13        int n = nums.size(); // Get the size of the input array
14        std::vector<int> memo(n, 0); // Create a memoization array initialized to 0
15      
16        // Define the recursive Depth-First Search (DFS) function using std::function
17        std::function<int(int)> dfs = [&](int i) -> int {
18            if (i >= n) return 0; // Base case: beyond array boundaries
19            if (memo[i] > 0) return memo[i]; // Return memoized result, if available
20          
21            int result = INF; // Initialize result to infinity
22          
23            // Check each subarray starting from index i
24            for (int j = i; j < n; ++j) {
25                // Only consider the subarray if gcd of start and current elements is greater than 1
26                if (std::gcd(nums[i], nums[j]) > 1) {
27                    // Choose the smallest count of splits for subarrays
28                    result = std::min(result, 1 + dfs(j + 1));
29                }
30            }
31          
32            // Memoize the answer for the current index i
33            memo[i] = result;
34            return result; // Return the result for the current subarray
35        };
36
37        // Kick-start the DFS from the beginning of the array
38        int answer = dfs(0);
39
40        // Return the total number of splits if answer is less than infinity, otherwise -1
41        return answer < INF ? answer : -1;
42    }
43};
44
1const INF: number = Infinity; // Define a constant for infinity
2
3// Array to store the memoized results
4let memo: number[];
5
6// Function to calculate the number of valid subarray splits
7function validSubarraySplit(nums: number[]): number {
8  let n: number = nums.length; // Get the size of the input array
9  memo = new Array(n).fill(0); // Initialize the memoization array with 0
10
11  // Recursive Depth-First Search (DFS) function
12  const dfs = (i: number): number => {
13    if (i >= n) return 0; // Base case: beyond array boundaries
14    if (memo[i] > 0) return memo[i]; // Return memoized result if available
15  
16    let result: number = INF; // Initialize result to infinity
17  
18    // Check each subarray starting from index i
19    for (let j: number = i; j < n; ++j) {
20      // Only consider the subarray if gcd of start and current elements is greater than 1
21      if (gcd(nums[i], nums[j]) > 1) {
22        // Choose the smallest count of splits for subarrays
23        result = Math.min(result, 1 + dfs(j + 1));
24      }
25    }
26  
27    // Memoize the answer for the current index i
28    memo[i] = result;
29    return result; // Return the result for the current subarray
30  };
31
32  // Kick-start the DFS from the beginning of the array
33  let answer: number = dfs(0);
34
35  // Return the total number of splits if answer is less than infinity, otherwise return -1
36  return answer < INF ? answer : -1;
37}
38
39// Function to compute the greatest common divisor (gcd) of two numbers
40function gcd(a: number, b: number): number {
41  while (b !== 0) {
42    let t: number = b;
43    b = a % b;
44    a = t;
45  }
46  return a;
47}
48

Time and Space Complexity

The time complexity of the given code is O(n^3), where n is the length of the nums list. This is because the dfs function is called recursively with the starting index i, and for each call to dfs, there's a loop running from i to n. Within this loop, we are making a recursive call to dfs(j + 1), and we also calculate the greatest common divisor (gcd) for each pair of elements between i and j. The gcd operation itself can be considered as O(log(min(nums[i], nums[j]))) on average, but in the worst case, the gcd can be considered O(n) for very large numbers or certain sequences. Simplifying our analysis by considering gcd as O(1) for an average case, the total operations are in the order of the sum of sequence from 1 to n, which is n(n + 1)/2, thus O(n^2). This O(n^2) combined with the O(n) for the iteration from i to n gives a complexity of O(n^3).

The space complexity is primarily determined by the size of the cache used in the memoization and the depth of recursion. Memoization could potentially store a result for every starting index i, hence it can take O(n) space. The maximum depth of the recursion determines the stack space, which is also O(n) because the function could be recursively called starting from each index in nums. Therefore, the space complexity can be considered as O(n) for the caching and O(n) for the recursive call stack, which simplifies to O(n).

Learn more about how to find time and space complexity quickly using problem constraints.


Discover Your Strengths and Weaknesses: Take Our 2-Minute Quiz to Tailor Your Study Plan:
Question 1 out of 10

Which of these pictures shows the visit order of a depth-first search?


Recommended Readings

Want a Structured Path to Master System Design Too? Don’t Miss This!