2464. Minimum Subarrays in a Valid Split
Problem Description
In this problem, we must take an integer array nums
and divide it into subarrays according to specific rules. A subarray here is defined as a contiguous part of the array. A splitting is considered valid if every subarray meets these conditions:
- The greatest common divisor (GCD) of the first and last elements of the subarray is greater than
1
. - Every element in
nums
must be included in exactly one subarray.
Our objective is to find the minimum number of such subarrays into which the original array can be split while maintaining the validity criteria. If we can't split the array to meet these conditions, the function should return -1
.
To give an example, suppose nums = [2, 3, 5, 2, 4]
. In this array, we could split it into two subarrays. One subarray could be [2, 3]
, as the GCD of 2 and 3 is 1; hence we cannot stop here. We consider the next element, which is 5
, and since the GCD of 2 and 5 is also 1, we continue to the next element and check 2
with 2
. Now, the GCD of 2 and 2 is 2, which is greater than 1, so we can make a valid subarray [2, 3, 5, 2]
. The remaining single element, 4
, by itself forms a valid subarray because the first and last elements are the same, and obviously, the GCD of a number with itself is greater than 1. Thus, we have split nums into 2 subarrays and met all the criteria, so the answer is 2
.
Intuition
The solution approach requires dynamic programming to avoid recalculating subproblems multiple times. The idea is to use a function dfs
which, through recursion, iteratively tries to split the array from a starting index, say i
, incrementing one element at a time until we find a valid split (where the GCD of the first and last is greater than 1) and then moves to find the next split from the next index, say j+1
.
We utilize the concept of memoization with the @cache
decorator to store the results of already computed states. This enhancement prevents unnecessary recomputations and allows us to find the minimum number of subarrays efficiently.
Each recursive call of the dfs
function will return the minimum number of subarrays starting from the index i
. We loop from i
to the end of the array (n
), each time checking if we can get a valid subarray with nums[i]
as the first element and nums[j]
as the last element. If we find a valid split (GCD > 1), we compute the number of splits from j+1
onwards and add 1 to it (as 1 split is done up to j
) and keep track of the least number of splits needed.
At the end of the recursion, we clear the cache for efficiency and return the answer if it's finite; otherwise, we return -1
. This approach ensures that we explore all possible splits and always move towards the solution with the minimum splits required.
Learn more about Math and Dynamic Programming patterns.
Solution Approach
The provided solution takes advantage of recursive depth-first search (DFS) with memoization, which is a common technique in dynamic programming. The goal is to explore all possible ways to split the array and use memoization to store the results of subproblems to avoid recalculating them. Let's walk through the critical components of the algorithm:
-
Recursion: The core of our solution is a recursive function named
dfs
. This function accepts an indexi
and returns the minimum number of valid subarrays the array can be split into starting from that index. By callingdfs(0)
, we start the process from the beginning of the array. -
Memoization: We use Python's
@cache
decorator from thefunctools
module on ourdfs
function, which automatically handles memoization for us. Thedfs.cache_clear()
is called at the end before returning the final answer to clear the cache, ensuring that no memory is wasted after the computation. -
Base Condition: The base case for our recursive function is when the index
i
is greater than or equal ton
, the length of the array. In this case, the function returns0
, indicating that no further splits are needed. -
Iteration within Recursion: Inside the
dfs
function, we loop through the array starting from the current indexi
to the end indexn
. In each iteration, we are checking whether we can create a valid subarray starting at indexi
and ending at indexj
. We use the built-in[math](/problems/math-basics).gcd
function to find the greatest common divisor between the first and last elements of the current subarray being considered. -
Validation: If the GCD of
nums[i]
andnums[j]
is greater than1
, it indicates that a valid subarray is found. We then recursively calldfs(j + 1)
to determine how many subarrays we can get starting from the next index after the current valid subarray. To ensure we find the minimum number of needed splits, we keep track of the smallest result returned from these recursive calls. -
Identifying the Minimum Splits: We initiate an
ans
variable for each recursive call with a value of infinity (inf
). Whenever a valid subarray is found, we updateans
to be the minimum between the existingans
and the result of1 + dfs(j + 1)
- the1
added represents the current valid subarray we have just found. -
Returning the Result: After the recursion terminates, if
ans
is still infinity, it means no valid subarray was created, and we return-1
. Ifans
is finite, it means a valid split was found, and we return theans
as the minimum number of splits needed.
In summary, the solution recursively explores all possible subarray splits and intelligently caches results to minimize runtime. This dynamic programming approach, combining recursion and memoization, enables us to solve the problem optimally.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's illustrate the solution approach with a small example. Suppose we have an array nums = [2, 6, 3, 4]
.
- We start by calling
dfs(0)
because we need to examine the array starting from the first element. - The recursive call
dfs(i)
loop is initiated, andi = 0
. We start looking for valid subarrays beginning with the first elementnums[0] = 2
. - The iteration within the recursion checks pairs of elements starting at
i = 0
and ending atj = 0, 1, 2, ...
until the end looking for valid splits. - When
j = 0
, our subarray is[2]
. The GCD of 2 and 2 is 2, which is greater than 1, so we found a valid subarray. We then calldfs(j + 1)
, which isdfs(1)
. - The
dfs(1)
will perform its loop. Forj = 1
, we get the subarray[6]
, and the GCD of 6 and 6 is 6, so another valid subarray is found. We calldfs(2)
. - The
dfs(2)
will similarly check for valid subarrays starting fromnums[2] = 3
. However, we soon realize when we reachj = 3
that[3, 4]
is a valid subarray because the GCD of 3 and 4 is 1 (not valid), so we must include both in the subarray where the GCD of 3 and 4 is still 1. Finally, we must continue with the single element subarray[4]
, with a GCD of 4 with itself which is valid. - As the base condition is met (no further elements left to process), the
dfs
function would return0
. - Adding up the splits, we see that
dfs(2)
returns1
(it found one valid subarray),dfs(1)
returns2
(it found a valid subarray plus the one found bydfs(2)
), anddfs(0)
returns3
since it identifies the valid subarray[2]
and then relies ondfs(1)
.
At the end of the recursion, we find that the minimum number of subarrays is 3
. This is the optimal solution as no fewer subarrays can satisfy the conditions.
Throughout the process, memoization saves the result of each recursive call, preventing the re-computation of dfs(j)
where j
is any index greater than 0
, thus optimizing the solution significantly. If we were to reach an index i
that had been previously explored, the saved result would be used instead of re-calculating it.
After exploring all possibilities, since ans
is not infinity, we do not return -1
. Instead, we return the minimum number of splits found, which is 3
in this case. The cache is cleared after returning the final answer to free up memory.
Solution Implementation
1from functools import lru_cache
2from math import gcd
3from typing import List
4
5class Solution:
6 def valid_subarray_split(self, nums: List[int]) -> int:
7 # Define a Depth-First Search (DFS) function with memoization to find the minimum number of valid splits
8 @lru_cache(maxsize=None)
9 def dfs(start_index):
10 # Base case: if we have gone past the end of the array, no more splits are needed
11 if start_index >= length:
12 return 0
13
14 # Start with a large number assuming no valid split is possible initially
15 min_splits = float('inf')
16
17 # Try splitting the array at different positions, updating the minimum splits if a valid split is found
18 for end_index in range(start_index, length):
19 # Check if the gcd of the numbers at the current segment (start_index to end_index) is greater than 1
20 if gcd(nums[start_index], nums[end_index]) > 1:
21 # If so, include this segment and add 1 for the split to the result of the recursive call for the next segment
22 min_splits = min(min_splits, 1 + dfs(end_index + 1))
23
24 # Return the minimum number of splits found
25 return min_splits
26
27 # Get the length of the input list
28 length = len(nums)
29
30 # Call the DFS function starting with index 0
31 min_splits_result = dfs(0)
32
33 # Clear the cache of the DFS function
34 dfs.cache_clear()
35
36 # Return the final result: if min_splits_result is still inf, return -1 to indicate no valid splits; otherwise, return min_splits_result
37 return min_splits_result if min_splits_result < float('inf') else -1
38
1class Solution {
2 private int arrayLength; // Represents the length of the input array
3 private int[] memo; // Memoization array to store results of sub-problems
4 private int[] numbers; // The input array of numbers
5 private final int INFINITY = Integer.MAX_VALUE; // A value to represent infinity
6
7 // Method to find the minimum number of valid subarrays with GCD greater than 1
8 public int validSubarraySplit(int[] nums) {
9 arrayLength = nums.length; // Initialize the length of the array
10 memo = new int[arrayLength]; // Initialize the memo array
11 numbers = nums; // Assign the input array to the instance variable numbers
12 int answer = depthFirstSearch(0); // Begin the DFS from the first index
13 return answer < INFINITY ? answer : -1; // If no valid split found, return -1, else return answer
14 }
15
16 // Performs a depth-first search to find the minimum number of valid subarrays
17 private int depthFirstSearch(int index) {
18 // If the index is beyond the last element, return 0 as no further split needed
19 if (index >= arrayLength) {
20 return 0;
21 }
22 // If we already computed the result for this index, return the stored value
23 if (memo[index] > 0) {
24 return memo[index];
25 }
26 int answer = INFINITY; // Start with an infinite answer
27 // Iterate over the array elements starting from current index
28 for (int j = index; j < arrayLength; ++j) {
29 // If the GCD of the starting element and the current element is greater than 1
30 if (greatestCommonDivisor(numbers[index], numbers[j]) > 1) {
31 // We have a valid subarray. Recursively solve for the remaining subarray,
32 // counting 1 for the current valid subarray and attempting to minimize the answer
33 answer = Math.min(answer, 1 + depthFirstSearch(j + 1));
34 }
35 }
36 // Store the result for the current index in the memoization array
37 memo[index] = answer;
38 return answer;
39 }
40
41 // Helper method to compute the Greatest Common Divisor of two numbers.
42 private int greatestCommonDivisor(int a, int b) {
43 return b == 0 ? a : greatestCommonDivisor(b, a % b);
44 }
45}
46
1#include <vector>
2#include <functional>
3#include <algorithm>
4#include <numeric> // For std::gcd
5
6class Solution {
7public:
8 // Define a constant for infinity to compare against during computation
9 static constexpr int INF = 0x3f3f3f3f;
10
11 // Function to calculate the number of valid subarray splits
12 int validSubarraySplit(std::vector<int>& nums) {
13 int n = nums.size(); // Get the size of the input array
14 std::vector<int> memo(n, 0); // Create a memoization array initialized to 0
15
16 // Define the recursive Depth-First Search (DFS) function using std::function
17 std::function<int(int)> dfs = [&](int i) -> int {
18 if (i >= n) return 0; // Base case: beyond array boundaries
19 if (memo[i] > 0) return memo[i]; // Return memoized result, if available
20
21 int result = INF; // Initialize result to infinity
22
23 // Check each subarray starting from index i
24 for (int j = i; j < n; ++j) {
25 // Only consider the subarray if gcd of start and current elements is greater than 1
26 if (std::gcd(nums[i], nums[j]) > 1) {
27 // Choose the smallest count of splits for subarrays
28 result = std::min(result, 1 + dfs(j + 1));
29 }
30 }
31
32 // Memoize the answer for the current index i
33 memo[i] = result;
34 return result; // Return the result for the current subarray
35 };
36
37 // Kick-start the DFS from the beginning of the array
38 int answer = dfs(0);
39
40 // Return the total number of splits if answer is less than infinity, otherwise -1
41 return answer < INF ? answer : -1;
42 }
43};
44
1const INF: number = Infinity; // Define a constant for infinity
2
3// Array to store the memoized results
4let memo: number[];
5
6// Function to calculate the number of valid subarray splits
7function validSubarraySplit(nums: number[]): number {
8 let n: number = nums.length; // Get the size of the input array
9 memo = new Array(n).fill(0); // Initialize the memoization array with 0
10
11 // Recursive Depth-First Search (DFS) function
12 const dfs = (i: number): number => {
13 if (i >= n) return 0; // Base case: beyond array boundaries
14 if (memo[i] > 0) return memo[i]; // Return memoized result if available
15
16 let result: number = INF; // Initialize result to infinity
17
18 // Check each subarray starting from index i
19 for (let j: number = i; j < n; ++j) {
20 // Only consider the subarray if gcd of start and current elements is greater than 1
21 if (gcd(nums[i], nums[j]) > 1) {
22 // Choose the smallest count of splits for subarrays
23 result = Math.min(result, 1 + dfs(j + 1));
24 }
25 }
26
27 // Memoize the answer for the current index i
28 memo[i] = result;
29 return result; // Return the result for the current subarray
30 };
31
32 // Kick-start the DFS from the beginning of the array
33 let answer: number = dfs(0);
34
35 // Return the total number of splits if answer is less than infinity, otherwise return -1
36 return answer < INF ? answer : -1;
37}
38
39// Function to compute the greatest common divisor (gcd) of two numbers
40function gcd(a: number, b: number): number {
41 while (b !== 0) {
42 let t: number = b;
43 b = a % b;
44 a = t;
45 }
46 return a;
47}
48
Time and Space Complexity
The time complexity of the given code is O(n^3)
, where n
is the length of the nums
list. This is because the dfs
function is called recursively with the starting index i
, and for each call to dfs
, there's a loop running from i
to n
. Within this loop, we are making a recursive call to dfs(j + 1)
, and we also calculate the greatest common divisor (gcd
) for each pair of elements between i
and j
. The gcd
operation itself can be considered as O(log(min(nums[i], nums[j])))
on average, but in the worst case, the gcd
can be considered O(n)
for very large numbers or certain sequences. Simplifying our analysis by considering gcd
as O(1)
for an average case, the total operations are in the order of the sum of sequence from 1
to n
, which is n(n + 1)/2
, thus O(n^2)
. This O(n^2)
combined with the O(n)
for the iteration from i
to n
gives a complexity of O(n^3)
.
The space complexity is primarily determined by the size of the cache used in the memoization and the depth of recursion. Memoization could potentially store a result for every starting index i
, hence it can take O(n)
space. The maximum depth of the recursion determines the stack space, which is also O(n)
because the function could be recursively called starting from each index in nums
. Therefore, the space complexity can be considered as O(n)
for the caching and O(n)
for the recursive call stack, which simplifies to O(n)
.
Learn more about how to find time and space complexity quickly using problem constraints.
Which of these pictures shows the visit order of a depth-first search?
Recommended Readings
Math for Technical Interviews How much math do I need to know for technical interviews The short answer is about high school level math Computer science is often associated with math and some universities even place their computer science department under the math faculty However the reality is that you
What is Dynamic Programming Prerequisite DFS problems dfs_intro Backtracking problems backtracking Memoization problems memoization_intro Pruning problems backtracking_pruning Dynamic programming is an algorithmic optimization technique that breaks down a complicated problem into smaller overlapping sub problems in a recursive manner and uses solutions to the sub problems to construct a solution
LeetCode Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we
Want a Structured Path to Master System Design Too? Don’t Miss This!