2730. Find the Longest Semi-Repetitive Substring
Problem Description
The problem tasks us with finding the longest semi-repetitive substring within a given numeric string s
. A string is considered semi-repetitive if it contains at most one consecutive pair of the same digit. Any additional pairs of identical consecutive digits would disqualify the substring from being semi-repetitive. Our goal is to determine the maximum length of any such semi-repetitive substrings within s
. For example, in the string "122344", the longest semi-repetitive substring would be "1223" with the length of 4, as it contains only one pair of consecutive '2's.
Intuition
To solve the problem, the intuition is to use a sliding window approach. A sliding window is a common technique used in problems involving substrings or subarrays. The idea is to keep track of a window (or a range) within the string, which can expand or contract depending on the conditions we are looking for.
In this particular problem, our window expands whenever we add characters that do not form more than one consecutive pair with the same digit. When we encounter a situation where our window includes more than one consecutive pair, we need to contract the window from the left side.
Imagine running through the string from left to right, keeping track of the longest window encountered thus far. Initially, the window can simply grow as characters are added. If we encounter a second consecutive pair of the same digit, we need to move the left edge of the window to the right until we are left with at most one consecutive pair within our window. At each step, we compare the size of the current window to the maximum size found so far and update if necessary.
The solution code follows this reasoning by setting up a loop through the characters of the string while keeping track of two pointers denoting the window's start and end and a count to record instances of consecutive pairs. As long as the count does not exceed one, the window can grow. But if an excess is detected, the left edge of the window will move to reduce the count, ensuring the substring always remains semi-repetitive. The maximum window length is tracked all along and continuously updated as the loop proceeds. This approach effectively parses the entire string and yields the correct maximum size of the semi-repetitive substring.
Learn more about Sliding Window patterns.
Solution Approach
To implement the solution, the algorithm utilizes two pointers and a counter as central aspects of the sliding window technique.
Here is the breakdown of the algorithm with these components:
-
Two Pointers (
j
andi
):j
is initialized to0
and represents the start of the sliding window.i
is used to iterate over the string, effectively defining the end of the sliding window.
-
Counter (
cnt
):- The counter keeps track of consecutive pairs within the current window.
With these components, the algorithm proceeds as follows:
-
Initialize an accumulator
ans
to store the maximum length encountered, a countercnt
to count consecutive pairs, a start pointerj
at position0
, and the end pointeri
which will iterate over the string. -
Iterate through the string with index
i
from0
ton-1
(inclusive), wheren
is the length of the string. -
If the current character at
i
is the same as the previous character (s[i] == s[i - 1]
), increment the countercnt
because we've found a consecutive pair. However, this should be done ifi
is not the first character to avoid checking before the beginning of the string. -
While the counter
cnt
is greater than 1, which means there are more than one consecutive pairs in our current window, incrementj
to move the start of the window to the right. This effectively reduces the size of the window from the beginning side. If the character at the new startj
and the next characterj + 1
form a consecutive pair, decrementcnt
as we've removed one such pair from the window. -
After adjusting the window to ensure it is semi-repetitive, calculate the length of the current window (
i - j + 1
) and updateans
if the current window is larger than the previous maximum. -
Once all possible windows have been considered (post loop completion), return the maximum length
ans
found during the iteration.
Here's the code enclosed with backticks for proper markdown display:
1class Solution:
2 def longestSemiRepetitiveSubstring(self, s: str) -> int:
3 n = len(s)
4 ans = cnt = j = 0
5 for i in range(n):
6 if i and s[i] == s[i - 1]:
7 cnt += 1
8 while cnt > 1:
9 if s[j] == s[j + 1]:
10 cnt -= 1
11 j += 1
12 ans = max(ans, i - j + 1)
13 return ans
This algorithm makes only a single pass through the string, therefore having a time complexity of O(n)
where n
is the length of the string, and a space complexity of O(1)
as it uses a fixed number of extra variables.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's walk through an example to illustrate how the algorithm works with the following input string s = "11231445"
.
-
Initialize the answer variable
ans
to store the maximum length found (ans = 0
). Initialize the count variablecnt
to keep track of consecutive pairs (cnt = 0
). Set the start of the sliding windowj = 0
. -
Start iterating with
i = 0
.- Since the first character doesn't have a previous character, just continue to next iteration.
-
At
i = 1
,s[i] = "1"
(the second '1'),s[i-1] = "1"
(the first '1').- Since
s[i] == s[i-1]
, incrementcnt
by 1 (cnt = 1
).
- Since
-
At
i = 2
,s[i] = "2"
, no consecutive pair is found.- The window size is now 3 (from index
j = 0
toi = 2
), updateans
(ans = 3
).
- The window size is now 3 (from index
-
At
i = 3
,s[i] = "3"
, continue to next because no consecutive pair is found.- The window size is now 4, update
ans
(ans = 4
).
- The window size is now 4, update
-
At
i = 4
,s[i] = "1"
which does not form a consecutive pair.- The window size is now 5, update
ans
(ans = 5
).
- The window size is now 5, update
-
At
i = 5
,s[i] = "4"
, proceed to next iteration. -
At
i = 6
,s[i] = "4"
, which makes a consecutive pair withs[i-1]
.- Increment
cnt
(cnt = 2
) because now there are two consecutive '4's.
- Increment
-
Now
cnt > 1
, initiate while loop to shrink the window from the left.- Since
s[j] == s[j+1]
(both are '1'), decrementcnt
by 1 (cnt = 1
).
- Since
-
Increment
j
so the new start of the window is atj = 1
. -
The window size is now 6 (from
j = 1
toi = 6
), updateans
(ans = 6
). -
At
i = 7
,s[i] = "5"
, move to the next iteration. -
The loop has now considered all possible windows within the string.
-
Return the maximum length found, which is
ans = 6
.
The longest semi-repetitive substring in s = "11231445"
is then "123144", with a maximum length of 6. This is consistent with the problem's requirementsโa substring with at most one consecutive pair (in this case, the pair of '4's).
The algorithm effectively finds the longest substring satisfying the semi-repetitive condition with a single pass over the input string.
Solution Implementation
1class Solution:
2 def longest_semi_repetitive_substring(self, s: str) -> int:
3 # Initialize the length of the string
4 string_length = len(s)
5
6 # Initialize variables; ans for storing the maximum length,
7 # count for counting consecutive repetitions, and j as the left pointer
8 max_length = count = left_pointer = 0
9
10 # Loop through the string using the right pointer i
11 for right_pointer in range(string_length):
12 # If we are not at the first character and the current character is the same
13 # as the previous one, increment the count of repetitions
14 if right_pointer > 0 and s[right_pointer] == s[right_pointer - 1]:
15 count += 1
16
17 # If there are more than one consecutive repetition, move the left pointer
18 while count > 1:
19 # If the character at the left pointer is followed by the same character,
20 # decrement the count because we are moving past this repetition
21 if s[left_pointer] == s[left_pointer + 1]:
22 count -= 1
23
24 # Move the left pointer to the right
25 left_pointer += 1
26
27 # Update max_length with the length of the current semi-repetitive substring
28 current_length = right_pointer - left_pointer + 1
29 max_length = max(max_length, current_length)
30
31 # Return the maximum length found
32 return max_length
33
1class Solution {
2 public int longestSemiRepetitiveSubstring(String s) {
3 int stringLength = s.length(); // Length of the input string
4 int maxLength = 0; // Initialize the maximum length to zero
5
6 // Start with two pointers i and j at the beginning of the string s and a count 'repeatedCharsCount' to record consecutive characters
7 for (int i = 0, j = 0, repeatedCharsCount = 0; i < stringLength; ++i) {
8 // Check if the current character is the same as the previous character, increase repeatedCharsCount
9 if (i > 0 && s.charAt(i) == s.charAt(i - 1)) {
10 ++repeatedCharsCount;
11 }
12
13 // If the repeatedCharsCount is more than one (more than two repeated characters),
14 // move the start pointer 'j' forward until repeatedCharsCount is at most one
15 while (repeatedCharsCount > 1) {
16 if (s.charAt(j) == s.charAt(j + 1)) {
17 --repeatedCharsCount;
18 }
19 ++j;
20 }
21
22 // Update the maximum length of the substring encountered so far
23 maxLength = Math.max(maxLength, i - j + 1);
24 }
25
26 return maxLength; // Return the maximum length found
27 }
28}
29
1class Solution {
2public:
3 // Function to find the length of longest semi-repetitive substring.
4 // A semi-repetitive substring is a substring where no character appears
5 // more than twice in a row.
6 int longestSemiRepetitiveSubstring(string s) {
7 // Initialize the size of the string and variable to keep track of the
8 // maximum length of the semi-repetitive substring found so far.
9 int stringSize = s.size();
10 int longestLength = 0;
11
12 // Initialize pointers for substring window and a counter to track consecutive
13 // character repetition count.
14 for (int left = 0, right = 0, repeatCount = 0; right < stringSize; ++right) {
15 // If the character is the same as the previous one, increase repetition count.
16 if (right > 0 && s[right] == s[right - 1]) {
17 ++repeatCount;
18 }
19
20 // If repeatCount is more than 1, it means the character has appeared
21 // more than twice. Shift the left pointer to the right to reduce the count.
22 while (repeatCount > 1) {
23 if (s[left] == s[left + 1]) {
24 --repeatCount;
25 }
26 ++left;
27 }
28
29 // Update longestLength with the maximum length found so far.
30 longestLength = max(longestLength, right - left + 1);
31 }
32
33 // Return the length of the longest semi-repetitive substring.
34 return longestLength;
35 }
36};
37
1function longestSemiRepetitiveSubstring(s: string): number {
2 const length = s.length; // Store the length of the input string s
3 let longestLength = 0; // This will keep track of the longest semi-repetitive substring
4 let startIndex = 0; // Starting index of the current substring
5 let repeatCount = 0; // Count of consecutive repeating characters
6
7 // Iterate through each character in the string
8 for (let index = 0; index < length; index++) {
9 // If the current and previous characters are the same, increment the repeat count
10 if (index > 0 && s[index] === s[index - 1]) {
11 repeatCount++;
12 }
13
14 // If there are more than one consecutive repeating characters, move the start index forward
15 while (repeatCount > 1) {
16 // If the character at the start index is the same as its next character, decrement the repeat count
17 if (s[startIndex] === s[startIndex + 1]) {
18 repeatCount--;
19 }
20 // Move the start index forward
21 startIndex++;
22 }
23
24 // Store the maximum length between the current longest and the length of the current semi-repetitive substring
25 longestLength = Math.max(longestLength, index - startIndex + 1);
26 }
27
28 // Return the length of the longest semi-repetitive substring
29 return longestLength;
30}
31
Time and Space Complexity
Time Complexity
The function longestSemiRepetitiveSubstring
iterates through each character in the input string s
exactly once using a single loop, which gives us a complexity of O(n)
where n
is the length of the string s
. Within this loop, it handles a while loop that only decrements the count and moves a second pointer, j
, forward, also at most n
times over the course of the entire function. Each character is considered at most twice (once when i
passes over it and once when j
passes over it), so the while loop does not increase the overall time complexity beyond O(n)
. Thus, the overall time complexity of the algorithm remains O(n)
.
Space Complexity
The space complexity of the function is O(1)
. It only uses a fixed number of additional variables (ans
, cnt
, j
, i
, and n
) that are not dependent on the size of the input. No additional data structures that scale with input size are used.
Learn more about how to find time and space complexity quickly using problem constraints.
How does quick sort divide the problem into subproblems?
Recommended Readings
https algomonster s3 us east 2 amazonaws com cover_photos stack svg Sliding Window Maximum Monotonic Stack We have an array and a sliding window defined by a start index and an end index The sliding window moves from left of the array to right There are always k elements in
Patterns The Shortest Path Algorithm for Coding Interviews The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we can determine the
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
Got a question?ย Ask the Monster Assistantย anything you don't understand.
Still not clear? ย Submitย the part you don't understand to our editors. Or join ourย Discord and ask the community.