1624. Largest Substring Between Two Equal Characters
Problem Description
The problem presents a string s
and requires finding the length of the longest substring that is present between two identical characters in the string, excluding the characters themselves. If no such substring exists (i.e., there aren't two identical characters in the string with anything between them), the function should return -1
. The term "substring" indicates a consecutive sequence of characters within the string s
.
For example, in the string "abca", the longest substring between two equal characters 'a' is "bc", which has the length of 2. In a case where the string is "abc", there are no two equal characters with something between them, so the result would be -1
.
Intuition
To solve this problem, we must find characters in the string that appear more than once and calculate the distance between their first and last occurrences, since that will determine the length of the substring between them. To efficiently do this, we should remember the index of the first occurrence of each character when we see it for the first time.
With these considerations in mind, we use a dictionary d
to store characters as keys and their first encountered index positions as values. We iterate over the string s
and for each character c
:
- If
c
is already in the dictionaryd
, this means we have previously seen it, and hence we calculate the distance between this occurrence and its first occurrence, which isi - d[c] - 1
, and update the answerans
if this distance is larger. - If
c
is not in the dictionary, we record its indexi
in the dictionaryd
, marking its first occurrence.
The variable ans
is used to keep track of the maximum length found so far. If no such length is found (meaning ans
never gets updated), it remains -1
, which is also the default value we return when there are no two identical characters with substrings between them.
Thus, by using a single pass over the string and a dictionary to track the indices of characters, we arrive at an efficient solution to the problem.
Solution Approach
The implementation uses a dictionary to store the first occurrence of each character in the given string. A dictionary is an excellent choice for this problem due to its fast lookups and insertions, which operate on average in O(1) time complexity.
Let's walk through the implementation step by step:
- A dictionary
d
is initialized to keep a record of the first index where each character appears. - A variable
ans
is initiated with a value of-1
. This will eventually hold the maximum length of any substring found between two equal characters, excluding the characters themselves. - We iterate over the characters of the string
s
using afor
loop, wherei
is the index andc
is the character at that index. - Inside the loop, we check if the character
c
is already in the dictionaryd
:- If
c
is ind
, it means we have encounteredc
before. The substring length between the twoc
characters isi - d[c] - 1
.- We compare this length with the current maximum stored in
ans
. - If it's larger, we update
ans
with this new value.
- We compare this length with the current maximum stored in
- If
c
is not ind
, we addc
to the dictionary with the current indexi
as its value. This marks the location of the first occurrence ofc
.
- If
- Once the loop is complete,
ans
will have the maximum length found or remain-1
if no such substring exists.
This approach is efficient because we maintain a sliding window between two occurrences of the same character by remembering only the first occurrence and using the current index to calculate the length of the in-between substring. As a result, the time complexity is O(n), where n is the length of the string, because we are making a single pass through the string.
With this understanding, the provided Python code realizes the approach effectively and delivers the correct result for the problem at hand.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's consider a string s
with the value "character"
. We are trying to find the length of the longest substring present between two identical characters (excluding those characters).
Following the solution approach, let's break it down step by step:
-
Initialize a dictionary
d
for keeping the first index occurrence of each character. -
Initialize
ans
as-1
. -
Start iterating through the string:
- For index
0
, characterc
: it's not ind
, so we addc: 0
to the dictionary. - For index
1
, characterh
: it's also not ind
, addh: 1
. - For index
2
, charactera
: it's not ind
, adda: 2
. - For index
3
, characterr
: not ind
, addr: 3
. - For index
4
, charactera
: a is already ind
at index2
. Calculate the distance:4 - 2 - 1 = 1
. Updateans
to1
. - For index
5
, characterc
: c is ind
with index0
. The distance is5 - 0 - 1 = 4
. Since4
is greater than the currentans
value of1
, updateans
to4
. - For index
6
, charactert
: not ind
, addt: 6
. - For index
7
, charactere
: not ind
, adde: 7
. - For index
8
, characterr
: r is ind
with index3
. The distance is8 - 3 - 1 = 4
.ans
is already4
, so no update is needed.
- For index
-
End of iteration. The final
ans
is4
, which is the length of the longest substring ("aract") between two identical characters without including them ("c...c" and "r...r").
The code returns 4
as the length of the longest substring between two repeating characters, excluding the characters themselves.
Solution Implementation
1class Solution:
2 def maxLengthBetweenEqualCharacters(self, s: str) -> int:
3 # Dictionary to store the first occurrence of each character
4 first_occurrence = {}
5
6 # Initialize the answer with -1 as per problem constraints
7 max_length = -1
8
9 # Iterate over the string to find the max length between equal characters
10 for index, char in enumerate(s):
11 # If character is already seen, calculate the length between the current and first occurrence
12 if char in first_occurrence:
13 length_between = index - first_occurrence[char] - 1
14 max_length = max(max_length, length_between)
15 else:
16 # Store the first occurrence of the character
17 first_occurrence[char] = index
18
19 # Return the maximum length found
20 return max_length
21
1class Solution {
2 public int maxLengthBetweenEqualCharacters(String s) {
3 // Array to store the first occurrence index of each character.
4 // Initialized to -1 indicating that the character hasn't been seen yet.
5 int[] firstOccurrence = new int[26];
6 Arrays.fill(firstOccurrence, -1);
7
8 // Variable to store the maximum length found.
9 int maxLength = -1;
10
11 // Loop through the string to find the maximum length.
12 for (int i = 0; i < s.length(); ++i) {
13 // Calculate the index for the character 'a' as 0, 'b' as 1, etc.
14 int charIndex = s.charAt(i) - 'a';
15
16 // If the character has not been seen before,
17 // store its index in firstOccurrence.
18 if (firstOccurrence[charIndex] == -1) {
19 firstOccurrence[charIndex] = i;
20 } else {
21 // If we have seen the character before,
22 // calculate the length between the current and first index.
23 // Then, update the maxLength if the calculated length is larger.
24 maxLength = Math.max(maxLength, i - firstOccurrence[charIndex] - 1);
25 }
26 }
27
28 // Return the maximum length found.
29 return maxLength;
30 }
31}
32
1class Solution {
2public:
3 int maxLengthBetweenEqualCharacters(string s) {
4 // Create a vector 'firstIndex' to store the first occurrence index of each lowercase letter.
5 // Initialized with -1, indicating we have not seen the character yet.
6 vector<int> firstIndex(26, -1);
7
8 // Variable to store the maximum length found between two equal characters.
9 int maxLen = -1;
10
11 // Iterate through the string to check each character.
12 for (int i = 0; i < s.size(); ++i) {
13 // Converting the current character to its corresponding index (0-25 for 'a'-'z').
14 int index = s[i] - 'a';
15
16 // If we have not seen this character before, store its index.
17 if (firstIndex[index] == -1) {
18 firstIndex[index] = i;
19 } else {
20 // If we've seen this character before, calculate the length between the two equal characters.
21 // Update 'maxLen' if we find a longer length.
22 maxLen = max(maxLen, i - firstIndex[index] - 1);
23 }
24 }
25
26 // Return the maximum length found.
27 return maxLen;
28 }
29};
30
1/**
2 * Finds the maximum length of a substring between two identical characters
3 * in the input string 's'.
4 *
5 * @param s The string to be evaluated.
6 * @return The maximum length of a substring.
7 */
8function maxLengthBetweenEqualCharacters(s: string): number {
9 // 'n' holds the length of the string 's'.
10 const n: number = s.length;
11 // 'positionIndex' is an array to track the first occurrence position of each alphabet character.
12 const positionIndex: number[] = new Array(26).fill(-1);
13 // 'maxLength' will keep track of the maximum length found.
14 let maxLength: number = -1;
15
16 // Iterate over each character in the string.
17 for (let i = 0; i < n; i++) {
18 // Compute the zero-based alphabet index of the current character.
19 const charIndex: number = s[i].charCodeAt(0) - 'a'.charCodeAt(0);
20 // If it is the first occurrence of the character, store its position.
21 if (positionIndex[charIndex] === -1) {
22 positionIndex[charIndex] = i;
23 } else {
24 // If a pair is found, update the 'maxLength', if necessary.
25 maxLength = Math.max(maxLength, i - positionIndex[charIndex] - 1);
26 }
27 }
28
29 // Return the maximum length found, or -1 if no such length exists.
30 return maxLength;
31}
32
Time and Space Complexity
Time Complexity
The given Python code iterates over the string s
only once with a for loop. For each character in the string, it performs a constant-time operation to check if the character is in the dictionary d
, update the maximum length using max()
, and set the value in the dictionary. None of these operations depend on the size of the input string inside the loop, so each iteration of the loop runs in constant time, O(1)
.
The time complexity of the entire function is thus proportional to the number of characters n
in the string s
, as each character is processed just once. This results in an overall time complexity of:
O(n)
where n
is the length of the string s
.
Space Complexity
The space complexity of the code depends on the number of unique characters in the string s
, as a dictionary d
stores the first index at which each character appears. In the worst case, all characters of the string s
are unique, which would require storing each character in the dictionary d
.
Assuming the input string s
contains n
characters and taking into account the possibility of n
unique characters, the space complexity of maintaining the dictionary is:
O(n)
where n
is the length of the string s
.
However, if we consider the constraint of the problem that the input string s
consists of only lowercase English letters, then there is a constant maximum of 26 unique characters that can be stored in the dictionary d
. This would imply a constant space complexity:
O(1)
Depending on the input constraints specified in the problem statement, you should use the appropriate analysis for the space complexity.
Learn more about how to find time and space complexity quickly using problem constraints.
Which of the following problems can be solved with backtracking (select multiple)
Recommended Readings
LeetCode Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
Runtime Overview When learning about algorithms and data structures you'll frequently encounter the term time complexity This concept is fundamental in computer science and offers insights into how long an algorithm takes to complete given a certain input size What is Time Complexity Time complexity represents the amount of time
Want a Structured Path to Master System Design Too? Don’t Miss This!