49. Group Anagrams
Problem Description
The task is to write a function that takes an array (list) of strings called strs
and groups the anagrams together. An anagram is a word obtained by rearranging the letters of another word to form a new word using all the original characters exactly once. For example, "listen" and "silent" are anagrams of each other. The function can return the groups in any order, meaning the sequence of the groups is not important, and within each group, the sequence of strings is not important either.
Intuition
The intuition behind the solution is that if two strings are anagrams of each other, when sorted alphabetically, they will result in the same string. For instance, 'ate' and 'eat' when sorted will both result in 'aet'. This provides us with a simple way to identify groups of anagrams: by using the sorted string as a key in a dictionary (or hash map).
We can iterate through each string in our input list, sort the string alphabetically, and use the sorted string as a key in a dictionary. The value for each key will be a list that contains all the strings from the input that, when sorted, match this key. We use a defaultdict for convenience because it allows us to append to the list without needing to check if the key already exists -- if it doesn't, defaultdict automatically creates a new list by default.
At the end of the iteration, all anagrams will be grouped together in the dictionary under the same key, and we can then return the values of the dictionary, which are our required anagram groups.
Learn more about Sorting patterns.
Solution Approach
The solution provided uses a hash table (or dictionary in Python) to group the strings that are anagrams of each other. To implement this approach, the following steps are taken during the execution of the groupAnagrams
function:
-
A
defaultdict
is created where each value is initialized as a list. This allows appending new items to the list without first checking if the key exists. -
The function iterates through each string
s
in the provided liststrs
. -
Inside the loop, the string
s
is sorted alphabetically usingsorted(s)
which returns a list of characters, and then''.join(sorted(s))
is used to convert the list of characters back into a string. This sorted string serves as the key in our dictionary. -
The original unsorted string
s
is appended to the list in the dictionary that corresponds to the sorted key (k
). This means all anagrams ofs
will be grouped under the same key. -
After processing all strings in
strs
, the function returns the values of the dictionary as a list of lists, where each sublist contains all the anagrams of a particular word from the input list.
Here is the relevant part of the code that demonstrates the algorithm:
d = defaultdict(list)
for s in strs:
k = "".join(sorted(s))
d[k].append(s)
return list(d.values())
Each key in the dictionary (k
) represents a group of anagrams, and the corresponding value is a list containing all the strings that are anagrams of one another. Since the defaultdict
automatically initializes missing keys with an empty list, no additional checks are necessary before appending to the list for a particular key. This makes the code clean and efficient.
The groupAnagrams
function effectively utilizes sorting and hash tables to categorize strings by their anagram groups, ensuring that the full process is clear, intuitive and efficient, making it a standard approach to solve anagram grouping problems.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's walk through a small example to illustrate the solution approach:
Suppose we have the following array of strings: ["bat", "tab", "eat", "tea", "tan", "nat"]
.
Here is how we would group the anagrams together using the provided solution approach:
-
Create a
defaultdict
of lists:d = defaultdict(list)
. This will store our anagrams. -
Iterate through each string in the array:
"bat"
: When sorted, becomes"abt"
. We append"bat"
tod["abt"]
."tab"
: When sorted, becomes"abt"
. We append"tab"
tod["abt"]
."eat"
: When sorted, becomes"aet"
. We append"eat"
tod["aet"]
."tea"
: When sorted, becomes"aet"
. We append"tea"
tod["aet"]
."tan"
: When sorted, becomes"ant"
. We append"tan"
tod["ant"]
."nat"
: When sorted, becomes"ant"
. We append"nat"
tod["ant"]
.
-
Now,
d
looks like this:{ "abt": ["bat", "tab"], "aet": ["eat", "tea"], "ant": ["tan", "nat"] }
Each key corresponds to a sorted string, and the values are the original strings that are anagrams.
-
Finally, we return the dictionary values as a list of lists:
[ ["bat", "tab"], ["eat", "tea"], ["tan", "nat"] ]
Each sublist contains anagrams of each other.
The order of the sublists and the order of the strings within each sublist does not matter. Those are our anagram groups, as requested. The function provides a clear and efficient way of splitting our input array into these anagram groups using sorting and hashing.
Solution Implementation
1from collections import defaultdict
2
3class Solution:
4 def group_anagrams(self, strs):
5 # A dictionary that automatically creates a new list entry for each new key
6 anagrams = defaultdict(list)
7
8 # Iterate through each string in the provided list
9 for s in strs:
10 # Sort the string to create a key for the anagrams
11 # Since sorted returns a list of characters, join them to form a string
12 key = "".join(sorted(s))
13
14 # Append the original string to the list of anagrams with the same key
15 anagrams[key].append(s)
16
17 # Convert the dictionary values to a list and return it
18 return list(anagrams.values())
19
1import java.util.*;
2
3class Solution {
4 public List<List<String>> groupAnagrams(String[] strs) {
5 // Create a map to group the anagrams, where the key is the sorted string, and the value is a list of original strings.
6 Map<String, List<String>> anagramsMap = new HashMap<>();
7
8 // Iterate over each string in the array.
9 for (String str : strs) {
10 // Convert the string to a character array and sort it.
11 char[] charArray = str.toCharArray();
12 Arrays.sort(charArray);
13
14 // Create a new string from the sorted character array.
15 String sortedStr = String.valueOf(charArray);
16
17 // If the sorted string key is not present in the map, initialize the list.
18 // Then add the original string to the list associated with the sorted string key.
19 anagramsMap.computeIfAbsent(sortedStr, key -> new ArrayList<>()).add(str);
20 }
21
22 // Return a new list containing all values from the map's lists,
23 // effectively grouping all anagrams together.
24 return new ArrayList<>(anagramsMap.values());
25 }
26}
27
1#include <vector>
2#include <string>
3#include <unordered_map>
4#include <algorithm> // needed for std::sort()
5
6class Solution {
7public:
8 // This function takes a list of strings and groups anagrams together.
9 vector<vector<string>> groupAnagrams(vector<string>& strs) {
10 // Create a map to keep track of the groups of anagrams.
11 unordered_map<string, vector<string>> anagramGroups;
12
13 // Iterate over each string in the input list.
14 for (auto& str : strs) {
15 // Create a key by sorting the characters in the string.
16 string key = str; // Copy the original string to "key".
17 sort(key.begin(), key.end()); // Sort the characters to form the key.
18
19 // Add the original string to the vector corresponding to the sorted key.
20 anagramGroups[key].emplace_back(str);
21 }
22
23 // Initialize a vector to hold the final grouped anagrams.
24 vector<vector<string>> groupedAnagrams;
25
26 // Iterate over the map to extract the groups of anagrams.
27 for (auto& pair : anagramGroups) {
28 // The second element of the pair is our group of anagrams.
29 // Add this group to our final result.
30 groupedAnagrams.emplace_back(pair.second);
31 }
32
33 // Return the grouped anagrams.
34 return groupedAnagrams;
35 }
36};
37
1// Function to group anagrams in an array of strings.
2// Each group contains strings that are anagrams of each other.
3function groupAnagrams(strs: string[]): string[][] {
4 // Define a map to hold the sorted string as key and an array of its anagrams as value.
5 const anagramMap: Map<string, string[]> = new Map();
6
7 // Iterate through each string in the input array.
8 for (const str of strs) {
9 // Sort the characters of the string to form the key.
10 const key = str.split('').sort().join('');
11
12 // If the key is not present in the map, initialize it with an empty array.
13 if (!anagramMap.has(key)) {
14 anagramMap.set(key, []);
15 }
16
17 // Add the original string to the corresponding key's list of anagrams.
18 anagramMap.get(key)!.push(str);
19 }
20
21 // Convert the values of the map to an array and return.
22 return Array.from(anagramMap.values());
23}
24
Time and Space Complexity
Time Complexity
The time complexity of the code is primarily determined by two operations: the sorting operation for each string and the insertion into the dictionary.
-
Sorting Each String: Assuming the average length of the strings is
n
and there arem
strings to sort, the time complexity of sorting each string using the TimSort algorithm (Python's default sort algorithm) isO(n log n)
. Since we need to sortm
such strings, the total time complexity for this part isO(m * n log n)
. -
Inserting into Dictionary: The insertion of
m
strings into the dictionary has a time complexity ofO(m)
, since each insertion operation into a dictionary isO(1)
on average.
Combining these two, we get the total time complexity as O(m * n log n)
.
Space Complexity
The space complexity is due to the storage of m
entries in the dictionary and the lists of anagrams.
-
Dictionary Storage (HashMap): The dictionary stores lists of anagrams. In the worst case, all strings are anagrams of each other, requiring
O(m * n)
space (since each of them
strings of lengthn
can be in the same list). -
Sorted String Keys: Additionally, for each string, we create a sorted copy to use as a key. Since we sort each string in place and only have
m
keys at any point in time, the space for the keys would beO(m * n)
.
Therefore, the total space complexity of the algorithm is O(m * n)
, as it dominates over the constant factors and smaller terms.
Learn more about how to find time and space complexity quickly using problem constraints.
Which of the two traversal algorithms (BFS and DFS) can be used to find whether two nodes are connected?
Recommended Readings
Sorting Summary Comparisons We presented quite a few sorting algorithms and it is essential to know the advantages and disadvantages of each one The basic algorithms are easy to visualize and easy to learn for beginner programmers because of their simplicity As such they will suffice if you don't know any advanced
LeetCode Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
Want a Structured Path to Master System Design Too? Don’t Miss This!