3. Longest Substring Without Repeating Characters 无重复字符的最长子串

负雪明烛2022年3月7日

LeetCode
算法题解

Medium
哈希表
字符串
滑动窗口

大约 4 分钟

作者：负雪明烛 id： fuxuemingzhu 个人博客： http://fuxuemingzhu.cn/ 公众号：负雪明烛本文关键词：无重复字符，最长子串，题解，leetcode, 力扣，python, c++, java

@TOC

题目地址：https://leetcode.com/problems/longest-substring-without-repeating-characters/description/

题目描述

Given a string, find the length of the longest substring without repeating characters.

Example 1:

Input: "abcabcbb"
Output: 3 
Explanation: The answer is "abc", which the length is 3.

Example 2:

Input: "bbbbb"
Output: 1
Explanation: The answer is "b", with the length of 1.

Example 3:

Input: "pwwkew"
Output: 3
Explanation: The answer is "wke", with the length of 3. 
             Note that the answer must be a substring, "pwke" is a subsequence and not a substring.

题目大意

找出字符串中最长的不含有重复字符的子串长度。

解题方法

看见题目求长度的，一般时间复杂度都不会太高。

解法一：虫取法+set

所谓虫取法，就是根据某个条件交替移动前后指针，使得在双指针之内的这部分是满足题意要求的。

具体思路比较简单易懂，使用双指针，[left, right]双闭区间来保存子串的左右区间，对应着这个区间我们维护一个set，这个set里面全部是不重复的字符。

使用while循环，如果right字符不在set中，就让它进去；如果right在，就把left对应的字符给remove出去。

所以，当我们得到一个right位置的字符时，通过移动left和修改[left,right]区间内对应的的set，来保持了一个最小的不重复字符区间。这里需要注意的是，移动left的次数不一定就是1次，因为我们要保证left和right之间没有重复字符，而新添加的right字符出现的位置不一定刚刚就是left指向的位置。

比如：

a b c b b c b b 0 1 2 3 4 5 6 7

当right移动到3的时候字符时b，此时，set = {a, b, c}中，left=0，字符b在set中。

所以在while循环中反复移动left，当left移动到2的位置时，此时set = {c},字符b已经不在set中。

按照这个方式移动，set的个数最多的值即为最长子串。

一定注意：[left, right]区间和set是对应的，要同时维护。

下面的python代码是根据right指向的字符是否出现在set中而反复的进行循环，代码如下：

class Solution(object):
    def lengthOfLongestSubstring(self, s):
        """
        :type s: str
        :rtype: int
        """
        left, right = 0, 0
        chars = set()
        res = 0
        while left < len(s) and right < len(s):
            if s[right] in chars:
                if s[left] in chars:
                    chars.remove(s[left])
                left += 1
            else:
                chars.add(s[right])
                right += 1
                res = max(res, len(chars))
        return res

下面的C++代码的思路是如果right刚移动到某个位置，而这个位置的字符在set中出现过，那么就内循环left使得right指向的元素不在set中为止。本质上和上面的代码一致。这里的代码是每次都要把right指向的元素放入到set中的。

class Solution {
public:
    int lengthOfLongestSubstring(string s) {
        const int N = s.size();
        if (N <= 1) return N;
        unordered_set<char> set;
        int res = 0;
        int l = 0, r = 0;
        while (r < N) {
            while (set.count(s[r])) {
                set.erase(s[l]);
                ++l;
            }
            set.insert(s[r]);
            res = max(res, int(set.size()));
            ++r;
        }
        return res;
    }
};

方法二：一次遍历+字典

一次遍历时，使用字典保存每个字符第一次出现的位置。这个方法我一直不知道叫什么名字，就勉强叫做prefix方法吧，因为需要维护已经遍历到的前缀部分。

当right向后遍历的过程中，如果这个字符在字典中，说明这个字符在前面出现过，即这个区间已经不是题目要求的不含重复字符的区间了，因此，需要移动left。

移动left到哪里呢？有个快速的方法，那就是移动到right字符在字典中出现的位置（即s[right]在前面的位置）的下一个位置。

无论如何都会使用right更新字典，另外记录最大区间长度即为所求。

注意，left更新的时候需要保留最大（最右）的位置。举例说明：

对于abba，当right指向最后的a的时候，left指向的是字典中保留的有第一个位置的a，如果不对此进行判断的话，left会移动到第一个字符b。

left一定是向右移动的，不可能撤回到已经移动过的位置。

class Solution(object):
    def lengthOfLongestSubstring(self, s):
        """
        :type s: str
        :rtype: int
        """
        left, right = 0, 0
        res = 0
        chars = dict()
        for right in range(len(s)):
            if s[right] in chars:
                left = max(left, chars[s[right]] + 1)
            chars[s[right]] = right
            res = max(res, right - left + 1)
        return res

C++代码如下，注意C++的变量务必需要初始化，否则将不确定，比如这里的l和res，如果不初始化会产生莫名其妙的结果：

class Solution {
public:
    int lengthOfLongestSubstring(string s) {
        const int N = s.size();
        unordered_map<char, int> pos;
        int l = 0;
        int res = 0;
        for (int r = 0; r < N; ++r) {
            if (pos.count(s[r])) {
                l = max(l, pos[s[r]] + 1);
            }
            pos[s[r]] = r;
            res = max(res, r - l + 1);
        }
        return res;
    }
};

参考资料：https://www.youtube.com/watch?v=hw0zHamgaks

另外有个文章不错：http://www.cnblogs.com/grandyang/p/4480780.html

日期

2018 年 8 月 24 日 —— Keep fighting! 2019 年 1 月 19 日 —— 有好几天没有更新文章了

3. Longest Substring Without Repeating Characters 无重复字符的最长子串

# 题目描述

# 题目大意

# 解题方法

# 解法一：虫取法+set

# 方法二：一次遍历+字典

# 日期

详情

题目描述

题目大意

解题方法

解法一：虫取法+set

方法二：一次遍历+字典

日期