algorithm - Find longest substring s.t the count of each character does not exceed the number of unique characters - Stack Overf

IT技术

更新时间：2025-04-103

admin管理员组
文章数量:1400681

Given a string, I want to find the longest substring such that the count of each character does not exceed the number of unique characters in that substring.

For example:

Input: 'aaabb'
Output: 'aabb'

I understand how to do this brute-force: Generate all substrings while tracking the count of each character and the number of distinct characters in each substring. If there's a substring where the condition is violated, we move the start of the window forward. Else, we move the end of the window forward.

The brute-force approach is in O(n^2). Is there a more efficient way to do this?

Given a string, I want to find the longest substring such that the count of each character does not exceed the number of unique characters in that substring.

For example:

Input: 'aaabb'
Output: 'aabb'

The brute-force approach is in O(n^2). Is there a more efficient way to do this?

Share Improve this question asked Mar 24 at 22:59 Mark Carothers 311 bronze badge

I don't think the brute force algorithm described works. e.g. "aaaabcdef" we would check "aa" and decide it violates the condition and move the start of window forward, even though if we continue moving the tail forward we'll eventually reach a substring that meets the condition, right? – Kaia Commented Mar 24 at 23:38
The most brute-force possible is "choose a start and end. then iterate through that substring in O(n) time and see if it works" for a O(n^3) performance, right? – Kaia Commented Mar 24 at 23:40
And in particular, it's not just a matter of choosing the right starting point (as it might seem from aaaabcdef, where if we start from the right side with a greedy approach we're golden). I don't think you can pick a starting pivot and then add items to the left or right until you have a maximal thing, since e.g. AAABCDAAAAAEFGHAAA the substring BCDAAAAA is invalid but becomes valid again once you reach BCDAAAAAEFGH – Kaia Commented Mar 25 at 0:03

Add a comment |

2 Answers 2

Sorted by: Reset to default 4

Here's an O(n^2) algorithm, I think. (note that I don't think the O(n^2) solution given in the question works)

O(n^2) in the `n >>> # of unique characters` case:

Let S be the input string.

Scan S and produce C the set of distinct characters in S. Let |C| denote the length of C.
For each character c in C, produce a cumulative-occurrences array O_c, where each O_c[i] is the number of c in the substring S[0:i]. This is O(n * |C|).
Given a start and endpoint, we can get the number of characters c in the substring via O_c[end] - O_c[start]. Obtain the maximum occurrences and the number of non-zero occurrences and use these to check validity.
Using 3, check each start and endpoint in O(n^2 * |C|). If we can assume that the number of characters is small compared to n, that's O(n^2).

It seems like if there is an improvement over O(n^2), it'd be some way to avoid checking every start and endpoint in 4.

O(n) in the `n >>> (# of unique characters)^2` case:

If for some substring S[i:j] there are more occurrences of some character c than |C|, then any S[i:k] with k > j is an invalid substring (because all the unique characters in the string cannot make up for the current occurrences of c). Thus, we can immediately increment i once we find this condition.
For substring of length l, at least one character will have at least ceil(l / |C|) occurrences (pigeonhole argument).
So the maximal length of a valid substring is |C|^2.

If we have few distinct characters m (say, if string consists of small letters only) we can check substrings with exactly m distinct characters each of them has frequency m or less, if there are no such substrings we can check for m - 1 distinct characters etc.

In the worst case we have O(n * m) time

C# Code:

public static string LongestSubstring(string value) {
  if (string.IsNullOrEmpty(value))
    return "";

  string result = value.Substring(0, 1);

  int distinct = value.Distinct().Count();

  for (int count = distinct; count > 1; --count) {
    var freqs = new Dictionary<char, int>();
    var found = false;

    for (int left = 0, right = 0; right < value.Length; ++right) {
      var letter = value[right];

      if (freqs.TryGetValue(letter, out var freq))
        freqs[letter] += 1;
      else
        freqs.Add(letter, 1);

      while (freqs.Count > count || freqs[letter] > count) {
        var leftLetter = value[left++];

        if (freqs[leftLetter] == 1)
          freqs.Remove(leftLetter);
        else
          freqs[leftLetter] -= 1;
      }

      if (freqs.Count == count && result.Length < right - left + 1) {
        found = true;
        result = value.Substring(left, right - left + 1);
      }
    }

    if (found)
      return result;
  }

  return result;
}

Demo:

var tests = new string[] {
  "aaaa",
  "aaabb",
  "aaabbc",
  "aaaabbaaaaaaccaaaaa",
  "aaaabbaaadaaaccaaaaa",
};
      
var report = string.Join(Environment.NewLine, tests
  .Select(test => $"{test,30} -> {LongestSubstring(test)}"));
      
Console.WriteLine(report);

Output:

                          aaaa -> a
                         aaabb -> aabb
                        aaabbc -> aaabbc
           aaaabbaaaaaaccaaaaa -> aabb
          aaaabbaaadaaaccaaaaa -> bbaaad

Fiddle

本文标签：

版权声明：本文标题：algorithm - Find longest substring s.t the count of each character does not exceed the number of unique characters - Stack Overf 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1744224782a2596036.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

algorithm - Find longest substring s.t the count of each character does not exceed the number of unique characters - Stack Overf

2 Answers 2

O(n^2) in the n >>> # of unique characters case:

O(n) in the n >>> (# of unique characters)^2 case:

更多相关文章

java - HikariPool-1 - Failed to validate connection in Spring Boot Data Jpa with mysql - Stack Overflow

xml - Javascript open content type - Stack Overflow

javascript - ReactJS Pass key with an onClick function - Stack Overflow

javascript - How to show back and forward button on popup opened thru window.open or self.open for Chrome? - Stack Overflow

javascript - html how to send user field input as a json object - Stack Overflow

Unable to set form action via javascript (error:object doesn&#39;t support this property or method) - Stack Overflow

plugin wp seo yoast - Remove an action from an external Class

javascript - Retrieving a value from a node child process - Stack Overflow

How can I ask Git for the difference between my current working tree and the commit from which I based my current branch? - Stac

r - How to run about 100 templates (training data) through test data using monitoR for bioacoustics - Stack Overflow

html - calling javascript function using onkeyup - Stack Overflow

reactjs - Javascript react-jss hover not changing color - Stack Overflow

javascript - setInterval stop working on alert - Stack Overflow

dataframe - Performance of BigQuery API Client vs BigQuery BigFrames? - Stack Overflow

javascript - Canvas rotation doesn&#39;t work properly - Stack Overflow

Add a javascript value to a Freemarker list - Stack Overflow

javascript - What is the NS_ERROR_INVALID_POINTER error in Firefox? - Stack Overflow

r - Week start on Mondays - Stack Overflow

php - Yahoo Contact API - Stack Overflow

javascript - Using JSON instead of GeoJSON in Leaflet with AJAX - Stack Overflow

发表评论

推荐文章

javascript - jQuery autocomplete doesn&#39;t work with key value pair array - Stack Overflow

How to get Author ID outside the loop

javascript - How to Pass in Props When Using CSS Modules - Stack Overflow

JavaScript array `push` with square brackets instead of parentheses - no error? - Stack Overflow

javascript - Pretty Photo Set linking Issues - Stack Overflow

热门文章

reactjs - Next.js DisplayName not accessible for component - Stack Overflow

installation - Multiple Multisite networks on the same domain?

categories - Create category post shortcode

javascript - To what extent should I enforce a DOM element&#39;s ID&#39;s uniqueness? - Stack Overflow

plugin development - Turn off Admin Bar (Toolbar) in backend - no easy way

Join two tables using period and interval between two dates in MySQL 5 - Stack Overflow

javascript - How to run JS function only one time? - Stack Overflow

javascript - how to add editdelete buttons in each row of datatable - Stack Overflow

custom post types - Troubles with acfsave_post and WP_Query

javascript - Auto-fill input form with PHP, MySQL and jQuery - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

javascript - Using JSON instead of GeoJSON in Leaflet with AJAX - Stack Overflow

javascript - How can I run very old Chrome apps? - Stack Overflow

javascript - Angular 2 http post null Web Api Core - Stack Overflow

plugins - Auto trigger of popup

javascript - Split mocha API test in multiple files - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

O(n^2) in the `n >>> # of unique characters` case:

O(n) in the `n >>> (# of unique characters)^2` case:

Unable to set form action via javascript (error:object doesn't support this property or method) - Stack Overflow

javascript - Canvas rotation doesn't work properly - Stack Overflow

javascript - jQuery autocomplete doesn't work with key value pair array - Stack Overflow

javascript - To what extent should I enforce a DOM element's ID's uniqueness? - Stack Overflow