Spaces:

yym68686
/

uni-api

Sleeping

yym68686 commited on Nov 5, 2024

Commit

d30f0dd

1 Parent(s): 782dde1

💻 Code: Modify the default behavior of API key cooldown, change it to default off for API key cooldown. When there is only one API key in the channel, it will not cooldown under any circumstances. When API_KEY_COOLDOWN_PERIOD is 0, it will not cooldown. After enabling cooldown, any error with the API key will trigger a cooldown.

Files changed (3) hide show

README.md CHANGED Viewed

@@ -92,7 +92,7 @@ providers:
     preferences:
       API_KEY_RATE_LIMIT: 15/min # Each API Key can request up to 15 times per minute, optional. The default is 999999/min.
       # API_KEY_RATE_LIMIT: 15/min,10/day # Supports multiple frequency constraints
-      API_KEY_COOLDOWN_PERIOD: 60 # Each API Key will be cooled down for 60 seconds after encountering a 429 error. Optional, the default is 60 seconds.
   - provider: vertex
     project_id: gen-lang-client-xxxxxxxxxxxxxx # Description: Your Google Cloud project ID. Format: String, usually composed of lowercase letters, numbers, and hyphens. How to obtain: You can find your project ID in the project selector of the Google Cloud Console.

     preferences:
       API_KEY_RATE_LIMIT: 15/min # Each API Key can request up to 15 times per minute, optional. The default is 999999/min.
       # API_KEY_RATE_LIMIT: 15/min,10/day # Supports multiple frequency constraints
+      API_KEY_COOLDOWN_PERIOD: 60 # Each API Key will be cooled down for 60 seconds after encountering a 429 error. Optional, the default is 0 seconds. When set to 0, the cooling mechanism is not enabled.
   - provider: vertex
     project_id: gen-lang-client-xxxxxxxxxxxxxx # Description: Your Google Cloud project ID. Format: String, usually composed of lowercase letters, numbers, and hyphens. How to obtain: You can find your project ID in the project selector of the Google Cloud Console.

README_CN.md CHANGED Viewed

@@ -92,7 +92,7 @@ providers:
     preferences:
       API_KEY_RATE_LIMIT: 15/min # 每个 API Key 每分钟最多请求次数，选填。默认为 999999/min
       # API_KEY_RATE_LIMIT: 15/min,10/day # 支持多个频率约束条件
-      API_KEY_COOLDOWN_PERIOD: 60 # 每个 API Key 遭遇 429 错误后的冷却时间，单位为秒，选填。默认为 60 秒
   - provider: vertex
     project_id: gen-lang-client-xxxxxxxxxxxxxx #    描述： 您的Google Cloud项目ID。格式： 字符串，通常由小写字母、数字和连字符组成。获取方式： 在Google Cloud Console的项目选择器中可以找到您的项目ID。

     preferences:
       API_KEY_RATE_LIMIT: 15/min # 每个 API Key 每分钟最多请求次数，选填。默认为 999999/min
       # API_KEY_RATE_LIMIT: 15/min,10/day # 支持多个频率约束条件
+      API_KEY_COOLDOWN_PERIOD: 60 # 每个 API Key 遭遇 429 错误后的冷却时间，单位为秒，选填。默认为 0 秒, 当设置为 0 秒时，不启用冷却机制。
   - provider: vertex
     project_id: gen-lang-client-xxxxxxxxxxxxxx #    描述： 您的Google Cloud项目ID。格式： 字符串，通常由小写字母、数字和连字符组成。获取方式： 在Google Cloud Console的项目选择器中可以找到您的项目ID。

main.py CHANGED Viewed

@@ -1013,9 +1013,11 @@ class ModelRequestHandler:
                     num_matching_providers = len(matching_providers)
                     index = 0
-                if status_code == 429:
                     current_api = await provider_api_circular_list[channel_id].after_next_current()
-                    await provider_api_circular_list[channel_id].set_cooling(current_api, cooling_time=safe_get(provider, "preferences", "API_KEY_COOLDOWN_PERIOD", default=60))
                 logger.error(f"Error {status_code} with provider {channel_id}: {error_message}")
                 if is_debug:

                     num_matching_providers = len(matching_providers)
                     index = 0
+                cooling_time = safe_get(provider, "preferences", "API_KEY_COOLDOWN_PERIOD", default=0)
+                api_key_count = provider_api_circular_list[channel_id].get_items_count()
+                if cooling_time > 0 and api_key_count > 1:
                     current_api = await provider_api_circular_list[channel_id].after_next_current()
+                    await provider_api_circular_list[channel_id].set_cooling(current_api, cooling_time=cooling_time)
                 logger.error(f"Error {status_code} with provider {channel_id}: {error_message}")
                 if is_debug: