[AI gateway] Request timeouts for fallback providers #19391

kodster28 · 2025-01-23T19:48:16Z

Summary

New feature, request timeouts for fallback providers.

Updated several pages in AI gateway docs + changelog entry.

hyperlint-ai

2 files reviewed, 1 total issue(s) found.

src/content/docs/ai-gateway/providers/universal.mdx

kodster28 · 2025-01-23T19:48:59Z

src/content/docs/ai-gateway/providers/universal.mdx

 You can use the Universal endpoint to contact every provider. The payload is expecting an array of message, and each message is an object with the following parameters:

 - `provider` : the name of the provider you would like to direct this message to. Can be OpenAI, workers-ai, or any of our supported providers.
 - `endpoint`: the pathname of the provider API you’re trying to reach. For example, on OpenAI it can be `chat/completions`, and for Workers AI this might be [`@cf/meta/llama-3.1-8b-instruct`](/workers-ai/models/llama-3.1-8b-instruct/). See more in the sections that are specific to [each provider](/ai-gateway/providers/).
- `authorization`: the content of the Authorization HTTP Header that should be used when contacting this provider. This usually starts with “Token” or “Bearer”.
+- `headers`:


@kathayl, I think this is accurate, but fact check me here :)

kodster28 · 2025-01-23T19:49:14Z

src/content/docs/ai-gateway/providers/universal.mdx

 - `query`: the payload as the provider expects it in their official API.

 ## cURL example

+The following example shows a simple setup with a primary model and a [fallback](/ai-gateway/configuration/fallbacks/) option.


Wanted to add more cross-links over to fallbacks page

kodster28 · 2025-01-23T19:49:25Z

src/content/glossary/ai-gateway.yaml

@@ -41,6 +41,10 @@ entries:
    general_definition: |-
      Header to [bypass caching for a specific request](/ai-gateway/configuration/caching/#skip-cache-cf-aig-skip-cache).

+  - term: cf-aig-request-timeout


Adds automatically to headers glossary page

kodster28 · 2025-01-23T19:50:08Z

src/content/docs/ai-gateway/configuration/fallbacks.mdx

+
+If that fails, then the gateway will timeout and move to the fallback `@cf/meta/llama-3.1-8b-instruct-fast` model. This model has 3000 milliseconds - determined by the request-level `cf-aig-request-timeout` value - to complete the request and provide an answer.
+
+```bash title="Request" collapse={36-50} {2,11,13-15}


If the collapsible bit here is distracting, happy to remove.

Also, switched the example a bit so it made sense to me... but I also could just be hallucinating stuff. Happy to flip it back to the original.

Co-authored-by: hyperlint-ai[bot] <154288675+hyperlint-ai[bot]@users.noreply.github.com>

cloudflare-workers-and-pages · 2025-01-23T20:29:25Z

Deploying cloudflare-docs with Cloudflare Pages

Latest commit:	`771fedb`
Status:	✅ Deploy successful!
Preview URL:	https://1647671e.cloudflare-docs-7ou.pages.dev
Branch Preview URL:	https://aig-request-timeout.cloudflare-docs-7ou.pages.dev

View logs

github-actions · 2025-01-23T20:29:41Z

Files with changes (up to 15)

Original Link	Updated Link
https://developers.cloudflare.com/ai-gateway/configuration/fallbacks/	https://aig-request-timeout.cloudflare-docs-7ou.pages.dev/ai-gateway/configuration/fallbacks/
https://developers.cloudflare.com/ai-gateway/providers/universal/	https://aig-request-timeout.cloudflare-docs-7ou.pages.dev/ai-gateway/providers/universal/

[AI gateway] Request timeouts for fallback providers

1ac75cb

kodster28 requested review from kathayl, G4brym, mchenco, daisyfaithauma and a team as code owners January 23, 2025 19:48

hyperlint-ai bot reviewed Jan 23, 2025

View reviewed changes

src/content/docs/ai-gateway/providers/universal.mdx Outdated Show resolved Hide resolved

kodster28 commented Jan 23, 2025

View reviewed changes

kodster28 and others added 2 commits January 23, 2025 13:50

bearer token inconsistency

2e0d6ba

Update src/content/docs/ai-gateway/providers/universal.mdx

771fedb

Co-authored-by: hyperlint-ai[bot] <154288675+hyperlint-ai[bot]@users.noreply.github.com>

github-actions bot assigned daisyfaithauma, G4brym, kathayl and mchenco Jan 23, 2025

github-actions bot added product:ai-gateway AI Gateway: https://developers.cloudflare.com/ai-gateway/ size/m labels Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AI gateway] Request timeouts for fallback providers #19391

[AI gateway] Request timeouts for fallback providers #19391

kodster28 commented Jan 23, 2025

hyperlint-ai bot left a comment

kodster28 Jan 23, 2025

kodster28 Jan 23, 2025

kodster28 Jan 23, 2025

kodster28 Jan 23, 2025

cloudflare-workers-and-pages bot commented Jan 23, 2025

github-actions bot commented Jan 23, 2025


		If that fails, then the gateway will timeout and move to the fallback `@cf/meta/llama-3.1-8b-instruct-fast` model. This model has 3000 milliseconds - determined by the request-level `cf-aig-request-timeout` value - to complete the request and provide an answer.

		```bash title="Request" collapse={36-50} {2,11,13-15}

[AI gateway] Request timeouts for fallback providers #19391

Are you sure you want to change the base?

[AI gateway] Request timeouts for fallback providers #19391

Conversation

kodster28 commented Jan 23, 2025

Summary

hyperlint-ai bot left a comment

Choose a reason for hiding this comment

kodster28 Jan 23, 2025

Choose a reason for hiding this comment

kodster28 Jan 23, 2025

Choose a reason for hiding this comment

kodster28 Jan 23, 2025

Choose a reason for hiding this comment

kodster28 Jan 23, 2025

Choose a reason for hiding this comment

cloudflare-workers-and-pages bot commented Jan 23, 2025

Deploying cloudflare-docs with Cloudflare Pages

github-actions bot commented Jan 23, 2025