Can I reduce API latency?

To find the most comprehensive and up-to-date information about reducing latency, we recommend reading our latency optimization best practices.

Through the API, you also have the option to optimize the generative process of the AI using the optimize_streaming_latency parameter, but this is deprecated, and we no longer recommend using it. To find out more, please see our API documentation.