API

Discussion about using the OpenAI API, including authentication, rate limits, SDKs, and integration.

Topic

Replies

Views

Activity

Proxy setup for OpenAI API behind corporate firewall

Our company network requires all outbound traffic to go through a corporate proxy. I'm struggling to configure the OpenAI Python SDK to work with it.

1.9k

Understanding token counting for prompt caching

tokens prompt-caching pricing

I'm trying to understand how prompt caching affects my token usage and billing. My system prompt is ~4000 tokens and I'm making thousands of calls per

5.4k

21d

Best practices for API key rotation in production?

api-keys security production

We're running OpenAI API calls in a production microservices architecture and need to implement key rotation. Currently we have a single API key hardc

3.1k

22d

Streaming responses dropping chunks randomly

streaming sse bug

I'm using streaming with the Chat Completions API and noticing that some chunks are being dropped, resulting in incomplete responses. This happens abo

2.8k

Feb 24

Using the Responses API - migration guide from Chat Completions

responses-api migration api

With the new Responses API, I wanted to document my migration experience from Chat Completions. Key differences 1. Input format is simplified — no

5.1k

Feb 22

Request IDs and error tracing in production

debugging request-id production

Tip for anyone running OpenAI API calls in production: always log the request ID from response headers. python response = client.chat.completions.wit

2.4k

Feb 22

Batch API: Processing 1M+ documents - lessons learned

batch-api scale best-practices

I just finished processing 1.2 million legal documents through the Batch API and wanted to share some lessons learned: What worked Batch API's 50%

7.2k

Feb 15

Implementing exponential backoff with jitter for OpenAI API

rate-limits retry best-practices

Proper retry logic is essential for production OpenAI API usage. Here's my battletested implementation: python import time import random from openai

4.7k

Feb 2

Rate limit 429 errors spike after migrating to gpt-4o-mini

rate-limits 429 gpt-4o-mini

After migrating from gpt3.5turbo to gpt4omini, I'm seeing a massive spike in 429 rate limit errors even though my request volume hasn't changed. My s

4.3k

Oct 23

How to properly handle tool_calls with parallel function execution?

function-calling tool-calls parallel

When using function calling with GPT4o, the model sometimes returns multiple tool_calls in a single response for parallel execution. I'm struggling wi

6.2k

Oct 17