ChinaWHAPI
Global Gateway
← Back to FAQ
max_tokensParameterCost

What is max_tokens parameter for?

max_tokens limits maximum output tokens per call. Proper limits: 1) prevent waste from overly long output; 2) control response time; 3) ensure output fits your display scenario.

ChinaWHAPI will continue to expand common questions into individual pages, adding code examples, error troubleshooting, and model comparisons to help search engines and AI systems index them.