Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.laozhang.ai/llms.txt

Use this file to discover all available pages before exploring further.

Current Model Recommendations (Updated May 2026)

LaoZhang API supports 200+ mainstream AI models. This page provides detailed model information, pricing, and usage guidance. Covering the latest models including GPT-5.5, Claude Opus 4.7, Claude Sonnet 4.6, Gemini 3.1, and Veo 3.1.
Enterprise-Grade Professional AI Model API Gateway All models are sourced directly from official providers with competitive pricing, pay-as-you-go billing, and long-term reliable service.
Below are the currently available popular models. For the complete model list and real-time pricing, visit LaoZhang API Console Pricing Page.

Model Categories

🤖 OpenAI Series

GPT-5.5 Series (Latest) 🔥

Model NameModel IDContextFeaturesRecommended Use
GPT-5.5 ⭐⭐gpt-5.51MOpenAI’s latest flagship modelProfessional tasks, complex coding, enterprise

GPT-5.1 Series (November 2025) 🔥

Model NameModel IDContextFeaturesRecommended Use
GPT-5.1 ⭐⭐gpt-5.1128KStrong performance, balancedGeneral advanced tasks
GPT-5.1-Codex ⭐⭐gpt-5.1-codex128KCoding specializedProgramming development
GPT-5.1-Codex Highgpt-5.1-codex-high128KHigh performance codingComplex coding tasks
GPT-5.1-Codex Minigpt-5.1-codex-mini128KLightweight codingQuick coding, completion

GPT-5 Series (August 2025)

Model NameModel IDContextFeaturesRecommended Use
GPT-5gpt-5128KFirst gen GPT-5General tasks
GPT-5 Progpt-5-pro128KProfessional versionEnterprise applications
GPT-5 Minigpt-5-mini128KLightweight efficientCost-sensitive scenarios
GPT-5 Nanogpt-5-nano128KUltra-lightweightBatch processing

Reasoning Models

Model NameModel IDContextFeaturesRecommended Use
o3-pro ⭐⭐o3-pro200KStrongest reasoningTop-tier reasoning
o3o3200KReasoning modelComplex reasoning
o4-mini ⭐⭐o4-mini200KLightweight reasoningProgramming tasks

GPT-4 Series

Model NameModel IDContextFeaturesRecommended Use
GPT-4.1gpt-4.1128KFast speedGeneral applications
GPT-4.1 Minigpt-4.1-mini128KAffordable lightweightCost-sensitive
GPT-4.1 Nanogpt-4.1-nano128KUltra-low-costHigh-volume
GPT-4ogpt-4o128KBalanced multimodalGeneral scenarios
GPT-4o Minigpt-4o-mini128KLightweight fastQuick responses
GPT-4o Minigpt-4o-mini128KLightweight, fast, compatibleDaily conversations, batch tasks

Image Generation

Model NameModel IDFeaturesRecommended Use
GPT-Image-1.5 ⭐⭐gpt-image-1.5Latest version, higher qualityProfessional design
GPT-Image-1gpt-image-1High cost-performanceGeneral image generation
GPT-Image-1 Minigpt-image-1-miniLightweight fastQuick generation
Flux Kontext Max ⭐⭐flux-kontext-maxHighest qualityProfessional design
Flux Kontext Proflux-kontext-proProfessional qualityCommercial design
DALL·E 3dall-e-3Classic generationStandard tasks

Video Generation

Model NameModel IDFeaturesRecommended Use
Sora 2 Prosora-2-pro (Async API only)HD video ($0.8/call)High-quality video
Sora 2 Charactersora-2-characterCharacter generationCharacter animation
Veo 3.1 ⭐⭐veo-3.1Latest version, improvedStandard video
Veo 3.1 Fastveo-3.1-fastQuick versionFast prototyping
Veo 3.1 FLveo-3.1-flFrame-level controlFine control
Veo 3 Proveo3-proHighest qualityProfessional video

Image Generation Models

Model NameModel IDSupported SizesFeaturesPricing
GPT-Image-1gpt-image-11024×1024 etc.High-value image generationSee documentation
Sora Imagesora_imageMultiple sizesReverse-engineered modelSee documentation
GPT-4o Imagegpt-4o-imageMultiple sizesConversational image generationSee documentation
DALL·E 3dall-e-31024×1024 etc.Classic image generationBilled by size
Image Generation Testing Tool Visit imagen.laozhang.ai to experience various image generation models.Detailed documentation:

🎭 Claude Series (Anthropic)

Claude Opus 4.7 (Most Intelligent) 🔥

Model NameModel IDContextFeaturesRecommended Use
Claude Opus 4.7 ⭐⭐claude-opus-4-71MAnthropic’s most capable current modelTop tasks, enterprise, coding agents
Claude Opus 4.7 Thinking ⭐⭐claude-opus-4-7-thinking1MDeep reasoning modeComplex reasoning and long workflows

Claude Sonnet / Haiku Series

Model NameModel IDContextFeaturesRecommended Use
Claude Sonnet 4.6 ⭐⭐claude-sonnet-4-61MBalanced speed, cost, and intelligenceCode generation, analysis, long text
Claude Sonnet 4.6 Thinkingclaude-sonnet-4-6-thinking1MReasoning modeComplex reasoning
Claude Haiku 4.5claude-haiku-4-5200KLightweight fastQuick response

Claude 4.5 Series (Classic High Performance)

Model NameModel IDContextFeaturesRecommended Use
Claude Opus 4.5claude-opus-4-5200KClassic high-performance versionHigh-quality analysis and coding
Claude Opus 4.5 Thinkingclaude-opus-4-5-thinking200KChain-of-thought modeDeep analysis
Claude Sonnet 4.5claude-sonnet-4-5200KStable coding versionDaily development

Claude 4 Series (May 2025)

Model NameModel IDContextFeaturesRecommended Use
Claude 4 Sonnetclaude-sonnet-4200KStable versionCode generation
Claude 4.1 Opusclaude-opus-4-1200KEnhanced versionHigh-demand tasks

Claude 3.7 Series (February 2025)

Model NameModel IDContextFeaturesRecommended Use
Claude 3.7 Sonnetclaude-3-7-sonnet-latest200KLegacy compatibilityGeneral scenarios

Claude 3.5 Series (Classic)

Model NameModel IDContextFeaturesRecommended Use
Claude 3.5 Sonnetclaude-3-5-sonnet-latest200KBalanced performanceGeneral scenarios
Claude 3.5 Haikuclaude-3-5-haiku-latest200KLightweight fastDaily tasks

🌟 Google Gemini Series

Gemini 3.1 / 3 Series (Latest) 🔥

Model NameModel IDContextFeaturesRecommended Use
Gemini 3.1 Pro Preview ⭐⭐gemini-3.1-pro-preview1MLatest Pro preview with strong tool and agent capabilitiesAdvanced tasks, long text, coding agents
Gemini 3.1 Pro Preview Custom Toolsgemini-3.1-pro-preview-customtools1MOptimized for custom tools and bash workflowsAgent tool use
Gemini 3 Flash Preview ⭐⭐gemini-3-flash-preview1MFast multimodal modelQuick response
Gemini 3.1 Flash-Lite Previewgemini-3.1-flash-lite-preview1MLighter fast modelHigh-volume, lower-cost tasks
Gemini 3 Pro Image Previewgemini-3-pro-image-preview-Image generationImage tasks

Gemini 2.5 Series (2025)

Model NameModel IDContextFeaturesRecommended Use
Gemini 2.5 Progemini-2.5-pro2MCoding advantage, multimodalProduction env
Gemini 2.5 Flashgemini-2.5-flash1MFast speed, low costQuick response
Gemini 2.5 Flash Imagegemini-2.5-flash-image-Image generationImage tasks
Gemini 2.5 Computer Usegemini-2.5-computer-use-preview-10-2025-Computer use previewAI agents, automation

Gemini 2.0 Series

Model NameModel IDContextFeaturesRecommended Use
Gemini 2.0 Flashgemini-2.0-flash-0011MExperimentalEarly access

Gemini 1.5 Series (Historical Compatibility)

Gemini 1.5 is no longer recommended for new integrations. Prefer gemini-3.1-pro-preview, gemini-3-flash-preview, or gemini-2.5-pro.

🚀 xAI Grok Series

Model NameModel IDFeaturesRecommended Use
Grok 4grok-4 / grok-4-0709Latest official versionGeneral tasks
Grok 4 Fast Reasoninggrok-4-fast-reasoningFast reasoningQuick reasoning
Grok 4 Fastgrok-4-fastSpeed optimizedQuick response
Grok 3grok-3-latestStable versionDaily use
Grok 3 DeepSearchgrok-3-deepsearchDeep search, per-callWeb search
Grok 3 Minigrok-3-mini-latestSmall with reasoningLightweight tasks

🔍 DeepSeek Series

Model NameModel IDContextFeaturesRecommended Use
DeepSeek V3.2 Expdeepseek-v3.2-exp128KExperimental latestTest new features
DeepSeek V3.1deepseek-v3-1-250821128KThink/Non-Think dual modeReasoning, programming
DeepSeek V3deepseek-v3128KStrong capabilityGeneral scenarios
DeepSeek R1deepseek-r1 / deepseek-r1-052864KReasoning modelMath, reasoning

🐘 Chinese Models

Alibaba Qwen QwQ Series (Reasoning) 🔥

Model NameModel IDContextFeaturesRecommended Use
QwQ Plus ⭐⭐qwq-plus / qwq-plus-latest32KLatest reasoning modelComplex reasoning
QwQ 72B Previewqwq-72b-preview32KPreview versionTest features

Alibaba Qwen Coder 3 Series (Coding) 🔥

Model NameModel IDParametersFeaturesRecommended Use
Qwen3 Coder 480B ⭐⭐qwen3-coder-480b-a35b-instruct480B (35B active)Large coding modelComplex coding
Qwen3 Coder Plusqwen3-coder-plus-Enhanced codingStandard coding

Other Chinese Models

Model NameModel IDContextFeatures
Kimi K2 Officialkimi-k2-250711200KStable, reliable
Llama 4 Maverickllama-4-maverick-Latest open source
SeeKDream 4.5 ⭐⭐seedream-4-5-251128-Video/image generation
Doubao 1.5 Vision ProDoubao-1.5-vision-pro-32k32KMultimodal
Gemma 3 12Bgemma-3-12b-Google open source
DeepSeek V3deepseek-v3128KStrong overall capabilitiesGeneral scenarios
DeepSeek Chatdeepseek-chat128KChat-optimized versionChat applications
DeepSeek Coderdeepseek-coder128KCode-specialized modelProgramming tasks

🐘 Chinese Language Models

Alibaba Qwen Series

Model NameModel IDContext LengthFeatures
Qwen Maxqwen-max32KStrongest version
Qwen Plusqwen-plus32KEnhanced version
Qwen Turboqwen-turbo32KFast version
Qwen 2.5qwen-2.5-72b128KOpen-source large model

Moonshot Kimi Series

Model NameModel IDContext LengthFeatures
Kimi K2 Officialkimi-k2-250711200KOfficial partnership, strong stability

Other Chinese Models

Model NameModel IDFeatures
ERNIE 4.0ernie-4.0Baidu’s latest model
GLM-4glm-4Tsinghua-based model
Spark 3.5spark-3.5iFlytek’s latest version
MiniMaxminimax-abab6.5Strong overall capabilities

💰 Pricing Information

Billing Method

  • Pay-as-you-go: Charged based on actual token usage
  • No minimum charge: Use what you pay for, balance never expires
  • Real-time deduction: Fees deducted immediately after each call

Pricing Advantage

  • Direct from official sources with competitive rates
  • Volume discounts available - contact support for bulk pricing
  • New users receive $0.5 free trial credit

View Real-time Pricing

Visit LaoZhang API Console Pricing Page to view the latest prices for all models.

🛠️ Usage Recommendations

Model Selection Guide

Programming Development
  • Primary: GPT-5.5, Claude Opus 4.7, Claude Sonnet 4.6, Qwen3 Coder 480B
  • Alternatives: GPT-5.1-Codex High, DeepSeek V3.1, Gemini 3.1 Pro Preview, o4-mini
Content Creation
  • Primary: GPT-5.5, Claude Opus 4.7, Claude Sonnet 4.6
  • Alternatives: GPT-5.1, Gemini 3.1 Pro Preview, Claude Opus 4.7, Kimi K2 Official
Quick Response
  • Primary: Gemini 3 Flash Preview, Claude Haiku 4.5, Grok 4 Fast
  • Alternatives: Gemini 2.5 Flash, GPT-4.1 Nano, GPT-4o Mini
Image Generation
  • Stability priority: GPT-Image-1
  • Quality priority: DALL·E 3
  • Cost-effective: Sora Image, GPT-4o Image
Long Text Processing
  • Primary: Gemini 3.1 Pro Preview, Claude Opus 4.7
  • Alternatives: Claude Sonnet 4.6, GPT-5.5, Gemini 2.5 Pro

Cost Optimization Tips

  1. Tiered Usage: Use cheaper models for simple tasks, premium models for complex ones
  2. Test and Optimize: Test with smaller models first, then scale up as needed
  3. Batch Processing: Choose Nano or Mini versions for large volumes of similar tasks
  4. Cache and Reuse: Cache results for repeated queries
Model list is continuously updated. We promptly add newly released excellent models. For specific model needs or bulk requirements, please contact support.