Introduction
Veo-3.1 is Google’s latest generation video generation model series, supporting high-quality video content generation through text descriptions or image inputs. The entire series is fully compatible with OpenAI Chat Completions API format, making it easy for developers to integrate quickly.Core Features
Text-to-Video
Generate videos from pure text descriptions, supporting complex scenes and action descriptions
Image-to-Video
Generate transition videos or animation effects based on 1-2 reference images
OpenAI Compatible
Uses standard Chat Completions API format, no need to learn new interfaces
Streaming Response
Supports streaming returns for real-time generation progress and results
Model Series
Veo-3.1 provides multiple model variants to meet different scenario requirements:Standard Models
veo-3.1 - Standard
veo-3.1 - Standard
Price: $0.25 / requestFeatures:
- Excellent overall performance
- Suitable for most scenarios
- Stable generation quality
veo-3.1-fast - Fast
veo-3.1-fast - Fast
Price: $0.15 / requestFeatures:
- Faster generation speed
- High cost-effectiveness
- Suitable for rapid iteration
Image-to-Video Models (Support Image Input)
veo-3.1-fl - Image-to-Video Standard
veo-3.1-fl - Image-to-Video Standard
Price: $0.25 / requestFeatures:
- Supports 1-2 image inputs
- Can generate image transition animations
- Bring static images to life
veo-3.1-fast-fl - Image-to-Video Fast
veo-3.1-fast-fl - Image-to-Video Fast
Price: $0.15 / requestFeatures:
- Fast image-to-video generation
- Supports image input
- High cost-effectiveness
Landscape Models
veo-3.1-landscape - Landscape Standard
veo-3.1-landscape - Landscape Standard
Price: $0.25 / requestFeatures:
- Optimized for landscape scenarios
- 16:9 or wider aspect ratio
- Suitable for professional production
veo-3.1-landscape-fast - Landscape Fast
veo-3.1-landscape-fast - Landscape Fast
Price: $0.15 / requestFeatures:
- Fast landscape video generation
- Maintains picture quality
- Efficient batch processing
veo-3.1-landscape-fl - Landscape Image-to-Video
veo-3.1-landscape-fl - Landscape Image-to-Video
Price: $0.25 / requestFeatures:
- Landscape + image input support
- Professional-grade output
- Excellent detail preservation
veo-3.1-landscape-fast-fl - Landscape Image-to-Video Fast
veo-3.1-landscape-fast-fl - Landscape Image-to-Video Fast
Price: $0.15 / requestFeatures:
- Fast + image input + landscape
- Triple feature combination
- Extremely high cost-effectiveness
Price Comparison
| Model Name | Price | Speed | Aspect Ratio | Image Support | Recommended Use |
|---|---|---|---|---|---|
| veo-3.1 | $0.25 | Standard | General | ❌ | Text-to-video general scenarios |
| veo-3.1-fast | $0.15 | Fast | General | ❌ | Text-to-video quick testing |
| veo-3.1-fl | $0.25 | Standard | General | ✅ | Image-to-video standard quality |
| veo-3.1-fast-fl | $0.15 | Fast | General | ✅ | Image-to-video fast processing |
| veo-3.1-landscape | $0.25 | Standard | Landscape | ❌ | Landscape text-to-video |
| veo-3.1-landscape-fast | $0.15 | Fast | Landscape | ❌ | Landscape text-to-video fast |
| veo-3.1-landscape-fl | $0.25 | Standard | Landscape | ✅ | Landscape image-to-video |
| veo-3.1-landscape-fast-fl | $0.15 | Fast | Landscape | ✅ | Landscape image-to-video fast |
Generation Methods
Text-to-Video
Generate video content through natural language descriptions:Image-to-Video
Generate video content based on reference images:Core Parameters
Select the Veo-3.1 model variant to use
Message array containing user input, supporting text and image types
Whether to enable streaming response, recommended to set to
true for real-time progressNumber of videos to generate, range 1-4, can generate multiple different results simultaneously
Billing Information
Important: Veo-3.1 models require pay-per-use tokens, not pay-as-you-go tokens. Please create a “pay-per-use” type token in the Console.
Pay-per-request: Each successful API call is charged at the selected model’s unit price, failed requests are not charged
Billing Rules
- Single Request: Charged at model price (0.25)
- Multiple Results: Setting
n=2generates 2 videos, charged for 2 requests - Failed Retries: Failed generations are not charged, can retry for free
- Streaming Response: No additional charge
Technical Advantages
High-Quality Output
Advanced video generation algorithms ensure smooth and natural footage
Fast Response
Fast series models can return results in seconds
Flexible Control
Supports text and image combination input for precise content control
Standard Interface
Fully compatible with OpenAI API, zero learning curve
Batch Processing
Supports generating multiple results simultaneously through n parameter
Real-time Progress
Streaming response provides real-time generation status feedback
Quick Start
Quick Start Guide
View complete quick start tutorial with detailed code examples
Use Cases
- Content Creation
- Professional Production
- Batch Generation
- Social media short videos
- Marketing clips
- Product showcase animations
- Educational training content