batches.create

Method

client.batches.create(
    model: str,
    src: BatchJobSource,
    config: Optional[CreateBatchJobConfig] = None
) -> BatchJob

Creates a batch prediction job for processing multiple requests asynchronously. Batch jobs are ideal for high-volume, non-latency-sensitive workloads.

model

string

required

The model to use for the batch job. Example: gemini-2.0-flash-001

src

BatchJobSource

required

The source of the batch job data:

Show Source Options

gcs_uri

string | list[string]

GCS bucket path(s) to input data (Vertex AI only). Example: gs://my-bucket/input.jsonl

bigquery_uri

string

BigQuery table URI (Vertex AI only). Format: bq://projectId.datasetId.tableId

file_name

string

File name from Files API (Gemini API only). Example: files/my-input-file

inlined_requests

list[InlinedRequest]

List of inline requests (Gemini API only)

config

CreateBatchJobConfig

Configuration options for the batch job

Show Configuration Fields

display_name

string

Display name for the batch job

dest

BatchJobDestination

Output destination configuration (Vertex AI only):

gcs_uri: GCS bucket path for output
bigquery_uri: BigQuery table for output
format: Output format (jsonl or bigquery)

Response

name

string

The resource name of the batch job

state

JobState

Current state of the batch job:

JOB_STATE_QUEUED: Job is queued
JOB_STATE_PENDING: Job is pending
JOB_STATE_RUNNING: Job is running
JOB_STATE_SUCCEEDED: Job completed successfully
JOB_STATE_FAILED: Job failed
JOB_STATE_CANCELLED: Job was cancelled

create_time

string

Timestamp when the job was created

start_time

string

Timestamp when the job started processing

end_time

string

Timestamp when the job completed

model

string

The model being used for the batch job

dest

BatchJobDestination

Destination information for the output

completion_stats

object

Statistics about completed requests (Vertex AI only)

Usage

Create Batch Job from GCS (Vertex AI)

from google import genai
from google.genai import types

client = genai.Client(
    vertexai=True,
    project='my-project',
    location='us-central1'
)

# Create batch job with GCS input/output
batch_job = client.batches.create(
    model='gemini-2.0-flash-001',
    src=types.BatchJobSource(
        gcs_uri='gs://my-bucket/input/requests.jsonl'
    ),
    config=types.CreateBatchJobConfig(
        display_name='My Batch Job',
        dest=types.BatchJobDestination(
            gcs_uri='gs://my-bucket/output/',
            format='jsonl'
        )
    )
)

print(f"Batch job created: {batch_job.name}")
print(f"State: {batch_job.state}")

Create Batch Job from File (Gemini API)

from google import genai
from google.genai import types

client = genai.Client(api_key='your-api-key')

# Upload file first
with open('requests.jsonl', 'rb') as f:
    uploaded_file = client.files.upload(file=f)

# Create batch job
batch_job = client.batches.create(
    model='gemini-2.0-flash-001',
    src=types.BatchJobSource(
        file_name=uploaded_file.name
    ),
    config=types.CreateBatchJobConfig(
        display_name='Gemini Batch Job'
    )
)

print(f"Job created: {batch_job.name}")

With Inline Requests (Gemini API)

from google.genai import types

# Create batch job with inline requests
batch_job = client.batches.create(
    model='gemini-2.0-flash-001',
    src=types.BatchJobSource(
        inlined_requests=[
            types.InlinedRequest(
                contents='What is the capital of France?',
                config=types.GenerateContentConfig(
                    temperature=0.7,
                    max_output_tokens=100
                )
            ),
            types.InlinedRequest(
                contents='Explain quantum computing',
                config=types.GenerateContentConfig(
                    temperature=0.5
                )
            )
        ]
    )
)

print(f"Batch job with inline requests: {batch_job.name}")

Monitor Job Progress

import time
from google.genai import types

# Create batch job
batch_job = client.batches.create(
    model='gemini-2.0-flash-001',
    src='gs://my-bucket/input.jsonl',
    config=types.CreateBatchJobConfig(
        dest=types.BatchJobDestination(
            gcs_uri='gs://my-bucket/output/'
        )
    )
)

print(f"Job created: {batch_job.name}")

# Poll until completion
while batch_job.state in [
    types.JobState.JOB_STATE_QUEUED,
    types.JobState.JOB_STATE_PENDING,
    types.JobState.JOB_STATE_RUNNING
]:
    print(f"Status: {batch_job.state}")
    time.sleep(60)
    batch_job = client.batches.get(name=batch_job.name)

if batch_job.state == types.JobState.JOB_STATE_SUCCEEDED:
    print("Batch job completed successfully!")
    print(f"Output location: {batch_job.dest.gcs_uri}")
    if batch_job.completion_stats:
        print(f"Stats: {batch_job.completion_stats}")
else:
    print(f"Job failed with state: {batch_job.state}")

BigQuery Input/Output (Vertex AI)

from google.genai import types

# Create batch job with BigQuery
batch_job = client.batches.create(
    model='gemini-2.0-flash-001',
    src=types.BatchJobSource(
        bigquery_uri='bq://my-project.my_dataset.input_table'
    ),
    config=types.CreateBatchJobConfig(
        display_name='BigQuery Batch',
        dest=types.BatchJobDestination(
            bigquery_uri='bq://my-project.my_dataset.output_table',
            format='bigquery'
        )
    )
)

print(f"BigQuery batch job: {batch_job.name}")

Input Format

For file-based inputs (GCS or Files API), use JSONL format with one request per line:

{"request": {"contents": [{"parts": [{"text": "What is AI?"}]}]}}
{"request": {"contents": [{"parts": [{"text": "Explain machine learning"}]}]}}
{"request": {"contents": [{"parts": [{"text": "What is deep learning?"}]}]}}

Notes

Batch jobs process requests asynchronously and may take minutes to hours
Gemini API has lower quotas than Vertex AI for batch processing
Results are written to the specified destination upon completion
Monitor job status using batches.get
You can cancel running jobs using batches.cancel

Client

Models

Chats

Files

Caches

Tunings

Batches

Interactions (Beta)

Types

Method

Response

Usage

Create Batch Job from GCS (Vertex AI)

Create Batch Job from File (Gemini API)

With Inline Requests (Gemini API)

Monitor Job Progress

BigQuery Input/Output (Vertex AI)

Input Format

Notes

See Also

Build docs developers (and LLMs) love

Client

Models

Chats

Files

Caches

Tunings

Batches

Interactions (Beta)

Types

​Method

​Response

​Usage

​Create Batch Job from GCS (Vertex AI)

​Create Batch Job from File (Gemini API)

​With Inline Requests (Gemini API)

​Monitor Job Progress

​BigQuery Input/Output (Vertex AI)

​Input Format

​Notes

​See Also

Build docs developers (and LLMs) love

Method

Response

Usage

Create Batch Job from GCS (Vertex AI)

Create Batch Job from File (Gemini API)

With Inline Requests (Gemini API)

Monitor Job Progress

BigQuery Input/Output (Vertex AI)

Input Format

Notes

See Also