Prices are listed in US Dollars (USD). If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
The costs for Vertex AI remain the same as they are for the legacy AI Platform and AutoML products that Vertex AI supersedes, with the following exceptions:
Vertex AI also offers more ways to optimize costs, such as the following:
For Generative AI on Vertex AI pricing information, see Pricing for Generative AI on Vertex AI.
For Vertex AI AutoML models, you pay for three main activities:
Vertex AI uses predefined machine configurations for Vertex AutoML models, and the hourly rate for these activities reflects the resource usage.
The time required to train your model depends on the size and complexity of your training data. Models must be deployed before they can provide online predictions or online explanations.
You pay for each model deployed to an endpoint, even if no prediction is made. You must undeploy your model to stop incurring further charges. Models that are not deployed or have failed to deploy are not charged.
You pay only for compute hours used; if training fails for any reason other than a user-initiated cancellation, you are not billed for the time. You are charged for training time if you cancel the operation.
Select a model type below for pricing information.
Image data
Operation | Price (classification) (USD) | Price (object detection) (USD) |
|---|---|---|
Training | $3.465 / 1 hour | $3.465 / 1 hour |
Training (Edge on-device model) | $18.00 / 1 hour | $18.00 / 1 hour |
Deployment and online prediction | $1.375 / 1 hour | $2.002 / 1 hour |
Batch prediction | $2.222 / 1 hour | $2.222 / 1 hour |
Tabular data
Operation | Price per node hour for classification/regression | Price for forecasting |
|---|---|---|
Training | $21.252 / 1 hour | Refer to Vertex AI Forecast |
Inference | Same price as inference for custom-trained models. Vertex AI performs batch inference using 40 n1-highmem-8 machines. | Refer to Vertex AI Forecast |
Inference charges for Vertex Explainable AI
Compute associated with Vertex Explainable AI is charged at same rate as inference. However, explanations take longer to process than normal inferences, so heavy usage of Vertex Explainable AI along with auto-scaling could result in more nodes being started, which would increase inference charges.
AutoML
Stage | Pricing |
|---|---|
Prediction | 0 count to 1,000,000 count $0.20 / 1,000 count, per 1 month / account 1,000,000 count to 50,000,000 count $0.10 / 1,000 count, per 1 month / account 50,000,000 count and above $0.02 / 1,000 count, per 1 month / account |
Training | $21.252 / 1 hour |
Explainable AI | Explainability using Shapley values. Refer to Vertex AI Inference and Explanation pricing page. |
* A prediction data point is one time point in the forecast horizon. For example, with daily granularity a 7-day horizon is 7 points per each time series.
ARIMA+
Stage | Pricing |
|---|---|
Prediction | $5.00 / 1,000 count |
Training | $250.00 per TB x Number of Candidate Models x Number of Backtesting Windows* |
Explainable AI | Explainability with time series decomposition does not add any additional cost. Explainability using Shapley values is not supported. |
Refer to the BigQuery ML pricing page for additional details. Each training and prediction job incurs the cost of 1 managed pipeline run, as described in Vertex AI pricing.
* A backtesting window is created for each period in the test set. The AUTO_ARIMA_MAX_ORDER used determines the number of candidate models. It ranges from 6-42 for models with multiple time series.
Training
The tables below provide the approximate price per hour of various training configurations. You can choose a custom configuration of selected machine types. To calculate pricing, sum the costs of the virtual machines you use.
If you use Compute Engine machine types and attach accelerators, the cost of the accelerators is separate. To calculate this cost, multiply the prices in the table of accelerators below by how many machine hours of each type of accelerator you use.
Machine types
You can use Spot VMs with Vertex AI custom training. Spot VMs are billed according to Compute Engine Spot VMs pricing. There are Vertex AI custom training management fees in addition to your infrastructure usage, captured in the following tables.
You can use Compute Engine reservations with Vertex AI custom training. When using Compute Engine reservations, you're billed according to Compute Engine Pricing, including any applicable committed use discounts (CUDs). There are Vertex AI custom training management fees in addition to your infrastructure usage, captured in the following tables.
Machine type | Price (USD) |
|---|---|
g4-standard-48 | $5.1749195 / 1 hour |
g4-standard-96 | $10.349839 / 1 hour |
g4-standard-192 | $20.699678 / 1 hour |
g4-standard-384 | $41.399356 / 1 hour |
n1-standard-4 | $0.21849885 / 1 hour |
n1-standard-8 | $0.4369977 / 1 hour |
n1-standard-16 | $0.8739954 / 1 hour |
n1-standard-32 | $1.7479908 / 1 hour |
n1-standard-64 | $3.4959816 / 1 hour |
n1-standard-96 | $5.2439724 / 1 hour |
n1-highmem-2 | $0.13604845 / 1 hour |
n1-highmem-4 | $0.2720969 / 1 hour |
n1-highmem-8 | $0.5441938 / 1 hour |
n1-highmem-16 | $1.0883876 / 1 hour |
n1-highmem-32 | $2.1767752 / 1 hour |
n1-highmem-64 | $4.3535504 / 1 hour |
n1-highmem-96 | $6.5303256 / 1 hour |
n1-highcpu-16 | $0.65180712 / 1 hour |
n1-highcpu-32 | $1.30361424 / 1 hour |
n1-highcpu-64 | $2.60722848 / 1 hour |
n1-highcpu-96 | $3.91084272 / 1 hour |
a2-highgpu-1g* | $4.425248914 / 1 hour |
a2-highgpu-2g* | $8.850497829 / 1 hour |
a2-highgpu-4g* | $17.700995658 / 1 hour |
a2-highgpu-8g* | $35.401991315 / 1 hour |
a2-megagpu-16g* | $65.707278915 / 1 hour |
a3-highgpu-8g* | $101.007352832 / 1 hour |
a3-megagpu-8g* | $106.046424032 / 1 hour |
a3-ultragpu-8g* | $99.773930496 / 1 hour |
a4-highgpu-8g* | $148.212 / 1 hour |
e2-standard-4 | $0.154126276 / 1 hour |
e2-standard-8 | $0.308252552 / 1 hour |
e2-standard-16 | $0.616505104 / 1 hour |
e2-standard-32 | $1.233010208 / 1 hour |
e2-highmem-2 | $0.103959618 / 1 hour |
e2-highmem-4 | $0.207919236 / 1 hour |
e2-highmem-8 | $0.415838472 / 1 hour |
e2-highmem-16 | $0.831676944 / 1 hour |
e2-highcpu-16 | $0.455126224 / 1 hour |
e2-highcpu-32 | $0.910252448 / 1 hour |
n2-standard-4 | $0.2233714 / 1 hour |
n2-standard-8 | $0.4467428 / 1 hour |
n2-standard-16 | $0.8934856 / 1 hour |
n2-standard-32 | $1.7869712 / 1 hour |
n2-standard-48 | $2.6804568 / 1 hour |
n2-standard-64 | $3.5739424 / 1 hour |
n2-standard-80 | $4.467428 / 1 hour |
n2-highmem-2 | $0.1506661 / 1 hour |
n2-highmem-4 | $0.3013322 / 1 hour |
cloud-tpu | Pricing is determined by the accelerator type. See 'Accelerators'. |
n2-highmem-8 | $0.6026644 / 1 hour |
n2-highmem-16 | $1.2053288 / 1 hour |
n2-highmem-32 | $2.4106576 / 1 hour |
n2-highmem-48 | $3.6159864 / 1 hour |
n2-highmem-64 | $4.8213152 / 1 hour |
n2-highmem-80 | $6.026644 / 1 hour |
n2-highcpu-16 | $0.6596032 / 1 hour |
n2-highcpu-32 | $1.3192064 / 1 hour |
n2-highcpu-48 | $1.9788096 / 1 hour |
n2-highcpu-64 | $2.6384128 / 1 hour |
n2-highcpu-80 | $3.298016 / 1 hour |
c2-standard-4 | $0.2401292 / 1 hour |
c2-standard-8 | $0.4802584 / 1 hour |
c2-standard-16 | $0.9605168 / 1 hour |
c2-standard-30 | $1.800969 / 1 hour |
c2-standard-60 | $3.601938 / 1 hour |
m1-ultramem-40 | $7.237065 / 1 hour |
m1-ultramem-80 | $14.47413 / 1 hour |
m1-ultramem-160 | $28.94826 / 1 hour |
m1-megamem-96 | $12.249984 / 1 hour |
*This amount includes GPU price, since this instance type always requires a fixed number of GPU accelerators.
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
Accelerators
Machine type | Price (USD) | Vertex Management Fee |
|---|---|---|
NVIDIA_TESLA_A100 | $2.933908 / 1 hour | $0.4400862 / 1 hour |
NVIDIA_TESLA_A100_80GB | $3.92808 / 1 hour | $0.589212 / 1 hour |
NVIDIA_H100_80GB | $9.79655057 / 1 hour | $1.4694826 / 1 hour |
NVIDIA_H200_141GB | $10.708501 / 1 hour | Unavailable |
NVIDIA_H100_MEGA_80GB | $11.8959171 / 1 hour | Unavailable |
NVIDIA_TESLA_L4 | $0.644046276 / 1 hour | Unavailable |
NVIDIA_TESLA_P4 | $0.69 / 1 hour | Unavailable |
NVIDIA_TESLA_P100 | $1.679 / 1 hour | Unavailable |
NVIDIA_TESLA_T4 | $0.4025 / 1 hour | Unavailable |
NVIDIA_TESLA_V100 | $2.852 / 1 hour | Unavailable |
TPU_V2 Single (8 cores) | $5.175 / 1 hour | Unavailable |
TPU_V2 Pod (32 cores)* | $27.60 / 1 hour | Unavailable |
TPU_V3 Single (8 cores) | $9.20 / 1 hour | Unavailable |
TPU_V3 Pod (32 cores)* | $36.80 / 1 hour | Unavailable |
tpu7x-standard-4t (1 chip) | $13.80 / 1 hour | Unavailable |
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
* The price for training using a Cloud TPU Pod is based on the number of cores in the Pod. The number of cores in a pod is always a multiple of 32. To determine the price of training on a Pod that has more than 32 cores, take the price for a 32-core Pod, and multiply it by the number of cores, divided by 32. For example, for a 128-core Pod, the price is (32-core Pod price) * (128/32). For information about which Cloud TPU Pods are available for a specific region, see System Architecture in the Cloud TPU documentation.
Disks
Machine type | Price (USD) |
|---|---|
pd-standard | $0.000063014 / 1 gibibyte hour |
pd-ssd | $0.000267808 / 1 gibibyte hour |
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
You are charged for training your models from the moment when resources are provisioned for a job until the job finishes.
Warning: Your training jobs are limited by the Vertex AI quota policy. If you choose a very powerful processing cluster for your first training jobs, it's likely you will exceed your quota.
Scale tiers for predefined configurations (AI Platform Training)
You can control the type of processing cluster to use when training your model. The simplest way is to choose from one of the predefined configurations called scale tiers. Read more about scale tiers.
Machine types for custom configurations
If you use Vertex AI or select CUSTOM as your scale tier for AI Platform Training, you have control over the number and type of virtual machines to use for the cluster's master, worker and parameter servers. Read more about machine types for Vertex AI and machine types for AI Platform Training.
The cost of training with a custom processing cluster is the sum of all the machines you specify. You are charged for the total time of the job, not for the active processing time of individual machines.
For model-based metrics, charges are applied only for the prediction costs associated with the underlying autorater model. They are billed based on the input tokens that you provide in your evaluation dataset and the autorater output.
Gen AI Evaluation Service is generally available (GA). Pricing change took effect on April 14, 2025.
Metric | Pricing |
|---|---|
Pointwise | Default autorater model Gemini 2.0 Flash |
Pairwise | Default autorater model Gemini 2.0 Flash |
Computation-based metrics are charged at $0.00003 per 1k characters for input and $0.00009 per 1k characters for output. They are referred to as Automatic Metric in SKU.
Metric Name | Type |
|---|---|
Exact Match | Computation-based |
Bleu | Computation-based |
Rouge | Computation-based |
Tool Call Valid | Computation-based |
Tool Name Match | Computation-based |
Tool Parameter Key Match | Computation-based |
Tool Parameter KV Match | Computation-based |
Prices are listed in US Dollars (USD). If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
Legacy model-based metrics are charged at $0.005 per 1k characters for input and $0.015 per 1k characters for output.
Metric Name | Type |
|---|---|
Coherence | Pointwise |
Fluency | Pointwise |
Fulfillment | Pointwise |
Safety | Pointwise |
Groundedness | Pointwise |
Summarization Quality | Pointwise |
Summarization Helpfulness | Pointwise |
Summarization Verbosity | Pointwise |
Question Answering Quality | Pointwise |
Question Answering Relevance | Pointwise |
Question Answering Helpfulness | Pointwise |
Question Answering Correctness | Pointwise |
Pairwise Summarization Quality | Pairwise |
Pairwise Question Answering Quality | Pairwise |
Prices are listed in US Dollars (USD). If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
Vertex AI Agent Engine is a set of services for developers to scale agents in production. Services can be used together or a la carte. You only pay for what you use. Today, you pay for the Agent Engine runtime.
Starting February 11, 2026, billing will begin for Code Execution, Sessions, and Memory Bank.
Pricing is based on compute (vCPU hours) and memory (GiB hours) resources used by agents that are deployed to the Agent Engine runtime. Billing is rounded to the nearest second of usage. Idle time for an agent is not billed.
To help you get started with the runtime, we offer a monthly free tier.
Once your monthly usage exceeds the free tier, billing begins per the rates below.
Resource | Price (USD) |
|---|---|
vCPU | 0 hour to 50 hour $0.00 (Free) / 3,600 second, per 1 month / project 50 hour and above $0.0864 / 3,600 second, per 1 month / project |
RAM | 0 gibibyte hour to 100 gibibyte hour $0.00 (Free) / 3,600 gibibyte second, per 1 month / project 100 gibibyte hour and above $0.009 / 3,600 gibibyte second, per 1 month / project |
Similar to the runtime, you pay for the compute and memory required to run a sandbox. Billing is rounded to the nearest second of usage. Idle time is not billed.
You pay based on the number of events stored in the session service. We bill for stored session events that include content. This includes the initial user request, model responses, function calls, and function responses. We do not bill for system control events (such as checkpoints) that are stored in the session service.
Pay based on the number of memories stored and returned.
Pricing scenarios
To help you understand the cost of using Agent Engine services, we offer two hypothetical agents: a Lightweight Agent and a Standard Agent. For both scenarios, we make the following assumptions:
Additional notes:
Hypothetical Scenarios
This scenario represents agents handling low-volume, sporadic traffic.
Service | Calculation | Monthly Cost |
|---|---|---|
Runtime | (432,000 requests × 3 sec/req ÷ 3600 sec/hr) = 360 hours vCPU: (360 hrs × 1 vCPU × $0.0864/hr) = $31.10 RAM: (360 hrs × 1 GiB × $0.0090/hr) = $3.24 | $34.34 |
Code Execution | (360 runtime hours × 30% usage) = 108 hrs vCPU: (108 hrs × 1 vCPU × $0.0864/hr) = $9.33 RAM: (108 hrs × 1 GiB × $0.0090/hr) = $0.97 | $10.30 |
Sessions | 432,000 requests x 3 events ÷ 1,000 × $0.25 | $324 |
Memory Bank | Stored: (432,000 reqs ÷ 10 reqs/session × 1 memory/session ÷ 1,000) × $0.25 = $10.80 Retrieval: (432,000 reqs × 1 returned memory ÷ 1,000) × $0.50 = $216.00 | $226.80 |
Total Estimated Monthly Cost | $595.44 |
This scenario represents a production agent integrated into a business application, handling consistent user traffic.
Service | Calculation | Monthly Cost |
|---|---|---|
Runtime | (25,920,000 requests × 5 sec/req ÷ 3600 sec/hr) = 36,000 hours vCPU: (36,000 hrs × 2 vCPU × $0.0864/hr) = $6,220.80 RAM: (36,000 hrs × 5 GiB × $0.0090/hr) = $1,620.00 | $7,840.80 |
Code Execution | (36,000 runtime hours × 30% usage) = 10,800 hours vCPU: (10,800 hrs × 2 vCPU × $0.0864/hr) = $1,866.24 RAM: (10,800 hrs × 5 GiB × $0.0090/hr) = $486 | $2,352.24 |
Sessions | 25,920,000 requests * 3 events ÷ 1,000 × $0.25 | $19,440 |
Memory Bank | Stored: (25,920,000 reqs ÷ 10 reqs/session × 1 memory/session ÷ 1,000) × $0.25 = $648.00 Retrieval: (25,920,000 reqs × 1 returned memory ÷ 1,000) × $0.50 = $12,960.00 | $13,608 |
Total Estimated Monthly Cost | $43,241.04 |
Training
The tables below provide the approximate price per hour of various training configurations. You can choose a custom configuration of selected machine types. To calculate pricing, sum the costs of the virtual machines you use.
If you use Compute Engine machine types and attach accelerators, the cost of the accelerators is separate. To calculate this cost, multiply the prices in the table of accelerators below by how many machine hours of each type of accelerator you use.
Machine types
Machine type | Price (USD) |
|---|---|
n1-standard-4 | $0.2279988 / 1 hour |
n1-standard-8 | $0.4559976 / 1 hour |
n1-standard-16 | $0.9119952 / 1 hour |
n1-standard-32 | $1.8239904 / 1 hour |
n1-standard-64 | $3.6479808 / 1 hour |
n1-standard-96 | $5.4719712 / 1 hour |
n1-highmem-2 | $0.1419636 / 1 hour |
n1-highmem-4 | $0.2839272 / 1 hour |
n1-highmem-8 | $0.5678544 / 1 hour |
n1-highmem-16 | $1.1357088 / 1 hour |
n1-highmem-32 | $2.2714176 / 1 hour |
n1-highmem-64 | $4.5428352 / 1 hour |
n1-highmem-96 | $6.8142528 / 1 hour |
n1-highcpu-16 | $0.68014656 / 1 hour |
n1-highcpu-32 | $1.36029312 / 1 hour |
n1-highcpu-64 | $2.72058624 / 1 hour |
n1-highcpu-96 | $4.08087936 / 1 hour |
a2-highgpu-1g* | $4.408062 / 1 hour |
a2-highgpu-2g* | $8.816124 / 1 hour |
a2-highgpu-4g* | $17.632248 / 1 hour |
a2-highgpu-8g* | $35.264496 / 1 hour |
a2-highgpu-16g* | $70.528992 / 1 hour |
a3-highgpu-8g* | $105.39898088 / 1 hour |
a3-megagpu-8g* | $110.65714224 / 1 hour |
a4-highgpu-8g* | $148.212 / 1 hour |
e2-standard-4 | $0.16082748 / 1 hour |
e2-standard-4 | $0.32165496 / 1 hour |
e2-standard-16 | $0.64330992 / 1 hour |
e2-standard-32 | $1.28661984 / 1 hour |
e2-highmem-2 | $0.10847966 / 1 hour |
e2-highmem-4 | $0.21695932 / 1 hour |
e2-highmem-8 | $0.43391864 / 1 hour |
e2-highmem-16 | $0.86783728 / 1 hour |
e2-highcpu-16 | $0.4749144 / 1 hour |
e2-highcpu-32 | $0.9498288 / 1 hour |
n2-standard-4 | $0.2330832 / 1 hour |
n2-standard-8 | $0.4661664 / 1 hour |
n2-standard-16 | $0.9323328 / 1 hour |
n2-standard-32 | $1.8646656 / 1 hour |
n2-standard-48 | $2.7969984 / 1 hour |
n2-standard-64 | $3.7293312 / 1 hour |
n2-standard-80 | $4.661664 / 1 hour |
n2-highmem-2 | $0.1572168 / 1 hour |
n2-highmem-4 | $0.3144336 / 1 hour |
n2-highmem-8 | $0.6288672 / 1 hour |
n2-highmem-16 | $1.2577344 / 1 hour |
n2-highmem-32 | $2.5154688 / 1 hour |
n2-highmem-48 | $3.7732032 / 1 hour |
n2-highmem-64 | $5.0309376 / 1 hour |
n2-highmem-80 | $6.288672 / 1 hour |
n2-highcpu-16 | $0.6882816 / 1 hour |
n2-highcpu-32 | $1.3765632 / 1 hour |
n2-highcpu-48 | $2.0648448 / 1 hour |
n2-highcpu-64 | $2.7531264 / 1 hour |
n2-highcpu-80 | $3.441408 / 1 hour |
c2-standard-4 | $0.2505696 / 1 hour |
c2-standard-8 | $0.5011392 / 1 hour |
c2-standard-16 | $1.0022784 / 1 hour |
c2-standard-30 | $1.879272 / 1 hour |
c2-standard-60 | $3.758544 / 1 hour |
m1-ultramem-40 | $7.55172 / 1 hour |
m1-ultramem-80 | $15.10344 / 1 hour |
m1-ultramem-160 | $30.20688 / 1 hour |
m1-megamem-96 | $12.782592 / 1 hour |
cloud-tpu | Pricing is determined by the accelerator type. See 'Accelerators'. |
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
Accelerators
GPU type | Price (USD) |
|---|---|
NVIDIA_TESLA_A100 | $3.5206896 / 1 hour |
NVIDIA_TESLA_A100_80GB | $4.517292 / 1 hour |
NVIDIA_H100_80GB | $11.75586073 / 1 hour |
NVIDIA_TESLA_P4 | $0.72 / 1 hour |
NVIDIA_TESLA_P100 | $1.752 / 1 hour |
NVIDIA_TESLA_T4 | $0.42 / 1 hour |
NVIDIA_TESLA_V100 | $2.976 / 1 hour |
TPU_V2 Single (8 cores) | $5.40 / 1 hour |
TPU_V2 Pod (32 cores)* | $28.80 / 1 hour |
TPU_V3 Single (8 cores) | $9.60 / 1 hour |
TPU_V3 Pod (32 cores)* | $38.40 / 1 hour |
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
* The price for training using a Cloud TPU Pod is based on the number of cores in the Pod. The number of cores in a pod is always a multiple of 32. To determine the price of training on a Pod that has more than 32 cores, take the price for a 32-core Pod, and multiply it by the number of cores, divided by 32. For example, for a 128-core Pod, the price is (32-core Pod price) * (128/32). For information about which Cloud TPU Pods are available for a specific region, see System Architecture in the Cloud TPU documentation.
Disks
Disk type | Price (USD) |
|---|---|
pd-standard | $0.000065753 / 1 gibibyte hour |
pd-ssd | $0.000279452 / 1 gibibyte hour |
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
You are charged for training your models from the moment when resources are provisioned for a job until the job finishes.
Warning: Your training jobs are limited by the Vertex AI quota policy. If you choose a very powerful processing cluster for your first training jobs, it's likely you will exceed your quota.
The following tables provide the prices of batch prediction, online prediction, and online explanation per node hour. A node hour represents the time a virtual machine spends running your prediction job or waiting in an active state (an endpoint with one or more models deployed) to handle prediction or explanation requests.
You can use Spot VMs with Vertex AI Inference. Spot VMs are billed according to Compute Engine Spot VMs pricing. There are Vertex AI Inference management fees in addition to your infrastructure usage, captured in the following tables.
You can use Compute Engine reservations with Vertex AI Inference. When using Compute Engine reservations, you're billed according to Compute Engine Pricing, including any applicable committed use discounts (CUDs). There are Vertex AI Inference management fees in addition to your infrastructure usage, captured in the following tables.
E2 Series
Machine type | Price (USD) |
|---|---|
e2-standard-2 | $0.0770564 / 1 hour |
e2-standard-4 | $0.1541128 / 1 hour |
e2-standard-8 | $0.3082256 / 1 hour |
e2-standard-16 | $0.6164512 / 1 hour |
e2-standard-32 | $1.2329024 / 1 hour |
e2-highmem-2 | $0.1039476 / 1 hour |
e2-highmem-4 | $0.2078952 / 1 hour |
e2-highmem-8 | $0.4157904 / 1 hour |
e2-highmem-16 | $0.8315808 / 1 hour |
e2-highcpu-2 | $0.056888 / 1 hour |
e2-highcpu-4 | $0.113776 / 1 hour |
e2-highcpu-8 | $0.227552 / 1 hour |
e2-highcpu-16 | $0.455104 / 1 hour |
e2-highcpu-32 | $0.910208 / 1 hour |
N1 Series
Machine type | Price (USD) |
|---|---|
n1-standard-2 | $0.1095 / 1 hour |
n1-standard-4 | $0.219 / 1 hour |
n1-standard-8 | $0.438 / 1 hour |
n1-standard-16 | $0.876 / 1 hour |
n1-standard-32 | $1.752 / 1 hour |
n1-highmem-2 | $0.137 / 1 hour |
n1-highmem-4 | $0.274 / 1 hour |
n1-highmem-8 | $0.548 / 1 hour |
n1-highmem-16 | $1.096 / 1 hour |
n1-highcpu-2 | $0.081 / 1 hour |
n1-highcpu-4 | $0.162 / 1 hour |
n1-highcpu-8 | $0.324 / 1 hour |
n1-highcpu-16 | $0.648 / 1 hour |
n1-highcpu-32 | $1.296 / 1 hour |
N2 Series
Machine type | Price (USD) |
|---|---|
n2-standard-2 | $0.1116854 / 1 hour |
n2-standard-4 | $0.2233708 / 1 hour |
n2-standard-8 | $0.4467416 / 1 hour |
n2-standard-16 | $0.8934832 / 1 hour |
n2-standard-32 | $1.7869664 / 1 hour |
n2-highmem-2 | $0.1506654 / 1 hour |
n2-highmem-4 | $0.3013308 / 1 hour |
n2-highmem-8 | $0.6026616 / 1 hour |
n2-highmem-16 | $1.2053232 / 1 hour |
n2-highcpu-2 | $0.0824504 / 1 hour |
n2-highcpu-4 | $0.1649008 / 1 hour |
n2-highcpu-8 | $0.3298016 / 1 hour |
n2-highcpu-16 | $0.6596032 / 1 hour |
n2-highcpu-32 | $1.3192064 / 1 hour |
N2D Series
Machine type | Price (USD) |
|---|---|
n2d-standard-2 | $0.0971658 / 1 hour |
n2d-standard-4 | $0.1943316 / 1 hour |
n2d-standard-8 | $0.3886632 / 1 hour |
n2d-standard-16 | $0.7773264 / 1 hour |
n2d-standard-32 | $1.5546528 / 1 hour |
n2d-highmem-2 | $0.131077 / 1 hour |
n2d-highmem-4 | $0.262154 / 1 hour |
n2d-highmem-8 | $0.524308 / 1 hour |
n2d-highmem-16 | $1.048616 / 1 hour |
n2d-highcpu-2 | $0.0717324 / 1 hour |
n2d-highcpu-4 | $0.1434648 / 1 hour |
n2d-highcpu-8 | $0.2869296 / 1 hour |
n2d-highcpu-16 | $0.5738592 / 1 hour |
n2d-highcpu-32 | $1.1477184 / 1 hour |
C2 Series
Machine type | Price (USD) |
|---|---|
c2-standard-4 | $0.240028 / 1 hour |
c2-standard-8 | $0.480056 / 1 hour |
c2-standard-16 | $0.960112 / 1 hour |
c2-standard-30 | $1.80021 / 1 hour |
c2-standard-60 | $3.60042 / 1 hour |
C2D Series
Machine type | Price (USD) |
|---|---|
c2d-standard-2 | $0.1044172 / 1 hour |
c2d-standard-4 | $0.2088344 / 1 hour |
c2d-standard-8 | $0.4176688 / 1 hour |
c2d-standard-16 | $0.8353376 / 1 hour |
c2d-standard-32 | $1.6706752 / 1 hour |
c2d-standard-56 | $2.9236816 / 1 hour |
c2d-standard-112 | $5.8473632 / 1 hour |
c2d-highmem-2 | $0.1408396 / 1 hour |
c2d-highmem-4 | $0.2816792 / 1 hour |
c2d-highmem-8 | $0.5633584 / 1 hour |
c2d-highmem-16 | $1.1267168 / 1 hour |
c2d-highmem-32 | $2.2534336 / 1 hour |
c2d-highmem-56 | $3.9435088 / 1 hour |
c2d-highmem-112 | $7.8870176 / 1 hour |
c2d-highcpu-2 | $0.086206 / 1 hour |
c2d-highcpu-4 | $0.172412 / 1 hour |
c2d-highcpu-8 | $0.344824 / 1 hour |
c2d-highcpu-16 | $0.689648 / 1 hour |
c2d-highcpu-32 | $1.379296 / 1 hour |
c2d-highcpu-56 | $2.413768 / 1 hour |
c2d-highcpu-112 | $4.827536 / 1 hour |
C3 Series
Machine type | Price (USD) |
|---|---|
c3-highcpu-4 | $0.19824 / 1 hour |
c3-highcpu-8 | $0.39648 / 1 hour |
c3-highcpu-22 | $1.09032 / 1 hour |
c3-highcpu-44 | $2.18064 / 1 hour |
c3-highcpu-88 | $4.36128 / 1 hour |
c3-highcpu-176 | $8.72256 / 1 hour |
A2 Series
Machine type | Price (USD) |
|---|---|
a2-highgpu-1g | $4.2244949 / 1 hour |
a2-highgpu-2g | $8.4489898 / 1 hour |
a2-highgpu-4g | $16.8979796 / 1 hour |
a2-highgpu-8g | $33.7959592 / 1 hour |
a2-megagpu-16g | $64.1020592 / 1 hour |
a2-ultragpu-1g | $5.7818474 / 1 hour |
a2-ultragpu-2g | $11.5636948 / 1 hour |
a2-ultragpu-4g | $23.1273896 / 1 hour |
a2-ultragpu-8g | $46.2547792 / 1 hour |
When consuming from a reservation or spot capacity, billing is spread across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and the Vertex AI Management Fee SKU. This enables you to use your Committed Use Discounts (CUDs) in Vertex AI.
A3 Series
Machine type | Price (USD) |
|---|---|
a3-ultragpu-8g | $96.015616 / 1 hour |
a3-megagpu-8g | $106.65474 / 1 hour |
When consuming from a reservation or spot capacity, billing is spread across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and the Vertex AI Management Fee SKU. This enables you to use your Committed Use Discounts (CUDs) in Vertex AI.
A4 Series
Machine type | Price (USD) |
|---|---|
a4-highgpu-8g | $148.212 / 1 hour |
When consuming from a reservation or spot capacity, billing is spread across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and the Vertex AI Management Fee SKU. This enables you to use your Committed Use Discounts (CUDs) in Vertex AI.
A4X Series
Machine type | Price (USD) |
|---|---|
a4x-highgpu-4g | $74.75 / 1 hour |
When consuming from a reservation or spot capacity, billing is spread across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and the Vertex AI Management Fee SKU. This enables you to use your Committed Use Discounts (CUDs) in Vertex AI.
a4x-highgpu-4g requires at least 18 VMs.
G2 Series
Machine type | Price (USD) |
|---|---|
g2-standard-4 | $0.81293 / 1 hour |
g2-standard-8 | $0.98181 / 1 hour |
g2-standard-12 | $1.15069 / 1 hour |
g2-standard-16 | $1.31957 / 1 hour |
g2-standard-24 | $2.30138 / 1 hour |
g2-standard-32 | $1.99509 / 1 hour |
g2-standard-48 | $4.60276 / 1 hour |
g2-standard-96 | $9.20552 / 1 hour |
When consuming from a reservation or spot capacity, billing is spread across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and the Vertex AI Management Fee SKU. This enables you to use your Committed Use Discounts (CUDs) in Vertex AI.
G4 Series
Machine type | Price (USD) |
|---|---|
g4-standard-48 | $5.1749195 / 1 hour |
g4-standard-96 | $10.349839 / 1 hour |
g4-standard-192 | $20.699678 / 1 hour |
g4-standard-384 | $41.399356 / 1 hour |
TPU v5e pricing
Machine type | Price (USD) |
|---|---|
ct5lp-hightpu-1t | $1.38 / 1 hour |
ct5lp-hightpu-4t | $5.52 / 1 hour |
ct5lp-hightpu-8t | $5.52 / 1 hour |
TPU v6e pricing
Machine type | Price (USD) |
|---|---|
ct6e-standard-1t | $3.105 / 1 hour |
ct6e-standard-4t | $12.42 / 1 hour |
ct6e-standard-8t | $24.84 / 1 hour |
Each machine type is charged as the following SKUs on your Google Cloud bill:
The prices for machine types are used to approximate the total hourly cost for each prediction node of a model version using that machine type.
For example, a machine type of n1-highcpu-32 includes 32 vCPUs and 32 GB of RAM. Therefore, the hourly pricing equals 32 vCPU hours + 32 GB hours.
E2 Series
Item | Price (USD) |
|---|---|
vCPU | $0.0250826 / 1 hour |
RAM | $0.0033614 / 1 gibibyte hour |
N1 Series
Item | Price (USD) |
|---|---|
vCPU | $0.036 / 1 hour |
RAM | $0.005 / 1 gibibyte hour |
N2 Series
Item | Price (USD) |
|---|---|
vCPU | $0.0363527 / 1 hour |
RAM | $0.0048725 / 1 gibibyte hour |
N2D Series
Item | Price (USD) |
|---|---|