vertex-agent-builder

@jeremylongshore/vertex-agent-builder

jeremylongshore

692

70 forks

Updated 1/6/2026

View on GitHub

Build and deploy production-ready generative AI agents using Vertex AI, Gemini models, and Google Cloud infrastructure with RAG, function calling, and multi-modal capabilities

Installation

$skills install @jeremylongshore/vertex-agent-builder

Claude Code

Cursor

Copilot

Codex

Antigravity

Details

Repositoryjeremylongshore/claude-code-plugins-plus

Pathplugins/jeremy-vertex-ai/skills/vertex-agent-builder/SKILL.md

Branchmain

Scoped Name@jeremylongshore/vertex-agent-builder

Usage

After installing, this skill will be available to your AI coding assistant.

Verify installation:

skills list

Skill Instructions

name: vertex-agent-builder description: | Build and deploy production-ready generative AI agents using Vertex AI, Gemini models, and Google Cloud infrastructure with RAG, function calling, and multi-modal capabilities. Use when appropriate context detected. Trigger with relevant phrases based on skill purpose. allowed-tools: Read, Write, Edit, Grep, Bash(cmd:*) version: 1.0.0 author: Jeremy Longshore jeremy@intentsolutions.io license: MIT

Vertex AI Agent Builder

Build and deploy production-ready agents on Vertex AI with Gemini models, retrieval (RAG), function calling, and operational guardrails (validation, monitoring, cost controls).

Overview

Produces an agent scaffold aligned with Vertex AI Agent Engine deployment patterns.
Helps choose models/regions, design tool/function interfaces, and wire up retrieval.
Includes an evaluation + smoke-test checklist so deployments don’t regress.

Prerequisites

Google Cloud project with Vertex AI API enabled
Permissions to deploy/operate Agent Engine runtimes (or a local-only build target)
If using RAG: a document source (GCS/BigQuery/Firestore/etc) and an embeddings/index strategy
Secrets handled via env vars or Secret Manager (never committed)

Instructions

Clarify the agent’s job (user intents, inputs/outputs, latency and cost constraints).
Choose model + region and define tool/function interfaces (schemas, error contracts).
Implement retrieval (if needed): chunking, embeddings, index, and a “citation-first” response format.
Add evaluation: golden prompts, offline checks, and a minimal online smoke test.
Deploy (optional): provide the exact deployment command/config and verify endpoints + permissions.
Add ops: logs/metrics, alerting, quota/cost guardrails, and rollback steps.

Output

A Vertex AI agent scaffold (code/config) with clear extension points
A retrieval plan (when applicable) and a validation/evaluation checklist
Optional: deployment commands and post-deploy health checks

Error Handling

Quota/region issues: detect the failing service/quota and propose a scoped fix.
Auth failures: identify the principal and missing role; prefer least-privilege remediation.
Retrieval failures: validate indexing/embedding dimensions and add fallback behavior.
Tool/function errors: enforce structured error responses and add regression tests.

Examples

Example: RAG support agent

Request: “Deploy a support bot that answers from our docs with citations.”
Result: ingestion plan, retrieval wiring, evaluation prompts, and a smoke test that verifies citations.

Example: Multimodal intake agent

Request: “Build an agent that extracts structured fields from PDFs/images and routes tasks.”
Result: schema-first extraction prompts, tool interface contracts, and validation examples.

Resources

Full detailed guide (kept for reference): {baseDir}/references/SKILL.full.md
Repo standards (source of truth):
- 000-docs/6767-a-SPEC-DR-STND-claude-code-plugins-standard.md
- 000-docs/6767-b-SPEC-DR-STND-claude-skills-standard.md
Vertex AI docs: https://cloud.google.com/vertex-ai/docs
Agent Engine docs: https://cloud.google.com/vertex-ai/docs/agent-engine

More by jeremylongshore

View all

rabbitmq-queue-setup

1,004

Rabbitmq Queue Setup - Auto-activating skill for Backend Development. Triggers on: rabbitmq queue setup, rabbitmq queue setup Part of the Backend Development skill category.

model-evaluation-suite

1,004

evaluating-machine-learning-models: This skill allows Claude to evaluate machine learning models using a comprehensive suite of metrics. It should be used when the user requests model performance analysis, validation, or testing. Claude can use this skill to assess model accuracy, precision, recall, F1-score, and other relevant metrics. Trigger this skill when the user mentions "evaluate model", "model performance", "testing metrics", "validation results", or requests a comprehensive "model evaluation".

neural-network-builder

1,004

building-neural-networks: This skill allows Claude to construct and configure neural network architectures using the neural-network-builder plugin. It should be used when the user requests the creation of a new neural network, modification of an existing one, or assistance with defining the layers, parameters, and training process. The skill is triggered by requests involving terms like "build a neural network," "define network architecture," "configure layers," or specific mentions of neural network types (e.g., "CNN," "RNN," "transformer").

oauth-callback-handler

1,004

Oauth Callback Handler - Auto-activating skill for API Integration. Triggers on: oauth callback handler, oauth callback handler Part of the API Integration skill category.