jeremylongshore

vertex-agent-builder

@jeremylongshore/vertex-agent-builder
jeremylongshore
692
70 forks
Updated 1/6/2026
View on GitHub

Build and deploy production-ready generative AI agents using Vertex AI, Gemini models, and Google Cloud infrastructure with RAG, function calling, and multi-modal capabilities

Installation

$skills install @jeremylongshore/vertex-agent-builder
Claude Code
Cursor
Copilot
Codex
Antigravity

Details

Pathplugins/jeremy-vertex-ai/skills/vertex-agent-builder/SKILL.md
Branchmain
Scoped Name@jeremylongshore/vertex-agent-builder

Usage

After installing, this skill will be available to your AI coding assistant.

Verify installation:

skills list

Skill Instructions


name: vertex-agent-builder description: | Build and deploy production-ready generative AI agents using Vertex AI, Gemini models, and Google Cloud infrastructure with RAG, function calling, and multi-modal capabilities. Use when appropriate context detected. Trigger with relevant phrases based on skill purpose. allowed-tools: Read, Write, Edit, Grep, Bash(cmd:*) version: 1.0.0 author: Jeremy Longshore jeremy@intentsolutions.io license: MIT

Vertex AI Agent Builder

Build and deploy production-ready agents on Vertex AI with Gemini models, retrieval (RAG), function calling, and operational guardrails (validation, monitoring, cost controls).

Overview

  • Produces an agent scaffold aligned with Vertex AI Agent Engine deployment patterns.
  • Helps choose models/regions, design tool/function interfaces, and wire up retrieval.
  • Includes an evaluation + smoke-test checklist so deployments don’t regress.

Prerequisites

  • Google Cloud project with Vertex AI API enabled
  • Permissions to deploy/operate Agent Engine runtimes (or a local-only build target)
  • If using RAG: a document source (GCS/BigQuery/Firestore/etc) and an embeddings/index strategy
  • Secrets handled via env vars or Secret Manager (never committed)

Instructions

  1. Clarify the agent’s job (user intents, inputs/outputs, latency and cost constraints).
  2. Choose model + region and define tool/function interfaces (schemas, error contracts).
  3. Implement retrieval (if needed): chunking, embeddings, index, and a “citation-first” response format.
  4. Add evaluation: golden prompts, offline checks, and a minimal online smoke test.
  5. Deploy (optional): provide the exact deployment command/config and verify endpoints + permissions.
  6. Add ops: logs/metrics, alerting, quota/cost guardrails, and rollback steps.

Output

  • A Vertex AI agent scaffold (code/config) with clear extension points
  • A retrieval plan (when applicable) and a validation/evaluation checklist
  • Optional: deployment commands and post-deploy health checks

Error Handling

  • Quota/region issues: detect the failing service/quota and propose a scoped fix.
  • Auth failures: identify the principal and missing role; prefer least-privilege remediation.
  • Retrieval failures: validate indexing/embedding dimensions and add fallback behavior.
  • Tool/function errors: enforce structured error responses and add regression tests.

Examples

Example: RAG support agent

  • Request: “Deploy a support bot that answers from our docs with citations.”
  • Result: ingestion plan, retrieval wiring, evaluation prompts, and a smoke test that verifies citations.

Example: Multimodal intake agent

  • Request: “Build an agent that extracts structured fields from PDFs/images and routes tasks.”
  • Result: schema-first extraction prompts, tool interface contracts, and validation examples.

Resources

More by jeremylongshore

View all
rabbitmq-queue-setup
1,004

Rabbitmq Queue Setup - Auto-activating skill for Backend Development. Triggers on: rabbitmq queue setup, rabbitmq queue setup Part of the Backend Development skill category.

model-evaluation-suite
1,004

evaluating-machine-learning-models: This skill allows Claude to evaluate machine learning models using a comprehensive suite of metrics. It should be used when the user requests model performance analysis, validation, or testing. Claude can use this skill to assess model accuracy, precision, recall, F1-score, and other relevant metrics. Trigger this skill when the user mentions "evaluate model", "model performance", "testing metrics", "validation results", or requests a comprehensive "model evaluation".

neural-network-builder
1,004

building-neural-networks: This skill allows Claude to construct and configure neural network architectures using the neural-network-builder plugin. It should be used when the user requests the creation of a new neural network, modification of an existing one, or assistance with defining the layers, parameters, and training process. The skill is triggered by requests involving terms like "build a neural network," "define network architecture," "configure layers," or specific mentions of neural network types (e.g., "CNN," "RNN," "transformer").

oauth-callback-handler
1,004

Oauth Callback Handler - Auto-activating skill for API Integration. Triggers on: oauth callback handler, oauth callback handler Part of the API Integration skill category.