tabzartist

@GGPrompts/tabzartist

GGPrompts

116

10 forks

Updated 1/6/2026

View on GitHub

Generate images via DALL-E and videos via Sora using browser automation. This skill should be used when a project needs visual assets like hero images, team photos, feature icons, or product demo videos. Spawnable as a conductor agent during bd-swarm-auto for parallel asset generation.

Installation

$skills install @GGPrompts/tabzartist

Claude Code

Cursor

Copilot

Codex

Antigravity

Details

RepositoryGGPrompts/TabzChrome

Pathplugins/conductor/skills/tabz-artist/SKILL.md

Branchmain

Scoped Name@GGPrompts/tabzartist

Usage

After installing, this skill will be available to your AI coding assistant.

Verify installation:

skills list

Skill Instructions

name: TabzArtist description: Generate images via DALL-E and videos via Sora using browser automation. This skill should be used when a project needs visual assets like hero images, team photos, feature icons, or product demo videos. Spawnable as a conductor agent during bd-swarm-auto for parallel asset generation.

TabzArtist - AI Asset Generation via Browser Automation

Generate images (DALL-E) and videos (Sora) for projects using TabzChrome MCP tools.

When to Use

Building landing pages, apps, or websites that need visual assets
Creating hero images, team photos, feature icons, product screenshots
Generating product demo videos or background ambient videos
Running as parallel asset generation during bd-swarm-auto builds

Quick Start

Generate assets for a [industry] landing page:
- Hero image: [description]
- Team photos: [description]
- Feature icons: [description]
- Product video: [description]

Workflow Overview

1. Plan assets based on project description
2. Generate images via DALL-E (batch if possible)
3. Generate videos via Sora
4. Wait for generation (poll with screenshots)
5. Download all assets to project folder
6. Return list of downloaded files

DALL-E Image Generation

URLs

Page	URL
DALL-E 3 GPT	`https://chatgpt.com/g/g-iLoR8U3iA-dall-e3`
Images Gallery	`https://chatgpt.com/images`

Workflow

# 1. Open DALL-E
mcp-cli call tabz/tabz_open_url '{"url": "https://chatgpt.com/g/g-iLoR8U3iA-dall-e3"}'

# 2. Fill prompt (wait 2-3s for page load)
mcp-cli call tabz/tabz_fill '{"selector": "#prompt-textarea", "value": "PROMPT_HERE"}'

# 3. Submit
mcp-cli call tabz/tabz_click '{"selector": "#composer-submit-button"}'

# 4. Wait 15-30s, poll with screenshots until image appears
mcp-cli call tabz/tabz_screenshot '{}'

# 5. Extract image URL
mcp-cli call tabz/tabz_get_element '{"selector": "img[alt=\"Generated image\"]", "includeStyles": false, "response_format": "json"}'
# Look for: attributes.src

# 6. Download
mcp-cli call tabz/tabz_download_file '{"url": "IMAGE_SRC_URL", "filename": "public/images/hero.png"}'

Batch Download from /images Page

# Go to gallery
mcp-cli call tabz/tabz_open_url '{"url": "https://chatgpt.com/images"}'

# Click download button (hover bottom-left of each image)
# nth-of-type(1) = first in grid, nth-of-type(2) = second, etc.
mcp-cli call tabz/tabz_click '{"selector": "div.flex.w-full > div.flex:nth-of-type(1) > span > button.flex.items-center"}'

Key Selectors

Element	Selector
Prompt input	`#prompt-textarea`
Submit button	`#composer-submit-button`
Generated image	`img[alt="Generated image"]`
Download btn (/images)	`div.flex.w-full > div.flex:nth-of-type(n) > span > button.flex.items-center`

Sora Video Generation

URL

Page	URL
Drafts	`https://sora.chatgpt.com/drafts`

Workflow

# 1. Open Sora drafts
mcp-cli call tabz/tabz_open_url '{"url": "https://sora.chatgpt.com/drafts"}'

# 2. Fill prompt (wait 2-3s for page load)
mcp-cli call tabz/tabz_fill '{"selector": "textarea", "value": "PROMPT_HERE"}'

# 3. Submit
mcp-cli call tabz/tabz_click '{"selector": "div.flex.items-center.justify-between > div:last-child > button:last-child"}'

# 4. Wait 60-120s, poll with screenshots until video thumbnail appears
mcp-cli call tabz/tabz_screenshot '{}'

# 5. Click video to open detail view
mcp-cli call tabz/tabz_click '{"selector": "video"}'

# 6. Extract video URL
mcp-cli call tabz/tabz_get_element '{"selector": "video", "includeStyles": false, "response_format": "json"}'
# Look for: attributes.src

# 7. Download
mcp-cli call tabz/tabz_download_file '{"url": "VIDEO_SRC_URL", "filename": "public/videos/demo.mp4"}'

Key Selectors

Element	Selector
Prompt textarea	`textarea`
Create video button	`div.flex.items-center.justify-between > div:last-child > button:last-child`
Video thumbnail	`video`
Video src	`attributes.src` via tabz_get_element

Asset Planning

When given a project description, plan assets like:

Asset Type	DALL-E Prompt Structure
Hero image	"[Subject] in [setting], [style], [mood lighting], [composition]"
Team photos	"Professional headshot grid, [n] diverse people, modern office, candid warm lighting"
Feature icons	"Minimal line icon set, [n] icons for: [list], consistent stroke weight, [color]"
Product shot	"[Product] on [surface], [lighting setup], commercial photography style"

Asset Type	Sora Prompt Structure
Product demo	"[Style]. [Device/screen] showing [UI]. [Camera movement]. [Specific actions with timing]. [Lighting + palette]."
Background	"[Aesthetic]. [Scene description]. [Camera movement]. [Duration]. [Palette]."

Output Structure

Save assets to project directories:

public/
├── images/
│   ├── hero.png
│   ├── team.png
│   └── icons/
│       ├── feature-1.png
│       └── feature-2.png
└── videos/
    └── demo.mp4

Completion

Return a summary of generated assets:

## Generated Assets

### Images (DALL-E)
- public/images/hero.png - Hero image for landing page
- public/images/team.png - Team photo grid

### Videos (Sora)
- public/videos/demo.mp4 - Product demo video (10s)

All assets downloaded and ready for use.

Tips

Batch images - Submit multiple DALL-E prompts, then download from /images page
Poll patiently - DALL-E: 15-30s, Sora: 60-120s
Screenshot first - Verify generation complete before extracting URLs
Use tabz_download_file - Not tabz_download_image (service worker limitation)
Concrete prompts - Specific details > vague descriptions

Reference Prompts

For detailed prompt guidance, see:

.prompts/images/dalle3.prompty - DALL-E prompt structure and examples
.prompts/video/sora.prompty - Sora prompt structure and examples

More by GGPrompts

View all

tabz-terminals

121

Spawn and manage terminal tabs via TabzChrome REST API. Use when spawning workers, creating terminals programmatically, setting up worktrees for parallel work, or crafting prompts for Claude workers.

tabz-development

121

Patterns for building and debugging TabzChrome itself. Use when working on Terminal.tsx, xterm.js integration, WebSocket I/O, resize handling, or any TabzChrome extension/backend code.

tabz-browser

121

Browser automation via 70 tabz MCP tools. Use when taking screenshots, filling forms, debugging network requests, testing responsive design, or using text-to-speech notifications.

cc-setup

121

This skill should be used when the user asks to "diagnose Claude Code", "fix MCP errors", "manage plugins", "configure MCP servers", "check /doctor errors", "debug plugin issues", "add MCP server", "remove plugin", "list plugins", "list MCP servers", "fix schema errors", or needs help with Claude Code CLI configuration and troubleshooting.