Vision Analyze (Google)
Analyze images using **Google Cloud Vision API**.
Best use case
Vision Analyze (Google) is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Analyze images using **Google Cloud Vision API**.
Teams using Vision Analyze (Google) should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/vision-analyze/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How Vision Analyze (Google) Compares
| Feature / Agent | Vision Analyze (Google) | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Analyze images using **Google Cloud Vision API**.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Vision Analyze (Google) Analyze images using **Google Cloud Vision API**. This skill performs: - Image label detection - Optical Character Recognition (OCR) ## Usage Provide either: - A local image file path - A publicly accessible image URL ## Example vision_analyze /tmp/image.png vision_analyze https://example.com/image.jpg
Related Skills
google-messages
Send and receive SMS/RCS via Google Messages web interface (messages.google.com). Use when asked to "send a text", "check texts", "SMS", "text message", "Google Messages", or forward incoming texts to other channels.
Portfolio Risk & Optimization Analyzer
**AI-powered crypto portfolio risk analysis with automated $BANKR buyback monetization.**
google-photos
Manage Google Photos library. Upload photos, create albums, and list library content. Use when the user wants to backup, organize, or share images via Google Photos.
Vision Sandbox
Agentic Vision via Gemini's native Code Execution sandbox. Use for spatial grounding, visual math, and UI auditing.
google-ads
Query, audit, and optimize Google Ads campaigns. Supports two modes: (1) API mode for bulk operations with google-ads Python SDK, (2) Browser automation mode for users without API access - just attach a browser tab to ads.google.com. Use when asked to check ad performance, pause campaigns/keywords, find wasted spend, audit conversion tracking, or optimize Google Ads accounts.
stealthy-google-search
Google search via Scrapling’s StealthyFetcher/StealthySession.
tiktok-video-analyzer
Analyze any video by dropping a URL.
crypto-market-analyzer
Cryptocurrency market analysis for Bitcoin and Ethereum.
google-veo-dynamic-city-nightview
Create dynamic city night view videos using Google Veo. Use when generating cinematic urban scenes, futuristic cityscapes, or nighttime urban video content.
google-imagen-3-portrait-photography
Generate professional portrait photography using Google Imagen 3. Use when creating realistic portraits, headshots, or artistic character photography with professional lighting and composition.
google-imagen-3-hyperrealistic-landscape
Generate hyperrealistic landscape photography using Google Imagen 3. Use when creating breathtaking natural scenes, landscapes, and nature photography with exceptional detail and realism.
akkadian-noun-analyzer
Akkadian noun analyzer using regex-based feature extraction. Use when working with ancient languages, linguistic analysis, or building morphological analyzers for historical languages.