get-image-file

Get local file path of image sent by user. When user sends image, system auto-downloads it. When you need to process user's image or analyze image content.

1,592 stars

byopenakita

View on GitHub Installation ↓

Best use case

get-image-file is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Get local file path of image sent by user. When user sends image, system auto-downloads it. When you need to process user's image or analyze image content.

Teams using get-image-file should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/get-image-file/SKILL.md --create-dirs "https://raw.githubusercontent.com/openakita/openakita/main/skills/system/get-image-file/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/get-image-file/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How get-image-file Compares

Feature / Agent	get-image-file	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Get local file path of image sent by user. When user sends image, system auto-downloads it. When you need to process user's image or analyze image content.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Get Image File

获取用户发送的图片的本地文件路径。

## Parameters

无参数。

## Workflow

1. 用户发送图片
2. 系统自动下载到本地
3. 使用此工具获取文件路径

## Related Skills

- `get-voice-file`: 获取语音文件
- `deliver-artifacts`: 发送文件给用户

Related Skills

write-file

1592

from openakita/openakita

Write content to file, creating new or overwriting existing. When you need to create new files, update file content, or save generated code/data.

update-user-profile

1592

from openakita/openakita

Update user profile information when user shares preferences, habits, or work details. When you need to save user preferences, remember user's work domain, or provide personalized service.

skip-profile-question

1592

from openakita/openakita

Skip profile question when user explicitly refuses to answer. When user says 'I don't want to answer' or 'skip this question', use this tool to stop asking about that item.

read-file

1592

from openakita/openakita

Read file content for text files. When you need to check file content, analyze code or data, or get configuration values.

get-voice-file

1592

from openakita/openakita

Get local file path of voice message sent by user. When user sends voice message, system auto-downloads it. When you need to process user's voice message or transcribe voice to text.

get-user-profile

1592

from openakita/openakita

Get current user profile summary to understand user's preferences and context. When you need to check known user info or personalize responses.

generate-image

1592

from openakita/openakita

Generate images from text prompts using Qwen-Image (Dashscope). Saves output as local PNG files. Requires DASHSCOPE_API_KEY. Use deliver_artifacts to send generated images to IM chat.

edit-file

1592

from openakita/openakita

Edit file by exact string replacement. Finds old_string and replaces with new_string. Safer and more token-efficient than write_file for modifying existing files. Auto-handles Windows CRLF line endings.

delete-file

1592

from openakita/openakita

Delete a file or empty directory. Non-empty directories are rejected for safety. Use run_shell for recursive deletion.

openakita/skills@image-understanding

1592

from openakita/openakita

Analyze images using Dashscope (Qwen) Vision models for detailed description, OCR text extraction, object recognition, and visual Q&A. Use when the user needs to understand image content via Alibaba Cloud Dashscope API, especially for Chinese-language image analysis and documents.

openakita/skills@image-understander

1592

from openakita/openakita

Analyze images using GPT-4 Vision for detailed description, OCR text extraction, object recognition, and visual Q&A. Use when the user needs to understand image content, extract text from screenshots, identify objects in photos, or ask questions about images via OpenAI GPT-4 Vision API.

openakita/skills@file-manager

1592

from openakita/openakita

File and directory management tool for creating, reading, writing, deleting, moving, copying, and searching files. Use this skill when the user needs to perform file operations, list directories, search by pattern or content, or get file metadata like size and modification time.