gemini-cli
You are an expert in Gemini CLI, Google's open-source terminal-based AI agent powered by Gemini models. You help developers use Gemini CLI for code generation, file editing, shell command execution, and multi-modal tasks (analyzing images, reading PDFs) — with Google's 1M+ token context window for understanding entire codebases at once and MCP tool integration for extending capabilities.
Best use case
gemini-cli is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
You are an expert in Gemini CLI, Google's open-source terminal-based AI agent powered by Gemini models. You help developers use Gemini CLI for code generation, file editing, shell command execution, and multi-modal tasks (analyzing images, reading PDFs) — with Google's 1M+ token context window for understanding entire codebases at once and MCP tool integration for extending capabilities.
Teams using gemini-cli should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/gemini-cli/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How gemini-cli Compares
| Feature / Agent | gemini-cli | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
You are an expert in Gemini CLI, Google's open-source terminal-based AI agent powered by Gemini models. You help developers use Gemini CLI for code generation, file editing, shell command execution, and multi-modal tasks (analyzing images, reading PDFs) — with Google's 1M+ token context window for understanding entire codebases at once and MCP tool integration for extending capabilities.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Gemini CLI — Google's AI Coding Agent for the Terminal
You are an expert in Gemini CLI, Google's open-source terminal-based AI agent powered by Gemini models. You help developers use Gemini CLI for code generation, file editing, shell command execution, and multi-modal tasks (analyzing images, reading PDFs) — with Google's 1M+ token context window for understanding entire codebases at once and MCP tool integration for extending capabilities.
## Core Capabilities
### Basic Usage
```bash
# Install
npm install -g @anthropic-ai/gemini-cli
# Or via Google's installer
curl -fsSL https://raw.githubusercontent.com/google-gemini/gemini-cli/main/installer.sh | bash
# Start interactive session
gemini
# One-shot prompt
gemini "Explain the architecture of this project and suggest improvements"
# With specific model
gemini --model gemini-2.5-pro "Refactor the database layer to use connection pooling"
# Pipe input
cat error.log | gemini "Analyze these errors and suggest fixes"
git diff HEAD~5 | gemini "Write a summary of these changes for the changelog"
```
### Configuration
```markdown
# GEMINI.md — Project instructions (auto-loaded)
## Project
TypeScript monorepo using Turborepo. Apps: web (Next.js), api (Hono), mobile (Expo).
## Coding Standards
- Strict TypeScript, no `any`
- Functional components with hooks
- Zod for runtime validation
- Drizzle ORM for database access
## Architecture
- Shared packages in /packages (ui, db, config)
- API routes in /apps/api/src/routes/
- Database schema in /packages/db/src/schema.ts
```
### Multi-Modal Capabilities
```bash
# Analyze a screenshot
gemini "What's wrong with this UI?" --image screenshot.png
# Read a PDF spec
gemini "Summarize the API changes in this spec" --file api-spec.pdf
# Analyze error screenshots from QA
gemini "The QA team sent these screenshots. What bugs do you see?" --image bug1.png --image bug2.png
```
### MCP Tool Integration
```json
// .gemini/settings.json — MCP servers
{
"mcpServers": {
"filesystem": {
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-filesystem", "/path/to/project"]
},
"database": {
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-postgres", "postgresql://localhost/mydb"]
},
"github": {
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-github"],
"env": { "GITHUB_TOKEN": "${GITHUB_TOKEN}" }
}
}
}
```
### Large Codebase Analysis
```bash
# Gemini's 1M+ token window can process entire codebases
gemini "Read the entire src/ directory and create a dependency graph. Identify circular dependencies and suggest how to break them."
gemini "Analyze all test files. Which modules have low coverage? Generate tests for the 5 least-covered modules."
gemini "Review the entire API layer. Are there any endpoints that don't validate input? Fix them all."
```
## Installation
```bash
npm install -g @google/gemini-cli
# Requires: GOOGLE_API_KEY or Google Cloud auth
# Free tier: 1M tokens/day with Gemini API
```
## Best Practices
1. **GEMINI.md for context** — Add project instructions; Gemini loads them automatically at session start
2. **Large context advantage** — Use Gemini for whole-codebase analysis; 1M+ tokens fits most projects entirely
3. **Multi-modal input** — Feed screenshots, PDFs, diagrams directly; Gemini understands visual content natively
4. **MCP for tools** — Connect database, GitHub, file system via MCP; Gemini can query data and create PRs
5. **Pipe workflows** — Pipe `git diff`, `npm test`, `cat error.log` directly into prompts for contextual assistance
6. **Free tier** — Google's free API tier is generous; 1M tokens/day covers most individual developer usage
7. **Sandbox mode** — Use `--sandbox` for untrusted operations; commands run in isolated environment
8. **Extension system** — Create custom tools with the extension API; Gemini calls them as needed during tasksRelated Skills
vertex-ai-gemini
Google Cloud Vertex AI for enterprise Gemini deployments — production scaling, fine-tuning, and MLOps. Use when deploying Gemini in GCP-native environments, running fine-tuning jobs, needing enterprise IAM controls, VPC isolation, batch prediction at scale, or production ML pipelines on Google Cloud.
zustand
You are an expert in Zustand, the small, fast, and scalable state management library for React. You help developers manage global state without boilerplate using Zustand's hook-based stores, selectors for performance, middleware (persist, devtools, immer), computed values, and async actions — replacing Redux complexity with a simple, un-opinionated API in under 1KB.
zoho
Integrate and automate Zoho products. Use when a user asks to work with Zoho CRM, Zoho Books, Zoho Desk, Zoho Projects, Zoho Mail, or Zoho Creator, build custom integrations via Zoho APIs, automate workflows with Deluge scripting, sync data between Zoho apps and external systems, manage leads and deals, automate invoicing, build custom Zoho Creator apps, set up webhooks, or manage Zoho organization settings. Covers Zoho CRM, Books, Desk, Projects, Creator, and cross-product integrations.
zod
You are an expert in Zod, the TypeScript-first schema declaration and validation library. You help developers define schemas that validate data at runtime AND infer TypeScript types at compile time — eliminating the need to write types and validators separately. Used for API input validation, form validation, environment variables, config files, and any data boundary.
zipkin
Deploy and configure Zipkin for distributed tracing and request flow visualization. Use when a user needs to set up trace collection, instrument Java/Spring or other services with Zipkin, analyze service dependencies, or configure storage backends for trace data.
zig
Expert guidance for Zig, the systems programming language focused on performance, safety, and readability. Helps developers write high-performance code with compile-time evaluation, seamless C interop, no hidden control flow, and no garbage collector. Zig is used for game engines, operating systems, networking, and as a C/C++ replacement.
zed
Expert guidance for Zed, the high-performance code editor built in Rust with native collaboration, AI integration, and GPU-accelerated rendering. Helps developers configure Zed, create custom extensions, set up collaborative editing sessions, and integrate AI assistants for productive coding.
zeabur
Expert guidance for Zeabur, the cloud deployment platform that auto-detects frameworks, builds and deploys applications with zero configuration, and provides managed services like databases and message queues. Helps developers deploy full-stack applications with automatic scaling and one-click marketplace services.
zapier
Automate workflows between apps with Zapier. Use when a user asks to connect apps without code, automate repetitive tasks, sync data between services, or build no-code integrations between SaaS tools.
zabbix
Configure Zabbix for enterprise infrastructure monitoring with templates, triggers, discovery rules, and dashboards. Use when a user needs to set up Zabbix server, configure host monitoring, create custom templates, define trigger expressions, or automate host discovery and registration.
yup
Validate data with Yup schemas. Use when adding form validation, defining API request schemas, validating configuration, or building type-safe validation pipelines in JavaScript/TypeScript.
yt-dlp
Download video and audio from YouTube and other platforms with yt-dlp. Use when a user asks to download YouTube videos, extract audio from videos, download playlists, get subtitles, download specific formats or qualities, batch download, archive channels, extract metadata, embed thumbnails, download from social media platforms (Twitter, Instagram, TikTok), or build media ingestion pipelines. Covers format selection, audio extraction, playlists, subtitles, metadata, and automation.