Edge Deployment Skill
ML model optimization and deployment on robot edge devices (Jetson, embedded)
Best use case
Edge Deployment Skill is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
ML model optimization and deployment on robot edge devices (Jetson, embedded)
Teams using Edge Deployment Skill should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/edge-deployment/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How Edge Deployment Skill Compares
| Feature / Agent | Edge Deployment Skill | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
ML model optimization and deployment on robot edge devices (Jetson, embedded)
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Edge Deployment Skill ## Overview Expert skill for optimizing and deploying machine learning models on robot edge devices including NVIDIA Jetson and embedded systems. ## Capabilities - Configure TensorRT optimization for NVIDIA Jetson - Set up ONNX model conversion and optimization - Implement INT8 and FP16 quantization - Configure DeepStream for video analytics - Set up CUDA graph optimization - Implement model pruning and distillation - Configure DLA (Deep Learning Accelerator) deployment - Set up multi-stream inference - Implement ROS2 inference nodes - Profile and benchmark on target hardware ## Target Processes - nn-model-optimization.js - object-detection-pipeline.js - rl-robot-control.js - field-testing-validation.js ## Dependencies - TensorRT - ONNX Runtime - NVIDIA Jetson SDK - DeepStream ## Usage Context This skill is invoked when processes require deploying ML models on edge devices with optimized inference performance. ## Output Artifacts - TensorRT engine files - ONNX optimized models - Quantization configurations - DeepStream pipeline configs - Inference benchmark reports - ROS2 inference node implementations
Related Skills
knowledge-analytics
Knowledge base analytics, usage reporting, and effectiveness measurement
knowledge-extractor
Extract tribal knowledge from code, documentation, and commit history to preserve institutional memory
knowledge-curation
Context priming before work (bd prime) and self-reflection after completion to extract patterns, gotchas, and decisions into the knowledge base.
knowledge-graph-management
Capture, validate, query, and sync architectural patterns and design decisions in the knowledge graph
cog-knowledge-consolidation
Build structured knowledge frameworks from scattered vault notes with source attribution
process-builder
Scaffold new babysitter process definitions following SDK patterns, proper structure, and best practices. Guides the 3-phase workflow from research to implementation.
babysitter
Orchestrate via @babysitter. Use this skill when asked to babysit a run, orchestrate a process or whenever it is called explicitly. (babysit, babysitter, orchestrate, orchestrate a run, workflow, etc.)
yolo
Run Babysitter autonomously with minimal manual interruption.
user-install
Install the user-level Babysitter Codex setup.
team-install
Install the team-pinned Babysitter Codex workspace setup.
retrospect
Summarize or retrospect on a completed Babysitter run.
resume
Resume an existing Babysitter run from Codex.