Edge Deployment Skill

ML model optimization and deployment on robot edge devices (Jetson, embedded)

509 stars

Best use case

Edge Deployment Skill is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

ML model optimization and deployment on robot edge devices (Jetson, embedded)

Teams using Edge Deployment Skill should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/edge-deployment/SKILL.md --create-dirs "https://raw.githubusercontent.com/a5c-ai/babysitter/main/library/specializations/robotics-simulation/skills/edge-deployment/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/edge-deployment/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How Edge Deployment Skill Compares

Feature / Agent	Edge Deployment Skill	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

ML model optimization and deployment on robot edge devices (Jetson, embedded)

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Edge Deployment Skill

## Overview

Expert skill for optimizing and deploying machine learning models on robot edge devices including NVIDIA Jetson and embedded systems.

## Capabilities

- Configure TensorRT optimization for NVIDIA Jetson
- Set up ONNX model conversion and optimization
- Implement INT8 and FP16 quantization
- Configure DeepStream for video analytics
- Set up CUDA graph optimization
- Implement model pruning and distillation
- Configure DLA (Deep Learning Accelerator) deployment
- Set up multi-stream inference
- Implement ROS2 inference nodes
- Profile and benchmark on target hardware

## Target Processes

- nn-model-optimization.js
- object-detection-pipeline.js
- rl-robot-control.js
- field-testing-validation.js

## Dependencies

- TensorRT
- ONNX Runtime
- NVIDIA Jetson SDK
- DeepStream

## Usage Context

This skill is invoked when processes require deploying ML models on edge devices with optimized inference performance.

## Output Artifacts

- TensorRT engine files
- ONNX optimized models
- Quantization configurations
- DeepStream pipeline configs
- Inference benchmark reports
- ROS2 inference node implementations

Related Skills

knowledge-analytics

509

from a5c-ai/babysitter

Knowledge base analytics, usage reporting, and effectiveness measurement

knowledge-extractor

509

from a5c-ai/babysitter

Extract tribal knowledge from code, documentation, and commit history to preserve institutional memory

knowledge-curation

509

from a5c-ai/babysitter

Context priming before work (bd prime) and self-reflection after completion to extract patterns, gotchas, and decisions into the knowledge base.

knowledge-graph-management

509

from a5c-ai/babysitter

Capture, validate, query, and sync architectural patterns and design decisions in the knowledge graph

cog-knowledge-consolidation

509

from a5c-ai/babysitter

Build structured knowledge frameworks from scattered vault notes with source attribution

process-builder

509

from a5c-ai/babysitter

Scaffold new babysitter process definitions following SDK patterns, proper structure, and best practices. Guides the 3-phase workflow from research to implementation.

Workflow & Productivity

babysitter

509

from a5c-ai/babysitter

Orchestrate via @babysitter. Use this skill when asked to babysit a run, orchestrate a process or whenever it is called explicitly. (babysit, babysitter, orchestrate, orchestrate a run, workflow, etc.)