tensorflow-trainer

TensorFlow/Keras model training skill with callbacks, distributed strategies, and TensorBoard integration.

509 stars

Best use case

tensorflow-trainer is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

TensorFlow/Keras model training skill with callbacks, distributed strategies, and TensorBoard integration.

Teams using tensorflow-trainer should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/tensorflow-trainer/SKILL.md --create-dirs "https://raw.githubusercontent.com/a5c-ai/babysitter/main/library/specializations/data-science-ml/skills/tensorflow-trainer/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/tensorflow-trainer/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How tensorflow-trainer Compares

Feature / Agenttensorflow-trainerStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

TensorFlow/Keras model training skill with callbacks, distributed strategies, and TensorBoard integration.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# tensorflow-trainer

## Overview

TensorFlow/Keras model training skill with callbacks, distributed strategies, TensorBoard integration, and production-ready model export capabilities.

## Capabilities

- Keras model training with callbacks
- Custom training loops with tf.GradientTape
- Distribution strategy configuration (MirroredStrategy, MultiWorkerMirroredStrategy, TPUStrategy)
- TensorBoard logging and visualization
- SavedModel export for TF Serving
- TFLite conversion for edge deployment
- Mixed precision training

## Target Processes

- Model Training Pipeline with Experiment Tracking
- Distributed Training Orchestration
- Model Deployment Pipeline

## Tools and Libraries

- TensorFlow
- Keras
- TensorBoard
- TensorFlow Serving
- TensorFlow Lite

## Input Schema

```json
{
  "type": "object",
  "required": ["modelConfig", "dataConfig", "trainingConfig"],
  "properties": {
    "modelConfig": {
      "type": "object",
      "properties": {
        "modelPath": { "type": "string" },
        "modelType": { "type": "string", "enum": ["sequential", "functional", "subclassed"] }
      }
    },
    "dataConfig": {
      "type": "object",
      "properties": {
        "trainPath": { "type": "string" },
        "valPath": { "type": "string" },
        "batchSize": { "type": "integer" },
        "prefetch": { "type": "boolean" }
      }
    },
    "trainingConfig": {
      "type": "object",
      "properties": {
        "epochs": { "type": "integer" },
        "optimizer": { "type": "string" },
        "learningRate": { "type": "number" },
        "loss": { "type": "string" },
        "metrics": { "type": "array", "items": { "type": "string" } },
        "callbacks": { "type": "array", "items": { "type": "string" } },
        "distributionStrategy": { "type": "string" }
      }
    },
    "exportConfig": {
      "type": "object",
      "properties": {
        "savedModelPath": { "type": "string" },
        "tflitePath": { "type": "string" },
        "servingSignatures": { "type": "array", "items": { "type": "string" } }
      }
    }
  }
}
```

## Output Schema

```json
{
  "type": "object",
  "required": ["status", "metrics", "modelPath"],
  "properties": {
    "status": {
      "type": "string",
      "enum": ["success", "error", "early_stopped"]
    },
    "metrics": {
      "type": "object",
      "properties": {
        "loss": { "type": "number" },
        "valLoss": { "type": "number" },
        "accuracy": { "type": "number" },
        "valAccuracy": { "type": "number" },
        "epochsTrained": { "type": "integer" }
      }
    },
    "modelPath": {
      "type": "string"
    },
    "savedModelPath": {
      "type": "string"
    },
    "tensorboardLogDir": {
      "type": "string"
    },
    "history": {
      "type": "object",
      "description": "Training history with all metrics per epoch"
    }
  }
}
```

## Usage Example

```javascript
{
  kind: 'skill',
  title: 'Train TensorFlow model',
  skill: {
    name: 'tensorflow-trainer',
    context: {
      modelConfig: {
        modelPath: 'models/cnn_model.py',
        modelType: 'functional'
      },
      dataConfig: {
        trainPath: 'data/train',
        valPath: 'data/val',
        batchSize: 64,
        prefetch: true
      },
      trainingConfig: {
        epochs: 50,
        optimizer: 'adam',
        learningRate: 0.001,
        loss: 'sparse_categorical_crossentropy',
        metrics: ['accuracy'],
        callbacks: ['early_stopping', 'model_checkpoint', 'tensorboard']
      }
    }
  }
}
```

Related Skills

vqc-trainer

509
from a5c-ai/babysitter

Variational quantum classifier training skill with gradient optimization

tensorflow-physics-ml

509
from a5c-ai/babysitter

TensorFlow machine learning skill specialized for physics applications including neural network potentials and surrogate models

calibration-trainer

509
from a5c-ai/babysitter

Probability calibration training skill for improving forecast accuracy and reducing overconfidence

sklearn-model-trainer

509
from a5c-ai/babysitter

Scikit-learn model training skill with cross-validation, hyperparameter tuning, pipeline construction, and model serialization. Enables automated ML model development using scikit-learn's comprehensive toolkit.

ray-distributed-trainer

509
from a5c-ai/babysitter

Distributed computing skill using Ray for parallel training, hyperparameter search, and resource management.

pytorch-trainer

509
from a5c-ai/babysitter

PyTorch model training skill with custom training loops, gradient management, and GPU optimization.

process-builder

509
from a5c-ai/babysitter

Scaffold new babysitter process definitions following SDK patterns, proper structure, and best practices. Guides the 3-phase workflow from research to implementation.

Workflow & Productivity

babysitter

509
from a5c-ai/babysitter

Orchestrate via @babysitter. Use this skill when asked to babysit a run, orchestrate a process or whenever it is called explicitly. (babysit, babysitter, orchestrate, orchestrate a run, workflow, etc.)

yolo

509
from a5c-ai/babysitter

Run Babysitter autonomously with minimal manual interruption.

user-install

509
from a5c-ai/babysitter

Install the user-level Babysitter Codex setup.

team-install

509
from a5c-ai/babysitter

Install the team-pinned Babysitter Codex workspace setup.

retrospect

509
from a5c-ai/babysitter

Summarize or retrospect on a completed Babysitter run.