foundation-models
On-device LLM integration using Apple's Foundation Models framework. Use when implementing AI text generation, structured output, or tool calling.
Best use case
foundation-models is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
On-device LLM integration using Apple's Foundation Models framework. Use when implementing AI text generation, structured output, or tool calling.
Teams using foundation-models should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/foundation-models/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How foundation-models Compares
| Feature / Agent | foundation-models | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
On-device LLM integration using Apple's Foundation Models framework. Use when implementing AI text generation, structured output, or tool calling.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Foundation Models
Integrate Apple's on-device LLM into your apps for privacy-preserving AI features.
## When This Skill Activates
- User wants AI text generation features
- User needs structured data from natural language
- User asks about prompting or LLM integration
- User wants to implement AI assistants
- User needs content summarization or extraction
## Quick Start
### 1. Check Availability
```swift
import FoundationModels
struct IntelligentView: View {
private var model = SystemLanguageModel.default
var body: some View {
switch model.availability {
case .available:
ContentView()
case .unavailable(.deviceNotEligible):
UnsupportedDeviceView()
case .unavailable(.appleIntelligenceNotEnabled):
EnableIntelligenceView()
case .unavailable(.modelNotReady):
ModelDownloadingView()
case .unavailable(let reason):
ErrorView(reason: reason)
}
}
}
```
### 2. Create a Session
```swift
// Simple session
let session = LanguageModelSession()
// Session with instructions
let session = LanguageModelSession(instructions: """
You are a helpful cooking assistant.
Provide concise, practical advice for home cooks.
""")
```
### 3. Generate Response
```swift
let response = try await session.respond(to: "What's a quick dinner idea?")
print(response.content)
```
## Prompt Engineering Best Practices
### The Instruction Formula
Instructions set the model's persona and constraints. They're prioritized over prompts.
```
[Role] + [Task] + [Style] + [Safety]
```
**Example:**
```swift
let instructions = """
You are a fitness coach specializing in home workouts.
Help users create exercise routines based on their equipment and goals.
Keep responses under 100 words and use bullet points for exercises.
Decline requests for medical advice and suggest consulting a doctor.
"""
```
### Instruction Components
| Component | Purpose | Example |
|-----------|---------|---------|
| **Role** | Define persona | "You are a travel expert" |
| **Task** | What to do | "Help plan itineraries" |
| **Style** | Output format | "Use bullet points, be concise" |
| **Safety** | Boundaries | "Don't provide medical advice" |
### Effective Prompts
Prompts are user inputs. Make them:
| Principle | Bad | Good |
|-----------|-----|------|
| **Specific** | "Help with cooking" | "Suggest a 30-minute vegetarian dinner" |
| **Constrained** | "Tell me about dogs" | "Describe Golden Retrievers in 3 sentences" |
| **Focused** | "I need help with many things" | "What ingredients substitute for eggs in baking?" |
### Prompt Patterns
**Question Pattern:**
```swift
let prompt = "What are three ways to reduce food waste at home?"
```
**Command Pattern:**
```swift
let prompt = "Create a weekly meal plan for a family of four, budget-friendly."
```
**Extraction Pattern:**
```swift
let prompt = """
Extract the following from this email:
- Sender name
- Meeting date
- Action items
Email: \(emailContent)
"""
```
**Transformation Pattern:**
```swift
let prompt = "Rewrite this text to be more formal: \(casualText)"
```
## Structured Output with @Generable
Get typed Swift data instead of raw strings.
### Define Generable Types
```swift
@Generable(description: "A recipe suggestion")
struct Recipe {
var name: String
@Guide(description: "Cooking time in minutes", .range(5...180))
var cookingTime: Int
@Guide(description: "Difficulty level", .options(["Easy", "Medium", "Hard"]))
var difficulty: String
@Guide(description: "List of ingredients", .count(3...15))
var ingredients: [String]
@Guide(description: "Step-by-step instructions")
var instructions: [String]
}
```
### @Guide Constraints
| Constraint | Use Case | Example |
|------------|----------|---------|
| `.range(min...max)` | Numeric bounds | `.range(1...100)` |
| `.options([...])` | Enum-like choices | `.options(["Low", "Medium", "High"])` |
| `.count(n)` | Exact array length | `.count(5)` |
| `.count(min...max)` | Array length range | `.count(3...10)` |
### Generate Structured Data
```swift
let session = LanguageModelSession(instructions: """
You are a recipe assistant. Generate practical, home-cook friendly recipes.
""")
let recipe = try await session.respond(
to: "Suggest a quick pasta dish",
generating: Recipe.self
)
print("Recipe: \(recipe.content.name)")
print("Time: \(recipe.content.cookingTime) minutes")
print("Ingredients: \(recipe.content.ingredients.joined(separator: ", "))")
```
### Complex Nested Structures
```swift
@Generable(description: "A travel itinerary")
struct Itinerary {
var destination: String
@Guide(description: "Daily activities for the trip")
var days: [DayPlan]
}
@Generable(description: "Activities for one day")
struct DayPlan {
var dayNumber: Int
@Guide(description: "Morning activity")
var morning: String
@Guide(description: "Afternoon activity")
var afternoon: String
@Guide(description: "Evening activity")
var evening: String
}
```
## Tool Calling
Let the model call your code to access data or perform actions.
### Define a Tool
```swift
struct WeatherTool: Tool {
let name = "getWeather"
let description = "Get current weather for a location"
struct Arguments: Codable {
var location: String
}
func call(arguments: Arguments) async throws -> ToolOutput {
let weather = await WeatherService.shared.fetch(for: arguments.location)
return .string("Temperature: \(weather.temp)°F, Conditions: \(weather.conditions)")
}
}
```
### Use Tools in Session
```swift
let weatherTool = WeatherTool()
let session = LanguageModelSession(
instructions: "You help users plan outdoor activities based on weather.",
tools: [weatherTool]
)
// Model automatically calls tool when needed
let response = try await session.respond(
to: "Should I go hiking in San Francisco today?"
)
```
### Tool Error Handling
```swift
do {
let response = try await session.respond(to: prompt)
} catch let error as LanguageModelSession.ToolCallError {
print("Tool '\(error.tool.name)' failed: \(error.underlyingError)")
} catch {
print("Generation error: \(error)")
}
```
## Snapshot Streaming
Show responses as they generate for better UX.
### Stream to SwiftUI
```swift
@Generable
struct StoryIdea {
var title: String
@Guide(description: "A brief plot summary")
var plot: String
@Guide(description: "Main characters", .count(2...4))
var characters: [String]
}
struct StreamingView: View {
@State private var partial: StoryIdea.PartiallyGenerated?
@State private var isGenerating = false
var body: some View {
VStack(alignment: .leading) {
if let partial {
if let title = partial.title {
Text(title).font(.headline)
}
if let plot = partial.plot {
Text(plot)
}
if let characters = partial.characters {
ForEach(characters, id: \.self) { char in
Text("• \(char)")
}
}
}
Button("Generate Story Idea") {
Task { await generateStory() }
}
.disabled(isGenerating)
}
}
func generateStory() async {
isGenerating = true
defer { isGenerating = false }
let session = LanguageModelSession()
let stream = session.streamResponse(
to: "Create a sci-fi story idea",
generating: StoryIdea.self
)
for try await snapshot in stream {
partial = snapshot
}
}
}
```
## Multi-Turn Conversations
Reuse sessions to maintain context.
```swift
@Observable
final class ChatViewModel {
private var session: LanguageModelSession?
var messages: [ChatMessage] = []
func startConversation() {
session = LanguageModelSession(instructions: """
You are a helpful assistant. Remember context from earlier in our conversation.
""")
}
func send(_ message: String) async throws {
guard let session else { return }
messages.append(ChatMessage(role: .user, content: message))
let response = try await session.respond(to: message)
messages.append(ChatMessage(role: .assistant, content: response.content))
}
}
```
## Error Handling
```swift
do {
let response = try await session.respond(to: prompt)
} catch LanguageModelSession.GenerationError.exceededContextWindowSize {
// Context too large (>4,096 tokens)
// Solution: Start new session, break into smaller requests
} catch LanguageModelSession.GenerationError.cancelled {
// Request was cancelled
} catch {
print("Unexpected error: \(error)")
}
```
## Context Window Management
### Get Context Size Programmatically
Don't hardcode `4096` — query the model at runtime:
```swift
let model = SystemLanguageModel.default
let contextSize = try await model.contextSize
print("Context size: \(contextSize)") // 4096 (current limit)
```
> `contextSize` is marked `@backDeployed(before: iOS 26.4, macOS 26.4, visionOS 26.4)`.
### Measure Token Usage at Runtime
Use `tokenUsage(for:)` to measure the exact token cost of each component instead of guessing:
```swift
let model = SystemLanguageModel.default
// Measure instruction cost
let instructions = Instructions("You're a helpful assistant that generates haiku.")
let instructionTokens = try await model.tokenUsage(for: instructions)
print(instructionTokens.tokenCount) // 16
// Measure instructions + tools combined
let tools = [MoodTool()]
let combinedTokens = try await model.tokenUsage(for: instructions, tools: tools)
print(combinedTokens.tokenCount) // 79 — tools add significant overhead!
// Measure a prompt
let prompt = Prompt("Generate a haiku about Swift")
let promptTokens = try await model.tokenUsage(for: prompt)
print(promptTokens.tokenCount) // 14
// Track cumulative usage in a multi-turn session
let session = LanguageModelSession(model: model, tools: tools, instructions: instructions)
let response = try await session.respond(to: prompt)
let transcriptTokens = try await model.tokenUsage(for: session.transcript)
print(transcriptTokens.tokenCount)
```
### ⚠️ Tools Multiply Token Consumption
Tool definitions are serialized as JSON schemas, which significantly inflates token usage. In the example above, adding a single tool jumped instructions from **16 → 79 tokens** (nearly 5x). Always measure before adding tools, and consider whether you truly need them for a given session.
### Token Budget Monitoring
Track context usage as a percentage to prevent overflows:
```swift
extension SystemLanguageModel.TokenUsage {
func percent(ofContextSize contextSize: Int) -> Float {
guard contextSize > 0 else { return 0 }
return Float(tokenCount) / Float(contextSize)
}
func formattedPercent(ofContextSize contextSize: Int) -> String {
percent(ofContextSize: contextSize)
.formatted(.percent.precision(.fractionLength(0)).rounded(rule: .down))
}
}
// Usage
let contextSize = try await model.contextSize
print(instructionTokens.formattedPercent(ofContextSize: contextSize)) // "0%"
print(combinedTokens.formattedPercent(ofContextSize: contextSize)) // "1%"
```
### Pre-flight Token Budget Check
Check if a prompt fits before sending to prevent `exceededContextWindowSize` errors:
```swift
func canFit(prompt: Prompt, in session: LanguageModelSession, model: SystemLanguageModel) async throws -> Bool {
let contextSize = try await model.contextSize
let transcriptTokens = try await model.tokenUsage(for: session.transcript)
let promptTokens = try await model.tokenUsage(for: prompt)
let totalNeeded = transcriptTokens.tokenCount + promptTokens.tokenCount
return totalNeeded < contextSize
}
```
### Debug with TranscriptDebugMenu
During development, drop `TranscriptDebugMenu` into your SwiftUI view hierarchy to visually inspect the full conversation transcript and token consumption in real time.
### Strategies for Large Content
```swift
// Break content into chunks
func processLargeDocument(_ document: String) async throws -> [Summary] {
let chunks = document.split(every: 10000) // ~2500 tokens per chunk
var summaries: [Summary] = []
for chunk in chunks {
let session = LanguageModelSession() // New session per chunk
let summary = try await session.respond(
to: "Summarize this section: \(chunk)",
generating: Summary.self
)
summaries.append(summary.content)
}
return summaries
}
```
## Generation Options
```swift
let options = GenerationOptions(
temperature: 0.7 // 0.0 = deterministic, 2.0 = creative
)
let response = try await session.respond(
to: prompt,
options: options
)
```
| Temperature | Use Case |
|-------------|----------|
| 0.0-0.3 | Factual extraction, data processing |
| 0.5-0.7 | Balanced creativity and accuracy |
| 1.0-2.0 | Creative writing, brainstorming |
## Common Patterns
### Content Extraction
```swift
@Generable
struct EventDetails {
var title: String
var date: String?
var time: String?
var location: String?
var attendees: [String]
}
let session = LanguageModelSession(instructions: """
Extract event details from text. Use nil for missing information.
""")
let event = try await session.respond(
to: "Extract: Team lunch next Friday at noon in Conference Room B with John and Sarah",
generating: EventDetails.self
)
```
### Summarization
```swift
let session = LanguageModelSession(instructions: """
Summarize text concisely. Focus on key points and actionable items.
Maximum 3 bullet points.
""")
let response = try await session.respond(
to: "Summarize this email: \(longEmail)"
)
```
### Classification
```swift
@Generable
struct Classification {
@Guide(description: "Category", .options(["Bug", "Feature", "Question", "Other"]))
var category: String
@Guide(description: "Priority", .options(["Low", "Medium", "High"]))
var priority: String
@Guide(description: "Brief summary")
var summary: String
}
let result = try await session.respond(
to: "Classify this support ticket: \(ticketText)",
generating: Classification.self
)
```
## Checklist
Before shipping:
- [ ] Check model availability before showing AI features
- [ ] Provide fallback UI for unsupported devices
- [ ] Handle all error cases gracefully
- [ ] Test with various prompt lengths
- [ ] Verify context window limits aren't exceeded
- [ ] Use @Generable for any structured output
- [ ] Add appropriate loading states for generation
- [ ] Test streaming UX feels responsive
- [ ] Ensure instructions are clear and specific
- [ ] Include safety boundaries in instructions
## References
- [Foundation Models Documentation](https://developer.apple.com/documentation/FoundationModels)
- [Generating content and performing tasks](https://developer.apple.com/documentation/FoundationModels/generating-content-and-performing-tasks-with-foundation-models)
- [Guided generation](https://developer.apple.com/documentation/FoundationModels/generating-swift-data-structures-with-guided-generation)
- [Tool calling](https://developer.apple.com/documentation/FoundationModels/expanding-generation-with-tool-calling)
- [HIG: Generative AI](https://developer.apple.com/design/human-interface-guidelines/technologies/generative-ai)
- [TN3193: Managing the on-device foundation model's context window](https://developer.apple.com/documentation/technotes/tn3193-managing-the-on-device-foundation-model-s-context-window)
- [Tracking token usage in Foundation Models](https://artemnovichkov.com/blog/tracking-token-usage-in-foundation-models) — Artem NovichkovRelated Skills
watchOS
watchOS development guidance including SwiftUI for Watch, Watch Connectivity, complications, and watch-specific UI patterns. Use for watchOS code review, best practices, or Watch app development.
visionos-widgets
visionOS widget patterns including mounting styles, glass/paper textures, proximity-aware layouts, and spatial widget families. Use when creating or adapting widgets for visionOS.
test-data-factory
Generate test fixture factories for your models. Builder pattern and static factories for zero-boilerplate test data. Use when tests need sample data setup.
test-contract
Generate protocol/interface test suites that any implementation must pass. Define the contract once, test every implementation. Use when designing protocols or swapping implementations.
tdd-refactor-guard
Pre-refactor safety checklist. Verifies test coverage exists before AI modifies existing code. Use before asking AI to refactor anything.
tdd-feature
Red-green-refactor scaffold for building new features with TDD. Write failing tests first, then implement to pass. Use when building new features test-first.
tdd-bug-fix
Fix bugs using red-green-refactor — reproduce the bug as a failing test first, then fix it. Use when fixing bugs to ensure they never regress.
snapshot-test-setup
Set up SwiftUI visual regression testing with swift-snapshot-testing. Generates snapshot test boilerplate and CI configuration. Use for UI regression prevention.
integration-test-scaffold
Generate cross-module test harness with mock servers, in-memory stores, and test configuration. Use when testing networking + persistence + business logic together.
characterization-test-generator
Generates tests that capture current behavior of existing code before refactoring. Use when you need a safety net before AI-assisted refactoring or modifying legacy code.
testing
TDD and testing skills for iOS/macOS apps. Covers characterization tests, TDD workflows, test contracts, snapshot tests, and test infrastructure. Use for test-driven development, adding tests to existing code, or building test infrastructure.
webkit-integration
WebKit integration in SwiftUI using WebView and WebPage for embedding web content, navigation, JavaScript interop, and customization. Use when embedding web content in SwiftUI apps.