axiom-ios-ml

@CharlesWiltgen/axiom-ios-ml

CharlesWiltgen

810

66 forks

Updated 4/12/2026

View on GitHub

Use when deploying ANY machine learning model on-device, converting models to CoreML, compressing models, or implementing speech-to-text. Covers CoreML conversion, MLTensor, model compression (quantization/palettization/pruning), stateful models, KV-cache, multi-function models, async prediction, SpeechAnalyzer, SpeechTranscriber.

Installation

$npx agent-skills-cli install @CharlesWiltgen/axiom-ios-ml

Claude Code

Cursor

Copilot

Codex

Antigravity

Details

RepositoryCharlesWiltgen/Axiom

Path.claude-plugin/plugins/axiom/skills/axiom-ios-ml/SKILL.md

Branchmain

Scoped Name@CharlesWiltgen/axiom-ios-ml

Usage

After installing, this skill will be available to your AI coding assistant.

Verify installation:

npx agent-skills-cli list

Skill Instructions

name: axiom-ios-ml description: Use when deploying ANY machine learning model on-device, converting models to CoreML, compressing models, or implementing speech-to-text. Covers CoreML conversion, MLTensor, model compression (quantization/palettization/pruning), stateful models, KV-cache, multi-function models, async prediction, SpeechAnalyzer, SpeechTranscriber. license: MIT

iOS Machine Learning Router

You MUST use this skill for ANY on-device machine learning or speech-to-text work.

When to Use

Use this router when:

Converting PyTorch/TensorFlow models to CoreML
Deploying ML models on-device
Compressing models (quantization, palettization, pruning)
Working with large language models (LLMs)
Implementing KV-cache for transformers
Using MLTensor for model stitching
Building speech-to-text features
Transcribing audio (live or recorded)

Boundary with ios-ai

ios-ml vs ios-ai — know the difference:

Developer Intent	Router
"Use Apple Intelligence / Foundation Models"	ios-ai — Apple's on-device LLM
"Run my own ML model on device"	ios-ml — CoreML conversion + deployment
"Add text generation with @Generable"	ios-ai — Foundation Models structured output
"Deploy a custom LLM with KV-cache"	ios-ml — Custom model optimization
"Use Vision framework for image analysis"	ios-vision — Not ML deployment
"Use pre-trained Apple NLP models"	ios-ai — Apple's models, not custom

Rule of thumb: If the developer is converting/compressing/deploying their own model → ios-ml. If they're using Apple's built-in AI → ios-ai. If they're doing computer vision → ios-vision.

Routing Logic

CoreML Work

Implementation patterns → /skill coreml

Model conversion workflow
MLTensor for model stitching
Stateful models with KV-cache
Multi-function models (adapters/LoRA)
Async prediction patterns
Compute unit selection

API reference → /skill coreml-ref

CoreML Tools Python API
MLModel lifecycle
MLTensor operations
MLComputeDevice availability
State management APIs
Performance reports

Diagnostics → /skill coreml-diag

Model won't load
Slow inference
Memory issues
Compression accuracy loss
Compute unit problems

Speech Work

Implementation patterns → /skill speech

SpeechAnalyzer setup (iOS 26+)
SpeechTranscriber configuration
Live transcription
File transcription
Volatile vs finalized results
Model asset management

Decision Tree

Implementing / converting ML models? → coreml
CoreML API reference? → coreml-ref
Debugging ML issues (load, inference, compression)? → coreml-diag
Speech-to-text / transcription? → speech

Anti-Rationalization

Thought	Reality
"CoreML is just load and predict"	CoreML has compression, stateful models, compute unit selection, and async prediction. coreml covers all.
"My model is small, no optimization needed"	Even small models benefit from compute unit selection and async prediction. coreml has the patterns.
"I'll just use SFSpeechRecognizer"	iOS 26 has SpeechAnalyzer with better accuracy and offline support. speech skill covers the modern API.

Critical Patterns

coreml:

Model conversion (PyTorch → CoreML)
Compression (palettization, quantization, pruning)
Stateful KV-cache for LLMs
Multi-function models for adapters
MLTensor for pipeline stitching
Async concurrent prediction

coreml-diag:

Load failures and caching
Inference performance issues
Memory pressure from models
Accuracy degradation from compression

speech:

SpeechAnalyzer + SpeechTranscriber setup
AssetInventory model management
Live transcription with volatile results
Audio format conversion

Example Invocations

User: "How do I convert a PyTorch model to CoreML?" → Invoke: /skill coreml

User: "Compress my model to fit on iPhone" → Invoke: /skill coreml

User: "Implement KV-cache for my language model" → Invoke: /skill coreml

User: "Model loads slowly on first launch" → Invoke: /skill coreml-diag

User: "My compressed model has bad accuracy" → Invoke: /skill coreml-diag

User: "Add live transcription to my app" → Invoke: /skill speech

User: "Transcribe audio files with SpeechAnalyzer" → Invoke: /skill speech

User: "What's MLTensor and how do I use it?" → Invoke: /skill coreml-ref

More by CharlesWiltgen

View all

axiom-app-store-ref

810

Use when looking up ANY App Store metadata field requirement, privacy manifest schema, age rating tier, export compliance decision, EU DSA trader status, IAP review pipeline, or WWDC25 submission change. Covers character limits, screenshot specs, encryption decision tree, account deletion rules.

axiom-shipping

810

Use when preparing ANY app for submission, handling App Store rejections, writing appeals, or managing App Store Connect. Covers submission checklists, rejection troubleshooting, metadata requirements, privacy manifests, age ratings, export compliance.

axiom-scenekit

810

Use when working with SceneKit 3D scenes, migrating SceneKit to RealityKit, or maintaining legacy SceneKit code. Covers scene graph, materials, physics, animation, SwiftUI bridge, migration decision tree.

axiom-app-store-diag

810

Use when app is rejected by App Review, submission blocked, or appeal needed - systematic diagnosis from rejection message to fix with guideline-specific remediation patterns and appeal writing