alma-scraper
Intelligent scraper for Australian youth justice sources. Discovers, extracts, and learns from government, Indigenous, research, and media sources.
Best use case
alma-scraper is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Intelligent scraper for Australian youth justice sources. Discovers, extracts, and learns from government, Indigenous, research, and media sources.
Teams using alma-scraper should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/alma-scraper/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How alma-scraper Compares
| Feature / Agent | alma-scraper | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Intelligent scraper for Australian youth justice sources. Discovers, extracts, and learns from government, Indigenous, research, and media sources.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# ALMA Intelligent Scraper
## When to Use
- Finding new youth justice information
- Updating ALMA intelligence
- Discovering new sources
- Analyzing coverage gaps
- Checking what's new in youth justice
## Commands
| Command | Purpose | Duration |
|---------|---------|----------|
| `quick` | Top 10 high-value sources | 5 min |
| `deep` | All 50+ sources with discovery | 30-60 min |
| `discover` | Follow discovered links | Variable |
| `source "QLD"` | Deep dive specific jurisdiction | 15 min |
| `gaps` | Show coverage gaps | 2 min |
| `status` | Current knowledge state | Instant |
## Learning Cycle
```
SCRAPE → EXTRACT → EVALUATE → LEARN → STORE
(Claude) (Quality) (Patterns)
```
## Quality Signals
| Signal | Weight |
|--------|--------|
| Relevance (AU youth justice?) | 30% |
| Novelty (new info?) | 25% |
| Specificity (concrete details?) | 20% |
| Evidence (research backed?) | 15% |
| Actionability (useful?) | 10% |
## Priority Formula
```
priority = (quality × 0.4) + (freshness_need × 0.3) + (coverage_gap × 0.3)
```
## Sacred Boundaries
**Never scrape:** Private info, court records, social media, paywalled
**Always mark:** Community Controlled, Indigenous orgs, cultural knowledge
**Always check:** Consent level, cultural authority, data sovereignty
## File References
| Need | Reference |
|------|-----------|
| Database schema | `references/database-schema.md` |
| Extraction patterns | `references/extraction-patterns.md` |
| Coverage tracking | `references/coverage-tracking.md` |
| Implementation code | `references/implementation.md` |Related Skills
firecrawl-scraper
Deep web scraping, screenshots, PDF parsing, and website crawling using Firecrawl API
x-twitter-scraper
X API & Twitter scraper skill for AI coding agents. Builds integrations with the Xquik REST API, MCP server & webhooks: tweet search, user lookup, follower extraction, engagement metrics, giveaway contest draws, trending topics, account monitoring, reply/retweet/quote extraction, community & Space data, mutual follow checks. Works with Claude Code, Cursor, Codex, Copilot, Windsurf & 40+ agents.
apify-ultimate-scraper
Universal AI-powered web scraper for any platform. Scrape data from Instagram, Facebook, TikTok, YouTube, Google Maps, Google Search, Google Trends, Booking.com, and TripAdvisor. Use for lead gener...
bgo
Automates the complete Blender build-go workflow, from building and packaging your extension/add-on to removing old versions, installing, enabling, and launching Blender for quick testing and iteration.
moai-lang-r
R 4.4+ best practices with testthat 3.2, lintr 3.2, and data analysis patterns.
moai-lang-python
Python 3.13+ development specialist covering FastAPI, Django, async patterns, data science, testing with pytest, and modern Python features. Use when developing Python APIs, web applications, data pipelines, or writing tests.
moai-icons-vector
Vector icon libraries ecosystem guide covering 10+ major libraries with 200K+ icons, including React Icons (35K+), Lucide (1000+), Tabler Icons (5900+), Iconify (200K+), Heroicons, Phosphor, and Radix Icons with implementation patterns, decision trees, and best practices.
moai-foundation-trust
Complete TRUST 4 principles guide covering Test First, Readable, Unified, Secured. Validation methods, enterprise quality gates, metrics, and November 2025 standards. Enterprise v4.0 with 50+ software quality standards references.
moai-foundation-memory
Persistent memory across sessions using MCP Memory Server for user preferences, project context, and learned patterns
moai-foundation-core
MoAI-ADK's foundational principles - TRUST 5, SPEC-First TDD, delegation patterns, token optimization, progressive disclosure, modular architecture, agent catalog, command reference, and execution rules for building AI-powered development workflows
moai-cc-claude-md
Authoring CLAUDE.md Project Instructions. Design project-specific AI guidance, document workflows, define architecture patterns. Use when creating CLAUDE.md files for projects, documenting team standards, or establishing AI collaboration guidelines.
moai-alfred-language-detection
Auto-detects project language and framework from package.json, pyproject.toml, etc.