alma-scraper

Intelligent scraper for Australian youth justice sources. Discovers, extracts, and learns from government, Indigenous, research, and media sources.

16 stars

Best use case

alma-scraper is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Intelligent scraper for Australian youth justice sources. Discovers, extracts, and learns from government, Indigenous, research, and media sources.

Teams using alma-scraper should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/alma-scraper/SKILL.md --create-dirs "https://raw.githubusercontent.com/diegosouzapw/awesome-omni-skill/main/skills/content-media/alma-scraper/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/alma-scraper/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How alma-scraper Compares

Feature / Agentalma-scraperStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Intelligent scraper for Australian youth justice sources. Discovers, extracts, and learns from government, Indigenous, research, and media sources.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# ALMA Intelligent Scraper

## When to Use
- Finding new youth justice information
- Updating ALMA intelligence
- Discovering new sources
- Analyzing coverage gaps
- Checking what's new in youth justice

## Commands

| Command | Purpose | Duration |
|---------|---------|----------|
| `quick` | Top 10 high-value sources | 5 min |
| `deep` | All 50+ sources with discovery | 30-60 min |
| `discover` | Follow discovered links | Variable |
| `source "QLD"` | Deep dive specific jurisdiction | 15 min |
| `gaps` | Show coverage gaps | 2 min |
| `status` | Current knowledge state | Instant |

## Learning Cycle

```
SCRAPE → EXTRACT → EVALUATE → LEARN → STORE
         (Claude)   (Quality)  (Patterns)
```

## Quality Signals

| Signal | Weight |
|--------|--------|
| Relevance (AU youth justice?) | 30% |
| Novelty (new info?) | 25% |
| Specificity (concrete details?) | 20% |
| Evidence (research backed?) | 15% |
| Actionability (useful?) | 10% |

## Priority Formula
```
priority = (quality × 0.4) + (freshness_need × 0.3) + (coverage_gap × 0.3)
```

## Sacred Boundaries

**Never scrape:** Private info, court records, social media, paywalled
**Always mark:** Community Controlled, Indigenous orgs, cultural knowledge
**Always check:** Consent level, cultural authority, data sovereignty

## File References

| Need | Reference |
|------|-----------|
| Database schema | `references/database-schema.md` |
| Extraction patterns | `references/extraction-patterns.md` |
| Coverage tracking | `references/coverage-tracking.md` |
| Implementation code | `references/implementation.md` |

Related Skills

firecrawl-scraper

16
from diegosouzapw/awesome-omni-skill

Deep web scraping, screenshots, PDF parsing, and website crawling using Firecrawl API

x-twitter-scraper

16
from diegosouzapw/awesome-omni-skill

X API & Twitter scraper skill for AI coding agents. Builds integrations with the Xquik REST API, MCP server & webhooks: tweet search, user lookup, follower extraction, engagement metrics, giveaway contest draws, trending topics, account monitoring, reply/retweet/quote extraction, community & Space data, mutual follow checks. Works with Claude Code, Cursor, Codex, Copilot, Windsurf & 40+ agents.

apify-ultimate-scraper

16
from diegosouzapw/awesome-omni-skill

Universal AI-powered web scraper for any platform. Scrape data from Instagram, Facebook, TikTok, YouTube, Google Maps, Google Search, Google Trends, Booking.com, and TripAdvisor. Use for lead gener...

bgo

10
from diegosouzapw/awesome-omni-skill

Automates the complete Blender build-go workflow, from building and packaging your extension/add-on to removing old versions, installing, enabling, and launching Blender for quick testing and iteration.

Coding & Development

moai-lang-r

16
from diegosouzapw/awesome-omni-skill

R 4.4+ best practices with testthat 3.2, lintr 3.2, and data analysis patterns.

moai-lang-python

16
from diegosouzapw/awesome-omni-skill

Python 3.13+ development specialist covering FastAPI, Django, async patterns, data science, testing with pytest, and modern Python features. Use when developing Python APIs, web applications, data pipelines, or writing tests.

moai-icons-vector

16
from diegosouzapw/awesome-omni-skill

Vector icon libraries ecosystem guide covering 10+ major libraries with 200K+ icons, including React Icons (35K+), Lucide (1000+), Tabler Icons (5900+), Iconify (200K+), Heroicons, Phosphor, and Radix Icons with implementation patterns, decision trees, and best practices.

moai-foundation-trust

16
from diegosouzapw/awesome-omni-skill

Complete TRUST 4 principles guide covering Test First, Readable, Unified, Secured. Validation methods, enterprise quality gates, metrics, and November 2025 standards. Enterprise v4.0 with 50+ software quality standards references.

moai-foundation-memory

16
from diegosouzapw/awesome-omni-skill

Persistent memory across sessions using MCP Memory Server for user preferences, project context, and learned patterns

moai-foundation-core

16
from diegosouzapw/awesome-omni-skill

MoAI-ADK's foundational principles - TRUST 5, SPEC-First TDD, delegation patterns, token optimization, progressive disclosure, modular architecture, agent catalog, command reference, and execution rules for building AI-powered development workflows

moai-cc-claude-md

16
from diegosouzapw/awesome-omni-skill

Authoring CLAUDE.md Project Instructions. Design project-specific AI guidance, document workflows, define architecture patterns. Use when creating CLAUDE.md files for projects, documenting team standards, or establishing AI collaboration guidelines.

moai-alfred-language-detection

16
from diegosouzapw/awesome-omni-skill

Auto-detects project language and framework from package.json, pyproject.toml, etc.