project-knowledge-base

Collects, structures and maintains a Project Knowledge Base (PKB.md) in Obsidian for a marketing agency. Aggregates data from Google Drive, Gmail, Telegram (group chat and DMs via MTProto), moo.team tasks/comments, and local Obsidian meeting transcripts. Uses async parallel collection and a two-stage LLM pipeline for init. Use when the user wants to initialize, update or enrich a project's knowledge base, mentions PKB, project knowledge base, синхронизация проекта, база знаний проекта, init_project_knowledge, update_project_knowledge, or ad_hoc_add_context.

7 stars

byai-mindset-org

View on GitHub Installation ↓

Best use case

project-knowledge-base is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using project-knowledge-base should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/project-knowledge-base/SKILL.md --create-dirs "https://raw.githubusercontent.com/ai-mindset-org/pos-sprint/main/skills/project-knowledge-base/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/project-knowledge-base/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How project-knowledge-base Compares

Feature / Agent	project-knowledge-base	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Project Knowledge Base Skill

Manages structured Markdown knowledge-base cards for marketing agency projects stored in Obsidian.

**Sources (collected in parallel):** Google Drive · Gmail · Telegram group · Telegram DMs · moo.team · Obsidian transcripts

## Privacy warning

This skill processes potentially sensitive client and agency data.

- Data collected from Google Drive, Gmail, Telegram, moo.team, and Obsidian may contain private project information.
- PKB generation and update send source data to the selected external LLM provider (`OpenAI` or `Anthropic`).
- Use this skill only when you are allowed to transfer that data to third-party AI APIs under your internal policy and client agreements.
- Public repositories must contain only the skill code, docs, templates, and tests. Never publish runtime auth artifacts, logs, or real project data.

## Setup (first run)

1. Use Python `3.11` (repo pin: root `.python-version`)
2. Copy `.env.example` → `.env` and fill in all values (local file only, do not commit)
3. Explicitly set `LLM_PROVIDER` to `openai` or `anthropic`
4. Install runtime deps: `pip install -r requirements.txt`
5. Install dev deps (quality gates + tests): `pip install -r requirements-dev.txt`
6. Ensure `_projects_index.yaml` exists at `$OBSIDIAN_VAULT_PATH/$PROJECTS_INDEX_PATH`
7. Run once to complete Google OAuth: `python skill.py init <project_id> --history-days 1`

## Commands

### Initialize a new PKB from scratch
```bash
python skill.py init <project_id> [--history-days 180]
```
Uses **two-stage LLM pipeline**: collect all sources in parallel → summarize each source in parallel → single LLM call to build PKB.

### Incrementally update an existing PKB
```bash
python skill.py update <project_id> [--force-sources tg_group tg_dm ...]
```
Reads `last_sync_at` from frontmatter; collects only newer data → single LLM call.

Available `--force-sources` values: `google_drive`, `gmail`, `tg_group`, `tg_dm`, `mootem`, `obsidian`

### Add ad-hoc context manually
```bash
python skill.py add-context <project_id> \
  --type text|file_path|gmail_message_id|telegram_messages|mootem_task_id \
  --data "<content or identifier>"
```
Does **not** update `last_sync_at`.

## File structure

```
project-knowledge-base/
├── SKILL.md
├── skill.py                      # entry point / CLI
├── pytest.ini                    # asyncio_mode = auto
├── pipelines/
│   ├── init_pipeline.py          # Approach B: collect → summarize per source → build PKB
│   └── update_pipeline.py        # Approach A: collect → single LLM update
├── sources/
│   ├── google_drive.py           # Google Drive API v3
│   ├── gmail.py                  # Gmail API
│   ├── telegram_group.py         # Group chat (async Telethon)
│   ├── telegram_dm.py            # DMs with relevance filter (async Telethon)
│   ├── telegram_utils.py         # Shared: TelegramCredentials, resolve_offset, serialize_message
│   ├── mootem.py                 # moo.team REST API
│   └── obsidian_notes.py         # Local .md transcripts
├── utils/
│   ├── models.py                 # SourceResult dataclass
│   ├── logger.py                 # PKBLogger → logs/{project_id}/{timestamp}.log
│   ├── project_index.py          # _projects_index.yaml parser
│   ├── pkb_writer.py             # Atomic write via os.replace()
│   └── llm_processor.py          # Claude API: summarize_source / build_pkb_from_summaries / update_pkb
├── tests/
│   ├── sources/                  # test_telegram.py, test_sources_async.py
│   ├── pipelines/                # test_init_pipeline.py, test_update_pipeline.py
│   └── utils/                   # test_models.py, test_logger.py, test_pkb_writer.py, test_llm_processor.py
├── logs/                         # {project_id}/{timestamp}.log (auto-created)
├── template/
│   └── pkb_template.md
├── .env.example
└── requirements.txt
```

## LLM pipeline: init vs update

| | `init` (180 days) | `update` (since last_sync_at) |
|---|---|---|
| Data volume | Large — full history | Small — only new data |
| Stage 1 | Parallel async collection | Parallel async collection |
| Stage 2 | Parallel LLM summarize per source | — |
| Stage 3 | `build_pkb_from_summaries()` | `update_pkb()` — single call |
| moo.team | Formatted as Markdown table (no LLM) | Same |

## External data processing

When `init`, `update`, or `add-context` calls the LLM layer, the skill sends project content to the configured provider API.

- `init`: raw source payloads are summarized per source, then merged into a full PKB document
- `update`: new raw source payloads are sent together with the current PKB to generate an updated document
- `add-context`: the provided context is sent together with the current PKB

Do not use production client data with this skill unless that transfer is explicitly allowed.

## Error handling

Each source is wrapped in try/except. Failures are logged but do not abort the pipeline.
The run always ends with a summary printed to stdout:

```
─── Сводка синхронизации ───
  ✅ google_drive        — 23 items
  ✅ gmail               — 8 items
  ❌ tg_group            — session expired
  ✅ mootem              — 41 items

⚠️  PKB обновлён без: tg_group
   Повтор: python skill.py update <id> --force-sources tg_group
```

## Source → PKB section mapping

| Source | PKB sections populated |
|--------|----------------------|
| Google Drive | Описание проекта, Цели и KPI, Текущая реклама, Ссылки и артефакты |
| Gmail | История решений, Хронология, Контакты |
| Telegram group | История решений, Хронология, Задачи |
| Telegram DMs | История решений, Тонкости работы с клиентом |
| moo.team | Задачи (таблица), История решений |
| Obsidian transcripts | История решений, Хронология, Тонкости |

## Telegram DM relevance filter

A DM message is included if it matches **any** of:
1. Any word from `project_name` (stem-prefix matching for Russian morphology)
2. Any domain from `domains`
3. Any keyword from `keywords`
4. moo.team task URL regex: `new-app\.moo\.team/{workspace_path_part}/projects/{project_id}/`

Where:
- `workspace_path_part` — path segment from moo.team URL after domain (e.g. `WSbawtbSmV` in `new-app.moo.team/WSbawtbSmV/projects/16390`)
- Config key: `channels.moo_team_workspace_path_part`
- Backward compatibility: `channels.moo_team_workspace_slug` is still supported

## Running tests

```bash
pip install -r requirements-dev.txt
pytest tests/ -v
```

## Quality gates (local)

```bash
ruff check .
ruff format --check .
mypy .
pytest -v
```

## Dependency strategy (pip-tools)

- Source files:
  - `requirements.in` — runtime direct dependencies
  - `requirements-dev.in` — dev tooling + `-r requirements.in`
- Locked files:
  - `requirements.txt`
  - `requirements-dev.txt`
- Regenerate locks:

```bash
PIP_TOOLS_CACHE_DIR=.pip-tools-cache pip-compile --resolver=backtracking requirements.in
PIP_TOOLS_CACHE_DIR=.pip-tools-cache pip-compile --resolver=backtracking requirements-dev.in
```

## Security and secret handling

- Keep only `.env.example` in VCS; real `.env` stays local.
- Keep `google_token.json` and `*.session` local only; never commit runtime auth artifacts.
- Keep `.venv`, `logs/`, `.mypy_cache/`, `.pytest_cache/`, `.ruff_cache/`, and `.pip-tools-cache/` out of VCS.
- Keep the root `.gitignore` from this folder in place; it is part of the security boundary for publication.
- Before first git publication, rotate all active credentials from `.env` and revoke stale OAuth/session tokens.
- Rotation checklist: `docs/plans/2026-03-07-p0-secret-rotation-checklist.md`

## Public repo checklist

Before opening a PR to a public repository:

1. Confirm only code, tests, templates, and documentation are included.
2. Confirm `.env`, `google_token.json`, `*.session`, logs, caches, and local virtualenv files are absent.
3. Confirm no real project cards, meeting notes, mail dumps, Telegram exports, or customer data are present.
4. Rotate active credentials if any secret was ever stored in the skill folder locally.
5. Run local quality gates and review the diff once more for private names, URLs, and workspace-specific paths.

Related Skills

writing-content

from ai-mindset-org/pos-sprint

Интерактивный процесс написания текстов для вайб-маркетинга на основе Julian Shapiro framework. **Новые возможности (v2.0):** - Research & Gap Analysis (Perplexity → WebSearch fallback) - Scoring 0-5 вместо binary (Novelty + Resonance + Hook + Clarity) - AI-Slop Detection на всех этапах (10 типов patterns) - 3 варианта intro с self-scoring - Markdown export всех промежуточных результатов **Русские triggers:** "напиши пост по шапиро", "написать статью по фреймворку шапиро", "создай текст в стиле julian shapiro", "помоги написать контент по методу shapiro", "контент по julian shapiro фреймворку", "пост по julian shapiro", "напиши в стиле шапиро" **English triggers:** "write content using julian shapiro framework", "create post with shapiro method", "write article shapiro style", "help with julian shapiro writing" **Generic triggers:** "напиши статью", "помоги написать контент", "создай текст", "начать писать", "хочу написать пост", "нужна помощь с текстом", "write content", "write article", "создай контент", "придумай идею для статьи", or requests help with content creation process.

Content & DocumentationClaude

YT Transcribe — YouTube → Whisper → Obsidian

from ai-mindset-org/pos-sprint

Транскрибирует YouTube-видео через mlx-whisper (Apple Silicon, Metal-native) с параллельными чанками.

/tg-saved v2 — Telegram Saved Messages → Deep Analysis → Obsidian

from ai-mindset-org/pos-sprint

## Назначение

summarize-comments

from ai-mindset-org/pos-sprint

Делает LLM-выжимку из комментариев менеджеров об одном или нескольких подрядчиках. Используй этот скилл когда нужно понять что говорят менеджеры о конкретном подрядчике, или получить JSON с выжимкой для дальнейшей обработки.

skill-security

from ai-mindset-org/pos-sprint

This skill activates when the user mentions "security audit", "skill audit", "проверка безопасности скилла", "аудит скилла", "skill-security", "проверить скилл", "пересобрать скилл", "rebuild skill", "security check", "dual memory audit", "credential isolation check". Also activates on /skill-security command. Use this skill when the user wants to audit, validate, or rebuild any Claude Code skill for security compliance.

session-status

from ai-mindset-org/pos-sprint

Statusline shown in Claude Code UI status bar via settings.json. No action needed in responses.

session-save

from ai-mindset-org/pos-sprint

Compress and save current session context for handoff to next session. Use when: (1) context pressure >50%, (2) user says "сохрани сессию", "session save", "checkpoint", (3) before ending a long productive session, (4) switching to a different task mid-session. Supports named sessions: /session-save vpn-fix

continue-session

from ai-mindset-org/pos-sprint

Restore context from a named or latest session checkpoint. Use when: (1) user says "продолжи", "continue", "что было в прошлой сессии", (2) starting work after a crash or context overflow, (3) "resume", "восстанови контекст", "где я остановился". Supports named sessions: /continue vpn-fix

compress

from ai-mindset-org/pos-sprint

Info-Compressor: compress text/context by 60-70% without losing meaning. Use when: (1) context pressure >50%, (2) user says "сжать", "compress", "compact", (3) need to fit more context into remaining window, (4) preparing handoff blob for next session.

seo-strategist

from ai-mindset-org/pos-sprint

Strategic SEO planning and analysis toolkit for site-wide optimization, keyword research, technical SEO audits, and competitive positioning. Complements content-creator's on-page SEO with strategic planning, topic cluster architecture, and SEO roadmap generation. Use for keyword strategy, technical SEO audits, SERP analysis, site architecture planning, or when user mentions SEO strategy, keyword research, technical SEO, or search rankings.

roi-razvitie-draft

from ai-mindset-org/pos-sprint

Generates a draft meeting document for the weekly "Roi Развитие" (Wednesday, product Roi Navigator). Use when the user asks for a draft for the meeting, for Wednesday's doc, for "Roi Развитие", or for the weekly team meeting agenda.

product-strategist

from ai-mindset-org/pos-sprint

Strategic product leadership toolkit for Head of Product including OKR cascade generation, market analysis, vision setting, and team scaling. Use for strategic planning, goal alignment, competitive analysis, and organizational design.