Add AI agent documentation (CLAUDE.md & AGENTS.md) (#295)

* add contributing md and coding agents md docs

* remove contributing md from this branch

* minor updates

* resolve PR comments and add tests instructions
This commit is contained in:
Ylber Gashi
2026-01-30 17:12:17 +01:00
committed by GitHub
parent 4217eedad5
commit 6590a0db0f
2 changed files with 481 additions and 0 deletions

47
AGENTS.md Normal file
View File

@@ -0,0 +1,47 @@
# Repository Guidelines
## Architecture Overview
Colanode is a local-first collaboration workspace. Clients keep a local SQLite cache and sync to the server (Fastify + Postgres + Redis) in the background for offline-first behavior. Real-time editing uses CRDTs (Yjs) to merge concurrent changes. Shared packages provide core types, sync logic, and UI; see `README.md` for product and hosting context.
## Project Structure & Module Organization
- `apps/`: client and server apps (`server`, `web`, `desktop`, `mobile`).
- `packages/`: shared libraries (`core`, `client`, `ui`, `crdt`).
- `scripts/`: asset and seed tooling (postinstall runs from here).
- `hosting/`: Docker Compose and Kubernetes (Helm) deploy configs.
- `assets/`: repository images used in docs.
## Development Guide (Quick Start)
- Install dependencies: `npm install`.
- Prefer running tasks in individual app/package directories; repo-level scripts run the entire monorepo.
- Run apps directly:
- `apps/server`: `cp .env.example .env && npm run dev`
- `apps/web`: `npm run dev` (Vite on port 4000)
- `apps/desktop`: `npm run dev`
- Local dependencies: `docker compose -f hosting/docker/docker-compose.yaml up -d`.
## Coding Guidelines
- Ground changes in the existing codebase. Start from the closest feature and mirror its folders, naming, and flow.
- Keep shared behavior in `packages/`; keep `apps/` thin and focused on wiring and UI.
- Server routes use Fastify plugins with Zod schemas from `@colanode/core`. Update schemas and error codes before handlers.
- Client operations follow the query/mutation pattern: define typed `type: 'feature.action'` inputs/outputs in `packages/client/src/queries` or `packages/client/src/mutations`, then wire handlers in `packages/client/src/handlers`.
- Use Kysely (`database`) for SQL access and limit raw SQL.
- UI styling uses Tailwind utilities, shared styles in `packages/ui/src/styles`, and shadcn components in `packages/ui/src/components/ui`. Prefer shared components over one-off styling.
- Use `@colanode/*` imports and follow ESLint import grouping; keep filenames consistent with nearby code.
## Server Configuration
- Config file location: set via `CONFIG` environment variable (e.g., `CONFIG=/path/to/config.json`).
- If `CONFIG` is not set, server uses schema defaults from `apps/server/src/lib/config/`.
- Template: `apps/server/config.example.json`.
- Reference resolution in JSON:
- `env://VAR_NAME` - resolves to environment variable (required, fails if not set).
- `file://path/to/file` - reads and inlines file contents (useful for certificates).
- Direct values - plain strings/numbers/booleans for non-sensitive settings.
- Only `postgres.url` is required (defaults to `env://POSTGRES_URL`); all other settings have schema defaults.
- For production: copy `config.example.json`, update values, mount it, and set `CONFIG` + required env vars.
- Storage config via `storage.provider.type`: `file` (default), `s3`, `gcs`, or `azure`.
- See `apps/server/src/lib/config/` for full schemas and validation.
## Testing
- Tests live in `apps/server` and `apps/web` and run with Vitest.
- Run `npm run test` in the relevant app directory; `npm run test` at the repo root runs them via Turbo.
- Validate changes manually where tests do not apply and note verification steps.

434
CLAUDE.md Normal file
View File

@@ -0,0 +1,434 @@
# CLAUDE.md
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
## Project Overview
Colanode is an open-source, local-first collaboration platform supporting real-time chat, rich text editing, customizable databases, and file management. It uses a sophisticated CRDT-based architecture (powered by Yjs) to enable offline-first operation with automatic conflict resolution.
## Commands
### Development
```bash
# Install dependencies (also runs postinstall script to generate emoji/icon assets)
npm install
```
Prefer running dev/build/compile/format commands inside the specific app or package directory.
**Note:** Tests exist for `apps/server` and `apps/web` and are run with Vitest. Prefer running tests in the relevant app directory; `npm run test` at the repo root runs the same suites via Turbo.
### Individual App Development
**Server:**
```bash
cd apps/server
cp .env.example .env # Configure environment variables
npm run dev # Start server with hot reload
# Start dependencies (Postgres, Redis, Mail server) via Docker Compose
docker compose -f hosting/docker/docker-compose.yaml up -d
# Include MinIO (S3-compatible storage) for testing
docker compose -f hosting/docker/docker-compose.yaml --profile s3 up -d
```
**Web:**
```bash
cd apps/web
npm run dev
```
**Desktop:**
```bash
cd apps/desktop
npm run dev
```
### Utility Scripts
Located in `scripts/`:
```bash
cd scripts
# Generate emoji assets (run from scripts directory)
npm run generate:emojis
# Generate icon assets (run from scripts directory)
npm run generate:icons
# Seed database with test data (run from scripts directory)
npm run seed
```
Note: Generated emoji and icon assets are git-ignored. These are regenerated on `npm install` via the postinstall hook.
## Architecture
### Monorepo Structure
This is a Turborepo monorepo with npm workspaces:
- **`packages/core`** - Shared types, validation schemas (Zod), business rules, and node type registry. Foundation for all other packages.
- **`packages/crdt`** - CRDT implementation wrapping Yjs. Provides type-safe document updates and conflict-free merging.
- **`packages/client`** - Client-side services, local SQLite database schema, and API communication layer. Handles mutations, synchronizers, and offline support.
- **`packages/ui`** - React components using TailwindCSS, Radix UI primitives, and TipTap editor. Shared between web and desktop apps.
- **`apps/server`** - Fastify-based API server with WebSocket support. Manages Postgres database, Redis events, and background jobs.
- **`apps/web`** - Web application (Vite + React + TanStack Router).
- **`apps/desktop`** - Electron desktop application.
- **`apps/mobile`** - React Native mobile app (experimental, not production-ready).
- **`scripts/`** - Utility scripts for generating emojis, icons, and seed data.
### Local-First Architecture
**Core Principle:** All data operations happen locally first, then sync to the server in the background.
**Client Write Path:**
1. User makes a change (e.g., edits a document)
2. Change is immediately applied to local SQLite database
3. CRDT update is generated using Yjs (as binary `Uint8Array`)
4. Update is stored in `mutations` table as a pending operation
5. `MutationService` batches and sends mutations to server via HTTP POST
6. Server validates, applies updates, and stores in Postgres
7. Server broadcasts changes to other clients via WebSocket
**Client Read Path:**
- All reads happen from local SQLite (instant response)
- `Synchronizer` services pull updates from server via WebSocket
- Updates are applied to local database in background
- UI reactively updates when local data changes
**Key Files:**
- `packages/client/src/services/workspaces/mutation-service.ts` - Mutation batching/syncing
- `packages/client/src/services/workspaces/synchronizer.ts` - Real-time sync via WebSocket
- `packages/client/src/databases/workspace/schema.ts` - Local SQLite schema
- `apps/server/src/api/client/routes/workspaces/mutations/mutations-sync.ts` - Server mutation endpoint
- `apps/server/src/services/socket-connection.ts` - WebSocket connection handler
### CRDT Integration (Yjs)
**Purpose:** Enable conflict-free collaborative editing with automatic merge resolution.
**Implementation:**
- `packages/crdt/src/index.ts` contains the `YDoc` class wrapping Yjs documents
- Each node (page, database record, etc.) has a corresponding Yjs document
- Updates are encoded as binary blobs (Base64 for storage, Uint8Array for processing)
- Server merges concurrent updates automatically using Yjs CRDT semantics
**Storage Strategy:**
Client storage maintains multiple layers:
1. **Current State** - JSON representation of latest merged state (for querying/UI)
2. **Merged CRDT State** - Binary Yjs state in `node_states` / `document_states`
3. **Pending Updates** - Local, unsynced CRDT updates in `node_updates` / `document_updates` kept separately so they can be reverted; once synced they are merged into `*_states` and removed
Server storage keeps current JSON state (`nodes` / `documents`) plus CRDT update history (`node_updates` / `document_updates`); background jobs merge older updates.
**Tables:**
- `nodes` / `documents` - Current state as JSON/JSONB
- `node_states` / `document_states` - Merged CRDT state (client-side)
- `node_updates` / `document_updates` - Pending local CRDT updates (client-side) and CRDT update history (server-side)
**Background Merging:**
Server jobs periodically merge old updates to reduce storage (`apps/server/src/jobs/node-updates-merge.ts`).
### Database Synchronization
**Local (SQLite) ↔ Server (Postgres):**
Cursor-based streaming synchronization:
- Each data stream (users, nodes, documents, collaborations, etc.) has a `Synchronizer`
- Synchronizers track a cursor (last synced revision number)
- Client requests updates via WebSocket: `synchronizer.input { cursor: 12345 }`
- Server responds with batch: `synchronizer.output { updates: [...], cursor: 12350 }`
- Client applies updates to local database and persists new cursor
**Synchronizer Types:**
- `users` - User list changes
- `collaborations` - Access control updates
- `node.updates` - Node CRDT updates (per workspace root)
- `document.updates` - Document CRDT updates (per workspace root)
- `node.reactions` - Emoji reactions
- `node.interactions` - Read receipts and activity tracking
**Key Files:**
- `packages/client/src/services/workspaces/synchronizer.ts`
- `apps/server/src/synchronizers/*` - Server-side data fetchers
- `apps/server/src/lib/event-bus.ts` - Event system for triggering syncs
### Node Type Registry
**Location:** `packages/core/src/registry/nodes/`
Each node type (space, page, database, message, etc.) defines:
- **Attribute Schema** - Zod schema for node metadata
- **Document Schema** (optional) - Zod schema for collaborative content
- **Permission Checks** - `canCreate`, `canUpdate`, `canDelete`, `canRead`
- **Text Extraction** - For search indexing
- **Mention Extraction** - For @mentions and notifications
**Example Node Types:**
- `space` - Top-level container
- `page` - Rich text document
- `database` - Structured data with custom fields and views
- `record` - Database row
- `database_view` - Saved database view
- `chat` - Chat container
- `channel` - Chat channel
- `message` - Chat message
- `file` - File attachment
- `folder` - Organizational container
**Important:** When adding a new node type, register it in the appropriate registry file and ensure both client and server import it.
### Configuration System
**Server Configuration:**
The server uses a JSON-based configuration system with smart reference resolution:
- **Config File Location**: Set via `CONFIG` environment variable (e.g., `CONFIG=/path/to/config.json`)
- **Default Behavior**: If `CONFIG` is not set, server uses schema defaults from `apps/server/src/lib/config/`
- **Example Config**: See `apps/server/config.example.json` for a complete template
**Reference Resolution:**
The config system supports special prefixes for dynamic value loading:
- `env://VAR_NAME` - Resolves to environment variable at runtime (required, fails if not set)
- `file://path/to/file` - Reads and inlines file contents at runtime (useful for certificates/secrets)
- Direct values - Plain strings/numbers/booleans in JSON
**Example:**
```json
{
"postgres": {
"url": "env://POSTGRES_URL",
"ssl": {
"ca": "file:///secrets/postgres-ca.pem"
}
},
"storage": {
"provider": {
"type": "s3",
"endpoint": "env://S3_ENDPOINT",
"accessKey": "env://S3_ACCESS_KEY",
"secretKey": "env://S3_SECRET_KEY"
}
}
}
```
**Required Configuration:**
- Only `postgres.url` is truly required (defaults to `env://POSTGRES_URL`)
- All other settings have sensible defaults in the Zod schemas
- Storage defaults to `file` type with `./data` directory
- Redis defaults to `env://REDIS_URL` but has fallback behavior
**For Docker/Production:**
1. Copy `apps/server/config.example.json` to your own config file
2. Update values (use `env://` for secrets, direct values for non-sensitive settings)
3. Mount your config file and set `CONFIG=/path/to/mounted/config.json`
4. Set required environment variables referenced by `env://` pointers
**For Local Development:**
- Use `.env` file in `apps/server/` directory
- Server will use schema defaults if no config file is provided
- See `apps/server/.env.example` for available environment variables
**Config Validation:**
- All config is validated using Zod schemas at startup
- Validation errors show clear messages about missing/invalid values
- Server exits immediately if config is invalid
**Client Apps:**
- Use standard `.env` files for build-time configuration
- Runtime configuration fetched from server
## Testing
**Automated Tests (Vitest):**
- `apps/server`: `npm run test`
- `apps/web`: `npm run test`
- Repo root: `npm run test` (runs app tests via Turbo)
- Focus manual verification on areas without test coverage
- Include clear verification steps in pull request descriptions
## Development Tips
### Working with CRDTs
When modifying node or document schemas:
1. Update Zod schema in `packages/core/src/registry/nodes/<type>.ts`
2. The CRDT layer automatically handles schema validation via `YDoc.update()`
3. Test with multiple clients to verify conflict resolution
4. Remember: updates are append-only, deletions use tombstones
### Debugging Synchronization
**Client-side:**
- Check `mutations` table for pending operations
- Check `cursors` table for sync position
- Use browser DevTools WebSocket tab to inspect messages
**Server-side:**
- Logs are in JSON format (Pino logger)
- Look for `synchronizer.input` / `synchronizer.output` messages
- Check `node_updates` table for stored updates
- Verify revision numbers are incrementing
### Database Migrations
**Server (Postgres):**
- Schema defined in `apps/server/src/data/schema.ts`
- Migrations live in `apps/server/src/data/migrations` and run via the Kysely migrator in `apps/server/src/data/database.ts`
- Add a migration for schema changes; avoid manual production edits. For local dev, dropping the DB is a last resort.
**Client (SQLite):**
- Schema versioning handled in `packages/client/src/databases/workspace/schema.ts`
- Migration logic in `packages/client/src/databases/workspace/migrations.ts`
- Migrations run automatically on app startup
### Adding a New Node Type
1. Create schema in `packages/core/src/registry/nodes/<type>.ts`
2. Define attribute schema, document schema (if collaborative), and permissions
3. Register in `packages/core/src/registry/nodes/index.ts`
4. Update server-side node creation logic in `apps/server/src/lib/nodes.ts`
5. Add client-side service in `packages/client/src/services/`
6. Create UI components in `packages/ui/src/components/`
### Storage Backends
Server supports multiple storage backends for files:
- **File** (default) - Local filesystem storage in `./data` directory
- **S3** - AWS S3 or compatible (MinIO, DigitalOcean Spaces, etc.)
- **GCS** - Google Cloud Storage
- **Azure** - Azure Blob Storage
**Configuration:**
Configure via `storage.provider.type` in your config file. Examples:
**Filesystem (default):**
```json
{
"storage": {
"provider": {
"type": "file",
"directory": "./data"
}
}
}
```
**S3-compatible:**
```json
{
"storage": {
"provider": {
"type": "s3",
"endpoint": "env://S3_ENDPOINT",
"accessKey": "env://S3_ACCESS_KEY",
"secretKey": "env://S3_SECRET_KEY",
"bucket": "env://S3_BUCKET",
"region": "env://S3_REGION",
"forcePathStyle": false
}
}
}
```
**GCS:**
```json
{
"storage": {
"provider": {
"type": "gcs",
"bucket": "env://GCS_BUCKET",
"projectId": "env://GCS_PROJECT_ID",
"credentials": "file:///secrets/gcs-credentials.json"
}
}
}
```
**Azure:**
```json
{
"storage": {
"provider": {
"type": "azure",
"account": "env://AZURE_STORAGE_ACCOUNT",
"accountKey": "env://AZURE_STORAGE_ACCOUNT_KEY",
"containerName": "env://AZURE_CONTAINER_NAME"
}
}
}
```
See `apps/server/src/lib/storage/` for implementations and `apps/server/src/lib/config/storage.ts` for full schema.
## Common Patterns
### Service Layer Pattern
Services encapsulate business logic and coordinate between database and API:
- `packages/client/src/services/` - Client-side services
- `apps/server/src/services/` - Server-side services
### Repository Pattern
Database access abstracted through clear interfaces:
- `packages/client/src/databases/` - Client database layer
- `apps/server/src/data/` - Server database layer
### Event-Driven Updates
- Client: Uses TanStack DB for reactive data fetching alongside TanStack Query
- Server: Uses EventBus (Redis-backed) for cross-instance communication
- Background jobs via BullMQ for asynchronous processing
### Optimistic Updates
All mutations are optimistic:
1. Update local state immediately
2. Show UI change instantly
3. Send mutation to server in background
4. On failure (after 10 retries), show error and optionally revert
5. CRDT ensures eventual consistency even if server order differs
## Important Considerations
- **Generated Assets:** Emoji and icon files are generated during `npm install`. Don't commit these to git.
- **TypeScript Source Imports:** Packages use TypeScript source directly (not compiled outputs) during development for faster iteration.
- **Local-First Mindset:** Always assume network can fail. Design features to work offline first.
- **CRDT Limitations:** Not all data types use CRDTs (e.g., messages and files use simpler database tables).
- **Mobile App:** The `apps/mobile` is experimental and not production-ready.
- **Performance:** For large workspaces, synchronizers per root node can cause memory pressure. Monitor closely.