Checklists

SaaS Applications implementation checklist

A production-readiness checklist for SaaS applications focusing on multi-tenant security, billing integrity, and operational scaling for modern tech stacks.

Progress0 / 25 complete (0%)

Multi-Tenancy and Data Isolation

0/5

Enforce Row-Level Security (RLS)
critical
Verify that every database table containing tenant data has RLS enabled with a policy that checks the authenticated user's organization_id.
Cross-Tenant Access Testing
critical
Execute a test suite that attempts to read/write a resource ID from Tenant A using an authentication token belonging to Tenant B; ensure it returns a 403 or 404.
JWT Claim Validation
recommended
Ensure your auth provider (Clerk/Auth.js) includes the current organization_id in the JWT claims to avoid extra database lookups for permissions.
Tenant-Aware Logging
recommended
Configure your logging middleware to append organization_id to every log entry to simplify troubleshooting during customer support tickets.
Soft-Delete Implementation
optional
Implement a deleted_at timestamp on primary entities to allow for data recovery in case of accidental tenant-wide deletions.

Billing and Subscription Lifecycle

0/5

Webhook Idempotency
critical
Implement a check against an 'event_logs' table to ensure Stripe or LemonSqueezy webhook events are only processed once based on their unique event ID.
Subscription State Sync
critical
Verify that the application UI and API middleware correctly handle 'past_due' and 'canceled' statuses by restricting access to premium features immediately.
Usage-Based Metering Accuracy
recommended
If using metered billing, implement a reconciliation script that compares internal usage logs against the billing provider's recorded usage every 24 hours.
Grace Period Logic
recommended
Configure a 3-7 day grace period for failed payments where the user sees a notification banner but retains access before the account is locked.
Tax Compliance Configuration
critical
Enable automatic tax calculation (Stripe Tax/TaxJar) for all active jurisdictions to ensure VAT and Sales Tax are collected at checkout.

AI Infrastructure and Cost Control

0/5

Token Usage Hard Caps
critical
Set a maximum token limit per request in the OpenAI/Anthropic SDK calls to prevent a single runaway recursive loop from depleting your credits.
Per-Tenant Cost Attribution
recommended
Tag every AI API call with metadata containing the tenant_id to generate internal reports on which customers are most/least profitable.
Model Fallback Strategy
optional
Implement a try/catch block that switches from a primary high-cost model to a lower-cost fallback model (e.g., GPT-4 to GPT-3.5-Turbo) if rate limits are hit.
Response Timeout Handling
recommended
Set a strict 30-second timeout for AI streaming responses to prevent Vercel/Edge function execution limits from being exceeded.
Prompt Versioning
optional
Move prompt strings out of hardcoded logic and into a database or config file to allow updates without requiring a full code redeploy.

Security and Compliance

0/5

Environment Variable Audit
critical
Scan the repository to ensure no production keys (STRIPE_SECRET, DATABASE_URL) are committed to Git; verify they are only in the CI/CD environment.
PII Encryption at Rest
critical
Identify fields containing Personally Identifiable Information (PII) and ensure they are encrypted using a standard like AES-256 before being stored.
Rate Limiting Policy
critical
Apply a global rate limit (e.g., 100 requests per minute per user) using Upstash or Redis to prevent API abuse and DDoS attacks.
GDPR Data Export Tool
recommended
Create an internal script or admin endpoint that aggregates all data for a specific user_id into a JSON file to satisfy 'Right to Access' requests.
Security Headers Configuration
recommended
Verify that the production server sends CSP, X-Content-Type-Options, and Strict-Transport-Security headers in every response.

Observability and Reliability

0/5

Error Grouping and Alerting
critical
Configure Sentry to alert the engineering team via Slack only when an error occurs more than 5 times in a 10-minute window to avoid noise.
Database Connection Pooling
critical
If using Serverless (Vercel/AWS Lambda), ensure a connection pooler like Prisma Accelerate or Supabase Pooling is active to prevent 'Too many connections' errors.
Health Check Endpoint
recommended
Expose a /api/health endpoint that performs a 'SELECT 1' on the database and a ping to the cache to verify the full stack is operational.
Background Job Monitoring
recommended
Set up a Dead Letter Queue (DLQ) for failed transactional emails or background tasks to allow for manual retries of failed messages.
Performance Baseline
optional
Record the LCP (Largest Contentful Paint) and API response time for the main dashboard to serve as a benchmark for future regressions.