Incident response plan for Weblate¶
Scope and objectives¶
This IRP covers incidents impacting the confidentiality, integrity, or availability of Weblate-operated deployments.
Informacja
This plan is specifically designed for deployments operated by Weblate s.r.o. Other deployments need to adapt provider-specific and organizational steps to their own environment.
Roles and responsibilities¶
Incident Response Lead (IRL): Coordinates all phases of the response process.
System Administrator: Executes containment and recovery measures.
Security Officer: Evaluates security impact and regulatory consequences.
Data Protection Officer (DPO): Evaluates if personal data (PII) was compromised and manages mandatory GDPR notifications.
Communications Lead: Manages notifications to internal stakeholders and external parties if required.
Communication logistics¶
- Internal Communication:
Primary channel is Signal for human-to-human coordination.
Technical alerts remain outside of Signal to avoid noise.
- External Communication:
E-mail is used to reach customers.
Customer contact lists are maintained in several locations to ensure access during service outages.
- Public Disclosure:
If a security vulnerability is discovered, follow Vulnerability and incident handling.
Incident categories and severity¶
Incident activation¶
Declare an incident when an event is confirmed or strongly suspected to affect the confidentiality, integrity, or availability of the service beyond routine operational noise.
The Security Officer declares the incident, assigns the initial severity, and appoints the Incident Response Lead (IRL).
If the Security Officer is unavailable, any available senior operator may declare the incident and hand over ownership as soon as practical.
Reclassify the incident if the scope or impact changes during investigation.
Incident categories¶
Category 1 – Unauthorized Access
Category 2 – Data Integrity Violation
Category 3 – Service Outage or Degradation
Category 4 – Misconfiguration or Deployment Error
Severity levels and SLAs¶
Istotność |
Definition |
Target Acknowledge |
Target Initial Action |
|---|---|---|---|
Critical |
Total outage; Admin compromise; Active data breach; requires immediate containment. |
< 30 Minutes |
< 4 Hours |
Wysoki |
Core feature failure; PII leak of single user. |
< 2 Hours |
12 Hours |
Średni |
Performance degradation; Minor security issue. |
1 Business Day |
3 Business Days |
Niski |
UI bugs; Staging issues; Non-security errors. |
Best Effort |
Best Effort |
Incident response lifecycle¶
Preparation¶
Ensure regular daily backups of the PostgreSQL database and the data directory using Weblate’s built-in backup with rotation, see Tworzenie kopii zapasowych i przenoszenie weblate.
Ensure Weblate uses a properly configured reverse proxy (e.g., NGINX) with HTTPS (TLS 1.2+).
Enable 2FA for all admin-level accounts.
Keep the Weblate instance and its dependencies (Python, Django, Celery, database, etc.) up to date.
Integrate with SIEM systems using the GELF protocol for audit and application log forwarding.
Identification¶
Monitor system and application logs (
journalctl, reverse proxy logs, Weblate application and audit logs).Analyze login events, webhook executions, and push/pull failures.
Configure alerting (via Prometheus, Zabbix, or SIEM) for multiple login failures, unexpected restarts, or irregular VCS actions.
Containment¶
Create an incident record with a case ID and record timeline updates as actions are taken.
Coordinate human response in Signal and keep technical alerting in the existing monitoring systems.
For Category 1 or 2 incidents, create a manual Hetzner Cloud Snapshot before taking disruptive action when it is safe to do so.
Name format:
IRP-[CaseID]-[YYYYMMDD]-Evidence.These are separate from standard rotating backups and must be preserved for analysis.
Isolate the affected host or service as needed (for example by firewall rules or service isolation).
Disable external integrations (Git/webhooks) if they are part of the attack vector.
Suspend affected user accounts immediately.
Revoke or rotate affected administrative, API, VCS, and webhook credentials as applicable.
Preserve relevant evidence, including system logs, reverse proxy logs, Weblate application and audit logs, affected configuration state, and the list of impacted credentials or integrations.
Eradication¶
Remove any unauthorized code or data.
Patch known vulnerabilities by upgrading Weblate or server components.
Validate binary and repository integrity using SHA-256 checksums or Git logs.
Recovery¶
Restore affected services or data from the latest known-good Weblate backups.
PII Assessment: DPO determines if the breach requires a 72-hour GDPR notification.
Reintroduce services in a phased approach.
Confirm the root cause has been removed or a compensating control is in place before restoring normal traffic.
Rotate affected credentials and verify integrity of the restored system, repositories, and configuration.
The Security Officer and IRL approve returning to normal operations.
Monitor logs and system behavior continuously for at least 72 hours post-recovery.
Post-incident review¶
Timeline: Hold a review meeting within 5 business days of incident closure.
Compile a full incident timeline and actions taken.
Perform Root Cause Analysis (RCA) and document it within 10 business days.
Update security policies and IRP documentation based on findings.
Review the effectiveness of detection and containment mechanisms.
Verify whether escalation, alerting, and external communication followed Vulnerability and incident handling as expected.