All Systems Operational

Developer Self-service ? Operational
90 days ago
99.99 % uptime
Today
Capability self-service portal (Blaster) ? Operational
90 days ago
99.99 % uptime
Today
Capability provisioning Operational
90 days ago
99.99 % uptime
Today
Slack communication ? Operational
90 days ago
100.0 % uptime
Today
Topic management Operational
90 days ago
99.99 % uptime
Today
Authentication/Authorisation Operational
90 days ago
100.0 % uptime
Today
Azure AD Apps (OAuth) ? Operational
90 days ago
100.0 % uptime
Today
Azure AD Sync ? Operational
90 days ago
100.0 % uptime
Today
AWS ECR image push Operational
90 days ago
100.0 % uptime
Today
AWS IAM Identity Centre (AWS SSO) ? Operational
90 days ago
100.0 % uptime
Today
AAD-AWS-Sync ? Operational
90 days ago
100.0 % uptime
Today
Kubernetes critical components Operational
90 days ago
99.96 % uptime
Today
Kubernetes [Hellman] - Capacity/Scheduling Operational
90 days ago
99.91 % uptime
Today
Kubernetes [Hellman] - Ingress/Networking Operational
90 days ago
99.91 % uptime
Today
Kubernetes [Hellman] - Storage Operational
90 days ago
100.0 % uptime
Today
Kubernetes [Hellman] - Developer access ? Operational
90 days ago
100.0 % uptime
Today
Kubernetes [Hellman] - GitOps/Flux CD ? Operational
90 days ago
100.0 % uptime
Today
Observability Operational
90 days ago
99.99 % uptime
Today
Kafka metrics exporter ? Operational
90 days ago
99.96 % uptime
Today
Prometheus ? Operational
90 days ago
100.0 % uptime
Today
Kubernetes [Hellman] - Logging ? Operational
90 days ago
100.0 % uptime
Today
Grafana Cloud ? Operational
90 days ago
100.0 % uptime
Today
Azure DevOps ? Operational
90 days ago
100.0 % uptime
Today
Amazon Web Services (3rd party) Operational
90 days ago
100.0 % uptime
Today
AWS ec2-eu-west-1 Operational
90 days ago
100.0 % uptime
Today
AWS ec2-eu-central-1 Operational
90 days ago
100.0 % uptime
Today
AWS elb-eu-west-1 Operational
90 days ago
100.0 % uptime
Today
AWS elb-eu-central-1 Operational
90 days ago
100.0 % uptime
Today
AWS s3-eu-central-1 Operational
90 days ago
100.0 % uptime
Today
AWS s3-eu-west-1 Operational
90 days ago
100.0 % uptime
Today
Global Operational
GitHub (3rd party) Operational
90 days ago
99.88 % uptime
Today
GitHub API Requests Operational
90 days ago
99.94 % uptime
Today
GitHub Git Operations Operational
90 days ago
99.92 % uptime
Today
GitHub Issues Operational
90 days ago
99.8 % uptime
Today
GitHub Pull Requests Operational
90 days ago
99.82 % uptime
Today
GitHub Webhooks Operational
90 days ago
99.93 % uptime
Today
Confluent Cloud (3rd party) Operational
90 days ago
100.0 % uptime
Today
General ? Operational
90 days ago
100.0 % uptime
Today
Kafka - Dev cluster Operational
90 days ago
100.0 % uptime
Today
Kafka - Prod cluster Operational
90 days ago
100.0 % uptime
Today
1Password Operational
90 days ago
99.99 % uptime
Today
1Password AWS CloudFront Operational
90 days ago
100.0 % uptime
Today
1Password 1Password.com website Operational
90 days ago
100.0 % uptime
Today
1Password Sign in Operational
90 days ago
99.97 % uptime
Today
1Password Saving password and other items Operational
90 days ago
100.0 % uptime
Today
1Password Syncing items between your devices Operational
90 days ago
100.0 % uptime
Today
1Password Multi-factor Authentication (MFA) Operational
90 days ago
100.0 % uptime
Today
1Password Command Line Interface (CLI) Operational
90 days ago
100.0 % uptime
Today
1Password Service Accounts Operational
90 days ago
100.0 % uptime
Today
1Password Events API and Reporting Operational
90 days ago
100.0 % uptime
Today
Messaging Operational
90 days ago
100.0 % uptime
Today
Slack Apps/Integrations Operational
90 days ago
100.0 % uptime
Today
Slack Workspace/Org Administration Operational
90 days ago
100.0 % uptime
Today
Slack Search Operational
90 days ago
100.0 % uptime
Today
Slack Posts/Files Operational
90 days ago
100.0 % uptime
Today
Slack Notifications Operational
90 days ago
100.0 % uptime
Today
Slack Messaging Operational
90 days ago
100.0 % uptime
Today
Slack Login/SSO Operational
90 days ago
100.0 % uptime
Today
Slack Link Previews Operational
90 days ago
100.0 % uptime
Today
Slack Apps/Integrations/APIs Operational
90 days ago
100.0 % uptime
Today
Slack Apps/Integrations/APIs Operational
90 days ago
100.0 % uptime
Today
Slack Apps/Integrations/APIs Operational
90 days ago
100.0 % uptime
Today
Slack Connections Operational
90 days ago
100.0 % uptime
Today
Slack Calls Operational
90 days ago
100.0 % uptime
Today
Wiki Operational
90 days ago
100.0 % uptime
Today
Wiki Frontend Operational
90 days ago
100.0 % uptime
Today
Backend database on AWS RDS Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.

Scheduled Maintenance

Significant change in Kubernetes outbound networking Apr 8, 2025 08:30-10:30 UTC

The change involves routing all outbound traffic from Kubernetes through AWS NAT Gateways with static IP addresses. This means that seen from ouside, all our traffic will come from the same 3 IP addresses instead of over 35 random IP addresses like today. A benefit is that these 3 IP addresses can be whitelisted by 3rd party companies or government agencies. This has been a feature request from several T&I teams over the years.

No downtime is expected.

Posted on Mar 19, 2025 - 08:53 UTC

Grafana access to self-hosted Prometheus on Hellman will be closed down for good Apr 23, 2025 09:30 - Apr 24, 2025 15:30 UTC

This is the last chance to either migrate to Grafana Cloud or point your self-hosted Grafana dashboards to Grafana Cloud as described in https://wiki.dfds.cloud/en/playbooks/observability/point-existing-grafana-dashboard-to-cloud
Posted on Jan 22, 2025 - 12:56 UTC

Self-hosted Prometheus stack will be removed for good Apr 28, 2025 09:30-10:30 UTC

From this date we got fully in on Grafana Cloud
Posted on Jan 22, 2025 - 13:01 UTC
Apr 2, 2025

No incidents reported today.

Apr 1, 2025

No incidents reported.

Mar 31, 2025

No incidents reported.

Mar 30, 2025

No incidents reported.

Mar 29, 2025

No incidents reported.

Mar 28, 2025

No incidents reported.

Mar 27, 2025

No incidents reported.

Mar 26, 2025

No incidents reported.

Mar 25, 2025
Completed - The scheduled maintenance has been completed.
Mar 25, 09:34 UTC
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Mar 25, 08:30 UTC
Scheduled - After 01.04.2025 Docker Hub requires authentication when pulling more than 10 images per hour. We will setup gloabl authentication to ensure your images from Docker Hub will not get throttled.

This requires a restart of all Kubernetes nodes. The nodes will at the same time be updated with new AWS AMI images (software updates).

Furthermore, we need to create new route tables for VPC peerings. This is not expected to cause issues for existing connections to databases, but there might be a sub second glitch for new connections.

Mar 3, 14:29 UTC
Mar 24, 2025

No incidents reported.

Mar 23, 2025

No incidents reported.

Mar 22, 2025

No incidents reported.

Mar 21, 2025

No incidents reported.

Mar 20, 2025

No incidents reported.

Mar 19, 2025

No incidents reported.