
Transforming Media Analytics with a Unified Data Platform at Zeit Online
Transforming analytics from Excel-based workflows to automated dashboards and enabling advanced insights like engagement scoring across all business levels.
The Challenge
Zeit Online, one of Germany's leading digital news publications, faced a critical data infrastructure challenge that many media organizations encounter. Their analytics ecosystem was fragmented and inefficient: data sharing relied on Excel files, dashboards were created in spreadsheets and distributed as static PDFs, and website tracking data was locked within third-party provider tools with limited access to raw data.
This fragmented approach severely limited Zeit Online's ability to gain comprehensive insights into user engagement, content performance, and business metrics. Without a unified view of their data, creating meaningful dashboards for C-level executives and company-wide reports was nearly impossible, hindering data-driven decision making across the organization.
Our Solution: A Three-Phase Data Platform Transformation
HCON partnered with Zeit Online to build a comprehensive data and ML platform from the ground up, transforming their entire analytics capability through a strategic three-phase approach.
Phase 1: Requirements Gathering and Strategy
We collaborated extensively with stakeholders across all organizational levels to understand their data needs and define key metrics such as engagement scoring. This comprehensive requirements analysis ensured our platform would deliver actionable insights aligned with business objectives.
Phase 2: Centralized Data Platform Architecture
Platform Foundation:
Google BigQuery as the central data warehouse
Multi-source data integration: Website tracking data, marketing campaigns, user behavior, content analytics
Containerized data pipelines built as Docker containers for scalability and maintainability
Automated deployment via GitHub Actions for reliable, continuous integration
Engineering Collaboration: Working closely with Zeit Online's engineering team during their Kubernetes migration, we designed all data pipelines as cloud-native applications, ensuring seamless integration with their modern infrastructure approach.
Phase 3: ML Platform Integration
We extended the data platform to support machine learning workflows:
Feature engineering pipelines using the same compute infrastructure
Model training and deployment integrated into the centralized platform
ML results integration with BigQuery for unified reporting
Automated ML workflows following the same containerized approach as data pipelines
Technical Architecture Highlights
Unified Data Ecosystem:
Centralized data warehouse replacing fragmented Excel-based workflows
Real-time and batch data processing capabilities
Scalable containerized pipeline architecture
Integration with modern Kubernetes infrastructure
End-to-End ML Operations:
Feature engineering using centralized data sources
Model training with shared compute resources
Results integrated directly into business dashboards
Automated deployment and monitoring
Results & Impact
The platform transformation delivered significant improvements across Zeit Online's analytics capabilities:
Data Accessibility & Quality
Eliminated Excel dependency: Transitioned from manual file sharing to automated data workflows
Unified data access: All stakeholders now work from the same centralized data source
Real-time insights: Replaced static PDF reports with dynamic, up-to-date dashboards
Data ownership: Gained full control over tracking data previously locked in third-party tools
Business Intelligence
Executive dashboards: C-level leadership now has access to comprehensive, real-time business metrics
Company-wide reporting: Standardized reporting across all departments
Advanced analytics: Engagement scoring and sophisticated user behavior analysis
Data-driven culture: Enabled organization-wide shift toward data-driven decision making
Technical Excellence
Scalable architecture: Containerized pipelines that grow with the business
Automated operations: Reduced manual overhead through CI/CD automation
ML-ready infrastructure: Seamless integration of machine learning capabilities
Future-proof platform: Modern, cloud-native architecture supporting continued innovation
Project Gallery

.png)

