1. Conducted an analysis of the customer's business processes and needs for reporting and analytics
2. Developed a data warehouse architecture and defined the data storage and integration structure:
3. Developed integration solutions between the customer's disparate information systems:
used open-source technologies, providing the ability for flexible scaling and integration
deployed a data warehouse, which became the basis for storing heterogeneous data sources
mechanisms for retrieving data via HTTP and JDBC protocols
data quality check modules
modules for checking the input data schema and validating the integration contract
unified data extraction processes based on Airflow + Python
4. Developed a set of functions based on PostgreSQL for generating data marts based on indicators in various sections
5. We developed a data provisioning module based on Python FastAPI, which provides data to external systems on request: CRM (showcases on the communication map and recommendations) and CDP (segments)
6. We developed a data quality management subsystem that includes tools for visualizing quality metrics for routine processes, data errors, etc. based on Grafana, a quality metrics storage component based on PostgreSQL, and a metrics generation component based on a Python module.
7. Based on BI solutions, we created a set of dashboards that allow for quick and transparent management decisions based on data products.