The target architecture is based on open-source technologies.
The solution is based on the Data Lakehouse approach, separating the data processing layer from the storage layer.
The architecture proposes the use of the following technologies: Apache Airflow, Iceberg, Trino, S3, Spark, Clickhouse, OpenMetadata, Apache Superset, and others.