Weekly refresh · Updated —

Cefic Comext ETL Status

Eurostat Comext → PostgreSQL → Parquet export. Imports the chemistry-relevant CN codes (chapters 20–39) every Monday morning, then republishes a single Parquet file for Power BI.

Rows in DB
Months loaded
Years covered
Parquet rows
Pipeline
Last cron run
scheduled Monday 06:00
Latest period in DB
Parquet refreshed
Last run errors

Latest run

Most recent execution of the weekly cron — what it tried, what it imported, what it skipped.

Started
Finished
Duration
Files processed
Rows added
Errors
Skipped
Parquet rows

No summary available.

Published Parquet outputs

Artifacts produced by the weekly export — the fact table filtered on chemistry CN codes, plus the partner-country dimension rebuilt from Eurostat's SDMX codelist. Both are regenerated in the same export_parquet.py run.

Fact table
Rows
Size
Refreshed
period, declarant, partner, product_nc, cpa2015, chapter_cn, flow, flow_label, value_in_euros, quantity_in_kg. Filtered on ~1369 CN2025 codes from SubstanceId.csv, EU27 aggregated on both sides.

Recent imports

Last successful monthly imports recorded in etl_log.

Period Filename Rows loaded Finished at Status

Loaded rows over time

Total rows inserted into PostgreSQL per monthly file, from the oldest period to the most recent.

First period
Last period
Average / month
Largest month

Coverage matrix

Year × month grid. Filled cells = period present in etl_log (darker = more rows). Dashed cells = missing.

Fewer rows
More rows Dashed = missing month

Latest log

Tail of the most recent cron log — INFO in default, WARNING in orange, ERROR in red.

Log file