logo
  • Home
  • About
  • Skills
  • Services
  • Experience
  • Portfolio
  • Certifications
  • Blog
  • Contact
post

Hello, I’m Saeid

Data engineer & MLOps, focused on reliable, production-grade batch and streaming data pipelines.

Get in touch

About Me

I am a Data Engineer based in Brussels, with a diverse engineering background spanning over a decade. I am currently finalizing my MSc in Applied Computer Science while completing an intensive Data Engineering Masterclass focused on AWS, Big Data, and Modern Data Architectures.

My background in maintaining critical medical systems taught me the value of reliability, compliance (FDA/GDPR), and zero-downtime operations. Today, I apply that same discipline to build robust data pipelines. I specialize in Real-time Streaming (Kafka/Flink), Cloud Infrastructure, and Data Security.

Open to work

Skills

Technologies I use to build reliable data pipelines.

Languages

Python (Pandas, PySpark, PyFlink), SQL (PostgreSQL, ClickHouse),Java, Bash.

Orchestration & Transformation

Apache Airflow, Docker, Kubernetes.

Streaming & Processing

Apache Kafka, Apache Flink, Spark Streaming, Debezium (CDC).

Cloud & Platforms

AWS (Glue, Athena, Lambda), GCP (BigQuery).

Monitoring & Visualization

Grafana (dashboards/alerts), Tableau.

Services

I help teams design, build, and monitor data systems from source to dashboard.

Data Pipelines

Building reliable Batch and Streaming pipelines to move data from sources to storage using Kafka & Flink.

Data Warehousing

Designing clean data models to ensure data is organized, fast, and ready for analytical queries.

Observability

Setting up Monitoring (Grafana) to catch issues early and ensure pipelines are running smoothly 24/7.

Data Quality

Implementing automated tests to ensure data is accurate, consistent, and trustworthy.

Data Visualization

Creating clear Dashboards (Tableau) to help stakeholders visualize trends and make decisions.

Experience & Training

Combining engineering discipline with modern data skills.

Training & Projects

Data Engineering Masterclass – Trainee

Dr Mohammad Fozouni · Nov 2024 – Present · Remote

Intensive, project-based Data Engineering & MLOps bootcamp. Working hands-on with PostgreSQL, ClickHouse, Apache Kafka, Apache Spark, Apache Flink, Delta Lake, Docker, Kubernetes, Apache Airflow, MLflow, Jenkins and AWS.

Capstone projects include:
– Real-time fraud detection pipeline (Kafka, Spark, PostgreSQL, MySQL, Redis, Grafana, AWS)
– End-to-end ML model deployment with monitoring and security on the cloud
– Building a secure Rust-based message broker (CipherMQ) as a product-focused project

Professional Experience

Data Visualization & Analyst (Intern)

Orange Business · Jul 2025 – Aug 2025 · Brussels

Worked with Google Cloud Platform (GCP) and Power BI to visualize complex datasets. Supported ETL processes and created clear technical reports to help business stakeholders make decisions.

Senior Technical Service Engineer

Sina Parto Jam · Apr 2019 – Dec 2021 · Tehran

Maintained complex medical imaging systems with 99% uptime. Performed root-cause analysis on system logs and ensured strict compliance with FDA and local healthcare regulations.

Radiology Technician

Bu Ali Sina Hospital · Aug 2014 – Feb 2017 · Hamedan

Operated high-tech radiology equipment in a fast-paced clinical environment, delivering accurate results while following strict safety protocols.

Portfolio

  • all

Real-Time Food Delivery

Scalable streaming architecture handling high-throughput events via Kafka & Flink.

Postgres Stream Anonymizer

Real-time GDPR masking with Debezium & PostgreSQL.

Breast Cancer ML Project

Medical classification model · Healthcare

Fraud Detection

Fault-tolerant anomaly detection pipeline using Spark Streaming & Grafana.

Banking System V2

Robust Java backend designed with Clean Architecture & OOP principles.

Certifications

Selected courses and certificates that support my data engineering path.

Data Engineering: From Local Development to Server Deployment

Data Engineering School · Issued Mar 2025

CI/CD, PostgreSQL, Git & GitHub, Kubernetes, Apache Airflow, Docker, AWS, Apache Kafka, ETL.

Hands-On Essentials: Data Warehousing Workshop

Snowflake · Issued Jun 2025

Data warehousing fundamentals and best practices on the Snowflake platform.

Introduction to dbt

DataCamp · Issued Jun 2025

Analytics engineering with dbt for modular SQL transformations and testing.

Data Ingestion with Delta Lake

Databricks · Issued Apr 2025

Building reliable ingestion pipelines with Delta Lake and the Lakehouse architecture.

Data Management and Governance

Databricks · Issued Apr 2025

Data governance, access control and quality on top of the Databricks Lakehouse.

Foundational Cloud Practitioner

Maktabkhooneh · Issued Mar 2025

Core cloud concepts, security and cost-awareness for modern cloud platforms.

Machine Learning

Coursera · Issued Jun 2021

Classical ML algorithms, logistic regression, neural networks and model evaluation.

Advanced Python Programming

Maktabkhooneh · Issued Apr 2021

Advanced Python for data processing, scripting and backend development.

LinkedIn Posts

28 Nov

Soft skills that really matter in tech

Read More
27 Nov

Data Engineering for Solution Architecture

Read More
04 Nov

ETL → ELT in 2025: What changed, what’s now, and what actually works

Read More
Read All Blogs

Get in Touch with Saeid

Prefer email? saeidshahriari1@gmail.com

Privacy

By sending this form you agree to my Privacy & Cookie Policy.

© 2025 Saeid Shahriari. All Rights Reserved