logo
  • Home
  • About
  • Skills
  • Services
  • Experience
  • Portfolio
  • Blog
  • Contact
post

Hello, I’m Saeid

Data engineer & MLOps, focused on reliable, production-grade batch and streaming data pipelines.

Get in touch

About Me

I am a Data Engineer based in Brussels. with a diverse engineering background spanning over a decade. I am currently finalizing my MSc in Applied Computer Science while completing an intensive Data Engineering Masterclass focused on AWS, Big Data, and Modern Data Architectures.

My background in maintaining critical medical systems taught me the value of reliability, compliance (FDA/GDPR), and zero-downtime operations. Today, I apply that same discipline to build robust data pipelines. I specialize in Real-time Streaming (Kafka/Flink), Cloud Infrastructure, and Data Security.

Open to work

Languages

Python (Pandas, PySpark, PyFlink), SQL (PostgreSQL, ClickHouse),Java, Bash.

Orchestration & Transformation

Apache Airflow, dbt, Docker, Kubernetes.

Streaming & Processing

Apache Kafka, Apache Flink (SQL/Table API), Spark Structured Streaming, Debezium (CDC).

Cloud & Platforms

AWS (Glue, Athena, Lambda), GCP (BigQuery), Snowflake (Data Warehousing), Databricks (Lakehouse/Delta Lake).

Monitoring & Visualization

Grafana (dashboards/alerts), Power BI, Tableau.

Data Pipelines

Building reliable Batch and Streaming pipelines to move data from sources to storage using Kafka & Flink.

Data Warehousing

Designing clean data models (like Star Schema) to ensure data is organized, fast, and ready for analytical queries.

Observability

Setting up Monitoring (Grafana) to catch issues early and ensure pipelines are running smoothly 24/7.

Data Quality

Implementing automated tests (using dbt concepts) to ensure data is accurate, consistent, and trustworthy.

Data Visualization

Creating clear Dashboards (Power BI/Tableau) to help stakeholders visualize trends and make decisions.

Professional History

Combining engineering discipline with modern data skills.

Data Engineer

Nov 2024 - Present | Remote

Built a real-time fraud detection system using Apache Kafka and Spark Streaming. Setup Grafana dashboards for live monitoring and ensured seamless data integration.

Data Visualization & Analyst (Intern)

Orange Business | Jul 2025 - Aug 2025

Worked with Google Cloud Platform (GCP) and Power BI to visualize complex datasets. Handled ETL processes and created technical reports to support business decisions.

Senior Technical Service Engineer

Sina Parto Jam | Apr 2019 - Dec 2021

Maintained complex medical imaging systems with 99% uptime. Analyzed system logs for root-cause analysis and ensured strict compliance with FDA regulations.

Radiology Technician

Bu Ali Sina Hospital | Aug 2014 - Feb 2017

Operated high-tech equipment in a fast-paced environment, requiring precision and adherence to Safety Protocols.

Portfolio

  • all

Real-Time Food Delivery

Scalable streaming architecture handling high-throughput events via Kafka & Flink.

Postgres Stream Anonymizer

Real-time GDPR masking with Debezium & PostgreSQL.

Breast Cancer ML Project

Medical classification model · Healthcare

Fraud Detection

Fault-tolerant anomaly detection pipeline using Spark Streaming & Grafana.

Banking System V2

Robust Java backend designed with Clean Architecture & OOP principles.

LinkedIn Posts

28 Nov

Soft skills that really matter in tech

Read More
27 Nov

Data Engineering for Solution Architecture

Read More
04 Nov

ETL → ELT in 2025: What changed, what’s now, and what actually works

Read More
Read All Blogs

Get in Touch with Saeid

Privacy

By sending this form you agree to my Privacy & Cookie Policy.

© 2025 Saeid Shahriari. All Rights Reserved