Paris, Ile-de-France region

George Siachamis

Data Systems Engineer | Postdoctoral Researcher at Inria

Building scalable data systems across stream processing, graph data management, data integration, and transactional workflows.

I am currently a Postdoctoral Researcher at Inria, in the CEDAR team (October 2024 - Present), working with Ioana Manolescu. I received my PhD from Delft University of Technology, with thesis title Adaptivity for Streaming Dataflow Engines, supervised by Asterios Katsifodimos. At TU Delft, I was part of the Web Information Systems group and AI for Fintech Research. Before my PhD, I completed a research internship at CY Cergy Paris University. I hold an Integrated M.Eng. in Electrical and Computer Engineering from National Technical University of Athens (NTUA).

Core Skills

Expert Proficient Familiar

Java Python SQL Kafka Apache Flink Docker Kubernetes Git Apache Spark Rust Go C/C++

Experience

Postdoctoral Researcher / Research Engineer, Inria CEDAR team

October 2024 - Present | Saclay, France

  • Designed and implemented software components for ontology-based data access, graph data lakes, and keyword-search query evaluation.
  • Built software components for the Data Exchange Platform (DXP) project, including access-control mechanisms for RDF/OBDA systems and ontology/schema-mapping pipelines for airline-sector data integration.

Academic Consultant, ING Group (Global Analytics and Tech Infra)

January 2020 - December 2023 | Amsterdam, Netherlands

  • Member of the AI for FinTech Research collaboration between ING and TU Delft.
  • Contributed to industry-academia collaboration on data systems topics in financial technology and infra management.
  • Explored approaches for infrastructure asset management and real-time internal processing tools.
  • Facilitated knowledge transfer through seminars and technical exchanges across academic and industrial stakeholders.

PhD Candidate, TU Delft (Web Information Systems)

November 2019 - April 2024 | Delft, Netherlands

  • Designed and evaluated adaptive mechanisms for distributed stream processing systems, focusing on autoscaling, checkpointing, and skew-aware similarity joins.
  • Built experimental frameworks and workloads in Java/Python for benchmarking stream processing engines under dynamic conditions.
  • Conducted large-scale empirical evaluations of fault-tolerance and autoscaling strategies.
  • Co-authored publications in ICDE, DEBS, SIGMOD, and other related venues.

Research Intern, Universite de Cergy-Paris

April 2019 - June 2019 | Paris, France

  • Extended the publicly available code of OrpheusDB to include all the data versioning strategies discussed in the paper, and designed and performed an extended experimental evaluation.

Software Engineer, FOCUS ON DIGITAL SERVICES LTD.

January 2015 - February 2016 | Athens, Greece

  • Part of a team that designed and implemented the digital content of an English learning book publisher.
  • Worked with Flash, HTML5 and video/audio processing tools.

Projects

Valentine: (Schema-) Matching DataFrames Made Easy

Built a Python package for capturing potential relationships among columns of different tabular datasets represented as pandas DataFrames.

Impact: Used by many researchers in data integration and adopted by companies.

Extracting Ontologies out of JSON Schema Specifications

Built a Python toolkit that transforms JSON Schema specifications into an RDFS ontology, reports ontology statistics, and generates synthetic airline-sector datasets, using sentence embeddings (SBERT), hierarchical clustering, and LLMs.

Impact: Delivered a practical ontology-engineering pipeline for the Inria-Amadeus Data Exchange Platform (DXP) project.

Rust Text Editor

A Rust project focused on building a lightweight text editor, exploring low-level systems concepts, terminal interaction, and idiomatic Rust design patterns.

Impact: Hands-on growth in systems programming and Rust engineering practices.

Self-hosted Homelab

Built a fully functional home server on reused hardware with Docker-based deployments, strong observability, and maintainable operations.

Deployed services include on-prem cloud storage, media servers, and productivity tooling.

Impact: Continuous experimentation and hands-on improvement in real-world operations.

Selected Publications

Contact

Best way to reach me is by email: george@gsiachamis.dev

You can also connect via LinkedIn or browse my code on GitHub.