I build scalable data + AI systems that deliver measurable impact.
AI/ML Engineer and PhD candidate in Data Engineering (Aalborg University & University of Athens), specializing in data-intensive LLM applications, RAG architectures, and production-scale deep learning.
My work blends research innovation with production-grade engineering, delivering outcomes like 80× faster ML pipelines, 18× smaller vector indexes, and retrieval systems that power intelligent, context-aware applications.
Programming & Core Foundations
Python C++ SQL Linux GitMachine Learning & AI
LLMs Agentic AI Tools RAG Pipelines LangChain Deep Learning Statistical Learning Experiment Tracking (MLflow)Data Engineering
ETL / ELT Pipelines Data Modeling dbt SQL (PostgreSQL) NoSQL (Redis, MongoDB, Elasticsearch) Data Lakes Apache Spark Workflow Orchestration (Airflow) Hadoop EcosystemMLOps & Deployment
CI/CD Pipelines Containerization (Docker) Model Serving (API-based) FastAPI Streamlit Cloud (AWS, GCP)Platforms, Tools & Delivery
GitHub JiraCertifications & Networking
CCNA CCNP Networking FundamentalsFebruary 2022 - Present
&
Athena Research and Innovation Center
- Developed advanced indexing techniques for vector-based table representation learning
- Optimized table discovery and search systems for data integration and augmentation
- Contributed to design of expressive exploratory workflows and scalability improvements
- Publications:
-
Table Search in Data Lakes: Methods, Indexing Techniques, and Research Challenges
I. Taha, M. Lissandrini, A. Simitsis, T. B. Pedersen, Y. Ioannidis
In Data Engineering for Data Science, Springer, 2026 -
Comparative Analysis of Indexing Techniques for Table Search in Data Lakes
I. Taha, M. Lissandrini, A. Simitsis, Y. Ioannidis
International Journal of Semantic Computing, 2025 -
A Study on Efficient Indexing for Table Search in Data Lakes
I. Taha, M. Lissandrini, A. Simitsis, Y. Ioannidis
In IEEE International Conference on Semantic Computing (ICSC), 2024
-
Table Search in Data Lakes: Methods, Indexing Techniques, and Research Challenges
December 2016 - August 2018
- Writing verification environment scripts using Python, NLP, Bash, Linux and Git
- Verifying design function using System Verilog simulators
- Received best possible performance reviews and ranked among top engineers
February 2024 - May 2024
- Designed and implemented robust web scraping platform for static and dynamic content
- Developed configurable XPath and regex-based extractors for affiliation metadata
- Mapped extracted data to publication IDs across heterogeneous publisher websites
- Explored ML models (transformer-based) for automated entity recognition
- Successfully extracted and linked affiliation data for majority of publisher sources
March 2020 - September 2020
- Applied RNNs and deep learning for signal processing in optical communications
- Reduced preprocessing time from 4 hours to 3 minutes through optimization
- Data engineering, preparation, optimization, processing, and analysis
- Improved BER (Bit Error Rate) using regression-based RNN models
January 2017 - November 2021
Created a learning environment and facilitated network and systems courses including Python, C++ and CCNA Routing and Switching.
June 2022 - Present
Aalborg University, Denmark & National and Kapodistrian University of Athens, Greece
PhD research on interactive data exploration and analytics in large-scale, heterogeneous data lakes.Academic Engagement: Participated in 5 specialized research and technical schools focused on data engineering, big data, and AI.
Industry Experience: Completed data engineering internship at OpenAIRE.
October 2018 - September 2020
Institut Polytechnique de Paris, France & University of Athens, Greece
Specialized in Big Data Management, Data Mining, Data Science, Machine Learning, AI, Deep Learning, and 5G Networks.- Developed expertise in scalable architectures, predictive analytics, and neural networks
- Gained practical knowledge in distributed processing and AI-driven decision-making
- Master's thesis on machine learning for optical signal processing
- Completed hands-on projects and internship at Orange Labs
August 2014 - June 2015
Uppsala University, Sweden
One academic year exchange program covering advanced technical subjects:- High Performance Computing and Programming
- Operating Systems I and Distributed Systems
- Computer Networks I and Computer Graphics
- Collaborated with Master's and PhD students, gaining exposure to research-oriented learning
June 2011 - August 2016
An-Najah National University, Nablus, Palestine
Comprehensive ABET-accredited degree covering algorithms, data structures, operating systems, computer architecture, networks, microprocessors, and digital systems.- Completed one-year exchange at Uppsala University
- Finalized both software and hardware graduation projects
- Gained strong foundations in computing theory and real-world application
- Three Erasmus Mundus Scholarships
- Marie Skłodowska-Curie PhD Fellowship
- Reduced-fee scholarship for VLDB Summer School 2025
