PROFILE

I build scalable data + AI systems that deliver measurable impact.

AI/ML Engineer and PhD candidate in Data Engineering (Aalborg University & University of Athens), specializing in data-intensive LLM applications, RAG architectures, and production-scale deep learning.

My work blends research innovation with production-grade engineering, delivering outcomes like 80× faster ML pipelines, 18× smaller vector indexes, and retrieval systems that power intelligent, context-aware applications.

PERSONAL INFORMATION
Phone:
Language:
English, Arabic
PROFESSIONAL SKILLS
Programming & Core Foundations
Python C++ SQL Linux Git
Machine Learning & AI
LLMs Agentic AI Tools RAG Pipelines LangChain Deep Learning Statistical Learning Experiment Tracking (MLflow)
Data Engineering
ETL / ELT Pipelines Data Modeling dbt SQL (PostgreSQL) NoSQL (Redis, MongoDB, Elasticsearch) Data Lakes Apache Spark Workflow Orchestration (Airflow) Hadoop Ecosystem
MLOps & Deployment
CI/CD Pipelines Containerization (Docker) Model Serving (API-based) FastAPI Streamlit Cloud (AWS, GCP)
Platforms, Tools & Delivery
GitHub Jira
Certifications & Networking
CCNA CCNP Networking Fundamentals
PROFESSIONAL EXPERIENCE

February 2022 - Present

Aalborg University
&
Athena Research and Innovation Center
Researcher in Data Engineering for Data Science
PhD research focusing on enabling efficient and intelligent interactive data exploration and analytics in large-scale, heterogeneous data lakes.
  • Developed advanced indexing techniques for vector-based table representation learning
  • Optimized table discovery and search systems for data integration and augmentation
  • Contributed to design of expressive exploratory workflows and scalability improvements
  • Publications:
    • Table Search in Data Lakes: Methods, Indexing Techniques, and Research Challenges
      I. Taha, M. Lissandrini, A. Simitsis, T. B. Pedersen, Y. Ioannidis
      In Data Engineering for Data Science, Springer, 2026
    • Comparative Analysis of Indexing Techniques for Table Search in Data Lakes
      I. Taha, M. Lissandrini, A. Simitsis, Y. Ioannidis
      International Journal of Semantic Computing, 2025
    • A Study on Efficient Indexing for Table Search in Data Lakes
      I. Taha, M. Lissandrini, A. Simitsis, Y. Ioannidis
      In IEEE International Conference on Semantic Computing (ICSC), 2024

December 2016 - August 2018

Cadence Design Systems
Product Validation & Verification Engineer
Contributed to software development of the cutting-edge Xcelium Parallel Simulator.
  • Writing verification environment scripts using Python, NLP, Bash, Linux and Git
  • Verifying design function using System Verilog simulators
  • Received best possible performance reviews and ranked among top engineers

February 2024 - May 2024

OpenAIRE AMKE
Data Engineer
Worked on enriching the OpenAIRE knowledge graph with scholarly publication data.
  • Designed and implemented robust web scraping platform for static and dynamic content
  • Developed configurable XPath and regex-based extractors for affiliation metadata
  • Mapped extracted data to publication IDs across heterogeneous publisher websites
  • Explored ML models (transformer-based) for automated entity recognition
  • Successfully extracted and linked affiliation data for majority of publisher sources

March 2020 - September 2020

Orange Labs
Machine Learning Engineer
Implemented smart non-linear equalizers for big data long-haul fiber optics transmission systems.
  • Applied RNNs and deep learning for signal processing in optical communications
  • Reduced preprocessing time from 4 hours to 3 minutes through optimization
  • Data engineering, preparation, optimization, processing, and analysis
  • Improved BER (Bit Error Rate) using regression-based RNN models

January 2017 - November 2021

HiTechA Academy
Co-Founder and Instructor

Created a learning environment and facilitated network and systems courses including Python, C++ and CCNA Routing and Switching.

EDUCATION

June 2022 - Present

Dual PhD Degree
Data Engineering for Data Science

Aalborg University, Denmark & National and Kapodistrian University of Athens, Greece

PhD research on interactive data exploration and analytics in large-scale, heterogeneous data lakes.
Academic Engagement: Participated in 5 specialized research and technical schools focused on data engineering, big data, and AI.
Industry Experience: Completed data engineering internship at OpenAIRE.

October 2018 - September 2020

Dual Master's Degree (Erasmus Mundus)
Big Data Analytics & 5G

Institut Polytechnique de Paris, France & University of Athens, Greece

Specialized in Big Data Management, Data Mining, Data Science, Machine Learning, AI, Deep Learning, and 5G Networks.
  • Developed expertise in scalable architectures, predictive analytics, and neural networks
  • Gained practical knowledge in distributed processing and AI-driven decision-making
  • Master's thesis on machine learning for optical signal processing
  • Completed hands-on projects and internship at Orange Labs

August 2014 - June 2015

Erasmus Mundus Exchange
Computer Science

Uppsala University, Sweden

One academic year exchange program covering advanced technical subjects:
  • High Performance Computing and Programming
  • Operating Systems I and Distributed Systems
  • Computer Networks I and Computer Graphics
  • Collaborated with Master's and PhD students, gaining exposure to research-oriented learning

June 2011 - August 2016

Bachelor of Science
Computer Engineering

An-Najah National University, Nablus, Palestine

Comprehensive ABET-accredited degree covering algorithms, data structures, operating systems, computer architecture, networks, microprocessors, and digital systems.
  • Completed one-year exchange at Uppsala University
  • Finalized both software and hardware graduation projects
  • Gained strong foundations in computing theory and real-world application

HONORS & AWARDS
Scholarships & Fellowships
  • Three Erasmus Mundus Scholarships
  • Marie Skłodowska-Curie PhD Fellowship
  • Reduced-fee scholarship for VLDB Summer School 2025
INTERESTS
What I enjoy
Building AI projects Exploring emerging technologies Reading Tech meetups & community events Outdoor running
Contact Me
Feel free to contact me