Alessandro Di Stefano, PhD

Cloud-Native & Distributed Systems Engineer - DevOps, MLOps, Platform Engineering, SRE
+44 (0) 747 64 386 43 [email protected] aleskandro aleskandro
I'm passionate about building intelligent, adaptable, cloud-native systems at the intersection of distributed computing, DevOps, MLOps, and AI, in and on Kubernetes. As a Principal Software Engineer in the Performance and Scalability for AI Applications team at Red Hat, I work on distributed AI/LLM inference with vLLM/LLM-D and Openshift AI.
My deeper mission is to bridge the rigor of academic research with the fast-paced, practical demands of industry—translating ideas into robust infrastructure that delivers value at scale. I hold a PhD in Distributed Computing, where I specialized in AIOps for PaaS systems, focusing on how AI-driven automation can support infrastructure decision-making and operational resilience.
Before joining Red Hat, I spent over five years as an independent consultant specializing in software architecture and design while pursuing my studies. I've mentored students at the University of Catania's Distributed Computing Lab, helping them build microservices-based Cloud-Native applications to run in Kubernetes clusters—sharing the same spark that inspired me as a kid, when I played a Commodore 64.
I'm driven by a deep belief in open collaboration, education, and a decentralized, censorship-resistant, and free internet. To me, Free and Open Source Software is more than code—it's a philosophy of transparency, empowerment, and collective progress.
When I'm not engineering systems, you'll find me hiking, climbing, at a live music concert, or reading sci-fi and non-fiction books.

Experience

Principal Software Engineer
Red Hat Inc.
09/2025 - Now
UK (Remote)
Technologies:  vLLM, LLM-D, Open Data Hub, Openshift AI, Kubernetes, eBPF, Open Telemetry, Python AI & Data Science Ecosystem.
Senior Software Engineer
Red Hat Inc.
08/2021 - 08/2025
UK (Remote)
Technologies:  GNU/Linux, Docker, Kubernetes, OpenShift, Golang, Python, Rust.
Research Engineer (Contractor)
Aucta Cognitio srl
08/2020 - 07/2021
Italy (Remote)
Technologies:  Linux, Kubernetes, GitLab, Golang, Python, Angular, Kafka, Prometheus, Machine Learning (LSTMs, ARIMA, Regression Models), Time-Series Analysis, SCADA Systems.
Self-Employed
Software Architecture & DevOps Consulting
01/2012 - 07/2021
Italy (Remote)
Technologies:  GNU/Linux, Docker, Kubernetes, Golang, Python, Rust, Ruby on Rails, Ansible, Terraform, Proxmox, pfSense, Active Directory, Kafka, Elasticsearch, MinIO, Prometheus.

Education

PhD in Distributed and Parallel Computing
University of Catania
10/2018 - 11/2021
Italy
Thesis: AIOps: communication-aware management of SLAs for Cloud-Native Applications.
MSc in Computer Engineering
University of Catania
10/2016 - 10/2018
Italy
Thesis: Raphtory: building distributed online graph processing system.
BSc in Computer Engineering
University of Catania
01/2011 - 07/2016
Italy
Thesis: ONOS and JFlowLight. Quality of Service Management for Software Defined Networking.
Research Engineer
Queen Mary University of London
03/2018 - 10/2018
UK

Publications

Certifications

3rd International Summer School on Deep Learning , Warsaw, Poland
01/2019
VI Mediterranean school of complex networks , Salina, Italy
07/2019
Lipari School on Network and Computer Sciences , Lipari, Italy
07/2017
Angular.JS certificate , University of Catania
01/2016
Degree in Music Theory , Conservatory of Music "Vincenzo Bellini", Catania, Italy
01/2009

Societies

Scout in the Italian Scout Association "Agesci" , Italy
01/2000 - 01/2010
Co-founder of the Scordia Linux User Group , Italy
01/2008 - 01/2012
Hacktivist at Catania GNU/Linux User Group , Italy
01/2008 - 01/2017
Hacktivist at Freaknet Medialab , Italy
01/2008 - 01/2017
Scoutmaster in the Italian Scout Association "Agesci" , Italy
01/2012 - 01/2020
Volunteer at MOCA Olografix Camp Hackmeeting , Italy
08/2016 - 08/2016

Skills

Programming And systems:

Python, Golang, Rust, C/C++, GNU/Linux, Docker, Kubernetes, OpenShift

DevOps and Cloud:

Ansible, Terraform, Jenkins, GitLab, AWS, Azure, Google Cloud, Bare Metal

Database and Observability:

PostgreSQL, MySQL, MongoDB, Redis, Prometheus, Grafana, ELK Stack, Jaeger

Soft Skills:

Team Leadership, Mentoring, Public Speaking, Technical Writing