WELCOME TO MY UNIVERSE

Crafting Digital
Masterpieces

I'm Ayoub Bouz, a professionalAI Engineer & Data Scientist
dedicated to building high-performance, user-centric AI applications.

ayoub_bouz.py
01

const developer = {

02

name: 'Ayoub Bouz',

03

focus: 'AI & Data Engineering',

04

stack: ['Python', 'LLMs', 'Pandas', 'Langchain'],

05

shipping: true,

06

motto: "From Data to Product"

07

};

08

developer.showcase();

Discovery

About The Architect

I'm Ayoub Bouz, an AI Engineer and Data Scientist with over 5 years of experience designing and deploying intelligent systems that combine large language models, machine learning, optimization algorithms, and big data processing. I build end-to-end data pipelines and AI solutions in Python, SQL, and cloud-native stacks (AWS, Azure, GCP) that improve decision-making and operational efficiency across business domains. From LLM-powered chatbots and speech recognition pipelines to climate-risk modeling and geospatial analytics, I focus on turning complex data into production systems that ship.
5+Years Experience
10+AI / ML Projects
20+Tech Mastered
Ayoub Bouz

Built with Passion

Professional Journey
Oct 2024 - Present

Data Scientist / AI Engineer

Tersea — Casablanca, Morocco

Developed an intelligent chatbot powered by large language models (LLMs) to assist call center agents with FAQ retrieval, intent classification, and response generation.

Implemented a full automatic speech recognition (ASR) pipeline that converts voice calls into text, integrating Speech-to-Text APIs and optimizing audio preprocessing.

Continuously improved ML models for prediction and recommendation (+20% performance gain) through fine-tuning, cross-validation, and production monitoring.

Sep 2021 - Jun 2024

Data Scientist / Engineer

Augurisk — New York, USA (Remote)

Built environmental and societal risk models for floods, hurricanes, wildfires, earthquakes, crime, and air pollution — used by individuals and businesses to assess property-level climate risk.

Processed and analyzed large-scale geospatial data with Python, GeoPandas, GDAL, and PyQGIS.

Built and optimized classifiers using machine learning techniques (Scikit-Learn, TensorFlow, LightGBM).

Deployed scientific models on big data infrastructure with clusters of virtual servers (AWS EC2, EMR, S3, DynamoDB).

Generated vector tilesets from large GeoJSON collections using tippecanoe, OpenLayers, and GDAL.

Extended company data with third-party sources (US Census Bureau, ACS, NIH, USGS, CODE) and built automated anomaly detection.

Feb 2021 - Aug 2021

Data Scientist, Intern

Mobiblanc — Casablanca, Morocco

Built a recommendation system for 2M Moroccan TV Channel — designed a collaborative filtering model in Python with Scikit-Learn.

Created the full data pipeline (ETL with Python and MongoDB) feeding the recommender.

Served predictions via a REST API built with Flask.

Designed dashboards and additional data pipelines for adjacent projects using Python and Power BI.

Jun 2020 - Aug 2020

Data Engineer, Intern

Leyton Morocco — Casablanca, Morocco

Worked for Leyton's Data Factory & Labs team.

Built a relational database from multiple sources via web scraping and PDF parsing (Python, PostgreSQL, BeautifulSoup).

Predicted missing emails from Salesforce France and verified their existence via SMTP probing in Python.

Jul 2019 - Aug 2019

Python Developer, Intern

Leyton Morocco — Casablanca, Morocco

Developed Python robots and scripts for the Data Labs team to scrape and process company information at scale.

Built Selenium-based scrapers to extract data from multiple corporate websites.

Downloaded and parsed thousands of XML files; distributed processing using a PySpark cluster architecture.

Persisted results to PostgreSQL for downstream analytics.

Inventory

The Tech Stack

Projects Showcase

Portfolio

Featured Creations

A selection of high-impact digital solutions, built with focus on scalability, performance, and exceptional user experience.

LLM Chatbot & ASR Pipeline (Tersea)

Production AI system for call centers: an LLM-powered chatbot that assists agents with FAQ retrieval, intent classification, and response generation — paired with a full automatic speech recognition (ASR) pipeline that converts live voice calls into text. Continuously improved ML prediction and recommendation models in production, achieving a +20% performance gain through fine-tuning, cross-validation, and monitoring.

PythonLLMsTransformersSpeech-to-TextFastAPI
Augurisk — Climate & Societal Risk Platform

Platform that helps individuals and businesses assess the climate and societal risks associated with their properties. I built environmental and societal risk models covering floods, hurricanes, wildfires, earthquakes, air pollution, crime, socioeconomic risk, and health infrastructure — processing large-scale geospatial data and deploying models on cloud infrastructure to deliver property-level risk scores at scale.

PythonGeoPandasGDALPyQGISScikit-Learn
Power Consumption Forecasting in Tetouan

End-to-end ML system that predicts power consumption across 3 zones of Tetouan, Morocco. Covers the full lifecycle: data preprocessing, feature engineering, model training and evaluation, experiment tracking with MLflow, containerization with Docker, and production deployment on AWS for scalable predictions exposed via a Flask API.

PythonNumPyPandasScikit-LearnFlask
Communication

Let's Connect

Have a project in mind or just want to say hi? I'm always open to discussing new opportunities and creative ideas.

Send a Message

I'll get back to you within 24 hours.