You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Mohamed Achraf BaizMA

Mohamed Achraf Baiz

Software Data Engineer

750 €/jour
Paris, FR
3-7 ans

Délai de réponse moyen : 1h

À propos de Mohamed Achraf

Experienced Data Engineer and Software Engineering with a strong background in Python, REST API development, and data integration across cloud platforms. Skilled in designing and implementing robust python libraries, scalable ETL/ELT pipelines using Databricks and PySpark or custom python. Proficient in Azure and AWS ecosystems, including Azure Functions, Azure ML, ADLS, AWS Glue, Athena, RDS, and DMS, with some expertise in CI/CD pipelines, containerization (Docker, ACR, ECR), and automated deployments.

Demonstrated experience in software architecture, pipeline orchestration with Airflow, data validation using Pydantic and Pandera, and monitoring dashboards with Chronograf and Grafana. Strong understanding of cloud-native REST APIs, multi-agent systems, and production model serving, with hands-on exposure to MLflow. Experienced in version control and automation using Git, GitHub Actions, pre-commit hooks, and security scanning tools (Blackduck, Coverity, Trivy, JFrog Xray).

Core Technologies: Python, REST API, Databricks, PySpark, Azure, AWS, Docker, Git, Delta Lake, Airflow, MLflow, SQL, Pandera, Pydantic, Containerization, Software Architecture.
  • Anglais

    Bilingue ou natif

  • Français

    Bilingue ou natif

  • Arabe

    Bilingue ou natif

Accepte de travailler sur site
Paris (jusqu’à 50 km)

Expériences

  • Schneider-electric
    Software Data Engineer - Services
    ENERGIE
    mars 2023 - Aujourd'hui (3 ans et 3 mois)
    92500 Rueil-Malmaison, France
    Developed a Python REST API using FastAPI packaged within an Azure Function, containerized it as a Docker image, and deployed it on Azure Function App for scalable serverless execution.



    Built a LangChain / LangGraph–based multi-agent system (3-agent graph) leveraging OpenAI to perform Text-to-SQL generation, including SQL creation via Databricks Genie.



    Integrated SQLGlot for sql parsing, validation, correction, and adaptation, enabling automated remediation of malformed queries before execution.



    Implemented logic to execute validated SQL against Databricks Genie API or custom queries on Databricks SQL Warehouse, followed by a downstream agent to interpret query results against the appropriate table/view.



    Built a Databricks Bundle pipeline to register a custom embedding model into MLflow Unity Catalog, and automated model serving by creating/updating serving endpoints.



    Created an additional Databricks Vector Search endpoint to build and maintain vector indexes using delta_sync on target Delta tables.



    Implemented data integration pipelines on Databricks 15.4 LTS using PySpark 3.5.5, automating ingestion, transformation, and optimization workflows across Delta Lake storage layers (bronze, silver, gold).



    Engineered a custom data catalog to track pipeline lineage, schema evolution, and transformation logic, enabling transparent governance and auditability across the multi-layer Delta architecture.



    Built scalable, fault-tolerant ETL/ELT processes leveraging Delta tables, optimized cluster configurations, and Databricks-native features (e.g., Auto Loader, Delta Live Tables patterns) to ensure data quality, consistency, and high performance.
    Databricks Azure Functions Docker Python Vector Embeddings
  • Schneider-electric
    Software Data Engineer - Energy Management
    ENERGIE
    mars 2023 - Aujourd'hui (3 ans et 3 mois)
    Rueil-Malmaison, France
    Designed and deployed Azure Machine Learning pipelines and Streamlit applications, enabling scalable, reproducible model deployments and automated workflows via Azure Container Registry.

    Developed a FastAPI/Azure Functions backend interfacing with MATLAB Production Server, supporting simulations, API management, OpenAPI documentation, and ADLS Gen2 integration.

    Built high-performance Python daemons and weather-driven data pipelines for industrial system integration, real-time telemetry ingestion into InfluxDB, and DER forecasting.

    Created dashboards with Chronograf, implemented structured logging (Loguru), and enforced rigorous data validation with Pydantic and Pandera.

    Automated testing, CI/CD (GitHub Actions), dependency management (JFrog Artifactory), and container security (Trivy, JFrog Xray) to ensure quality, reproducibility, and secure deployments.

    Engineered modular, configuration-driven architectures (YAML + Python validation), optimized columnar data handling with PyArrow, and maintained documentation and changelogs using Sphinx and git-cliff.
    Multiprocessing Packaging Docker Python Azure Functions
  • BMW Group France
    Data Engineer Consultant
    AUTOMOBILE
    mai 2020 - mars 2023 (2 ans et 10 mois)
    Montigny-le-Bretonneux, France
    Developed Python-based ETL pipelines to ingest, integrate, and process data from Salesforce, Google Ads API, Adobe Omniture API 1.4, and Oracle Siebel CRM (PyODBC), handling schema normalization, deduplication, historical tracking, and high-volume datasets for marketing and sales analytics. Implemented CI/CD pipelines with AWS CodeCommit, CodeBuild, Lambda, ECR, and CodeArtifact, containerized ETL services with Docker, and orchestrated batch workflows using Airflow for automated, reliable, and reproducible deployments. Leveraged AWS Glue Crawlers and Data Catalog for metadata management, Amazon Athena for querying and aggregation, AWS RDS (PostgreSQL) for transactional support, and AWS DMS for daily data migrations across heterogeneous sources. Optimized AWS S3 data lake storage with partitioning, lifecycle policies, and integration with downstream analytics; automated operations with Python and Shell scripts, and monitored pipelines with dashboards (Chronograf). Applied domain expertise in automotive marketing and sales, collaborating with cross-functional teams to validate data quality, align pipelines with business KPIs, and ensure actionable insights for reporting and analytics.
    PostgreSQL AWS Athena SQL Airflow Python

Recommandations

Soyez le premier à recommander Mohamed Achraf

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • Software and Systems of Information Engineer
    Faculty of Science and Technology
    Software and Systems of Information Engineer
  • MSc Data Science
    Ecole Centrale de Lyon
    MSc Data Science

Compétences

Catégories