You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Arnaud HureauxAH

Arnaud Hureaux

GenAI Expert / Data Engineer / Data Scientist

670 €/jour
3 projets
Paris, FR
3-7 ans

Délai de réponse moyen : 1h

À propos de Arnaud

8 ans d'expériences en tant que Data Engineer & Data Scientist, avec des expériences axés sur les pipelines de données et les usecases LLM/NLP


Expériences (non exhaustif, cf mon CV) :

Decathlon (2 years)​ - GenAI Expert
Context: the Customs had a process to verify 20 000 "container's PDFs" to avoid fines fromEuropean Customs Office, which mobilised 20 Full-Time-Equivalent​
As a LLM Expert, I automated the process with a workflow of 3 steps:​
Categorise => Classic NLP with TFIDF/LogisticRegression deploy on AWS Lambda​
Extract => GenAI / LLM with AWS Bedrock Anthropic models and Bedrock Batch/Airflow​
Compare => Databricks SQL/Spark Data Pipeline and AWS Lambda Python workflow​
Display => Front in Streamlit/React and deployment on EKS​
Huge success of the POC (1 month), success of the POV, proof of value (4 months) andsuccessful deployment on 12 countries (Bangladesh, India, Vietnam...), 1M of value added​
2 years of developpement, 1 alone and 1 as a Tech Lead with an Data Scientist to manage​
2 parralel projects of RAG development : to help Customs to find the best HS code for theirproduct (Pinecone & Amazon OpenSearch) + to chat with transport documentation)

Groupe Automobile - Management de 4 Développeurs Palantir Foundry
Pipeline en production de 15 datasources, 70 intermediates datasets, 20 dashboards, et 20 ontologies
Techs : Spark, Palantir Foundry, Typescript
Groupe dans l'Energie (6 mois) - Développement d’un pipeline sous Palantir Foudry & AWS
Techs : Spark, Palantir Foundry, Typescript, AWS (Glue Jobs, Lambda, Eventbridge, S3)

Luxe - Clusterisation de points de ventes
Production de features clients à partir des données CRM et du langage Python. Utilisation de ces features pour produire une clusterisation à partir d’algorithmes non supervisés.

Banque - Classification automatique de PDFs via du Deep learning
Développement d’un outil de classification de PDF
Computer Vision et NLP (Spacy, Tensorflow, OpenCV, Flask, SHAP)

  • Français

    Bilingue ou natif

  • Anglais

    Capacité professionnelle complète

Accepte de travailler sur site
Paris (jusqu’à 50 km)

Expériences

  • BNP PARIBAS PACE
    Logo MaltSur Malt
    Data Scientist
    BANQUE & ASSURANCES
    juin 2024 - juillet 2024 (2 mois)
    Paris, France
    Comprehensive modelization of the dynamic relationship between Bank Relationship Managers’ commercial performance, customer advocacy measured through Net Promoter Score (NPS), and the concrete actions they perform across different stages of the client journey — including prospecting, onboarding, portfolio management, and after-sales follow-up — with the objective of identifying the behavioral patterns and managerial levers that most effectively drive both revenue growth and long-term client satisfaction.
    Python MySQL PySpark Banque
  • Decathlon
    GenAI Expert & Data Engineer
    E-COMMERCE
    octobre 2023 - octobre 2025 (2 ans)
    Paris, France

    DECATHLON Global Sport Retail Group (2 years)​

    Context: the Customs had a process to verify 20 000 "container's PDFs" to avoid fines fromEuropean Customs Office, which mobilised 20 Full-Time-Equivalent​

    As a LLM Expert, I automated the process with a workflow of 3 steps:​

    Categorise => Classic NLP with TFIDF/LogisticRegression deploy on AWS Lambda​

    Extract => GenAI / LLM with AWS Bedrock Anthropic models and Bedrock Batch/Airflow​

    Compare => Databricks SQL/Spark Data Pipeline and AWS Lambda Python workflow​

    Display => Front in Streamlit/React and deployment on EKS​

    Huge success of the POC (1 month), success of the POV, proof of value (4 months) andsuccessful deployment on 12 countries (Bangladesh, India, Vietnam...), 1M of value added​

    2 years of developpement, 1 alone and 1 as a Tech Lead with an Data Scientist to manage​

    2 parralel projects of RAG development : to help Customs to find the best HS code for theirproduct (Pinecone & Amazon OpenSearch) + to chat with transport documentation)
    LLM GenAI Databricks Bedrock Gemini
  • DataImmoSolution
    DBT/GCP/AI Tech Lead
    IMMOBILIER
    janvier 2023 - janvier 2024 (1 an)
    Paris, France
    Full Stack Tech Lead on DBT / GCP / BigQuery / CloudComposer / Python / C#​
    Machine Learning API of price's prediction for reel estate​
    Data Pipeline creation on french reel estate opendata (DVF, PCI, RNC, BDNB...)​
    API creation and integration on their React website (https://dataimmosolutions.com/)
    DBT GCP BigQuery ML Python

Recommandations

Soyez le premier à recommander Arnaud

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • Master 2 (M2), Strategy & Consulting
    EDHEC Business School
    2021
    Master 2 (M2), Strategy & Consulting
  • CPGE PCSI-PSI, Maths -Physique-Sciences Industrielles
    Académie de Paris
    2017
    CPGE PCSI-PSI, Maths -Physique-Sciences Industrielles

Compétences

Catégories