You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Soufiane AaziziSA

Soufiane Aazizi

Senior Data scientist, Ph.D

950 €/jour
2 projets
Paris, FR
8-15 ans

Délai de réponse moyen : 1h

À propos de Soufiane

Lead AI Engineer with 12+ years of experience in data science, quantitative finance, and machine learning — now specialized
in Generative AI and LLM systems. Designed and deployed agentic AI architectures, RAG pipelines, and multi-agent
frameworks in regulated industries (pharma, banking, transport). Background in quantitative strategies (Société Générale,
Thomson-Reuters, RBC Dexia). Fluent in English, French, and Arabic. Based in France, open to international opportunities.
AWS ML Certified.
  • Anglais

    Capacité professionnelle complète

  • Français

    Bilingue ou natif

  • Arabe

    Bilingue ou natif

Accepte de travailler sur site
Paris (jusqu’à 50 km), Paris (jusqu’à 100 km), Lille (jusqu’à  km), Toulouse (jusqu’à 100 km)

Expériences

  • Servier Laboratory
    Lead Gen AI
    INDUSTRIE PHARMACEUTIQUE
    décembre 2024 - Aujourd'hui (1 an et 6 mois)
    Suresnes, France
    • Developed Agentic ICF system that transforms complex Clinical Study Protocol tables (spanning multiple pages, dozens of columns and rows) into concise ICF Summary Tables — reducing generation time from 1–2 days (manual) to under 5 minutes, leveraging Skills Cards architecture with scoped MCP tool restrictions.
    • Developed Vision Agentic RAG system with DSPy serving a global team of medical writers; improved retrieval hit-rate@6 from 60% to 85% across thousands of indexed documents and images.
    • Built custom document parser using Docling to extract tables, figures, and images from complex PDFs, RTF, and DOCX into structured Markdown and metadata; indexed into Weaviate (text) and Google Cloud Storage (thousands of images).
    • •Designed comprehensive evaluation framework assessing parsing quality, retriever performance, anti-hallucination robustness, and answer generation effectiveness.
    • •Re-engineered monolithic application with full streaming architecture, reducing long-running task response time from 2 minutes to ~4 seconds.
    Technologies: DSPy, Agentic-RAG, Compound AI, Docling, Weaviate, Vertex AI, GCS/GCP
    DSPy Weaviate Docling Google cloud RAG
  • KPMG
    Lead Data Scientist
    CONSEIL & AUDIT
    avril 2024 - novembre 2024 (7 mois)
    Paris, France
    • Led a team of 5 Data Scientists; delivered POC in 2 months and production-ready RAG chatbot with full UI in 3 months, parsing thousands of documents (PDF, PPTX, images) for the audit department.
    • •Designed compound AI architecture with DSPy optimizers: query decomposition, chain-of-thought reasoning, and multi-hop document traversal — improving answer accuracy from 70% to 94%.
    • Implemented Azure Search reranking and enhanced recursive retrieval with DSPy for dynamic keyword generation; achieved ~4-second streaming response time.
    • Integrated LangFuse for real-time monitoring, performance evaluation, and feedback collection.
    Technologies: DSPy, Azure Search, LangFuse, Structure.io, Pytesseract, GPT
    DSPy Cloud Azure LangFuse GPT4 azure searrch
  • SNCF-Connect
    Lead Data Scientist
    TRANSPORTS
    novembre 2023 - mars 2024 (4 mois)
    Paris, France
    • Designed and implemented QA ChatBot leveraging LlamaIndex RAG and LangChain, covering dozens of FAQ topics with sub-50ms retrieval latency, eliminating manual searches for support agents.
    • Employed auto-retriever composition on Vespa.ai for enhanced passage retrieval; implemented LangFuse monitoring for performance evaluation and continuous FAQ enrichment.
    Technologies: LangChain, LlamaIndex, DSPy, Amazon Bedrock, Vespa.ai, OpenAI, LangFuse
    Langchain vespa.ai LlamaIndex

Recommandations

Soyez le premier à recommander Soufiane

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • Docteur en Mathématiques Appliquées
    Université Cadi Ayyad
    2016
    • Approximation discrètes des Equations différentielles Stochastiques Rétrogrades • Contributions à l'étude des processus de Lévy et des processus fractionnaires via le calcul de Malliavin et applications en statistiques • Le théorème central limite en probabilité et statistiques pour les mouvements Browniens sous-fractionnaires et bi-fractionnaires • Problème de portefeuille avec contraints stochastiques • Problème de switching avec contrainte
  • Master recherche (MASEF): Mathématiques Appliquées à la Finance à l’Economie & l’Assurance -
    Université Paris DAUPHINE – ENSAE
    2008
    Master recherche (MASEF): Mathématiques Appliquées à la Finance à l’Economie & l’Assurance -

Certifications

Compétences (38)

Catégories