OELLM's Deliverables

Find the project's structure and organization plan.

  1. 31
    JUL
    2025
    Initial catalogue and analytics reports for existing training datasets

    A catalogue of training datasets per language along with automatic analytics reports.

  2. 31
    JUL
    2025
    Communication, Dissemination and Exploitation Strategy

    Full communication and dissemination and exploitation strategy and schedule of activities to implement during the project duration.

  3. 31
    JUL
    2026
    Initial dataset release

    Texts (with metadata) used to train the OpenEuroLLM model available at mid-project.

  4. 31
    JUL
    2026
    First models

    Initial release of LLM models (tokenizers and model weights).

  5. 31
    JUL
    2026
    Evaluation Code package

    Code package in Python containing model evaluation procedures. The package will be open-sourced on the proposed due date after having iterated on feedback provided by other WPs.

  6. 31
    JUL
    2027
    Final dataset release

    Texts (with metadata) used to train the final OpenEuroLLM model(s).

  7. 31
    JAN
    2028
    Stakeholder Report

    Written report on strategic advice of the OSPB and community feedback on the development of OpenEuroLLM.

  8. 31
    JAN
    2028
    Final models

    Final release of LLM models (tokenizers and model weights).

  9. 31
    JAN
    2028
    LLM training report (other tasks)

    Final report on model training, including all necessary details for open publishing and regulatory compliance.

  10. 31
    JAN
    2028
    Evaluation Report

    Technical report on the work made in the Evaluation workpackage. It will include our findings in evaluating LLMs on multilingual and regulatory aspects.

  11. 31
    JAN
    2028
    Evaluation Report of Communication, Dissemination and Exploitation Strategy

    Evaluation of the impact of the overall strategy in aspects of dissemination, exploitation and communication.