FINOS Events

FINOS Events

AI EVALUATION & BENCHMARKING SUITE - TECHSPRINT

September 20, 2025, 9:00am BST
London

Turn yesterday’s ideas into working code for the Evaluation and Benchmarking Suite community.

REGISTER HERE

Why a TechSprint?

The scoping workshop (Day 1) will finish with a Must‑Have feature backlog and roadmap for the Evaluation & Benchmarking Suite. Day 2 is where we build. In just eight focused hours we aim to:

  1. Ship the first proof‑of‑concept for the “add‑your‑own‑benchmark data” feature.

  2. Prototype integrations with leading open‑source eval tools such as LightEval, EleutherAI Harness, HELM and BIG‑bench.

  3. Automate FINOS‑style governance checks (licensing, dataset metadata, risk tags) on every pull request.

By the closing demo we expect multiple merged PRs and a clear follow‑on workstream for the wider community.

 

Who should attend?

  1. Backend & data engineers keen to work with HuggingFace, FastAPI, GitHub Actions

  2. Data‑/ML‑scientists who maintain or create finance‑specific tasks/datasets

  3. Model‑risk, compliance & DevSecOps specialists translating EU AI Act controls into code

  4. UI/UX & documentation writers to polish the demo deliverables


What you’ll leave with

  1. Publicly visible contributions merged (or ready to merge) into FINOS‑Labs repos.

  2. Reusable tooling for your own AI governance pipelines.

  3. Network & bragging rights – best demo wins sponsor swag and a FINOS blog feature.

 

Logistics

  1. Date: 20 Sept 2025 – the day after the Scoping Workshop

  2. Time: 09:00 – 18:00 BST

  3. Format: Red Hat, Peninsular House, 30-36 Monument Street, 4th floor, London EC3R 8NB, United Kingdom & Virtual

  4. Cost: Free – registration required (capacity limited to ensure mentor access).

 

Supported by