Summary and Setup

Workshop overview


Machine learning and AI are now routine tools in astronomy. They are used to classify objects, generate catalogues, estimate physical parameters, and prioritise observations across increasingly large and complex datasets.

This workshop focuses on how to use these tools responsibly, reproducibly, and transparently in a research context. Rather than teaching ML techniques themselves, the emphasis is on how ML fits into the broader scientific workflow: how results are produced, documented, reused, and interpreted over time.

The intent of the workshop is practical. It aims to help participants recognise common failure modes in ML‑based astronomy, understand how reproducibility and FAIR practices apply to data and models, and adopt lightweight habits that make their work easier to validate, defend, and reuse. Ethical issues are discussed in terms of scientific responsibility, scope control, and downstream reuse, rather than policy or regulation.

This workshop is designed to be discussion‑driven and reflective. Participants are encouraged to think about their own projects and workflows, rather than aiming for idealised or perfect practice.

Assumed background and skills


This workshop assumes that participants:

  • are familiar with basic astronomical data analysis
  • have some exposure to machine learning or AI methods, either through use or collaboration
  • have experience working with real survey or simulation data

We assume no prior knowledge of:

  • formal research ethics frameworks
  • FAIR data principles
  • software engineering best practices

The workshop does not teach:

  • how to build ML models
  • specific ML algorithms or libraries
  • advanced software tooling

Instead, it focuses on the decisions and documentation that sit around ML use in astronomy research.

Who this workshop is for


This material is particularly aimed at:

  • PhD students using ML or AI in their research
  • early‑career researchers working with ML‑derived results
  • astronomers collaborating with data scientists or ML specialists
  • researchers preparing ML‑based results for publication or reuse

The examples and activities are drawn from astronomy research contexts and are intended to be directly applicable to real projects.

What you should expect to take away


By the end of the workshop, participants should be able to:

  • articulate what reproducibility means in ML‑based astronomy
  • recognise where ML workflows commonly break reproducibility and FAIR practices
  • identify ethical risks related to scope, reuse, and over‑interpretation
  • apply a small set of practical actions to improve their own workflows

You should leave with a clearer sense of what is reasonable to do now, and what can be improved incrementally over time.

This workshop is not about doing everything right. It is about making ML‑based astronomy easier to trust, reuse, and understand.

There will be no coding or computer based exercises used in this workshop. You need only have a browser to view the content, and maybe a notepad to take some notes.