An overview of the bootcamp objectives, logistics, and how machine learning is reshaping chemical discovery.
Recording
Recording will be available after the bootcamp.
August 2026Learning Objectives
Key Takeaways
Takeaway 1. ML models learn statistical patterns from data — they do not encode physical laws, so the quality of training data is the single biggest lever on model quality.
Takeaway 2. The ML workflow is a loop: data curation → featurisation → training → evaluation → iteration. Skipping evaluation or treating it as a formality is the most common source of over-optimistic results.
Takeaway 3. Chemistry imposes domain constraints (molecular validity, physical plausibility) that purely data-driven models can violate — always sanity-check predictions against chemical intuition.
Takeaway 4. No model family dominates all tasks. Random forests, neural networks, and Gaussian processes each have regimes where they excel; choose based on dataset size, interpretability requirements, and uncertainty needs.