-
Notifications
You must be signed in to change notification settings - Fork 1
Expand file tree
/
Copy pathindex.qmd
More file actions
495 lines (354 loc) · 18.5 KB
/
index.qmd
File metadata and controls
495 lines (354 loc) · 18.5 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
---
title: Machine Learning in Materials Processing & Characterization
subtitle: Course Curriculum and Materials
author:
- name: Philipp Pelz
corresponding: true
roles:
- Course Instructor
- Content Development
affiliations:
- Materials Science and Engineering
keywords:
- Machine Learning
- Materials Science
- Materials Processing
- Materials Characterization
- Deep Learning
- Microstructure Analysis
- Process Optimization
abstract: |
This course teaches how machine learning can be applied to experimental data
from materials processing and characterization. The focus lies on images,
spectra, time-series, and processing parameters, and on understanding how
physical data formation interacts with learning algorithms. Students learn to
build robust, uncertainty-aware ML pipelines for real experimental workflows,
avoiding common pitfalls such as data leakage, overfitting, and spurious
correlations.
plain-language-summary: |
Students learn how to use machine learning on real materials data such as
microscopy images, spectra, and processing logs. The course emphasizes
understanding the physics behind the data, choosing suitable ML models, and
assessing uncertainty and reliability rather than just maximizing accuracy.
key-points:
- ML for experimental materials data
- Vision-based ML for microstructures
- Time-series ML for processing
- Uncertainty-aware regression and optimization
- Physics-informed and constrained ML
date: last-modified
bibliography: references.bib
number-sections: true
execute:
eval: false
---
```{=html}
<style>
.course-hero {
background: linear-gradient(135deg, rgba(59,130,246,0.18), rgba(139,92,246,0.14));
border: 1px solid rgba(255,255,255,0.08);
border-radius: 24px;
padding: 2.5rem 3rem;
margin: 2rem 0 3rem 0;
text-align: left;
}
.course-hero h1 { font-family: "Outfit", sans-serif; font-weight: 800; margin-bottom: 0.5rem; border: none; }
.course-kicker {
text-transform: uppercase;
letter-spacing: 0.08em;
font-size: 0.8rem;
color: #94a3b8;
margin-bottom: 0.5rem;
}
.course-subtitle {
color: #cbd5e1;
font-size: 1.05rem;
line-height: 1.6;
margin-bottom: 1rem;
}
.course-meta-grid {
display: grid;
grid-template-columns: repeat(2, minmax(0, 1fr));
gap: 0.75rem 1rem;
margin: 1rem 0 1.25rem 0;
}
.course-meta-item {
background: rgba(255,255,255,0.03);
border: 1px solid rgba(255,255,255,0.05);
border-radius: 12px;
padding: 0.85rem 1rem;
}
.course-meta-label {
font-size: 0.78rem;
text-transform: uppercase;
letter-spacing: 0.06em;
color: #94a3b8;
margin-bottom: 0.2rem;
}
.course-meta-value { color: #f8fafc; }
.course-actions { margin-top: 1rem; display:flex; gap:0.75rem; flex-wrap:wrap; }
.course-guide {
background: rgba(20, 25, 40, 0.45);
border-left: 4px solid #3b82f6;
border-radius: 12px;
padding: 1rem 1.1rem;
margin: 1.25rem 0 2rem 0;
}
@media (max-width: 768px) {
.course-hero { padding: 2rem; }
.course-meta-grid { grid-template-columns: 1fr; }
}
</style>
```
```{=html}
<div class="course-hero">
<div class="course-kicker">ECLIPSE Lab Teaching</div>
<h1>Machine Learning in Materials Processing & Characterization</h1>
<p class="course-subtitle">Application-focused course on ML for experimental materials data, from images and spectra to process signals.</p>
<div class="course-meta-grid">
<div class="course-meta-item"><div class="course-meta-label">Semester</div><div class="course-meta-value">Summer Semester 2026</div></div>
<div class="course-meta-item"><div class="course-meta-label">Format</div><div class="course-meta-value">2h lecture + 2h exercises</div></div>
<div class="course-meta-item"><div class="course-meta-label">Credits</div><div class="course-meta-value">5 ECTS</div></div>
<div class="course-meta-item"><div class="course-meta-label">Audience</div><div class="course-meta-value">Students in Materials Science, data science, and computational engineering</div></div>
<div class="course-meta-item" style="grid-column: 1 / -1;"><div class="course-meta-label">Prerequisites</div><div class="course-meta-value">Helpful: Mathematical Foundations of AI & ML or equivalent background</div></div>
</div>
<div class="course-actions">
<a href="https://www.studon.fau.de/campo/course/538616" class="btn btn-primary">StudOn</a>
<a href="https://github.com/ECLIPSE-Lab/MachineLearningForCharacterizationAndProcessing" class="btn btn-outline-secondary">GitHub / Materials</a>
<a href="https://pelzlab.science/teaching.html" class="btn btn-outline-secondary">All Teaching</a>
<a href="https://www.mat.studium.fau.de/studiengaenge/neu-ki-materialtechnologie/" class="btn btn-outline-secondary">KI in Materialtechnologie</a>
</div>
</div>
<div class="course-guide">
<strong>How to use this course site.</strong> Use this page as the central hub for syllabus, lecture structure, reading, notebooks, and course materials. Formal announcements and enrollment remain on StudOn; code and openly shared resources live in the linked GitHub repository.
</div>
```
# Machine Learning in Materials Processing & Characterization
**5th Semester – 5 ECTS · 2h lecture + 2h exercises per week**
*Coordinated with “Mathematical Foundations of AI & ML” (MFML)
and “Materials Genomics” (MG)*
---
## Synergy Map
- **MFML** provides the mathematical spine:
loss functions, neural networks, generalization, uncertainty, Gaussian Processes.
- **This course (ML-PC)** applies these concepts to *experimental* data:
images, spectra, and processing signals.
- **Materials Genomics** focuses on crystal structures, databases, and discovery.
ML-PC is therefore **application-driven**, not algorithm-driven.
---
## Companion books
- @sandfeld2024materials: Materials Data Science
## Week-by-Week Curriculum (14 weeks)
### Unit I — Experimental Data as a Learning Problem (Weeks 1–3)
#### Week 1 – What makes materials data special?
*Lecture: Tuesday, 14.04.2026, 14:15-15:45 | Exercise: Thursday, 16.04.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit01_intro/01_intro.html)
- Types of experimental data:
micrographs, EBSD, EDS, EELS, XRD, process logs, thermal histories.
- PSPP (Processing–Structure–Property–Performance) as a *data dependency graph*.
- Why ML failure modes are common in experimental science.
**Summary:**
- Transition from physics-based to data-driven modeling
- Experimental data challenges: multi-modal, high acquisition cost, sparse
- **PSPP** (Processing → Structure → Property → Performance) as a data dependency graph
- Data scales and measurement uncertainty
- **CRISP-DM** workflow adapted for scientific labs
**Exercise:**
Inspect real microscopy and process datasets; identify sources of bias and noise.
---
#### Week 2 – Physics of data formation
*Lecture: Tuesday, 21.04.2026, 14:15-15:45 | Exercise: Thursday, 23.04.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit02_physics_of_data/01_intro.html)
- Image and signal formation in characterization:
resolution, contrast, artifacts.
- Sampling, aliasing, noise as *physical priors* (not preprocessing tricks).
- Relation to MFML refresher on PCA and covariance.
**Summary:**
- Physical signal formation as a learning prior
- Resolution, noise, sampling as physical (not algorithmic) constraints
- **PCA** and **SVD** for low-dimensional structure in high-dimensional data
**Exercise:**
Fourier inspection of micrographs; effects of sampling and filtering.
---
#### Week 3 – Data quality, labels, and leakage
*Lecture: Tuesday, 28.04.2026, 14:15-15:45 | Exercise: Thursday, 30.04.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit03_data_quality/01_intro.html)
- Annotation uncertainty and inter-annotator variance.
- Train/test leakage in materials workflows.
- Why “good accuracy” often means a broken pipeline.
**Summary:**
- Measurement chain → **data cleaning**: missing values, outliers, duplicates ("fix at source")
- **Transformation toolbox**: centering, min–max / z-score scaling, non-dimensionalization, log, differentiation, FFT, triggering
- **Labels and uncertainty**: inter-annotator variance, probabilistic labels, Bayesian view (priors, likelihoods, posteriors)
- **Bias–variance** tradeoff with parsimony and regularization
- **Data leakage** in materials workflows: pre-processing, temporal, group/spatial
- **Validation**: holdout, K-fold, LOOCV, stratified
- **Error measures**:
- Regression: MAE, MSE, RMSE, $R^2$
- Classification / segmentation: confusion matrix, precision/recall, F1/Dice, IoU, categorical cross-entropy
**Exercise:**
Construct a deliberately flawed ML pipeline and diagnose its failure.
---
### Unit II — Representation Learning for Microstructures (Weeks 4–6)
*(Aligned with early neural networks in MFML)*
#### Week 4 – From classical microstructure metrics to learned representations
*Lecture: Tuesday, 05.05.2026, 14:15-15:45 | Exercise: Thursday, 07.05.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit04_microstructure_representations/01_intro.html)
- Grain size, phase fractions, orientation maps.
- Limits of hand-crafted microstructure features.
- Transition to learned representations.
**Summary:**
- Classical stereological metrics (grain size, phase fractions) and their limits
- Transition to **learned representations**
- The **artificial neuron**: weights, biases, non-linear activations
- **Multi-Layer Perceptrons (MLPs)** as automatic feature learners
**Exercise:**
Compare classical features vs simple NN-based features for microstructure tasks.
---
#### Week 5 – Neural networks for microstructure images
*Lecture: Tuesday, 12.05.2026, 14:15-15:45 | Exercise: Thursday, 14.05.2026, 16:15-17:45 (cancelled - Himmelfahrt)*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit05_unsupervised_learning/01_intro.html)
- CNN intuition: filters as structure detectors.
- Example tasks:
phase segmentation, defect detection, porosity identification.
- Overfitting risks with small datasets.
**Summary:**
- **Convolutional Neural Networks (CNNs)** for materials characterization
- Hierarchical structure detectors: edges → textures → phase morphologies
- Filters and pooling; parameter efficiency vs. MLPs
- Case studies: phase segmentation, defect detection
- Practical challenges: high-resolution, noisy micrographs
**Exercise:**
Train a small CNN on microstructure images; analyze failure cases.
---
#### Week 6 – Data scarcity & transfer learning
*Lecture: Tuesday, 19.05.2026, 14:15-15:45 | Exercise: Thursday, 21.05.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit06_transfer_learning/01_intro.html)
- Why materials datasets are small.
- Transfer learning from natural images vs self-supervised pretraining.
- When transfer learning helps—and when it does not.
**Summary:**
- **Data scarcity** as the materials informatics bottleneck
- **Transfer learning** from natural-image pretrained models
- Self-supervised pretraining as an alternative
- **Data augmentation** tailored to scientific data
- When cross-domain transfer succeeds vs. fails
**Exercise:**
Fine-tune a pretrained model; compare against training from scratch.
---
#### Week 7 – No lecture (26.05.2026, public holiday)
::: {.callout-warning appearance="simple"}
**Cancelled** — no ML-PC lecture or exercise takes place in Week 7 (Pfingstmontag / Pfingstdienstag public holidays, 25–26.05.2026). The *time-series & process monitoring* unit moves to Week 8 (02.06.2026); every later unit shifts one week, taking up the former Week-14 buffer slot.
:::
---
### Unit III — Learning from Processing Data (Weeks 8–9)
#### Week 8 – Time-series and process monitoring
*Lecture: Tuesday, 02.06.2026, 14:15-15:45 (in class) | Exercise: Thursday, 04.06.2026, 16:15-17:45 (self-study — Fronleichnam public holiday)*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit07_time_series/01_intro.html)
> **Self-study exercise:** the Thursday slot is cancelled (Fronleichnam). The exercise is provided for independent work; a solution is released afterwards. The Tuesday lecture takes place in class.
- Processing signals:
temperature cycles, AM melt pool signals, SPS, rolling.
- Regression and sequence models as surrogates.
- Relation to MFML concepts of generalization.
**Summary:**
- **Time-series ML** for process monitoring and prediction
- **RNNs** and **LSTMs** for sequential dependencies
- Preprocessing: signal smoothing, triggering on noisy logs
- Case studies: additive manufacturing, process stability
- Real-time anomaly detection from processing history
**Exercise:**
Predict a process outcome from time-series data using regression or simple RNNs.
---
#### Week 9 – Inverse problems and process maps
*Lecture: Tuesday, 09.06.2026, 14:15-15:45 | Exercise: Thursday, 11.06.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit08_inverse_problems/09_inverse_problems.html)
- Process → structure inverse problems.
- ML-guided process maps (e.g. AM laser power vs scan speed).
- Physics-informed vs unconstrained regression.
**Summary:**
- **Inverse problems**: target microstructure / performance → processing parameters
- Forward (causal) vs. inverse (often ill-posed, multi-valued)
- **Physics-informed learning**: physical transformations and constraints
- **Process maps** and **process corridors** for safe operating regions
**Exercise:**
Construct a simple ML-based process map; compare constrained vs unconstrained models.
---
### Unit IV — Characterization, Transformers, and Uncertainty (Weeks 10–12)
#### Week 10 – ML for characterization signals
*Lecture: Tuesday, 16.06.2026, 14:15-15:45 | Exercise: Thursday, 18.06.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit09_characterization_signals/10_characterization_signals.html)
- Spectral data: XRD, EELS, EDS.
- Denoising, peak finding, dimensionality reduction.
- Using ML without destroying physical meaning.
**Summary:**
- Unsupervised ML on high-dimensional spectra (XRD, EDS, EELS)
- **K-Means** and **t-SNE** for phase identification and visualization
- **Autoencoders**: compressing spectra into a low-dimensional latent space
- Denoising and feature extraction at high throughput without losing physics
**Exercise:**
Apply PCA/NMF to spectral datasets; interpret components physically.
---
#### Week 11 – Transformers for materials characterization
*Lecture: Tuesday, 23.06.2026, 14:15-15:45 | Exercise: Thursday, 25.06.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit10_transformers_for_materials/transformers_for_materials.html)
- Why attention: long-range correlations beyond CNN receptive fields.
- Scaled dot-product attention and the Vision Transformer (ViT).
- Flash Attention for tractable long sequences.
**Summary:**
- **Self-attention** and the **Vision Transformer (ViT)** for materials imaging
- **Flash Attention**: long sequences without the L×L memory blow-up
- Applications: **ViT on 4D-STEM** diffraction; cross-attention across LPBF layer stacks
- Scaling alternatives (**Mamba / state-space models**) — and when *not* to reach for a transformer
**Exercise:**
Apply a small ViT / attention model to a characterization dataset (e.g. 4D-STEM patches); compare against a CNN baseline.
---
#### Week 12 – Uncertainty-aware regression & Gaussian Processes
*Lecture: Tuesday, 30.06.2026, 14:15-15:45 | Exercise: Thursday, 02.07.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit11_uncertainty_gp/12_uncertainty_gp.html)
- Aleatoric vs epistemic uncertainty in experiments.
- Gaussian Processes as uncertainty-aware surrogates.
- Exploration vs exploitation in experimental design.
- Connection to materials acceleration platforms.
**Exercise:**
Compare GP regression and NN ensembles for a process-parameter problem.
---
### Unit V — Physics, Trust, and Synthesis (Weeks 13–14)
#### Week 13 – Physics-informed and constrained ML
*Lecture: Tuesday, 07.07.2026, 14:15-15:45 | Exercise: Thursday, 09.07.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit12_pinns/13_pinns.html)
- Embedding physical constraints into ML models.
- Penalty terms, soft constraints, hybrid approaches.
- Failure modes of unconstrained models.
**Exercise:**
Train a constrained model for a processing or characterization task.
---
#### Week 14 – Integration, limits, and reflection
*Lecture: Tuesday, 14.07.2026, 14:15-15:45 | Exercise: Thursday, 16.07.2026, 16:15-17:45*
**Slides:** [Open](https://pelzlab.science/public_presentations/ml_for_characterization_and_processing/unit13_reflection/14_reflection.html)
- Explainability for experimental ML (CAMs, SHAP).
- Why ML fails in real labs.
- Where ML genuinely changes materials processing.
**Exercise:**
Mini-project presentations and critical discussion.
---
## Learning Outcomes
Students completing this course will be able to:
- Interpret materials processing and characterization data as learning problems.
- Build ML pipelines for microstructure analysis, process prediction, and spectral data.
- Understand the physics of data formation to avoid common ML pitfalls.
- Evaluate generalization, robustness, and uncertainty in experimental ML models.
- Apply Gaussian Processes and neural networks as surrogate models.
- Integrate physical constraints into ML workflows.
- Critically assess claims about ML in materials processing and characterization.
---
## Lab Possibilities
- Microscopy datasets: noise, metadata, units, and artifacts.
- Fourier inspection of SEM/TEM images.
- Broken vs correct ML pipelines (data leakage case studies).
- Feature extraction vs learned representations.
- Fine-tuning pretrained CNNs on microstructures.
- Process–property regression with uncertainty.
- GP-based process maps.
- Spectral decomposition (NMF) of EELS/XRD data.
- ML-assisted autofocus or EBSD pattern classification.
- Multi-modal fusion of images, spectra, and process parameters.