Rj01322897 -
A occurs if the Euclidean distance to any obstacle (o \in \mathcalO t) falls below a safety margin (d \textsafe = 2~\textm).
The risk estimator is updated using on a binary collision label, yielding a differentiable surrogate that can be back‑propagated through both policies. rj01322897