Why Token-Level IS Correction Is Insufficient
Click any token position to see what the loss term depends on, and what each correction covers.
Correction mode:
Token-level ρ
j
Sequence-level ρ
j
H =
8
Click a token position (a
1
, a
2
, ...) to explore the dependency structure.