▼ BERT encode + MLP project
▼
Decoder-Only LLM (Causal Attention)
▼
▼ concat
Embedding = [hC ; hEOS]
BERT encodes global context into a Contextual token (C), prepended before text. LLM processes with causal attention — all tokens can attend to C. Concat C + EOS hidden states.
Used by Causal2Vec (Lin et al., 2025).