figshare
Browse

s1K probe training data

Version 3 2025-06-05, 21:23
Version 2 2025-06-05, 17:08
Version 1 2025-06-04, 17:45
dataset
posted on 2025-06-05, 21:23 authored by Menghua WuMenghua Wu
<p dir="ltr">Data required to replicate the training of thought calibration probes. Files provided include:</p><ul><li>s1K per-step embeddings by DeepSeek-R1 distilled Qwen 2.5 32B and Llama 3.3 70B, and QwQ 32B</li><li>Labels for supervised correctness, consistency, novelty, and leafness</li></ul><p dir="ltr">These files are used in notebook <b>2-probe.ipynb</b>.</p>

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC