colizz commited on
Commit
af832d6
·
verified ·
1 Parent(s): 11f0488

Update ROC curves

Browse files
Files changed (1) hide show
  1. README.md +15 -8
README.md CHANGED
@@ -42,10 +42,10 @@ The **SophonAK4** model, together with the [**Sophon**](https://huggingface.co/j
42
 
43
  ## Evaluation
44
 
45
- The performance of SophonAK4 is evaluated using the standard model \\(t\bar{t}\\) events to enable direct comparison with performance benchmarks from ATLAS and CMS.
46
- Details are provided in the [[Appendix B of the paper]](https://arxiv.org/html/2503.00118#:~:text=B.2,Performance%20of%20SophonAK4), and are summarized below.
47
 
48
- For *b*- and *c*-tagging, genuine *b*, *c*, and light-flavor jets are selected via jet-parton matching as implemented in Delphes. Jets are required to satisfy *p*<sub>T</sub> > 30 GeV and |*η*| < 2.5, consistent with CMS configurations.
49
 
50
  The following *b*-tagging discriminant is constructed from **SophonAK4**'s the raw output scores to evaluate *b* vs. light and *b* vs. *c* jet performance:
51
 
@@ -58,16 +58,23 @@ The following *c*-tagging discriminants are defined for *c* vs. light and *c* vs
58
 
59
  1. The ROC performance for *b* vs. light/*c* jets and *c* vs. light/*b* jets is shown below and can be compared to [CMS benchmarks](https://cds.cern.ch/record/2904702) (Figs. 1 and 3 for the *tt̅* process).
60
 
61
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6369850885fef3ca96e6dd63/H5FPFm4ZnwVbNshrPQDg_.png)
62
 
63
  > Conclusion
64
- > - The *b* vs. light jet performance is slightly below that of the widely-adopted DeepJet tagger in CMS.
65
- > - The *b* vs. *c* and *c* vs. light/*b* jet performances fall between DeepJet and ParticleNet taggers in CMS.
66
- > - Similar trends are found by comparing with ATLAS's widely-adopted DL1r tagger, see [Appendix B of the paper](https://arxiv.org/html/2503.00118#:~:text=B.2,Performance%20of%20SophonAK4).
 
 
 
 
 
 
 
67
 
68
  2. Performance across different *p*<sub>T</sub> and |*η*| regions is benchmarked below and can be compared with [CMS benchmarks](https://cds.cern.ch/record/2904702) (Figs. 17, 19, 21, 23, 25, 27, 29, and 31).
69
 
70
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6369850885fef3ca96e6dd63/XTAce2zjvwzLhe-_OL29r.png)
71
 
72
  > Conclusion
73
  > - Tagging performance degrades in the low-*p*<sub>T</sub> and high-|*η*| regions but reaches the plateau beyond the turn-on point, indicating that the **SophonAK4** tagger exhibits realistic flavor-tagging behavior across kinematic regimes.
 
42
 
43
  ## Evaluation
44
 
45
+ The performance of SophonAK4 is evaluated using the standard model *tt̅* events to enable direct comparison with performance benchmarks from ATLAS and CMS.
46
+ Details are summarized below.
47
 
48
+ For *b*- and *c*-tagging, genuine *b*, *c*, and light-flavor jets are selected via ghost-association following the CMS convention. Jets are required to satisfy *p*<sub>T</sub> > 30 GeV and |*η*| < 2.5, consistent with CMS configurations.
49
 
50
  The following *b*-tagging discriminant is constructed from **SophonAK4**'s the raw output scores to evaluate *b* vs. light and *b* vs. *c* jet performance:
51
 
 
58
 
59
  1. The ROC performance for *b* vs. light/*c* jets and *c* vs. light/*b* jets is shown below and can be compared to [CMS benchmarks](https://cds.cern.ch/record/2904702) (Figs. 1 and 3 for the *tt̅* process).
60
 
61
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6369850885fef3ca96e6dd63/VgwRLCuQP2wJoKaPhNhrV.png)
62
 
63
  > Conclusion
64
+ > - The *b* and *c* tagging performance of SophonAK4, evaluated on the Delphes simulation, is very compatible with that of the CMS taggers. Its performance falls between **DeepJet** and **UParT**, and is generally comparable to **ParticleNet**.
65
+
66
+ <details>
67
+
68
+ <summary>Click here to show the ROC curves with CMS results overlaid.</summary>
69
+
70
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6369850885fef3ca96e6dd63/0hdH8ADxSZwKOmIluca_L.png)
71
+
72
+ </details>
73
+
74
 
75
  2. Performance across different *p*<sub>T</sub> and |*η*| regions is benchmarked below and can be compared with [CMS benchmarks](https://cds.cern.ch/record/2904702) (Figs. 17, 19, 21, 23, 25, 27, 29, and 31).
76
 
77
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6369850885fef3ca96e6dd63/wUWfD_D-Y8phSfsc4-ey2.png)
78
 
79
  > Conclusion
80
  > - Tagging performance degrades in the low-*p*<sub>T</sub> and high-|*η*| regions but reaches the plateau beyond the turn-on point, indicating that the **SophonAK4** tagger exhibits realistic flavor-tagging behavior across kinematic regimes.