BSM H→ττ on Early Run 3 data

Zhiyuan Li¹, Xiaohu Sun¹, Leyan Li¹, Licheng Zhang¹, Botao Guo¹, Mingxuan Zhang¹, Jin Wang¹, Zhenyu Dong¹

Markus Klute², Artur Gottmann², Roger Wolf²

David Colling³, Daniel Winterbottom³, Irene Andreou³

Andrea Cardini⁴, Aliaksei Raspiareza⁴, Jacopo Malvaso⁴

¹ Peking University (CN)

² Karlsruhe Institute of Technology (DE)

³ Imperial College London (UK)

⁴ Deutsches Elektronen-Synchrotron DESY (DE)

May 1^st, 2026 [TODO: Need to update]

Introduction
Event Selection & Categorization
Background Modelling
Corrections
Neural Network Discriminant
Systematic Uncertainties
Control Plots
Expected Results & Summary

Introduction

CADI line: HIG-25-006
Analysis Note: AN-2025/067 (V8)
Paper Draft: Finished - v2.pdf [TODO: Update to latest version]
TWiki: BSMHtautauEarlyRun3
Datacards: cms-analysis/hig/hig-25-006/datacards (Under review)
Pre-approval questionnaire: Completed ✓
POG sign-offs: HIG-25-006 CMS Pub Talk
- MuonPOG ✓ TauPOG ✓ EgammaPOG ✓ JetMET ✓
- BTV: in progress (agree for pre-approval)
- Stats questionnaire: in progress

Search for additional neutral Higgs bosons $\phi$ in $pp$ collisions at $\sqrt{s}=13.6$ TeV using 62.3 fb$^{-1}$ of early Run 3 data
The MSSM Higgs sector, inspired by a 2HDM type II, introduces two Higgs doublets → five physical Higgs bosons: $H^{\pm}$, $h$, $H$, $A$
- Tree-level properties described by two free parameters: $m_A$ and $\tan\beta$ (ratio of VEVs)
BR($H/A \to \tau\tau$) enhanced at high $\tan\beta$ → di-$\tau$ final state is a prime search channel
Two production modes: $gg\phi$ (dominant at low $\tan\beta$) and $bb\phi$ (dominant at high $\tan\beta$)
Model-independent interpretation: $60 \leq m_\phi \leq 3500$ GeV
Study four final states: $\mu\tau_h$, $e\tau_h$, $\tau_h\tau_h$, $e\mu$
Hybrid discriminant: PNN score ($m_\phi \leq 1000$ GeV) / $m_{\mathrm{T}}^{\text{tot}}$ ($m_\phi > 1000$ GeV)
CMS H→γγ [HIG-20-002] and H→ττ [HIG-21-001] Run 2 analyses indicate local excess ~3σ at $m_\phi$ = 95 and 100 GeV

(a)

(b)

(c)

Figure 1: Diagrams for the production of neutral Higgs bosons $\phi$ (left) via gluon fusion, labelled as $gg\phi$, and (middle and right) in association with b quarks, labelled as $bb\phi$ in the text. In the middle diagram, a pair of b quarks is produced from the fusion of two gluons, one from each proton. In the right diagram, a b quark from one proton scatters from a gluon from the other proton. In both cases $\phi$ is radiated off one of the b quarks.

CMS $pp$ collisions at $\sqrt{s}=13.6$ TeV, $\mathcal{L}=62.3$ fb$^{-1}$
Using HLepRare skimmed samples NanoAODv14
Only fully certified data (golden JSON) included
- 2022preEE (Eras C–D) and 2022postEE (Eras E–G)
- 2023preBPix (Era C) and 2023postBPix (Era D)

MC simulation used to model most backgrounds
- DY: $Z \to \tau\tau$ — Drell–Yan (aMC@NLO FxFx, 0/1/2-jet merged)
- Top quark: $t\bar{t}$ (Powheg) + single top (Powheg, 4FS t-channel & tW)
- Diboson: $WW/WZ/ZZ$ (Powheg)
- Other: $W$+jets (aMC@NLO FxFx) + SM Higgs (ggH, VBF, WH, ZH $\to \tau\tau$ Powheg)
Fake backgrounds (jet$\to\tau_h$/$\ell$ fakes from QCD multijet, $W$+jets, $t\bar{t}$) are estimated via the data-driven fake factor method
All MC yields normalized to best available cross sections × integrated luminosity per era

BSM signal: model-independent scalar $\Phi \to \tau\tau$
- $gg \to \Phi$ — gluon gluon fusion (Powheg, 2HDM-II)
- $bb \to \Phi$ — $b$-associated production
Mass range: 60–3500 GeV, $\sigma \times \mathrm{BR}$ set to 1 pb (model-independent)
Signal events generated with Powheg + Pythia8 (TuneCP5) at $\sqrt{s}=13.6$ TeV
Two production modes fitted with independent signal strengths, via the other production
mode is floating
SM Higgs ($ggH$, VBF, $WH$, $ZH$, $H \to \tau\tau / bb$) treated as background

(a)

(b)

(c)

Figure 1: Diagrams for the production of neutral Higgs bosons $\phi$ (a) via gluon fusion, labelled as $gg\phi$, and (b, c) in association with b quarks, labelled as $bb\phi$ in the text. In diagram (b), a pair of b quarks is produced from the fusion of two gluons, one from each proton. In diagram (c), a b quark from one proton scatters from a gluon from the other proton. In both cases $\phi$ is radiated off one of the b quarks.

Event Selection and Categorization

Event vertex: selected by maximizing $\Sigma p_{\mathrm{T}}^2$ of associated physics objects
Muons & Electrons: standard PF-based IDs with impact parameter cuts ($|d_{xy}|<0.045$ cm, $|d_z|<0.2$ cm); overlap removal $\Delta R> 0.5$ (lep-lep), $> 0.4$ (jet-lep)
Hadronic taus ($\tau_h$): reconstructed with hadrons-plus-strips algorithm; identified via DeepTau v2p5 (WPs channel-dependent)
Jets & b-tagging: anti-$k_T$ $R=0.4$ with PUPPI pileup mitigation; ParticleNet Medium WP for b-tagging; HF horn mitigation: jet $p_{\mathrm{T}} > 50$ GeV for $2.5 < |\eta| < 3.0$
MET: PUPPI MET

Object	Identification & Isolation	Kinematics
Muon	Medium ID, $I_{rel}<0.15$ ($\mu\tau$) / $<0.20$ ($e\mu$)	$\|\eta\|<2.1$ / $<2.4$
Electron	MVA WP90 (Isolation included)	$\| \eta \|<2.1$ ($e\tau$) / $<2.4$ ($e\mu$)
$\tau_h$	DeepTau vs jet: Medium vs e/μ: chan. dependent (Tight/VLoose)	$\| \eta \|<2.3$ ($\mu\tau/e\tau$) / $<2.1$ ($\tau\tau$)
Jets	anti-$k_T$ $R=0.4$, Tight ID	$p_{\mathrm{T}} > 30$ GeV, $\| \eta \| < 4.7$
b-jets	ParticleNet Medium WP	$p_{\mathrm{T}} > 20$ GeV, $\| \eta \| < 2.5$
MET	PUPPI MET Type-I corrected	—

Table I: Summary of object identification, isolation working points, and kinematic requirements used in the analysis.

Trigger & Kinematics: Combines single-lepton, di-$\tau$, and cross triggers. Offline $p_{\mathrm{T}}$ is set strictly above trigger turn-ons (see table for exact bounds).
Object & Pair Selection: High-purity ID required (MVA WP90 / Medium / DeepTau). Selected pairs must be oppositely charged (OS), separated by $\Delta R > 0.5$ (0.3 for $e\mu$), and are ranked by highest isolation.
Topological Background Suppression (Extra Cuts):
- $W$+jets ($e\tau_h, \mu\tau_h$): require $m_{\text{T}} \lt 50$ GeV, based on $m_{\text{T}}(e/\mu, p_{\text{T}}^{\text{miss}}) = \sqrt{2p_{\text{T}}^{\ell}p_{\text{T}}^{\text{miss}}[1 - \cos(\Delta\phi)]}$
- $t\bar{t}$ ($e\mu$): $D_\zeta = p_\zeta^{\text{miss}} - 0.85\,p_\zeta^{\text{vis}} > -35$ GeV (where $p_\zeta^{\text{miss/vis}}$ are projections onto bisector $\hat{\zeta}$)
MET Filters: Standard $E_T^{\text{miss}}$ filters recommended by JetMET POG are applied

Definition of the $D_{\zeta}$ observable in $e\mu$ channel

	$e\tau_h$	$\mu\tau_h$	$\tau_h\tau_h$	$e\mu$
$p_{\mathrm{T}}$	$e > 25(31)$ GeV $\tau_h > 35$ GeV depending on trigger	$\mu > 21(25)$ GeV $\tau_h > 32$ GeV depending on trigger	$\tau_h > 35(40)$ GeV depending on trigger	$e > 15(24,31)$ GeV $\mu > 15(24,25)$ GeV depending on trigger
$\eta$	$\|e\| < 2.1, \|\tau_h\| < 2.3$	$\|\mu\| < 2.1, \|\tau_h\| < 2.3$	$\|\tau_h\| < 2.1$	$\|e\| < 2.4, \|\mu\| < 2.4$
DeepTau WP vs (jet, e, μ)	(Medium, Tight, VLoose)	(Medium, VVLoose, Tight)	(Medium, VVLoose, VLoose)	—
Lepton ID & Iso	MVA WP90	Medium & Iso $< 0.15$	—	MVA WP90 $e$ & Medium $\mu$, $I_{rel} < 0.20$
Trigger	Single-$e$ or $e\tau$ cross or single-$\tau$	Single-$\mu$ or $\mu\tau$ cross or single-$\tau$	Di-$\tau$ or Di-$\tau$+jet or single-$\tau$	Single-$e$ or Single-$\mu$ or $e\mu$-cross
$d_z, d_{xy}$ (cm)	$\ell$: $\|d_{xy}\| \lt 0.045, \|d_z\| \lt 0.2$ $\tau_h$: $\|d_z\| \lt 0.2$	$\ell$: $\|d_{xy}\| \lt 0.045, \|d_z\| \lt 0.2$ $\tau_h$: $\|d_z\| \lt 0.2$	$\ell$: $\|d_{xy}\| \lt 0.045, \|d_z\| \lt 0.2$ $\tau_h$: $\|d_z\| \lt 0.2$	$\ell$: $\|d_{xy}\| \lt 0.045, \|d_z\| \lt 0.2$ $\tau_h$: $\|d_z\| \lt 0.2$
Extra	$m_{\mathrm{T}} < 50$ GeV	$m_{\mathrm{T}} < 50$ GeV	—	$D_\zeta > -35$ GeV

Table II: Event selection summary per channel.

Fundamental split by b-jet multiplicity:
- b-tag ($N_\text{b-jets} \ge 1$): Highly enriched in $bb\phi$ signal events
- no b-tag ($N_\text{b-jets} = 0$): Dominantly targets $gg\phi$ production
- $e\tau_h$, $\mu\tau_h$: $m_{\mathrm{T}} \lt 50$ GeV; $e\mu$: $D_\zeta > -35$ GeV
Fit variable strategy (transition at $m_\phi = 1000$ GeV):
- $m_\phi \leq 1000$ GeV: Fit the PNN score distribution directly
- $m_\phi > 1000$ GeV: Fit $m_{\mathrm{T}}^{\text{tot}}$ (PNN sensitivity gain is marginal above ~1 TeV)

                                    $m_{\mathrm{T}}^{\text{tot}} = \sqrt{m_{\mathrm{T}}^2(\tau_1^{\mathrm{vis}},
                                    p_{\mathrm{T}}^{\text{miss}}) + m_{\mathrm{T}}^2(\tau_2^{\mathrm{vis}},
                                    p_{\mathrm{T}}^{\text{miss}}) + m_{\mathrm{T}}^2(\tau_1^{\mathrm{vis}},
                                    \tau_2^{\mathrm{vis}})}$
                                

Low-mass no b-tag sub-binning:
- Further split into 4 bins using $p_{\mathrm{T}}^{\tau\tau}$ to enhance $gg\phi$ sensitivity:
  $\lt 50$, $[50, 100)$, $[100, 200)$, $\ge 200$ GeV

High-mass categorization

Low-mass categorization

Background Modelling

Prompt backgrounds (MC-based):
- $Z/\gamma^*\to\tau\tau$ (Drell-Yan): dominant in no b-tag
- $t\bar{t}$: dominant in b-tag categories
- $Z/\gamma^*\to\ell\ell$ ($\ell=e,\mu$), diboson (WW,WZ,ZZ), single top, $W$+jets, SM $H(125)$
Lepton misidentification:
- $\ell\to\tau_h$ misID: negligible contribution, estimated from MC
- $e\mu$: $\mu\to e$ misID (from $Z\to\mu\mu$) also MC-based
Channel dependencies: Composition varies heavily: $Z\to\tau\tau$ leads in $\mu\tau_h$, $e\tau_h$, $e\mu$ (~40–55%), while fakes dominate $\tau_h\tau_h$ (~80%)
Data-driven backgrounds:
- $\mu\tau_h$, $e\tau_h$, $\tau_h\tau_h$: Fake Factor (FF) method for $j\to\tau_h$ fakes
- $e\mu$: SS/OS method for QCD multijet estimation
- Method heritage: background estimation follows closely the related $H\to\tau\tau$ analyses (HIG-19-010, HIG-21-001)

Background composition in the $\mu\tau_h$ channel

The FF method maps background from jets misidentified as hadronic $\tau$ decays
Determination Region (DR): enriched in target fake process, orthogonal to SR
- W+jets: $m_{\mathrm{T}}^{e/\mu} > 50$ GeV and $N_{b\text{-jets}}=0$
- QCD:
  - $e\tau_h$ and $\mu\tau_h$: same-sign taus, $m_{\mathrm{T}}^{e/\mu} < 50$ and $I_{\text{rel}}>0.05$
  - $\tau_h\tau_h$: same-sign taus
- $t\bar{t}$: $m_{\mathrm{T}}^{e/\mu} > 50$ GeV and $N_{b\text{-jets}}>0$
$F_F$ weighted by estimated AR fraction:
$F_F = \displaystyle\sum_{i} w_i \cdot F_F^i = F_F^{t\bar{t}} \times f_{t\bar{t}} + F_F^{\mathrm{QCD}} \times f_{\mathrm{QCD}} + F_F^{\mathrm{W+jets}} \times f_{\mathrm{W+jets}}$
For $\tau_h\tau_h$: QCD dominant, only $F_{\text{QCD}}$ measured
Application Region (AR): SR selection but $\tau_h$ passes VVVLoose and fails Medium DeepTau WP
Genuine $\tau$ decays from MC subtracted

FF workflow: DR → AR → SR

$AR fraction μτh nob 2022postEE$ $w_i$ in AR ($\mu\tau_h$, no b-tag, 2022postEE,more in backup)

In each DR the $F_F$ are binned in $N_{\text{pre b-jets}}$ and $p_{\mathrm{T}}^{\text{jet}}/p_{\mathrm{T}}^{\tau_h}$ and fit to $p_{\mathrm{T}}^{\tau_h}$
- QCD, W+jets: 2 $N_{\text{pre b-jets}}$ bins $(0, \geq1)$ × 3 $p_{\mathrm{T}}^{\text{jet}}/p_{\mathrm{T}}^{\tau_h}$ bins
- $t\bar{t}$: 3 $p_{\mathrm{T}}^{\text{jet}}/p_{\mathrm{T}}^{\tau_h}$ bins only (derived from MC)
- Fitted using Landau + first-order polynomial
High-$p_{\mathrm{T}}$ tail: binned values used where statistics are limited ($p_T > 140$ GeV)
- If frac. error of [140, 200] < 0.5 and [200, ∞] < 0.5 → take binned values [140, 200, ∞]
- If frac. error of [140, 200] > 0.5 and [200, ∞] < 0.5 → take binned value [200, ∞]
- Else use fit for all $p_T$
$\tau_h\tau_h$: only QCD $F_F$ measured (QCD fraction $\sim$100% in AR); done for leading $\tau_h$, subleading jet fakes added using MC

QCD FF in $\mu\tau_h$
($N_{\text{pre b-jets}}=0, p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$)

QCD FF in $e\tau_h$
($N_{\text{pre b-jets}}=0, p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$)

W+jets FF in $\mu\tau_h$
($N_{\text{pre b-jets}}=0, p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$)

W+jets FF in $e\tau_h$
($N_{\text{pre b-jets}}=0, p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$)

$t\bar{t}$ FF in $\mu\tau_h$
($N_{\text{pre b-jets}}\geq 1, p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$)

$t\bar{t}$ FF in $e\tau_h$
($N_{\text{pre b-jets}}\geq 1, p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$)

A DR closure correction is determined in each region
- Non-closures observed in $E_{\mathrm{T}}^{\text{miss}}$ and $\eta^{\tau}$
- Corrections fitted using polynomials
From alternate DRs a DR→AR correction is also calculated
No obvious non-closure observed in $t\bar{t}$ DR
$\tau_h\tau_h$: only $\eta^{\tau_h}$ correction applied (no $E_T^{\text{miss}}$ correction needed)

QCD $E_T^{\text{miss}}$ ($e\tau_h$)

QCD $\eta^{\tau}$ ($e\tau_h$)

QCD $E_T^{\text{miss}}$ ($\mu\tau_h$)

QCD $\eta^{\tau}$ ($\mu\tau_h$)

DR closure corrections (2022postEE representative)

DR closure validation: data vs background estimation in the FF derivation regions (Good agreement validates the FF method)
Representative $\mu\tau_h$ plots shown (2022postEE, $N_{\text{pre b-jets}}=0$)
Full set of closure plots for all channels in backup

$E_T^{\text{miss}}$

$\eta_{\mu}$

$\eta_{\tau_h}$

$p_T^{\tau_h}$

QCD DR

$E_T^{\text{miss}}$

$\eta_{\mu}$

$\eta_{\tau_h}$

$p_T^{\tau_h}$

W+jets DR

$\mu\tau_h$ closure plots (2022postEE)

QCD multijet: dominant misID background in $e\mu$, where both leptons are mimicked by hadronic jets; non-QCD fakes (W/Z+jets, $t\bar{t}$) estimated from MC
Based on the strategy developed in HIG-19-010 and HIG-21-001
QCD background estimated from sideband with SS $e\mu$ pair
OS/SS extrapolation factor: $R_{\text{OS/SS}}(\Delta R) = \dfrac{N_{\text{anti-iso}\;\mu}^{\text{OS}}(\Delta R)}{N_{\text{anti-iso}\;\mu}^{\text{SS}}(\Delta R)}$
Determination region: nominal electron and anti-isolated muon ($0.20 < I_{\text{rel}}^{\mu} < 0.50$)
Dependence of OS/SS on $\Delta R(e,\mu)$ fitted with 2^nd ~ 4^th order polynomial

2022postEE OS/SS factor in no b-tag region

2022postEE OS/SS factor in b-tag region

OS/SS scale factors depend not only on $\Delta R(e,\mu)$ and jet multiplicity, but also on $p_T^{e}$ and $p_T^{\mu}$
Residual 2D correction $C_{p_T}(p_T^{e},\, p_T^{\mu})$ derived in the anti-isolated $\mu$ control region
Apply the $\Delta R$-dependent factors to SS events, then adjust the 2D QCD distribution to match OS data
Final QCD estimate: $N_{\text{QCD}}^{\text{OS,SR}} = R \times C_{p_T}(p_T^{e},\, p_T^{\mu}) \times \big(N_{\text{data}}^{\text{SS}} - N_{\text{prompt,MC}}^{\text{SS}}\big)$
2D weights derived independently for each data-taking era

(a) 2022preEE

(b) 2022postEE

(d) 2023postBPix

Corrections of the QCD OS/SS scale factors determined in the region with an anti-isolated muon as a function of the $p_T$ of the electron and the muon, using data collected in 2022preEE(a), 2022postEE(b), 2023preBPix(c), 2023postBPix(d)

Closure tests in the $\text{DR}_{\text{OS}}$ control region: isolated $e$ + anti-isolated $\mu$ (OS), with 2D $p_T$ correction applied
Representative $e\mu$ plots shown (2022postEE); good data-MC agreement validates the QCD estimation

$\Delta R(e,\mu)$

$E_T^{\text{miss}}$

$p_T^{e}$

$p_T^{\mu}$

$\Delta R(e,\mu)$

$E_T^{\text{miss}}$

$p_T^{e}$

$p_T^{\mu}$

Corrections

Pileup reweighting:
- Minimum-bias cross section 69.2 mb ± 4.6% [TWiki]
Electron corrections: [EGamma]
- ID/isolation scale factors (tag-and-probe)
- Energy scale and smearing ($p_T$-dependent)
Muon corrections: [MuonPOG]
- ID/isolation scale factors (tag-and-probe)
$\tau_h$ corrections: [TauPOG]
- DeepTau v2.5 ID SF (genuine $\tau_h$, per WP)
- $e \to \tau_h$, $\mu \to \tau_h$ mis-ID rate corrections
- Energy scale: decay-mode dependent (DM 0, 1, 10, 11)

Jet corrections: [JERC]
- JES: $p_{\mathrm{T}}$, $\eta$, pileup dependent
- JER: Hybrid smearing to match data resolution
b-tagging SF: [BTV Wiki]
- Method 1a event reweighting (flavor, $p_T$, $\eta$)
- Efficiency maps: DY, $t\bar{t}$, ggH, bbH
DY reweighting: [HLepRare]
- $Z$ $p_T$-mass reweighting + MET recoil correction
Trigger SF: [HLepRare]
- Dedicated SFs calculated with OR logic
- Per-leg tag-and-probe
Top $p_T$ reweighting (for $t\bar{t}$ MC)
Jet-fake $\tau_h$ residual corrections

$Z$ $p_T$-mass reweighting for DY+jets MC [HLepRare] [Dennis Roy, 22 Oct 2025]
Ratio of (Data − non-DY MC) / DY MC in $\mu\mu$ region over reco-level $p_{T,\ell\ell}$
Normalized — should not affect overall DY normalization
Fit splits into 3 regions with floating boundaries; ~8–10 uncorrelated shape uncertainties

Before $Z$ $p_T$ reweighting [NLO CR Z→μμ]

After $Z$ $p_T$ reweighting [NLO CR Z→μμ]

Bosonic recoil correction derived from $Z \to \mu\mu$ final state, binned by $N_{\text{jet}}$ and $p_{T,\ell\ell}$ [HLepRare] [Dennis Roy, 22 Oct 2025]
Quantile mapping & Rescaling applied to $U_{\parallel}$, $U_{\perp}$ to recompute PUPPI MET
Derived after nominal $Z$ $p_T$ reweighting
Corrections from $Z \to \mu\mu$; validation & uncertainties from $Z \to ee$
Applied to $Z/\gamma^*$+jets, $W$+jets, and Higgs signal

Before recoil correction [NLO CR Z→μμ]

After recoil correction [NLO CR Z→μμ]

All triggers combined via OR logic per channel [HLepRare]
- $\mu\tau_h$: single $\mu$ (IsoMu24) OR $\mu\tau$ cross (Mu20Tau27)
- $e\tau_h$: single $e$ (Ele30) OR $e\tau$ cross (Ele24Tau30)
- $\tau_h\tau_h$: di-$\tau$ (DoubleTau35) OR di-$\tau$ + jet (DoubleTau30 + Jet60)
- $e\mu$: 4 triggers — IsoMu24, Ele30, Mu8Ele23, Mu23Ele12
For 2-trigger channels ($\mu\tau_h$, $e\tau_h$, $\tau_h\tau_h$), taking single $\mu$ OR $\mu\tau$ cross as example:

$\begin{aligned} \text{OR_eff}_{i} &= \text{passSingle} \times \text{single_}\mu^{\text{eff}}_{i} \\ &\quad - \text{passCross} \times \text{passSingle} \times \min(\text{single_}\mu^{\text{eff}}_{i},\; \mu\text{_leg}^{\text{eff}}_{i}) \times \tau\text{_leg}^{\text{eff}}_{i} \\ &\quad + \text{passCross} \times \mu\text{_leg}^{\text{eff}}_{i} \times \tau\text{_leg}^{\text{eff}}_{i} \end{aligned}$
For $e\mu$ channel (4 triggers), simplified via inclusion-exclusion (see Backup S61–S62):
$\varepsilon_{\text{union}} = \varepsilon(\mu{>}24) + \varepsilon(e{>}30) + \varepsilon(\mu{>}15,\,e{>}23) - \varepsilon(\mu{>}24,\,e{>}23) - \varepsilon(\mu{>}15,\,e{>}30)$
Per-leg efficiencies measured via tag-and-probe following EGM POG / MUO POG recommendations
Uncertainties: decorrelated across eras; lepton-leg unc. fully correlated with single-lepton trigger unc.

Electron leg SF measured via EGamma tag-and-probe: 4-fit average ± RMS as SF and uncertainty
Tag-and-probe using HLT_Ele23_Ele12_CaloIdL_TrackIdL_IsoVL as recommended
$p_T$ bins: [15, 20, 25, 50, 100, 200, ∞]; $|\eta|$ bins optimized for offline selection
For more measurement details, see EGM POG presentation: [Leyan Li, 11/21 2025] [EGM gitlab issue]
For Combined OR SF validations, see Backup S65.

SF vs $p_T$ (2022postEE)

SF vs $|\eta|$ (2022postEE)

SF vs $p_T$ (2023)

SF vs $|\eta|$ (2023)

Efficiency extracted on the Z peak using standard MUO POG tag-and-probe
Since HLT_Mu8_TrkIsoVVL is highly prescaled, the alternative cross-trigger
HLT_Mu17_TrkIsoVVL_Mu8_TrkIsoVVL_DZ_Mass8 is used as recommended
Probe selection: TightID && LoosePFIso
Systematics: tag isolation, fit mass range & binning, alternative background
For more measurement details, see [Full plots]

SF vs $p_T$ (2022postEE)

SF vs $|\eta|$ (2022postEE)

SF vs $p_T$ (2023preBPix)

SF vs $|\eta|$ (2023preBPix)

Neural Network Discriminant

A PNN introduces mass hypothesis $m_\Phi$ as an explicit network input, replacing many separate classifiers across mass points [arXiv:1601.07913]
The model learns $f_\theta(\mathbf{x}, m_\Phi)$ — conditioned on both event features $\mathbf{x}$ and mass $m_\Phi$
Signal events paired with their true mass; background events randomly assigned mass → smooth interpolation across full mass range
Architecture: [550, 2550, 550] fully-connected layers
- LeakyReLU ($\alpha\!=\!0.1$), Dropout (0.5), BatchNorm, $L_2$ reg ($\lambda\!=\!5\!\times\!10^{-3}$)
- SGD + Nesterov momentum; Sparse categorical cross-entropy
- Early stopping mechanism and dynamic LR decay

Left: Individual networks each trained at a single fixed parameter $\theta$ (suboptimal, non-smooth interpolation).Right: A unified network trained with input features and $\theta$, learning smooth interpolation across several values of $\theta$ [arXiv:1601.07913]

Kinematic features + $m_\Phi$ → [550, 2550, 550] → Softmax(S, T, F)

Training datasets: ggH + bbH signals (unified single-signal-node) + all SR backgrounds (MC + data-driven fakes with weights)
- Unified training improves limits and avoids prohibitive template scaling; ggH/bbH separated as independent POIs at fit stage (see Backup S66)
Variable selection: 44 candidates → ~25 best variables per channel, selected based on:
- Separation power: rank variables by $\langle S^2 \rangle$ (following TMVA)
- Decorrelation: remove highly correlated pairs with $|r_{ij}| > 0.85$
- Same candidate set for all 4 channels; channel-specific ranking optimizes final selection
$\langle S^2 \rangle = \frac{1}{2}\int\frac{(\hat{y}_S - \hat{y}_B)^2}{\hat{y}_S + \hat{y}_B}\,dy$

$r_{ij} = \frac{\sum_k w_k (x_i^{(k)} - \bar{x}_i)(x_j^{(k)} - \bar{x}_j)}{\sqrt{\sum_k w_k (x_i^{(k)} - \bar{x}_i)^2} \sqrt{\sum_k w_k (x_j^{(k)} - \bar{x}_j)^2}}$

**Table 2:** Representative categories of PNN input variables. The full feature set includes channel-dependent variations and composite observables.
Category	Representative Variables	Main Discrimination Target
Mass	$m_\mathrm{T}^\mathrm{tot}$, $m_\mathrm{vis}$, $m_{\tau\tau}$, $m_\mathrm{T}^{\ell_1}$, $m_\mathrm{T}^{\ell_2}$	Resonance scale; $W$+jets / $t\bar{t}$ rejection
Momentum	$p_\mathrm{T}^{\ell_1}$, $p_\mathrm{T}^{\ell_2}$, $p_\mathrm{T}^{\tau\tau}$, $p_\mathrm{T}^\mathrm{vis}$, MET, $D_\zeta$	Boost of the system; recoil and MET topology
Angular	$\Delta R(\ell_1,\ell_2)$, $\Delta\phi(\ell_1,\ell_2)$, $\Delta\eta(\ell_1,\ell_2)$, $\phi_\mathrm{MET}$	Event geometry; reducible bkg separation
FastMTT	$m_\mathrm{fastmtt}$, $p_\mathrm{T}^\mathrm{fastmtt}$, $\eta^\mathrm{fastmtt}$, $\phi^\mathrm{fastmtt}$	Improved parent-boson kinematics
b-tag / Jet	leading b-jet kinematics, $p_T^{b_1}/p_T^{\mathrm{fastmtt}}$, $\Delta\eta(b_1,\tau\tau)$	$gg\phi$ vs $bb\phi$ topology; top rejection
Mass hypothesis	$m_\Phi$ (explicit network input)	Continuous parameterization across signal masses

3-class softmax output: S (Signal), T (True bkg), F (Fake bkg)
- Background split into genuine $\tau$/lepton (T) vs misidentified (F) for improved modeling
- Ratio-type variables: explicit denominator protection; NaN/inf events removed
Multiple formulas tested: $S$, $\dfrac{S}{S+F}$, $\dfrac{S}{S+T}$, $\dfrac{S}{T}$, $\dfrac{S}{F}$, etc.
- No visible performance difference
Final discriminant:
$\text{PNN}_{\text{score}} = S$
- Raw signal node probability used directly for simplicity

PNN score at $m_\Phi = 250$ GeV ($e\mu$ channel):
signal peaks near 1, background near 0

Two dedicated models: Low-mass (60–250 GeV) and High-mass (300–3500 GeV)
- Mass normalized to [0,1]: $m_{\text{scaled}} = (m - m_{\min})/(m_{\max} - m_{\min})$; background events randomly assigned mass from signal mass points
- Low-mass PNN serves as nominal discriminant for 60–250 GeV
- High-mass PNN trained to extend PNN coverage to 300–3500 GeV; systematic comparison with $m_T^{\text{tot}}$ shows PNN advantage diminishes at high mass — score degenerates (S→1, B→0), reducing shape fit to counting experiment
- Final strategy: PNN score for $m_\Phi \le 1$ TeV, $m_T^{\text{tot}}$ for $m_\Phi > 1$ TeV (see Backup S47)
Inference: for each mass $M$, PNN score evaluated at $m_\Phi = M$ for all data & MC
- Even events → odd model, odd events → even model → full statistics
The scores from even and odd models, applied to the same even event number events, show no significant deviation and overfitting

(a) PNN score at 100 GeV

(b) PNN score at 160 GeV

(c) PNN score at 200 GeV

PNN scores calculated with even and odd models, applied on even event number events. Shown for $\mu\tau$ channel and $gg\phi$ signal. The ratio of background prediction is shown in the bottom pad.

Systematic Uncertainties

Tau ID efficiency & TES: [TauPOG]
- Statistical: decorrelated by DM and by year
- Systematic: 3-way split (fully corr. / corr. across years but not DM / fully decorr.)
Trigger efficiency: [HLepRare]
- OR logic (single lepton + cross triggers)
- Lepton leg corr. with single lepton; tau leg uncorr.
- All decorrelated between years
Electron energy scale: [EGamma]
- Scale + resolution from EGamma POG; fully corr. between eras
Jet energy scale: [JES]
- Reduced grouping: 11 nuisances/era
- Propagated to MET
Jet energy resolution: [JER]
- Uncorr. between years; propagated to MET

$l \to \tau_h$ fake energy scale:
- 1.0% for muons; 0.5–6.6% for electrons
- Uncorr. between years
MET recoil corrections: [HLepRare]
- Response and resolution varied; uncorr. between eras
Top $p_T$ reweighting:
- 0 to 2× correction; fully corr. between eras
DY mass, $p_T$ reweighting: [HLepRare]
- 10 NLO fit parameters; fully uncorr. from each other
Fake-factor shape uncertainties: [HIG-19-010] [HIG-21-001]
- $\tau_h\tau_h$ / $\mu\tau_h$ / $e\tau_h$: FF stat., closure, non-closure, process subtraction
- $e\mu$: OS/SS stat., 2D $p_T$ weight correction, data-driven stat. (see Backup 72)
Bin-by-bin: autoMCStats (Barlow-Beeston lite)

Luminosity: [LumiPOG]
- 2022: 34.7 fb$^{-1}$ (1.4%); 2023: 27.6 fb$^{-1}$ (1.3%)
- Combined: 62.3 fb$^{-1}$ (~1.0%); partially correlated (2 nuisances)
Electron, muon ID efficiency:
- 2% per lepton; fully correlated between years
Lepton to tau fake rate: [TauPOG]
- $\eta$-dependent; negligible shape → lnN
- Uncorrelated between eras
b-tagging efficiency: [BTV]
- btagSFbc (b/c): 1 corr. + 1 decorr. across years
- btagSFlight: 1 corr. + 1 decorr. across years
- Decorrelated per channel (phase-space dependent)

Background normalization: [SM Xsec]
- DY: scale 1.1% + PDF 1.0%
- $t\bar{t}$: scale 3.6% + PDF 2.5% [ttbar]
- Diboson: scale 2.4% + PDF 0.1%
- W+jets: scale 5.4% + PDF 0.2%
- $e\mu$ QCD: $r_{\text{b-tag}}$ non-closure unc., uncorr. across years
SM Higgs theory: [LHCHXSWG]
- ggH: scale 6.7%, PDF+$\alpha_s$ 3.2%
- VBF: scale 0.5%, PDF+$\alpha_s$ 2.1%
- BR($H\to\tau\tau$): 1.7% (h.o.) + 0.99% ($m_q$) + 0.62% ($\alpha_s$) [CERN YR4]

Control Plots

Expected Results

Simultaneous binned maximum-likelihood fit across all channels, categories, and control regions
Shape uncertainties: template morphing; normalization: log-normal constraints; finite MC: automatic bin-by-bin statistical terms
95% CL upper limits on $\sigma \times \mathcal{B}(\phi \to \tau\tau)$ via $CL_s$ method [CMS:COMBINE2024]
Hybrid discriminant strategy:
- $m_\phi \le 1000$ GeV: PNN score (evaluated at each mass hypothesis)
- $m_\phi > 1000$ GeV: $m_{\mathrm{T}}^{\text{tot}}$
Transition at 1000 GeV: PNN gain decreases at high mass; $m_{\mathrm{T}}^{\text{tot}}$ provides robust and competitive performance
Independent signal-strength parameters for gg$\phi$ and bb$\phi$ in dedicated fits
Blinded analysis: all results shown are expected limits from Asimov dataset

Expected 95% CL upper limits on $\sigma(\text{gg}\phi/\text{bb}\phi) \times \mathcal{B}(\phi \to \tau\tau)$, combined 2022 + 2023
Exclusion limits range from a few pb at low Higgs boson masses to $\sim 0.9$ fb at the highest Higgs boson masses

Expected upper limits on gg$\phi$ production for individual final states

Expected upper limits on bb$\phi$ production for individual final states

The model-independent expected limits for HIG-21-001 are calculated based on their provided datacards.
The limits shown in the comparisons are the original expected limits without any luminosity scaling, evaluated at their absolute integrated luminosities.
Expanded the mass scan from 30 to 42 points by computing additional expected limits for [65, 70, 75, 85, 90, 95, 105, 110, 115, 135, 450, 1100] GeV.
For both gg$\phi$ and bb$\phi$ productions, the PNN technique applied in the $< 1$ TeV range effectively improves upon the $m_{\mathrm{T}}^{\text{tot}}$ performance, generally outperforming the HIG-21-001 limits:
- gg$\phi$: improvements are seen across the entire 60–1000 GeV range.
- bb$\phi$: improvements are slightly smaller but remain superior in the 60–600 GeV range, except for the 80–120 GeV region where a background peak is naturally expected due to the dominant DY background.

30 leading impacts for the gg$\phi$ production mode at $m_{\phi} = 100$ GeV fitted by PNN score

30 leading impacts for the bb$\phi$ production mode at $m_{\phi} = 100$ GeV fitted by PNN score

Presented BSM $H \to \tau\tau$ analysis on early Run 3 data (62.3 fb$^{-1}$ at $\sqrt{s} = 13.6$ TeV): model-independent search for additional neutral Higgs bosons in the $\tau_h\tau_h$, $\mu\tau_h$, $e\tau_h$, and $e\mu$ final states over $m_\phi = 60$–$3500$ GeV, targeting both gg$\phi$ and bb$\phi$ production
Two-regime strategy: PNN-score templates for $m_\phi \le 1000$ GeV; $m_{\mathrm{T}}^{\text{tot}}$ templates for $m_\phi > 1000$ GeV. A common boundary at $m_\phi = 1000$ GeV selected from a dedicated Run 3 strategy comparison
Expected upper limits on $\sigma \times \mathcal{B}(\phi \to \tau\tau)$ derived with full treatment of systematic uncertainties for both production modes
The PNN technique in the $m_\phi \lt 1$ TeV range resolves the known limitation of $m_{\mathrm{T}}^{\text{tot}}$ being suboptimal at low mass [HIG-21-001], providing improved or comparable sensitivity for both gg$\phi$ and bb$\phi$ productions under absolute luminosity

Status: Analysis is
                        blinded. All POG sign-offs obtained (BTV in progress). Pre-approval questionnaire completed.
                        We are seeking permission to unblind & pre-approval of the analysis.

Backup

PNN extended to cover full $60$–$3500$ GeV range (two models: low-mass + high-mass)
Compare expected upper limits: PNN score vs $m_{\mathrm{T}}^{\text{tot}}$ in the $300$–$3500$ GeV region
PNN advantage decreases with mass, becomes marginal around $1000$ GeV → chosen as common boundary

All-channel combined expected upper-limit comparison in the high-mass region between fits using PNN score and fits using $m_{\mathrm{T}}^{\text{tot}}$ for $gg\phi$

All-channel combined expected upper-limit comparison in the high-mass region between fits using PNN score and fits using $m_{\mathrm{T}}^{\text{tot}}$ for $bb\phi$

Expected limits are derived from the PNN score for $60 \le m_{\phi} \le 1000$ GeV, and from $m_{\mathrm{T}}^{\text{tot}}$ for $m_{\phi} > 1000$ GeV
Evaluated on a finer mass grid than HIG-21-001 (union of both sets): 60, 80, 100, 120, 125, 130, 140, 160, 180, 200, 250, 300, 350, 400, 500, 600, 700, 800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2300, 2600, 2900, 3200, 3500 GeV

The bb/gg$\phi$ All-channel combined expected upper limits results with full systematics at 95% confidence level of HIG-21-001 and this analysis

Channel	Trigger type	Trigger threshold (GeV)	Offline $p_{\mathrm{T}}$ (GeV)
$\tau_h\tau_h$	di-$\tau$	$\tau(35),\tau(35)$	$\tau(40),\tau(40)$
	di-$\tau$+jet	$\tau(30),\tau(30)$, jet(60)	$\tau(35),\tau(35)$, jet(60)
$\mu\tau_h$	single-$\mu$	$\mu(24)$	$\mu(25)$
	$\mu\tau$ cross	$\mu(20),\tau(27)$	$\mu(21),\tau(32)$
$e\tau_h$	single-$e$	$e(30)$	$e(31)$
	$e\tau$ cross	$e(24),\tau(30)$	$e(25),\tau(35)$
$e\mu$	single-$e$/single-$\mu$	$e(30)$/$\mu(24)$	$e(31)$/$\mu(25)$
	$e\mu$ cross	$e(12)\mu(23)$ / $e(23)\mu(8)$	$e(15)\mu(24)$ / $e(24)\mu(15)$

Table III: Trigger and offline $p_{\mathrm{T}}$ thresholds per channel (2022–2023).

Offline $p_{\mathrm{T}}$: 1 GeV above trigger threshold for $e/\mu$ (minimum 15 GeV)
Offline $p_{\mathrm{T}}$: 5 GeV above trigger threshold for $\tau_h$ (minimum 30 GeV)
Trigger objects matched offline within $\Delta R < 0.5$

$e\mu$

$e\tau_h$

$\mu\tau_h$

$\tau_h\tau_h$

Fractions of QCD, W+jets, and $t\bar{t}$ in the AR as a function of $m_{\mathrm{T}}^{\text{tot}}$, used as weights $f_k$ in the combined $F_F$

$μτh nob fraction$ $\mu\tau_h$ — no b-tag

$μτh btag fraction$ $\mu\tau_h$ — b-tag

$eτh nob fraction$ $e\tau_h$ — no b-tag

$eτh btag fraction$ $e\tau_h$ — b-tag

QCD (2022postEE)

($N_{\text{pre b-jets}}=0, p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$)

QCD (2023preBPix)

($N_{\text{pre b-jets}}=0, p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$)

QCD (2022postEE, $N_{\text{pre b-jets}}=0$)

W+jets (2022postEE, $N_{\text{pre b-jets}}=0$)

$t\bar{t}$ (2022postEE, $N_{\text{pre b-jets}}\ge 1$)

QCD (2023preBPix)

W+jets (2023preBPix)

$t\bar{t}$ (2023preBPix)

All plots shown for lowest $p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$ bin

QCD (2022postEE, $N_{\text{pre b-jets}}=0$)

W+jets (2022postEE, $N_{\text{pre b-jets}}=0$)

$t\bar{t}$ (2022postEE, $N_{\text{pre b-jets}}\ge 1$)

QCD (2023preBPix)

W+jets (2023preBPix)

$t\bar{t}$ (2023preBPix)

All plots shown for lowest $p_T^{\text{jet}}/p_T^{\tau_h} < 1.25$ bin

2022preEE

2022postEE

2023preBPix

2023postBPix

Only $\eta^{\tau_h}$ correction applied in $\tau_h\tau_h$ channel (no $E_T^{\text{miss}}$ correction needed)

$E_T^{\text{miss}}$ 2022preEE

$E_T^{\text{miss}}$ 2022postEE

$E_T^{\text{miss}}$ 2023preBPix

$E_T^{\text{miss}}$ 2023postBPix

$\eta^{\tau_h}$ 2022preEE

$\eta^{\tau_h}$ 2022postEE

$\eta^{\tau_h}$ 2023preBPix

$\eta^{\tau_h}$ 2023postBPix

$E_T^{\text{miss}}$ 2022preEE

$E_T^{\text{miss}}$ 2022postEE

$E_T^{\text{miss}}$ 2023preBPix

$E_T^{\text{miss}}$ 2023postBPix

$\eta^{\tau_h}$ 2022preEE

$\eta^{\tau_h}$ 2022postEE

$\eta^{\tau_h}$ 2023preBPix

$\eta^{\tau_h}$ 2023postBPix

$E_T^{\text{miss}}$

$\eta$ (leading $\tau_h$)

$\eta$ (sub-leading $\tau_h$)

$p_T$ (sub-leading $\tau_h$)

2022postEE

$E_T^{\text{miss}}$

$\eta$ (leading $\tau_h$)

$\eta$ (sub-leading $\tau_h$)

$p_T$ (sub-leading $\tau_h$)

2023preBPix

QCD: $E_T^{\text{miss}}$

QCD: $\eta_e$

QCD: $\eta_{\tau_h}$

QCD: $p_T^{\tau_h}$

QCD DR

W: $E_T^{\text{miss}}$

W: $\eta_e$

W: $\eta_{\tau_h}$

W: $p_T^{\tau_h}$

W+jets DR

QCD: $E_T^{\text{miss}}$

QCD: $\eta_e$

QCD: $\eta_{\tau_h}$

QCD: $p_T^{\tau_h}$

QCD DR

W: $E_T^{\text{miss}}$

W: $\eta_e$

W: $\eta_{\tau_h}$

W: $p_T^{\tau_h}$

W+jets DR

$\mu\tau_h$: $E_T^{\text{miss}}$

$\mu\tau_h$: $\eta_\mu$

$\mu\tau_h$: $\eta_{\tau_h}$

$\mu\tau_h$: $p_T^{\tau_h}$

$\mu\tau_h$ $t\bar{t}$ DR

$e\tau_h$: $E_T^{\text{miss}}$

$e\tau_h$: $\eta_e$

$e\tau_h$: $\eta_{\tau_h}$

$e\tau_h$: $p_T^{\tau_h}$

$e\tau_h$ $t\bar{t}$ DR

Both 2022 and 2023 Data and MC events are required to pass the following $E_T^{\text{miss}}$ filters recommended by the JetMET POG [TWiki]:
- Primary vertex filter (Data, MC)
- Beam halo filter (Data, MC)
- ECAL TP filter (Data, MC)
- Bad PF Muon Filter (Data, MC)
- Bad PF Muon Dz Filter (Data, MC)
- HF noisy hits filter (Data, MC)
- ee badSC noise filter (Data, MC)
- ECAL bad calibration filter update (Data, MC)

Notation — We make following notation to simply the derivation:
- $T_1 = \text{Mu24}$: $\mu: p_T > 24$ (single muon trigger)
- $T_2 = \text{Ele30}$: $e: p_T > 30$ (single electron trigger)
- $T_3 = \text{Mu8Ele23}$: $\mu: p_T > 8$, $e: p_T > 23$ (dilepton trigger)
- $T_4 = \text{Mu23Ele12}$: $\mu: p_T > 23$, $e: p_T > 12$ (dilepton trigger)
Subset relations — Due to the $p_T$ threshold, we have:
1. $\mu > 24 \subset \mu > 23 \subset \mu > 8$ (so $T_1 \subset T_4 \subset T_3$ for muon legs)
2. $e > 30 \subset e > 23 \subset e > 12$ (so $T_2 \subset T_3 \subset T_4$ for electron legs)
3. $T_1 \cap T_2 \subset T_3 \cap T_4$, and $T_1 \cap T_2 = T_1 \cap T_2 \cap T_3 \cap T_4$ (four-fold intersection $\equiv T_1T_2$ double intersection).
Considering the lepton ID and isolation, this subset relation is still valid. This is becasue the single-lepton triggers $\text{Mu24}$ and $\text{Ele30}$ use tighter identification and isolation criteria than the dilepton triggers $\text{Mu8Ele23}$ and $\text{Mu23Ele12}$.
Inclusion-exclusion — The total union efficiency for the OR combination of four triggers is:
\begin{equation*} \begin{split} \varepsilon_{\rm union} =& \underbrace{(\varepsilon_1+\varepsilon_2+\varepsilon_3+\varepsilon_4)}_{\text{Single-trigger terms}} - \underbrace{(\varepsilon_{12}+\varepsilon_{13}+\varepsilon_{14} +\varepsilon_{23}+\varepsilon_{24}+\varepsilon_{34})}_{\text{Pairwise intersections (6 terms)}} \\ &+ \underbrace{(\varepsilon_{123}+\varepsilon_{124}+\varepsilon_{134}+\varepsilon_{234})}_{\text{Triple intersections (4 terms)}} - \underbrace{\varepsilon_{1234}}_{\text{Four-fold intersection (1 term)}} \end{split} \end{equation*}

Where: $\varepsilon_i$ = efficiency of trigger $i$ firing alone; $\varepsilon_{ij}$ = triggers $i,j$ firing simultaneously; $\varepsilon_{ijk}$ = triggers $i,j,k$ simultaneously; $\varepsilon_{1234}$ = all four triggers simultaneously.

Step 1 — Replace $\varepsilon_1 \sim \varepsilon_4$ with physical leg criteria:
\[ \varepsilon_1+\varepsilon_2+\varepsilon_3+\varepsilon_4 = \varepsilon(\mu{>}24) + \varepsilon(e{>}30) + \varepsilon(\mu{>}8, e{>}23) + \varepsilon(\mu{>}24,e{>}12) \]
Step 2 — Eliminate redundancy via subset inclusion:
\begin{align*} &\varepsilon_{12}+\varepsilon_{13}+\varepsilon_{14}+\varepsilon_{23}+\varepsilon_{24}+\varepsilon_{34} \\ &= \varepsilon(\mu{>}24,e{>}30) + \varepsilon(\mu{>}24,e{>}23) + \varepsilon(\mu{>}24,e{>}12) + \varepsilon(\mu{>}8,e{>}30) + \varepsilon(\mu{>}24,e{>}30) + \varepsilon(\mu{>}24,e{>}23) \\ &= 2\varepsilon(\mu{>}24,e{>}30) + 2\varepsilon(\mu{>}24,e{>}23) + \varepsilon(\mu{>}24,e{>}12) + \varepsilon(\mu{>}8,e{>}30) \end{align*}
Step 3 — Collapse triple intersections to core double intersections:
\begin{align*} \varepsilon_{123}+\varepsilon_{124}+\varepsilon_{134}+\varepsilon_{234} &= \varepsilon(\mu{>}24,e{>}30) + \varepsilon(\mu{>}24,e{>}30) + \varepsilon(\mu{>}24,e{>}23) + \varepsilon(\mu{>}24,e{>}30) \\ &= 3\varepsilon(\mu{>}24,e{>}30) + \varepsilon(\mu{>}24,e{>}23) \end{align*}
Step 4 — By global subset inclusion ($T_1\cap T_2 \subset T_3\cap T_4$): $\varepsilon_{1234} = \varepsilon(\mu{>}24,e{>}30)$
\begin{align*} \varepsilon_{\rm union} &= \left[\varepsilon(\mu{>}24) + \varepsilon(e{>}30) + \varepsilon_3 + \varepsilon(\mu{>}24,e{>}12)\right] \\ &- \left[2\varepsilon(\mu{>}24,e{>}30) + 2\varepsilon(\mu{>}24,e{>}23) + \varepsilon(\mu{>}24,e{>}12) + \varepsilon(\mu{>}8,e{>}30)\right] \\ &+ \left[3\varepsilon(\mu{>}24,e{>}30) + \varepsilon(\mu{>}24,e{>}23)\right] - \varepsilon(\mu{>}24,e{>}30) \end{align*}
Cancel redundant terms to obtain the minimal formula:
$\varepsilon_{\rm union} = \varepsilon(\mu{>}24) + \varepsilon(e{>}30) + \varepsilon(\mu {>} 8, e{>}23) - \varepsilon(\mu{>}24,e{>}23) - \varepsilon(\mu{>}8,e{>}30)$
Comply with the offline lepton $p_T > 15$ GeV requirement, the final formula becomes:
$\varepsilon_{\rm union} = \varepsilon(\mu{>}24) + \varepsilon(e{>}30) + \varepsilon(\mu {>} 15, e{>}23) - \varepsilon(\mu{>}24,e{>}23) - \varepsilon(\mu{>}15,e{>}30)$

Combined Trigger OR scale factors are calculated on the fly in the analysis framework
Evaluates combinatorial behavior of single vs. cross trigger efficiencies across $p_T$ and $\eta$

OR SF vs $p_T^e$ (2022postEE)

OR SF vs $p_T^{\mu}$ (2022postEE)

OR SF vs $p_T^e$ (2023preBPix)

OR SF vs $p_T^{\mu}$ (2023preBPix)

OR SF vs $\eta$ ($e$ bins, 22postEE)

OR SF vs $\eta$ ($\mu$ bins, 22postEE)

OR SF vs $\eta$ ($e$ bins, 23preBPix)

OR SF vs $\eta$ ($\mu$ bins, 23preBPix)

Separate ggH-only and bbH-only models tested → slightly inferior expected limits
Dual models → double templates across 40+ mass × 4 channels × 4 eras × 2 regions → operationally prohibitive
3-class softmax effectively binary-like: bkg → score~0, signal → score~1
Dedicated b-jet variables ($p_T^{b_1}/p_T^{\text{fastmtt}}$, $\Delta\eta(b_1,\tau\tau)$) adaptively leverage region-dependent features
At fit stage, ggH and bbH are fully separated via generator-level flags (independent POI)

$e\mu$

$e\tau_h$

$\mu\tau_h$

$\tau_h\tau_h$

PNN score at $m_\Phi = 250$ GeV: ggH and bbH achieve comparable discrimination

At high mass, PNN score degenerates: signal concentrates near ~1, background near ~0 → few effective bins
$m_T^{\text{tot}}$ retains shape power: signal and background peaks naturally well-separated
Final strategy: PNN for 60–1000 GeV, $m_T^{\text{tot}}$ for 1000–3500 GeV

$m_T^{\text{tot}}$ at 250 GeV

$m_T^{\text{tot}}$ at 1000 GeV

$m_T^{\text{tot}}$ at 3500 GeV

PNN score at 250 GeV

PNN score at 1000 GeV

PNN score at 3500 GeV

Loss function: Sparse categorical cross-entropy (3-class: True bkg / Fake bkg / Signal)
- Weighted by per-event sample weights (MC × SF × lumi); negative weights preserved
Accuracy metrics: unweighted and weighted sparse_categorical_accuracy
- Weighted accuracy accounts for physical cross-section differences among processes

Early stopping: monitor val_loss, patience = 170 epochs
- Best model saved by ModelCheckpoint on val_weighted_sparse_categorical_accuracy
Dynamic LR decay: ReduceLROnPlateau (factor = 0.99, patience = 25) + per-epoch schedule (×0.995), floor at $10^{-6}$
Optimizer: SGD + Nesterov momentum (0.9), initial LR = 0.1

Loss

Accuracy

Weighted Accuracy

Learning Rate

TensorBoard diagnostics — $e\mu$ low-mass model example

Correlation matrix of the PNN input variables of the ggH signal (60–250 GeV all mass points) sample for the low-mass model ($e\mu$ channel).

Correlation matrix of the PNN input variables of the background sample for the low-mass model ($e\mu$ channel).

Theory uncertainties for bb process known to affect the $N_{\text{bjet}}$ distributions → different predictions for NoBtag and Btag categories
We estimate theory uncertainties due to: Hdamp variations, QCD ($\mu_R$, $\mu_F$) scale variations, and PDF variations
- Other contributions e.g. parton shower uncertainties sub-dominant so neglected
Hdamp and QCD scale uncertainties added linearly as recommended in YR4
Hdamp + QCD scale uncertainties range between ~1.5–5%; PDF uncertainty ~1–2%
We also compare predictions to Madgraph5_aMC@NLO but find good agreement with POWHEG — no additional uncertainty introduced

$j \to \tau_h$ fakes ($\tau_h\tau_h$, $\mu\tau_h$, $e\tau_h$):
- FF statistical unc. — from fitted functions, split no-btag / btag; per $N_{\text{jets}}$/$p_T^{\text{jet}}$ bin; uncorr. across years
- Closure correction stat. unc. — split no-btag / btag; per $N_{\text{jets}}$ bin; uncorr. across years
- Non-closure corrections: applied 2× for up, not applied for down
- W+jets subtraction ($\mu\tau_h$/$e\tau_h$): varied ±50%; fully corr. across years
- $t\bar{t}$ subtraction ($\mu\tau_h$/$e\tau_h$): varied ±50%; fully corr. across years
- W+jets / QCD FF ($\mu\tau_h$/$e\tau_h$): unc. from subtraction of other processes

$e\mu$ QCD fakes (OS/SS method):
- OS/SS scale factor stat. unc.: from fitted functions, ~10–30% in no-btag regions; separate unc. per no-btag/$\Delta R(e,\mu)$ bin; uncorr. across years
- 2D $p_T$ weight correction syst.: ~20% up/down in no-btag regions; covers residual $p_T(e/\mu)$ correlation mismodeling
  - Validated via closure tests in anti-iso muon region
- Data-driven stat. unc.: ~10%; from limited statistics in data control regions used for fake estimation

The exact mass points simulated are:

[60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 160, 180, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1400, 1600, 1800, 2000, 2300, 2600, 2900, 3200, 3500] GeV

30 leading impacts for the gg$\phi$ production mode at $m_{\phi} = 1000$ GeV fitted by PNN score

30 leading impacts for the bb$\phi$ production mode at $m_{\phi} = 1000$ GeV fitted by PNN score

30 leading impacts for the gg$\phi$ production mode at $m_{\phi} = 1000$ GeV fitted by $m_{\mathrm{T}}^{\text{tot}}$

30 leading impacts for the bb$\phi$ production mode at $m_{\phi} = 1000$ GeV fitted by $m_{\mathrm{T}}^{\text{tot}}$

Search for additional Higgs bosons decaying to a pair of τ leptons with the CMS experiment in pp collisions at $\sqrt{s} = 13.6~\text{TeV}$

HIG-25-006 Pre-approval