DC clustering updates #419

tongtongcao · 2025-01-08T16:03:31Z

Main updates include:

Fix a bug in the splitter for complicated hit clumps, and update resolver for hit-overlapped clusters from the splitter.
Loose limit for clusters with consideration of real hit lost due to dead strips, AI-denoising, edge effect, etc.
Fix issues for hits shared by multiple clusters.
See details from
New DC Clustering.pdf

…ination and selection of clusters from cluster splitter

…y it twice

zieglerv · 2025-01-08T20:28:27Z

isExceptionalCluster and isExceptionalFittedCluster are basically copies of each other.
It may be better to use only one method for the algorithm to decide if the cluster is exceptional and apply it to Hit or Fittedhit, something like:
/**

Check if one or more layers are skipped in the cluster
@param hitsInClus the hits in a cluster (can be either Hit or FittedHit)
@param nlayr the number of layers
@return true if one or more layers are skipped in the cluster
*/
private boolean isExceptionalClusterHelper(List<? extends Hit> hitsInClus, int nlayr) {
// Initialize array to count hits in each layer
int[] nlayers = new int[nlayr];

// Count hits for each layer in a single pass through the hits
for (Hit hit : hitsInClus) {
int layer = hit.get_Layer() - 1; // layer numbering starts from 1
if (layer >= 0 && layer < nlayr) {
nlayers[layer]++;
}
}

// Check for skipped layers (special cases for layer pairs)
if ((nlayers[0] == 0 && nlayers[1] == 0) || (nlayers[4] == 0 && nlayers[5] == 0)) {
return true;
}

// Check for skipped layers in the middle (layers 0 to 3)
for (int l = 0; l < 4; l++) {
if (nlayers[l] > 0 && nlayers[l + 1] == 0) {
return true;
}
}

return false;
}

/**

Wrapper for checking if a cluster of Hit objects is exceptional.
*/
public boolean isExceptionalCluster(List hitsInClus) {
return isExceptionalClusterHelper(hitsInClus, 6); // 6 layers for Hit objects
}

/**

Wrapper for checking if a cluster of FittedHit objects is exceptional.
*/
public boolean isExceptionalFittedCluster(List hitsInClus) {
return isExceptionalClusterHelper(hitsInClus, 6); // 6 layers for FittedHit objects
}

zieglerv · 2025-01-08T22:33:50Z

A lot of 3-hit clusters are noise. Which is why on the plot of the cluster size on page 11 of the document, there is a spike at 3. What is the fraction of these 3-hit clusters on track after AI inference? Also the problem with using clusters with only 3 layers is that the left-right ambiguity can be hard to resolve. For example, on page 4, Exc1 it is 100 ambiguous unless it is in region3 which has the minstagger; Exc2 has ambiguities for the small docas, and it does not matter that much in this case. It would be nice to include in this document a MC study of the left-right ambiguity accuracy as a function of cluster size for special cluster type1 (exc1) and type2 (exc2).
In the exceptional clusters you may want to add a constraint that the left-right ambiguity (LR) is solvable. For instance, for type1 clusters, you may want to require a double hit in a layer for regions 1 and 2 to pin down LR.
It may be better to toss out a 3-layer cluster with wrong left-right ambiguity and rely on 5 super layer tracking than to keep it and possibly bias the track.

zieglerv

See my comments in the conversation section about 3-hit clusters. I suggest requiring a double hit in a layer for type 1 exceptional clusters (where the first or last 2 layers are lost) that are in region 1 or 2 to assist resolving the left-right ambiguity for those cases. In region 3 the mini stagger does that job.

Suggestions for code tidiness: isExceptionalCluster and isExceptionalFittedCluster are basically copies of each other.
It may be better to use only one method for the algorithm to decide if the cluster is exceptional and apply it to Hit or Fittedhit, something like:
/**

Check if one or more layers are skipped in the cluster

@param hitsInClus the hits in a cluster (can be either Hit or FittedHit)

@param nlayr the number of layers

@return true if one or more layers are skipped in the cluster
*/
private boolean isExceptionalClusterHelper(List<? extends Hit> hitsInClus, int nlayr) {
// Initialize array to count hits in each layer
int[] nlayers = new int[nlayr];

// Count hits for each layer in a single pass through the hits
for (Hit hit : hitsInClus) {
int layer = hit.get_Layer() - 1; // layer numbering starts from 1
if (layer >= 0 && layer < nlayr) {
nlayers[layer]++;
}
}

// Check for skipped layers (special cases for layer pairs)
if ((nlayers[0] == 0 && nlayers[1] == 0) || (nlayers[4] == 0 && nlayers[5] == 0)) {
return true;
}

// Check for skipped layers in the middle (layers 0 to 3)
for (int l = 0; l < 4; l++) {
if (nlayers[l] > 0 && nlayers[l + 1] == 0) {
return true;
}
}

return false;
}

/**

Wrapper for checking if a cluster of Hit objects is exceptional.
/
public boolean isExceptionalCluster(List hitsInClus) {
return isExceptionalClusterHelper(hitsInClus, 6); // 6 layers for Hit objects
}
/*

Wrapper for checking if a cluster of FittedHit objects is exceptional.
*/
public boolean isExceptionalFittedCluster(List hitsInClus) {
return isExceptionalClusterHelper(hitsInClus, 6); // 6 layers for FittedHit objects
}

tongtongcao · 2025-01-09T15:36:09Z

Nice suggestion to use wrapper to combine two similar functions.
Page 11 in the shared document shows distributions of clusters directly from DC clustering before AI prediction of cluster combo.
It is not surprise that there are plenty of low-size clusters. So we just looser cluster limit for so-called exceptional clusters.
Indeed, there is LR ambiguous for some of exceptional clusters.
Most likely, low-size clusters have more chance to get LR ambiguous than high-size clusters.
Regards to LR ambiguity of clusters, we actually handle it during TB tracking by build two-folder clusters for a cluster with LR ambiguity

tongtongcao added 16 commits May 9, 2024 16:04

update DC clustering

0aa56c2

update class PatternRec corresponding to updates of DC clustering

06988cf

restore pruner for DC clustering

abbd5d0

update ClusterCleanerUtilities::OverlappingClusterResolver() for elim…

6b36008

…ination and selection of clusters from cluster splitter

fix an issue in HitReader::read_NNHits()

89607e4

fix bug for hits shared by clusters

44575d1

clean up comments

00905ee

update ClusterCleanerUtilities::OverlappingClusterResolver()

4522187

Merge branch 'updateDCClustering' into fixBugHitsSharedbyClusters

cfc851d

Add a new item called as indexTDC into HitBasedTrkg::Hits

1f905f4

cancel pruner in clustering

6839898

recover pruner for DC clustering and apply j4ml v1.0

9b22d06

back to use j4ml 0.9-SNAPSHOT

e0dc4fc

fix issues in ClusterCleanerUtilities::OverlappingClusterResolver()

6914f6e

update ClusterCleanerUtilities::OverlappingClusterResolver() and appl…

6381e60

…y it twice

Merge branch 'development' into updateDCClustering

17d1843

tongtongcao requested review from zieglerv and raffaelladevita January 8, 2025 16:03

zieglerv requested changes Jan 8, 2025

View reviewed changes

use wrapper to treat similar functions of isExceptionalCluster

ae75107

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DC clustering updates #419

DC clustering updates #419

tongtongcao commented Jan 8, 2025

zieglerv commented Jan 8, 2025

zieglerv commented Jan 8, 2025

zieglerv left a comment

tongtongcao commented Jan 9, 2025 •

edited

Loading

DC clustering updates #419

Are you sure you want to change the base?

DC clustering updates #419

Conversation

tongtongcao commented Jan 8, 2025

zieglerv commented Jan 8, 2025

zieglerv commented Jan 8, 2025

zieglerv left a comment

Choose a reason for hiding this comment

tongtongcao commented Jan 9, 2025 • edited Loading

tongtongcao commented Jan 9, 2025 •

edited

Loading