Neurosymbolic models based on hybrids of convolutional neural networks and decision trees

Tracking #: 885-1895

Flag : Review Received

Authors:

Rasul Kairgeldin

Miguel Carreira-Perpinan

Responsible editor:

Guest Editors NeSy 2025

Submission Type:

Other (note in cover letter)

Full PDF Version:

nai-paper-885.pdf

Cover Letter:

Dear editors of Neurosymbolic Artificial Intelligence, This is "Special Issue on NeSy 2025" The extended version of the NeSy 2025 paper “Neurosymbolic Models Based on Hybrids of Convolutional Neural Networks and Decision Trees” is approximately 50–70 % longer than the original. It provides an expanded Related Work section and a more comprehensive description of the optimization algorithm (pages 6–9). In addition, we introduce several new experiments, including analyses of the regularization path (Figure 5), trees with varying sparsity distributions (Figures 4 and 6), tree of receptive field density maps (Figure 8), as well as more detailed explanations and discussions of the experimental results. Best regards, Rasul

Approve Decision:

Approved

Tags:

Reviewed

Decision:
Reject

Solicited Reviews:

Review #1 submitted on 17/Oct/2025

By Michele Collevati
Review Details

Reviewer has chosen not to be Anonymous

Overall Impression: Weak

Content:
Technical Quality of the paper: Average
Originality of the paper: No
Adequacy of the bibliography: Yes, but see detailed comments

Presentation:
Adequacy of the abstract: Yes
Introduction: background and motivation: Good
Organization of the paper: Needs improvement
Level of English: Satisfactory
Overall presentation: Average

Detailed Comments:

This paper presents a hybrid model to the neurosymbolic community, consisting of convolutional neural network layers and a sparse oblique decision tree. Specifically, this model is further developed by introducing a new type of regularization and modifying the Tree Alternating Optimization (TAO) algorithm to control the distribution of feature sparsity over the tree structure. The hybrid model benefits from both the ability of convolutional layers to learn a better representation of an image and the ability of trees to explain the reasoning used to classify the image based on those neural features. A further contribution of the work to the interpretability of the model is to show the receptive fields of neurons responsible for discriminating between specific classes.

The contribution of this work is not highly original, as it consists of a small (but interesting) addition to an already published approach. Unfortunately, this journal paper does not significantly extend its NeSy 2025 conference version. As required by the journal guidelines, the journal version must clearly extend beyond the previously published content; however, this is not the case. Furthermore, the submitted manuscript does not clearly state the previous publication on which it is based, which is another guideline required by the journal. Therefore, due to the lack of novelty, I must support its rejection.

Below are some suggestions/corrections/improvements:
- The paper considers only one dataset (Fashion MNIST), which is relatively simple. Further experiments on more complex datasets should have been included in the extension to support the general applicability of the proposed methodology.
- Further examples beyond the one distinguishing a shoe from a bag are needed to strengthen the claim.
- For completeness, details must be provided on the evaluation platform (hardware and environment) on which the experiments were conducted.
- Experimental settings for training the LeNet model should be given for greater reproducibility.
- No code/data repository has been made available for the reproducibility of the experiments.
- Define acronyms before using them, e.g., SGD, CNN, RS, RP, etc.
- Specify paper keywords.
- Explain clearly what "sparsity" refers to in this context.
- Add references to LeNet, VGG, Fashion MNIST, PyTorch, and "interpretability attempts based on saliency maps or Shapley values".
- Page 2, line 18: missing section reference.
- Page 3, line 2: explain why the hybrid model has much faster inference than the original CNN.
- Page 5, line 1, I believe a reference to this claim is necessary.
- In Section "The Tree Alternating Optimization (TAO) algorithm: review", the sets N_dec and N_leaf should be D and L, respectively.
- In Section "The Tree Alternating Optimization (TAO) algorithm: review", explain clearly that l_1 is the regularization term and indicate it in Equation (1).
- The proof of Theorem 0.1 is provided, but not that of Theorem 0.2. Please explain why or provide the proof.
- In Theorem 0.1, define the set N.
- In Theorem 0.1 and its proof, page 7: specify more precisely which nodes are referred to by "other nodes".
- In Theorem 0.1, page 7, line 6: I think it should be "E_j is independent of node *i*".
- In the proof of Theorem 0.1, define what a reduced set is and its symbol R before using it.
- In the proof of Theorem 0.1, the symbols \Theta and \theta should be in bold.
- In Theorem 0.2, page 7, line 23: It should be "...that reach node *i*".
- In Section "Finer sparsity control with a modified TAO algorithm", explain what "dense node" refers to in this context.
- In section "Experiments", line 3, for clarity, specify which neural network is being referred to.
- In section "Experiments", line 3, specify which other experiments are being referred to.
- Provide information about the structure of the Fashion MNIST dataset used.
- In Subsection "Predictive error and tree size", according to Figure 3, 1,298 non-zero parameters should be 1,286.
- In Subsection "Predictive error and tree size", line 2 refers to a non-existent Appendix.
- In Subsection "Predictive error and tree size", line 11, why is the training error for the best TAO univariate tree not also provided?
- Correct typos, e.g., missing dot '.' on page 4 line 14, "LeNet" in the caption of Figure 9, etc.
- In the captions, replace '#' with "number of" and "params" with "parameters".
- Standardize the format for Fig. and Eq. (upper or lower case).

Review #2 submitted on 27/Sep/2025

By Johannes Langer
Review Details

Reviewer has chosen not to be Anonymous

Overall Impression: Weak

Content:
Technical Quality of the paper: Good
Originality of the paper: Yes, but limited
Adequacy of the bibliography: No

Presentation:
Adequacy of the abstract: No
Introduction: background and motivation: Bad
Organization of the paper: Needs improvement
Level of English: Unsatisfactory
Overall presentation: Weak

Detailed Comments:

The submission proposes a hybrid architecture in which the classification head of a CNN is replaced with a TAO learned decision tree. They improve previous work by introducing a hyperparameter to control node sparsity to the TAO algorithm. In experiments, the authors show that this approach maintains classification performance compared to the original MLP classification head and demonstrate how this approach can be used to make class feature relevance interpretable.

In terms of methodology, this paper is solid, with room for improvement. Reviews for the shorter conference paper already pointed out that the approach should be tested on a more complicated dataset than FashionMNIST, as classification on this is fairly easy, and the reported errors are still surprisingly high (over 10% on the test set). Further improvements could include to conduct a performance analysis using a k-fold cross validation for statistical support, and to properly evaluate faithfulness of the generated explanations, not only claiming that they must be faithful due to how they were constructed (even if I do agree with the authors that this is the case).

While this is an incremental improvement over existing work, the submission is theoretically sound and relevant to the NeSy community.

Reproducibility is not easily possible as no code for the experiments or models has been made available. The descriptions are detailed enough that I believe this work could be feasibly replicated, but only with significant implementation effort.

But, here are some big issues with the submission as it stands:

The abstract does a great job of summarising what’s in the submission. However, the descriptions could be a bit more precise, which makes the findings sound a little more vague than they actually are, and makes the abstract a bit hard to understand without having read the whole paper beforehand.

I’m really concerned about the language in the introduction, especially the choice of words. Also, the descriptions could be clearer for the scientific community, and some parts feel more like science communication than scientific writing. In other sections, some decision tree terms are introduced quickly without much explanation or a link to later parts of the paper. For example, the authors only mention counterfactual explanations here and never again, just referencing their own previous work.

This brings me to the biggest issue: the language. The way the paper is written is not up to par. I won’t go into every single instance, but just from the introduction, things like „say, …“, „surgical precision,“ or the term „overkill“ in the footnote stand out. Also, make sure to check for missing articles. To get accepted, the paper needs a major language overhaul so it presents its content more clearly and accurately.

Plus, there are some formatting, structure, and convention issues.
- The references list conference proceedings as their own source and then cites these in other bibliography entries.
- Footnotes are not numerical but using different special characters.
- In-text citations are used in a way that doesn’t make sense (trying to include too many sources at once), which leads to awkward collections of references in parentheses (see the last paragraph of page 3 for an example).
- Abbreviations are often not introduced (like SGD) or not explained well (like CNN).
- Titles don’t follow title case for capitalisation.
- Figures in the experimental section aren’t ordered properly. Figure 5 is mentioned before Figure 3. A better order would make the paper easier to read.

Some important references are missing. The paragraph „The Tree Alternating Optimisation (TAO) algorithm: review“ is missing its reference to the original TAO paper. While it’s great that the original paper is mentioned in the introduction and the author is involved, it would be even better if the paper itself was cited here. The current version implies that the described algorithm was a brand-new introduction, which isn’t entirely accurate. Mentioned intrepretability attempts in the first paragraph of the experiments section are not referenced (this is done much later in the section; the first claim is unreferenced).

The related work section starts with a historical overview of heuristic search-based approaches for incorporating trees into neural networks. While this is informative, it might not be the most precise way to understand what’s been done in that area, and it could be considered unnecessary for the rest of the submission.

On the other hand, related work on explainability (evaluation, faithfulness, and especially methods like LRP) is completely missing.

Lastly, I would like to see an improved description of the section „Where is a specific neuron, and a specific decision node, looking at“. NEURONS are specifically not arranged in a grid, and they also do not only receive input from only a specific subset of the outputs of the previous layer. This is due to convolution layers operating with sliding kernels. Instead, the authors trace back localised activations through the layers to the original image. This distinction is especially important, as no code is available to further understand how the RFs were generated.

Disclaimer: I used local AI-tools to re-formulate parts of this review. I never supplied the submission or even parts of it as context, and reviewed the changes thoroughly.

Neurosymbolic models based on hybrids of convolutional neural networks and decision trees

Tracking #: 885-1895

Flag : Review Received

Authors:

Responsible editor:

Submission Type:

Full PDF Version:

Cover Letter:

Approve Decision:

Tags:

Recent blog posts

Journal Info

Submit

For Reviewers

Links

Search form

Tracking #: 885-1895

Flag : Review Received

Authors:

Responsible editor:

Submission Type:

Full PDF Version:

Cover Letter:

Approve Decision:

Tags:

Journal Info

Submit

For Reviewers

Links