A General Neural-symbolic Architecture for Knowledge-oriented Complex Reasoning

Tracking #: 678-1658

Flag : Review Received

Authors:

Shulin Cao

Zijun Yao

Hou Lei

Juanzi Li

Responsible editor:

Claudia d'Amato

Submission Type:

Other (note in cover letter)

Full PDF Version:

nai-paper-678.pdf

Approve Decision:

Approved

Revised Version:

A General Neural-symbolic Architecture for Knowledge-intensive Complex Reasoning

Tags:

Reviewed

Decision:
Major Revision

Solicited Reviews:

Review #1 submitted on 20/Oct/2023

By Alessandra Mileo
Review Details

Reviewer has chosen not to be Anonymous

Overall Impression: Average

Content:
Technical Quality of the paper: Average
Originality of the paper: Yes, but limited
Adequacy of the bibliography: Yes, but see detailed comments

Presentation:
Adequacy of the abstract: Yes
Introduction: background and motivation: Limited
Organization of the paper: Needs improvement
Level of English: Satisfactory
Overall presentation: Average

Detailed Comments:

This paper identifies the key components that should be part of a neural-symbolic architecture for complex reasoning. It also suggests, for each components, what functionalities they should have, how they should interact and what existing solutions from both neural and symbolic side could be used to implement such components. Future directions could be more indicative of the different opportunities in all components identified.

Some of the references for some of the components (e.g. entity extraction from textual corpora in 3.1) are not very recent.
Language could be improved (see comments below).

English language presentation could be improved: several incorrect or improper grammar forms, use of terms (e.g. initiative vs initial), repeated words and other language issues to be fixed throughout.

Detailed comments (per section) as follows:

INTRODUCTION
In general the relevance and importance of the proposal in this paper should be more strongly emphasised.
There is a very simplistic claim that connectionist AI relies on the idea of computerising neural networks that mimic neural networks in living beings. This is quite simplistic in that we know today that neural networks are based on very complex statistical and mathematical models but are still not able to capture the complexity of the human brain. This should also be one of the reasons why cognitive + neural is and should be gaining attention.
The success of modern approaches to connectionist AI is not only given by better models, but most importantly by more data (to train the models) and more computational resources (to do the training).

Some claims are introduced without enough context: what is System 2 deep learning and what are the other systems? References should be provided to contextualise this sentence.

The context of complex reasoning and question answering should come better defined and earlier. NeuroSymbolic combination for reasoning has specific characteristics (e.g. can we say that a neural network is not able to do REASONING but can instead to LEARNING? In QA, for example, LLM uses attention mechanism and statistical properties to generate the answer, but there is no causality or comparison or other reasoning tasks performed. Therefore, in this case the combination is more a fusion of two different capabilities. I think this should be better highlighted as it is a bit confusing atm in the introduction.

The structure of the paper is a bit confusing too: the components are described, then the state-of-the-art is used to discuss the implementations of these components: are you suggesting to reuse existing components in a new way in the architecture? What other approaches combining Learning and Reasoning are there that leverage Neuro-Symbolic combination?

Human-in-the-loop role seems minor… maybe it should not be when it comes to reasoning, and should be part of the architecture.

SECTION 2
- I do not think LLM understand complex semantics. This is the reason why they hallucinate, there is no understanding of the semantics and constraints or knowledge guidance.
- some sentences in Section 2 are not well formed, I suggest grammar checking
- Lack of clarity re. Concepts around system 1 and system 2 is also present here as well las in the introduction.
- in Fig 2. What is neural? What is symbolic? It all reads symbolic from the boxes but in the paragraph each component is described and possible implementations are suggested, it is clear that some functionalities of the components can be either neural or symbolic in nature. This duality in the components should be clearer. Also, the process is described backwards. It would be better to start from the block where everything starts and go from there.
- Section 2.2 should also provide example of manipulators that are neural or hybrid.
- The role of the human in the components is under defined and simplistically added in terms of the high level cycles the human could/should be involved in.
I think that if the human role is critical, it should be added in the framework conceptual diagram, and better defined in terms of what intervention the human can introduce, what components do the cycles involve, and how does each intervention affect the framework (and how this can be measured). If the human role is in addition to the main framework ability, then it should be seen as external and maybe discussed later on in section 4.

SECTION 3 and SECTION 4
In general I would have expected this section to be not considered stats-of-the-art, but rather a list of concrete suggestions (maybe using the most recent techniques) on how each component/functionality could be implemented.
A SOTA section to me should rather compare other approaches where neural and symbolic AI have been used to combine Learning abilities on the neural side and complex reasoning abilities on the symbolic side (even just looking at Question Answering or Language Understanding). Comparing those approaches with a framework like the one proposed should be focused on highlighting what specific advantages would this framework have (e.g. transparency at multiple level, or better flexibility or …).
An alternative way (or additional perspective) of clarifying related work versus SOTA and open challenges, is indicating for each component in the possible implementations (Sec 3), where are the gaps in existing solutions for certain functionalities. For example, in the interplay between the symbolic and the neural approaches to translating questions into sub-tasks in the reasoning planner, or in the way humans are involved in some of the tasks and where we can do better or what challenges are still to be addressed…). At the macroscopic level one example is provided in section 4 for the former, and at the macroscopic level a general consideration of the role of human in the loop is also provided. However, I think this can be done more systematically (e.g. for different functionalities discussed section 3 at the microscopic level, and for different components/ interactions at the macroscopic level. An account of more limitations would serve as the basis for developing future research ideas in many directions.

Review #2 submitted on 23/Sep/2023

By Filip Ilievski
Review Details

Reviewer has chosen not to be Anonymous

Overall Impression: Average

Content:
Technical Quality of the paper: Average
Originality of the paper: Yes, but limited
Adequacy of the bibliography: Yes, but see detailed comments

Presentation:
Adequacy of the abstract: Yes
Introduction: background and motivation: Limited
Organization of the paper: Satisfactory
Level of English: Satisfactory
Overall presentation: Average

Detailed Comments:

This paper deals with a timely topic of proposing an architecture for knowledge-intensive complex reasoning. The introductory section is interesting and it draws comparisons between connectionist and symbolic AI. I appreciated the claim that these two streams have often been seen to be at odds, and now increasingly people recognize their complementarities. Section 2 describe the architectural components, while section 3 hints at some technologies that can be used to implement these components. These two sections include an interesting human component that can improve the architecture's trustfulness.

Main remarks and suggestions:
* A critical missing piece of the introduction is: why this paper and why now? The introduction does not motivate the need for such an architecture in the current context. Moreover, I did not see mentions of prior architectures anywhere in the paper - this is a related critical omission that is needed to help the reader understand the contribution of this paper.
* The paper makes a lot of claims that really need support. For instance, a repeated claim is that grounding the data to some symbolic formalism will reduce hallucinations - but this is a nontrivial claim.
* The benefits of the different architectural aspects are presented without considering their nuances. For instance, it is true that symbolic representations can be expected to enable more precise reasoning; but how about the information loss and the errors in grounding complex knowledge to symbolic representations? Similarly, I agree that knowledge consolidation can enable better coverage; but it could also introduce more inconsistencies, contradictory information, and redundancy.
* I am wondering who the "human" is in this manuscript. The tasks described in S2.5 are pretty low-level, does that mean that humans are mostly knowledge engineers? Or can they be domain experts, knowledge explorers, etc.?
* I understand that the paper is meant to be a vision paper on a high level, but I do think that it would have been very useful if the paper provided some discussion on some of the key challenges that someone considering this architecture would face. I can think of three: 1) Choice of methods - to be fair, section 3 lists a selection of methods, but it is unclear whether these are considered most promising, or it was an arbitrary set of methods. I.e., it is unclear why these methods were chosen. 2) Representation - certain parts of section 3 seem to hint that the authors suggest the usage of symbolic representations like RDF, but later parts of the same section mention various technologies like SQL and textual representations. 3) Evaluation - it would have been great to provide some idea on how the architecture would be evaluated. Given that the authors already have a great task in mind, why not discuss the path to testing this idea?
* Finally, a certain roadmap towards a practical realization of this architecture would have been very insightful.

S1 (Introduction):
* The description of symbolic and connectionist AI in the introduction is not providing the same information - namely, the description of the connectionist methods is much longer and includes more aspects such as domains of successful application and popular methods. I suggest mirroring these two paragraphs to provide analogous information for symbolic AI.
* I found the introduction to be pretty abstract. Here, the dichotomy of the two AI streams is pretty clear, but in paragraph 3 their relation to other aspects like data and knowledge is implicit. I suggest that the authors revise this paragraph to align it better with the introduction story.
* Some aspects of the introduction feel out of place. Certainly, McCarthy is a relevant researcher to cite for such a paper, but the description of his ideas comes out of the blue. I suggest that this should be better integrated as well.

S2:
* What are "knowledge items"?

S4:
* The first paragraph of the conclusion reads as if the architecture has been tested and proven to reduce hallucinations and understand rough descriptions. This is misleading and should be rephrased.

Overall:
* The paper has many typos, including "a initiative" -> "an initial"; "the following of this section" -> "the rest of this section"; mixing of singular and plural ("are unique object"; ), "surfers out" -> ?; "inspired by the promising capability" (of who?)

Review #3 submitted on 15/Oct/2023

By Anonymous User
Review Details

Reviewer has chosen to be Anonymous

Overall Impression: Bad

Content:
Technical Quality of the paper: Weak
Originality of the paper: No
Adequacy of the bibliography: Yes

Presentation:
Adequacy of the abstract: No
Introduction: background and motivation: Limited
Organization of the paper: Needs improvement
Level of English: Unsatisfactory
Overall presentation: Bad

Detailed Comments:

This submission reads as a position paper, with the stated purpose of proposing architecture
which allows for neural and symbolic approaches for complex reasoning. While this purpose is
relevant and in line with the scope of the journal, I feel that the result remains too vague
to warrant publication. I explain my motivations in more detail next.

First of all, note that this submission is proposing a general architecture, which should
be applicable in various scenarios (and not a specific architecture used in a specific
application). The propose a framework composed of four modules with some oriented connections
(who feeds whom), but there is no motivation or explanation of why exactly these modules
and executed in this order. What would happen, for example, if the knowledge manipulator
influenced the reasoning conductor? why is this not allowed? It is not clear.

The authors then go to explain these modules in detail. Yet, these descriptions are extremely
vague. I got the impression of reading a high-level description of the AI dream from the
1950s. Take the data manager (Section 2.1). It is supposed to be able to aggregate knowledge
from multiple sources and formats (KBs, raw text, images, tables, etc). Sure, we would love
to have that, but how do we obtain it? Even combining two KBs, or the information of a few
images remains an issue, let alone a multi-format coherent management. There is no proposal
of how to try to achieve this---rather, it is stated as a given.

Similarly the knowledge manipulator (Section 2.2) requires well-defined primitives, but
it is not clear what these are. It is also supposed to provide transparency and interpretability
but again, without any hint as to *how* (note that this module contains connectionist elements,
which are not interpretable.

Overall the submission lacks clarity and specificity. Although the base idea might be
interesting, in its current form I cannot suggest publication.

Minor comments:
- pages 1-2: "It is widely recognized by academics and the industry that LLMs marks a new
level of intelligence in AI systems" -> really? Such a big claim requires at the very least
a reference
- page 2: "Actually, as John McCarthy states, hist 1976 memorandum is of modern interest"
There are some big grammar issues in this sentence. Actually, the authors should carefully
control the grammar and spelling throughout the whole document

A General Neural-symbolic Architecture for Knowledge-oriented Complex Reasoning

Tracking #: 678-1658

Flag : Review Received

Authors:

Responsible editor:

Submission Type:

Full PDF Version:

Approve Decision:

Tags:

Recent blog posts

Journal Info

Submit

For Reviewers

Links

Search form

Tracking #: 678-1658

Flag : Review Received

Authors:

Responsible editor:

Submission Type:

Full PDF Version:

Approve Decision:

Tags:

Journal Info

Submit

For Reviewers

Links