Cognitive LLMs: Toward Human-Like Artificial Intelligence by Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making

Tracking #: 791-1782

Flag : Review Received

Authors:

Siyu Wu

Alessandro Oltramari

Jonathan Francis

C. Lee Giles

Frank E. Ritter

Responsible editor:

Guest Editors Trustworthy Neurosymbolic AI 2024

Submission Type:

Article in Special Issue (note in cover letter)

Full PDF Version:

nai-paper-791.pdf

Cover Letter:

Dear Editor Team, We are submitting the paper titled "Cognitive LLMs: Toward Human-Like Artificial Intelligence by Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-Making" to the Special Issue on Trustworthy Neurosymbolic AI. Thank you very much, Siyu, Alessandro, Jon, Lee, and Frank

Approve Decision:

Approved

Revised Version:

Cognitive LLMs: Toward Human-Like Artificial Intelligence by Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making

Tags:

Reviewed

Decision:
Minor Revision

Solicited Reviews:

Review #1 submitted on 09/Nov/2024

By Anonymous User
Review Details

Reviewer has chosen to be Anonymous

Overall Impression: Weak

Content:
Technical Quality of the paper: Weak
Originality of the paper: Yes, but limited
Adequacy of the bibliography: No

Presentation:
Adequacy of the abstract: No
Introduction: background and motivation: Bad
Organization of the paper: Needs improvement
Level of English: Satisfactory
Overall presentation: Weak

Detailed Comments:

This paper introduces a neuro-symbolic architecture by integrating the ACT-R Cognitive Architecture with LLMs. It claims to resolve “the dichotomy between the human-like yet constrained reasoning processes of Cognitive Architectures and the broad but often noisy inference behavior of Large Language Models (LLMs)”. It does so by the fact that it “extracts and embeds knowledge of ACT-R’s internal decision-making process as latent neural representations, injects this information into trainable LLM adapter layers, and fine-tunes the LLMs for downstream prediction”.

The topic is of course interesting. However, the paper has a number of serious problems, some of which are listed below.

The writing is unnecessarily wordy and often repetitive, making it harder to read and understand the points that the authors are making. For one instance, the abstract appears to be too long and too wordy for its content.
I suggest that the authors streamline the writing of the whole paper, avoiding repeated or scattered explanations.

The paper reads like an exhaustive lab report, not a paper ready for publication. There is often no clear enough take-home message in many sections. Critical details are not well explained or missing altogether, despite its length.
The authors thus should also restructure the paper, re-organizing the most relevant materials, and removing less relevant or irrelevant materials. For example, this has very little relevance: “Dopaminergic signals are believed to transmit reinforcement information to the corpus striatum [71], traditionally signaling reward-related activities.” Or, “Neurologically, as cognitive strategies evolve …….”. Etc. etc.
Some less relevant but useful materials may be relegated to appendices.

On p.8, the temporal difference (TD) algorithm was mentioned, but the equation followed (eqn.1) seems not TD, but just time weighted updating of U, with time steps. The notations are very confusing.

In terms of the results, is there any performance advantage in fine-tuning LLMs with ACTR traces, compared with the original ACTR model from which traces were obtained? This needs to be better analyzed and discussed in detail. The paper mentioned that it “show both improved task performance as well as improved grounded decision-making capability of our approach, compared to LLM-only baselines that leverage chain-of-thought reasoning strategies”, but not comparisons with the original ACTR model.

Methodologically, is there any advances in this paper, compared with existing work such as Trieu H. Trinh, Yuhuai Wu, Quoc V. Le, He He & Thang Luong (2023)?

In terms of the scholarship and citations of relevant previous work, the authors should make some improvements in several regards. Here are just some examples:

In terms of integrating cognitive architectures and LLMs, the authors need to cite highly relevant existing work, such as:
• Integrating LLMs with Soar: arXiv:2310.06846v1 ; etc.
• Integrating LLMs with Clarion: arXiv:2401.10444 ; arXiv:2410.20037 ; etc.
• And other cognitive architectures; Etc.
So, the authors’ claim “unlike these previous efforts that incorporate LLMs into CAs, there is currently no research focusing on assimilating the advantages of CAs into LLMs” is not accurate.

In terms of a more thorough understanding of dual-process theories (beyond just their popularization), see, for example, https://content.iospress.com/articles/neurosymbolic-artificial-intellige... and references cited therein. The authors need to present a more balanced view on such theoretical issues at the beginning of the paper (even though, understandably, this is not the focus of the paper in question), as well as their relations to neuro-symbolic systems.

By the way, just FYI, here are some further readings:
• Evans, J. & K. Frankish (eds.), (2009). In Two Minds: Dual Processes and Beyond. Oxford University Press, Oxford, UK.
• Macchi, L., M. Bagassi, & R. Viale, (eds.), (2016). Cognitive Unconscious and Human Rationality.  MIT Press, Cambridge, MA. 
Each of the two presents a variety of views.

Finally, with regard to cognitive architectures, only focusing on two cognitive architectures misses the big picture. The authors should provide a more complete picture by reviewing a lot more of them. One way of accomplishing that is citing some existing comprehensive reviews of cognitive architectures, such as Taatgen & Anderson (2023), or Kotseruba & Tsotsos (2020), etc. etc.

Review #2 submitted on 28/Nov/2024

By Anonymous User
Review Details

Reviewer has chosen to be Anonymous

Overall Impression: Good

Content:
Technical Quality of the paper: Excellent
Originality of the paper: Yes
Adequacy of the bibliography: Yes

Presentation:
Adequacy of the abstract: Yes
Introduction: background and motivation: Good
Organization of the paper: Satisfactory
Level of English: Satisfactory
Overall presentation: Good

Detailed Comments:

First, my broad overall impression: Attempting to embed ACT-R decision-making knowledge for a domain into the representations of an LLM to get more human-like decisions -- that's something worth looking into. I think the actual ACT-R model used is a little bespoke, but that's not a problem for the questions this work attempts to address.

No one thing below, if not addressed, would alone be a basis for rejection to me, but please at least give this another pass just to make things a little simpler/fix grammar oddities.

There are a couple of places where sentences are just difficult to parse and I think it's just minor grammar fixes. Examples:

* page 7 line 8 or 9 "refer to VSM-ACTR below" -- it's not clear what you're trying to say there. If you mean something like "referred to as VSM-ACTR below", well, that's a confusing clarification because it's not like this is a shortened name. You seem to always refer to it that way, and furthermore, you actually say VSM-ACTR 2.0 quite often.
* page 16 line 46/47 "Followed results that validate VSM-ACTR reasoning process can be mapped through the neural network"

Also, it would help if the authors clarified the scope of their work earlier to clarify that they weren't concerned broadly in this work with implementing a system that alleviates the issues with LLMs, but specifically only through the approach of infusing CA knowledge into LLM structure, because otherwise methods such as RAG, approaches that attempt to recreate CAs using LLMs, or approaches that simply use both a CA and LLM to produce behavior would probably need extended discussion as related work. Note: This does seem like a reasonable reduced scoping. I'm not being snarky or disagreeing with this scoping. This approach is concerned specifically with mitigation of the issues by changing the LLM itself rather than by really implementing a system that is a hybrid of LLM and additional software (which means leaving the LLM with the same flaws it had and just mitigating them). I don't know, maybe that's something to lean into. Take it or leave it.

The paper ends with mostly text discussion of results that almost seems to hide some of the prediction accuracy results. I'd just try to get the point more clearly and overtly here. Maybe even just some additional subsection labeling or bolding, some formatting like that could help me see things here.

One big thing I’d like to see is more discussion of the overall approach's theoretical limitations, including a broader discussion on what cognitive capabilities seem like they should be available simultaneously/together in vanilla ACT-R vs in LLM-ACTR. For example, it seems cognitive architectures afford persistent learning in one-shot in real-time. Can LLM-ACTR do that? I don't actually think this change is necessary for acceptance, but it would help the reader understand what potential benefits *not* to expect from the approach the authors take (so they can decide if they think that's an important part of alignment with human decision making). However, that's probably a lot of space and work and it's not a basis for rejection.

Cognitive LLMs: Toward Human-Like Artificial Intelligence by Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making

Tracking #: 791-1782

Flag : Review Received

Authors:

Responsible editor:

Submission Type:

Full PDF Version:

Cover Letter:

Approve Decision:

Tags:

Recent blog posts

Journal Info

Submit

For Reviewers

Links

Search form

Tracking #: 791-1782

Flag : Review Received

Authors:

Responsible editor:

Submission Type:

Full PDF Version:

Cover Letter:

Approve Decision:

Tags:

Journal Info

Submit

For Reviewers

Links