By Anonymous User
Review Details
Reviewer has chosen to be Anonymous
Overall Impression: Bad
Content:
Technical Quality of the paper: Weak
Originality of the paper: No
Adequacy of the bibliography: No
Presentation:
Adequacy of the abstract: No
Introduction: background and motivation: Bad
Organization of the paper: Needs improvement
Level of English: Satisfactory
Overall presentation: Weak
Detailed Comments:
This paper's excellent surface-level presentation makes it difficult for outsiders to see how problematic it is. None of its claims appear to be novel, and given the author's past work, the author must be aware of this fact and willfully ignoring prior work that established all these ideas before. Between Eisner's Dyna, the well-established pipeline of interpreting (probabilistic/neural) logic programs (which include datalog) as probabilistic circuits which then run as tensorized computation using einsum operators on the GPU, as well as the work on semiring datalog, there is really nothing covered by this paper that is not already well-known in the field. There is simply a lack of novelty that cannot be addressed by a revision with related work discussion. Given the extremely poor scholarship, I believe the editors should take measures to avoid this type of submission from happening again in future.
Writing up in detail the related work that has all the ideas already would amount to me writing a survey of the field, which I don't have time for. Luckily the issue is so blatant that even AI can spot it easily and identify all the relevant work. See below for a summary.
—
Differentiable Datalog / neurosymbolic systems
TensorLog (Cohen 2016; Cohen, Yang & Mazaitis, JAIR 2020). https://scholar.google.com/scholar?q=TensorLog+differentiable+deductive+... — Compiles Datalog clauses into differentiable tensor operations, with predicates as sparse matrices and inference as matrix products. This is the central correspondence the paper claims as its foundational observation, published a decade earlier and with the same name. Reduces novelty of the "Representation" and "Inference" sections to near zero.
Neural Theorem Provers (Rocktäschel & Riedel, NeurIPS 2017). https://scholar.google.com/scholar?q=end-to-end+differentiable+proving+R... — Differentiable backward chaining over embedded symbols, with soft unification via vector similarity. Directly anticipates the "reasoning in embedding space" section, including the analogical-reasoning-via-embedding-similarity story.
NeuralLP (Yang, Yang & Cohen, NeurIPS 2017). https://scholar.google.com/scholar?q=Differentiable+learning+logical+rul... — End-to-end differentiable learning of first-order rules built on TensorLog operators. Reduces the novelty of the rule-learning claim.
DeepProbLog (Manhaeve, Dumančić, Kimmig, Demeester & De Raedt, NeurIPS 2018; AIJ 2021). https://scholar.google.com/scholar?q=DeepProbLog+neural+probabilistic+lo... — Integrates neural predicates into probabilistic logic programming with exact, sound probabilistic semantics. Directly addresses "sound treatment of uncertainty" combined with neural learning, which the paper claims as a distinguishing feature.
Scallop (Huang et al., NeurIPS 2021; PLDI 2023). https://scholar.google.com/scholar?q=Scallop+neurosymbolic+programming+D... — Provenance-semiring-based differentiable Datalog with a working compiler and GPU backend. Directly competes with the proposed system and has actually been implemented.
NeurASP (Yang, Ishay & Lee, IJCAI 2020). https://scholar.google.com/scholar?q=NeurASP+neural+networks+answer+set+... — Combines ASP with neural networks. Another neurosymbolic system the paper would need to position against.
Differentiable ILP / ∂ILP (Evans & Grefenstette, JAIR 2018). https://scholar.google.com/scholar?q=Learning+explanatory+rules+noisy+da... — Learns Datalog programs by gradient descent through a differentiable relaxation. Predates the paper's claim that tensor decomposition in tensor logic is a "generalization of predicate invention."
Semiring Datalog and weighted logic programming
Provenance semirings (Green, Karvounarakis & Tannen, PODS 2007). https://scholar.google.com/scholar?q=Provenance+semirings+Green+Karvouna... — Foundational result that positive Datalog evaluation generalizes to any commutative semiring. The general framework predates the paper by ~20 years.
Dyna (Eisner & Filardo, Datalog Reloaded 2011; earlier Eisner et al. 2005). https://scholar.google.com/scholar?q=Dyna+weighted+logic+programming+Eisner — Weighted Datalog with semiring aggregation, designed explicitly for ML and inference. Has the "single-construct equational" feel tensor logic aspires to, including recursion and aggregation. Probably the closest prior art on the language-design side.
FAQ — Functional Aggregate Queries (Abo Khamis, Ngo & Rudra, PODS 2016). https://scholar.google.com/scholar?q=FAQ+questions+asked+frequently+Kham... — Unified framework for sum-product queries over arbitrary semirings, subsuming CSPs, probabilistic inference, matrix chain multiplication, and database joins. Provides the worst-case-optimal algorithms tensor logic would need for its "scaling up" section to be more than a sketch.
Datalog° (Abo Khamis, Ngo, Pichler, Suciu & Wang). https://scholar.google.com/scholar?q=Datalogo+recursive+aggregates+semir... — Recursive Datalog with aggregation over (pre-)semirings, with clean fixed-point semantics. The proper formal setting for recursive real-valued tensor equations, which tensor logic uses (e.g., RNN and message-passing examples) without addressing the semantic issues. Also identified "a sum-product expression is a tensor expression, sometimes called an Einsum".
Tensor implementations of database queries / tensor query languages
Tensor Relational Algebra (Yuan, Jankov, Cai, Gao, Jermaine, PVLDB 2021). https://scholar.google.com/scholar?q=Tensor+relational+algebra+distribut... — Defines a relational algebra whose tuples are tensors, designed explicitly to unify tensor computation and relational query processing for ML systems. Directly anticipates the "tensors as relations / einsum as join" framing as the basis of a programming system, with an actual implementation and optimizer.
LaraDB / Lara (Hutchison, Howe & Suciu, 2017). https://scholar.google.com/scholar?q=Lara+key-value+algebra+arrays+relat... — A three-operator algebra that subsumes both linear algebra and relational algebra over associative arrays. Same unification goal as tensor logic, with formal expressiveness results.
Expressive power of linear algebra query languages (Geerts, Muñoz, Riveros & Vrgoč, PODS 2021). https://scholar.google.com/scholar?q=Expressive+power+linear+algebra+que... — Formal characterization of which relational queries are expressible as linear/tensor algebra. Directly relevant to the paper's claims about Datalog-as-tensor-equations, with precise expressiveness results the paper does not engage with.
MATLANG (Brijder, Geerts, Van den Bussche, Weerwag). https://scholar.google.com/scholar?q=MATLANG+expressive+power+matrix+que... — A matrix-algebra query language with worked-out expressiveness vs. relational calculus. Same conceptual territory.
In-database learning with sparse tensors (Abo Khamis, Ngo, Nguyen, Olteanu & Schleich, PODS 2018). https://scholar.google.com/scholar?q=In-database+learning+sparse+tensors... — Exactly the proposed "treat sparse tensors as relations, dense as GPU tensors" idea, worked out with provable complexity bounds. Makes the paper's "scaling up" sketch look underdeveloped.
SPORES (Wang, Hutchison, Howe & Suciu, SIGMOD 2020). https://scholar.google.com/scholar?q=SPORES+sum-product+optimization+rel... — Optimizes sum-product (i.e., einsum-style) linear algebra by translating to relational algebra and back via equality saturation. Concretely demonstrates that the einsum/relational duality is a tool for query optimization, not just a notational coincidence.
AC/DC and F-IVM (Schleich, Olteanu et al.). https://scholar.google.com/scholar?q=AC%2FDC+in-database+learning+gradie... — In-database gradient computation and factorized ML, treating learning as sum-product queries.
Compilation of (neural) logic programs to probabilistic circuits → einsums on GPU
Knowledge compilation for weighted model counting (Darwiche & Marquis, JAIR 2002; Chavira & Darwiche, AIJ 2008). https://scholar.google.com/scholar?q=knowledge+compilation+map+Darwiche+... — Foundational framework: compile logical theories to tractable circuit forms (d-DNNF, SDD, etc.) on which weighted sum-product becomes linear-time. The paper's claim that tensor logic "preserves sound treatment of uncertainty" via tree-structured programs is a special case of this much more general theory, with sharper tractability results.
ProbLog → SDD/d-DNNF compilation (Fierens, Van den Broeck, Renkens et al., TPLP 2015). https://scholar.google.com/scholar?q=Inference+learning+probabilistic+lo... — Compiles probabilistic logic programs to weighted Boolean circuits for inference and learning. The canonical "logic program → circuit → tractable sum-product" pipeline. Directly anticipates the paper's framing.
Semiring programming (Belle & De Raedt, AIJ 2020). https://scholar.google.com/scholar?q=Semiring+programming+Belle+De+Raedt — Generalizes the compile-and-evaluate pipeline to arbitrary semirings, recovering WMC, MPE, gradient computation, expectations, and sensitivity analysis as instances. The unification tensor logic gestures at, formalized.
aProbLog and algebraic model counting (Kimmig, Van den Broeck & De Raedt, ECAI 2011; JAL 2017). https://scholar.google.com/scholar?q=Algebraic+model+counting+Kimmig+Van... — Logic program → circuit → semiring evaluation. Same architecture as tensor logic, with cleaner theory.
Einsum Networks (Peharz, Lang, Vergari, Stelzner, …, ICML 2020). https://scholar.google.com/scholar?q=Einsum+networks+fast+scalable+proba... — Implements probabilistic circuits as a single batched einsum operation per layer on GPU. The paper's vision of "tensor logic programs as GPU-executable sum-product computations" is what EiNets do, with implementations, scaling experiments, and a public library.
PyJuice / sparse PC GPU evaluation (Liu, Ahmed & Van den Broeck, ICML 2024). https://scholar.google.com/scholar?q=Scaling+tractable+probabilistic+cir... — State-of-the-art GPU implementation of sparse probabilistic circuits using block-sparse einsum kernels. Directly addresses the "scale up sparse tensors on GPUs" problem the paper presents as open.
Probabilistic circuits as a unifying framework (Choi, Vergari & Van den Broeck 2020). https://scholar.google.com/scholar?q=Probabilistic+circuits+unifying+fra... — Surveys the entire compile-to-circuit-evaluate-as-tensor-contraction pipeline.