By Anonymous User
Review Details
Reviewer has chosen to be Anonymous
Overall Impression: Average
Content:
Technical Quality of the paper: Average
Originality of the paper: Yes, but limited
Adequacy of the bibliography: Yes
Presentation:
Adequacy of the abstract: Yes
Introduction: background and motivation: Limited
Organization of the paper: Poor
Level of English: Satisfactory
Overall presentation: Average
Detailed Comments:
This submission presents a tool for generating (factual or contrastive) explanations from decision trees, which may or may not derive from a black-box model. The tool ultimately translates the problem of finding such explanations into a CLP problem, to be solved via Prolog's CLP(R) library.
I think that this is a potentially interesting tool that deserves to be presented in this journal; however, I also believe that the quality of the presentation needs to be improved considerably before acceptance can be recommended. The paper spends, in my opinion, too many pages describing implementational details of comparatively little interest (the discussion of the meta-interpreter at pages 20-23 being perhaps an especially egregious example); instead, the "evaluation" part of pages 31-onwards, which should be the most important one since the paper is about presenting a new tool, contains only a limited discussion of the results and leaves many important details in the appendix, to the point of being borderline unparseable as it is.
Some more specific comments (some of which very minor) follow:
* Page 2: "...encodes a linear constraints" - a linear constraint
* Page 4: "---whether the explanation method exploits knowledge about the internals of the black-box or not". If we can see "the internal of the black box", isn't that *not* a black-box by definition?
* Page 6: "Bayes rule". What is meant by this? Bayes' rule isn't about choosing the label with the highest estimated probability, as the phrase would seem to suggest; and it doesn't seem to be relevant to the framework described here, in which the leaves simply assign probabilities to labels.
* Pages 11 and following: a criticism of the authors' notion of "factual explanation" that should be worth addressing, I think, is that depending on the number of variables and on the structure of the decision tree the explanation generated by the algorithm might involve a number of variables that are not *actually* relevant to the decision made and might make it difficult to parse for a human user. One can answer this in a few different ways (for example, by arguing that if so then the problem lies in the fact that the decision tree is badly constructed, or suggesting that this factual explanation might be simplified later on if necessary but at any rate it still need to be constructed first); but in any case, I believe that this should be addressed.
* Also pages 11 and following: something should also be said about how to address the fact that a set of constraints can often be made non-redundant in different ways. Presumably, we want to give the user the constraints in the most readable form possible: but, to make a fairly trivial example, how are we to choose between {x1 >=3, x2 >=2, x1 + x2 <= 5} and {x1 = 3, x2 = 2}? Intuitively in this case it is
clear that the second set of constraints is preferable, but in general one needs a well-defined notion of what this means and a description of what the system will return.
* Page 17: "the factual rules are produced only for c's that appear as c and c' \in Th(S) for some c'". It is not clear to me what is meant here. What is S? Does it refer to the previous point?
* Page 19: "Each declared instance is encoded by a list of CLP(R) variables, one for each feature". Shouldn't it be encoded by a list of variable *values*?
* Pages 21-23: as I was saying, I think that the discussion of these implementational details could have been safely moved to the supplementary material.
* Page 32: I think that more discussion of Table 7 is necessary. Here we do not even see an explanation of what its various columns mean, never mind some insight about what these results tell us about the performance of the proposed tool. Similarly for the other tables in this section (I see that there is some material along these lines in the supplementary material, but it really should be part of the main paper).
* Page 35: "REASONX advances other methods in qualitative terms." Such as?
* Page 48: Much of the "Contrasting to sufficient reasons" section should be moved in the main text, I think; and the same applies also to the "Quantitative Evaluation" section later on.