Allow extended filtering on relationships#524
Allow extended filtering on relationships#524ml-evs merged 38 commits intoMaterials-Consortia:developfrom
Conversation
|
I'm confused. Even if we disregard that the |
ml-evs
left a comment
There was a problem hiding this comment.
I wholeheartedly support this, and indeed have been breaking the spec a bit myself to allow filtering on DOIs of references via this exact mechanism. This would be straightforward to enable in optimade-python-tools, as we currently hvae to disable filtering on fields other than ID in joins.
The only issue I see here is touched on by @rartino, previously we would allow filtering on the relationship description (which nobody ever used, and the filter was never implemented afaik), but somehow this is a breaking change that removes this functionality. Since we do not have a top-level description field of an entry, this could be kept in unambiguously, though slightly annoying for implementers, I think it makes sense (and we could never standardize a field called description for each entry in this case).
A somewhat drastic move would be to say that this design choice was a bug 😅 In a sense this would not be false, as
|
|
I don't think filtering on the human readable description of a relationship is particularly crucial. However, the "role" of a relationship discussed in #523 is crucial to be able to filter on. (E.g. "does this calculation have an output structure that contains sodium"). |
|
Since this PR asks to change how the dot operator works, I think we should try to peek forward enough so we do not paint ourselves into another corner. Are there good reasons to not let the relationship still be the 'relationship object' (or strictly in OPTIMADE a list of relationship objects) and then let the target of the relationship object explicitly live under, e.g., 'target'? To clarify, we could adopt a model of the structures relationship in an optimade calculation to work like: This would mean that a query for all calculations that have an output structure that contains sodium could be expressed as: (As I type this up, I realize that we are actually missing the construct with the "inner" HAS for zip list constructs in our grammar, but that seems like an oversight that we should fix, given that we support e.g. the full set of fuzzy string operators in that position.) Edit: note, this means that @ml-evs's query on DOIs now would be, e.g.: |
I am not sure if I am getting this right, so bear with me. Do you mean retaining entry's relationships as they are (v1.2.0) and then including explicit copies of the related entries inside entry's |
|
I have had a query related to this and think it would be worth discussing again at an upcoming meeting. For my implementations I am still breaking compliance by allowing filtering on |
|
From the web meeting: how do one filter on |
ml-evs
left a comment
There was a problem hiding this comment.
This quick review just works up the target that we discussed in the meeting. It feels a little cumbersome and feels strange to be introducing a filter-specific syntax inside a "field" name, but it achieves our goals in a backwards-compatible way and there seemed to be consensus (plus it is easy for servers to implement). I tried to come up with some other options (e.g., relationships.structures.nsites = 20 or included.structures.nsites=20) that match other existing keywords, but they are also not satisfactory (and would be more of a breaking change).
Co-authored-by: Matthew Evans <7916000+ml-evs@users.noreply.github.com>
Co-authored-by: Matthew Evans <7916000+ml-evs@users.noreply.github.com>
Co-authored-by: Matthew Evans <7916000+ml-evs@users.noreply.github.com>
Co-authored-by: Matthew Evans <7916000+ml-evs@users.noreply.github.com>
Co-authored-by: Matthew Evans <7916000+ml-evs@users.noreply.github.com>
|
I have updated the PR according to comments on Friday's web meeting. The PR now retains full backwards compatibility and allows for extended filtering on relationships through I did not implement |
Co-authored-by: Matthew Evans <git@ml-evs.science>
|
The consensus here seemed to be that people are happy to merge this functionality, but that we may tidy up some of the language before the 1.3 release. As @rartino has already approved a previous version of this (with only light changes), I feel empowered to merge and create the release candidate today, if there are no objections in the next few hours. |
Can review it this evening. |
ndaelman-hu
left a comment
There was a problem hiding this comment.
I added some suggestions where the formulation could be clearer. Still, the specification is correct, just somewhat dense for fresh eyes. This should not delay the merger of this PR.
Thanks @ndaelman-hu, I have to resolve your comments to be able to merge, but will open an issue tracking the clarifications you suggested. |
|
I have not forgotten to make the adjustments suggested by @ndaelman-hu. Please bear with me, I will make a PR tomorrow or early next week. |
This PR addresses #437 by allowing fully-fledged filtering on relationships. Prior to it, filtering was allowed, but only on
idandmeta.descriptionof relationships. Now all properties of related entries should in principle be allowed to filter on, to arbitrary depth (OPTIONAL).This introduces a slight backwards-incompatibility by abandoning special treatment fordescription. Thus queries which previously matched/mismatched, but were perfectly legal, now will return HTTP 400 for all standard entry types exceptfileswheredescriptionproperty is defined since v1.2.0. I am not sure how severe this incompatibility is - if we want to avoid it, we will have to mandate the special handling ofdescriptionby re-routing it tometa.descriptionof a relationship instead of entry's property.Edit: Full backwards compatibility is now restored, entry properties are now accessible through
target.Fixes #437.