Skip to content

Conversation

@delucchi-cmu
Copy link
Collaborator

@delucchi-cmu delucchi-cmu commented Oct 21, 2025

Adds section on registry and HTTP capabilities. Closes #31. Closes #32.

Adds new properties and file names. Closes #27. Closes #28. Closes #29.

@delucchi-cmu delucchi-cmu marked this pull request as ready for review November 7, 2025 20:24
Copy link

@tsdower tsdower left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Purely for the registry component:

The standardID is straightforward and makes sense and mirrors HiPS support.
Do we want the columns/filters parameters to be optional or required?
Do we care about having two different delimiters?

Copy link
Collaborator

@bsipocz bsipocz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Super quick check for the registry content, IMO it needs more.

Let me know if you want me to add text instead of just saying it needs more :)

\label{fig:exampleCollectionPropertiesFile}
\end{figure}

\section{Service Capabilities} \label{sec:capabilities}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How do you feel about having this section (or a different section) as "HATS registration" analogue for how it's done in the hips document?

Comment on lines +686 to +689
\begin{figure}
\begin{minipage}{\linewidth}
\begin{lstlisting}
<capability standardID="ivo://ivoa.net/std/hats#hats-1.0">
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK, I revert my comment above, as it should be "how about a new section for HATS registration"? As this example below misses a lot of key elements.

hats_order = 6
moc_sky_fraction = 0.00618
hats_builder = hats-import v0.4.4
hats_creation_date = 2025-03-20T03\:38UTC
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
hats_creation_date = 2025-03-20T03\:38UTC
hats_creation_date = 2025-03-20T03:38UTC

hats_creation_date = 2025-03-20T03\:38UTC
hats_estsize = 23137775
hats_release_date = 2024-09-18
hats_version = v0.1
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should it be v1.0 now?

Suggested change
hats_version = v0.1
hats_version = v1.0

Comment on lines +758 to +759
The service is not required to return the columns in the same order they were requested in the query parameters.
The service may return more columns than the user requested, e.g. the spatial index column described in Section~\ref{sec:spatialIndex}, and positional (radec) columns.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If I remember correctly, current lsdb-server implementation always returns all the columns, but put Null values to filtered columns, so 1) parquet files size is much smaller, but 2) the schema still matches _common_metadata

The service is not required to return the columns in the same order they were requested in the query parameters.
The service may return more columns than the user requested, e.g. the spatial index column described in Section~\ref{sec:spatialIndex}, and positional (radec) columns.

\subsubsection{Row Filters}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is also a question on how row and column filters work together: must client/server select all the columns we are filtering rows on, or not? If I'm not wrong, the current lsdb-server adds columns used in row filters to the output. This is different from pandas/pyarrow behavior, which supports row-filtering on columns not specified in the column list, while not returning these columns in the final table.


\subsubsection{Metadata}

For VOParquet files, each leaf parquet file may contain the same VO metadata inside the file-level metadata.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should the VOParquet note be referenced in some way?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

5 participants