Add check for existence of requested data to Query init #346

ddobie · 2022-03-08T06:00:42Z

This code will check that the requested data exists at the initialisation of a Query object. It checks that the relevant directories exist and assumes that if those directories exist then the catalogues/fits files it should contain also exist.

The goal is a quick first-pass check to see if the data is available (since, e.g., nimbus doesn't have any Stokes Q/U/V data) with a nice exit if it doesn't. Adding a check that all files exist is quite cumbersome and probably unnecessary, as the query will still fail somewhat nicely (i.e. it includes the filepath in the traceback) if they don't.

This solution doesn't directly fix #339, mostly because the jupyterhub instances don't have unique hostnames and therefore a simple hostname check is not feasible. Restricting access to Stokes Q/U/V data to ada is not ideal as some users may have downloaded full stokes data to their own machines. However, this solution is more general and produces a similar outcome while also handling any other instances of missing data (e.g. there is currently no COMBINED data for some epochs on nimbus, there are a number of missing epochs on ada)

ajstewart

A minor thing on the error message to fix.

I think the approach is good here. I wouldn't go down the route of hostname checks, a generic check like this is a much better general approach. And yes, can only check so much before a query, at some point the error just has to be triggered at the point of actually using the file.

A future idea could be a function somewhere in VAST Tools that is a list_available_data() so users could run that to see what is detected on the system they are working on.

ajstewart · 2022-03-08T16:09:51Z

vasttools/query.py

+                "Not all requested data is available!"
+                "Please address and try again."


These strings are added so there won't be a space in there at the moment.

When the space is added the test checks will also need to be updated.

ddobie added 5 commits March 8, 2022 14:16

Added data availability check

5dd84ab

Change data availability check handling

0433d0a

Added testing

1f5c669

PEP8

f95338f

Continue if epoch or data dirs don't exist

d50f91f

ddobie added the enhancement New feature or request label Mar 8, 2022

Updated changelog

ff5a4be

ddobie marked this pull request as ready for review March 8, 2022 06:12

ddobie requested a review from ajstewart March 8, 2022 06:12

ddobie added 2 commits March 8, 2022 17:38

Reword logging message

4e36982

Added debug logging

e740dbb

ajstewart reviewed Mar 8, 2022

View reviewed changes

Fix error message

b5da0be

ddobie mentioned this pull request Mar 8, 2022

Add data list function #347

Open

ddobie requested a review from ajstewart March 8, 2022 23:43

ajstewart approved these changes Mar 9, 2022

View reviewed changes

ddobie merged commit c8a09c5 into dev Mar 9, 2022

ddobie deleted the iss339 branch March 10, 2022 06:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add check for existence of requested data to Query init #346

Add check for existence of requested data to Query init #346

ddobie commented Mar 8, 2022 •

edited

Loading

ajstewart left a comment

ajstewart Mar 8, 2022 •

edited

Loading

		"Not all requested data is available!"
		"Please address and try again."

Add check for existence of requested data to Query init #346

Add check for existence of requested data to Query init #346

Conversation

ddobie commented Mar 8, 2022 • edited Loading

ajstewart left a comment

Choose a reason for hiding this comment

ajstewart Mar 8, 2022 • edited Loading

Choose a reason for hiding this comment

ddobie commented Mar 8, 2022 •

edited

Loading

ajstewart Mar 8, 2022 •

edited

Loading