Xp65 on ARE VDI - unable to call on /opt/conda/*

mblack · 29 April 2025 01:26

Hi all,

I am experiencing an issue when trying to run xp65 analysis3 within ARE VDI.

I have followed the helpful instructions (inc. adding gdata/xp65 under Storage). When I attempt to execute python from the VDI terminal I get a No such file or directory error:

It seems that /opt/conda/* is not available for me to call. Have I overlooked a step that makes this available on ARE VDI (it works fine on ARE JupyterLab)?

Thanks for your help.

Scott · 29 April 2025 01:43

It’s a technical issue - VDI runs using a container, and the xp65 environment is also using a container. You can’t run a container from inside another container.

Simple fix is to exit the VDI container first, which is simply done with ssh localhost. You should then be able to use conda inside the ssh session.

mblack · 29 April 2025 02:15

Thanks @Scott

Aidan · 29 April 2025 02:26

Sounds like something we should add to the docs.

Would you have time to make an issue for this @mblack?

mblack · 29 April 2025 03:08

done, thanks

Aidan · 29 April 2025 04:50

No thank you!

For anyone following along at home:

ag0498 · 1 May 2025 00:06

At the risk of derailing the useful discussion above, I think I have a similar or near-identical issue.

When trying to create a dask PBS cluster within a Jupyter notebook in ARE, the cluster has an error and exits prematurely. The dask-worker.e… file says:

/local/spool/pbs/mom_priv/jobs/140245218.gadi-pbs.SC: line 12: /g/data/xp65/public/apps/med_conda/envs/analysis3-24.07/bin/python: No such file or directory

While Scott’s suggestion above may be applicable here, I am not sure how I’d exit the (parent?) container in a situation like this.

I’m initialising the PBS cluster like this in my notebook (storages line snipped):

from dask_jobqueue import PBSCluster
from dask.distributed import Client

cores=2
memory="9GB"
processes=2
walltime = '1:00:00'
storages = 'gdata/xp65+ ... <snipped> ... +gdata/ia39'

cluster = PBSCluster(walltime=str(walltime), cores=cores, memory=str(memory), processes=processes,
                     job_extra_directives=['-q normalbw', '-l ncpus='+str(cores), '-l mem='+str(memory), '-l storage='+storages, '-l jobfs=10GB', '-P ai05'],
                     job_directives_skip=["select"])

cluster.scale(jobs=1)  # Scale the resource to this many nodes
client = Client(cluster)
print(f"Dask Client started. Dashboard URL: {client.dashboard_link}")

This worked in hh5 so I presume it’s roughly correct.

Thanks in advance for suggestions.

Regards,
Aurel.

Scott · 1 May 2025 00:12

Try setting the path to python, as Using dask_jobqueue in the new xp65 environment - Technical - ACCESS Hive Community Forum

rbeucher · 20 May 2025 02:33

Hi everyone,

As @Scott mentioned, this is due to the fact that VDI and XP65 are both containerised.
@Scott solution works for python but it won’t work for executable that are shipped with analysis3.

The solution is to pass the following PBS flag when creating the VDI instance:

-v SINGULARITY_OVERLAYIMAGE=/g/data/xp65/public/apps/med_conda/envs/analysis3-25.05.sqsh

Here:

Once launched, you can load the analysis3 module as usual:

Note that I explicitly use analysis3-25.05 here.

Topic		Replies	Views
Running xp65 analysis on ARE Infrastructure help , conda	11	182	15 June 2025
Using gadi_jupyter.sh script with new xp65 conda environment Technical python , jupyter , help-needed , outofscope	13	94	16 April 2025
Xp65 conda init fails General help , climate-conda-enviro	8	60	9 May 2025
Strange error using xp65 Technical help , inscope	17	137	20 May 2025
Issues when transitioning from hh5 to xp65 in suite runs General help , access-cm2 , suite , inscope	10	149	4 July 2025

Xp65 on ARE VDI - unable to call on /opt/conda/*

Related topics