Dashboards for managing NCI resources

Paul.Gregory · 3 April 2025 01:49

Hello all.

I’m interested in developing simple tools to help monitor NCI resources across the various projects associated with 21st Century Weather.

Before I build some simple bash and python scripts, does the community have any existing solutions?

I note this thread has been locked:

That thread suggested some possibilities.

Rui at NCI has set up a ‘voila’ thingie on ARE that can do jupyter-based dashboards (do these still exist?)
Resurrecting the old CLEX dashboard
GitHub - coecms/dashboard
Replicating an tools used by NRI to monitor their resources.

Suggestions are welcome

Aidan · 3 April 2025 03:42

@rui.yang might be able to comment.

NCI systems folks might set up a regular dump of NCI resource commands if you ask nicely (help@nci.org.au). At least they have done so in the past. Otherwise you need a user who is a member of all the groups you want to capture and set up a scheduled job to do it. This is what we’re doing at ACCESS-NRI (from GitHub).

ACCESS-NRI is using a grafana cloud account (free tier) for the grafana dashboarding and a postgresql DB back-end on a NECTAR VM. Happy to share what we’ve done, but note we’re going to refactor the part that updates the DB to use the Django server/apps @tmcadam and @CharlesTurner have developed.

Scott · 3 April 2025 04:03

If you have access to your scheme’s info in mancini you can get csv files for historical compute and storage use for all of the scheme’s projects in the scheme dashboard. It’s a bit of a hassle to set up authentication to download automatically though - if you’re interested I can provide a script.

We have some python scripts that collect this as well as nci_account and nci_files_report output that dump the output into a /g/data space that’s mounted on a web server, then have a basic dashboard using plotly to present the data.

We may at some stage look at using PowerBI to set up a dashboard instead, since we have access to that through sharepoint.

Paul.Gregory · 3 April 2025 04:30

Thanks for the replies.

Grafana cloud accounts with postgresql DB back-end on a NECTAR VM moving to Django server/apps is beyond my skills and time constraints at this stage.

But .csv files plotted using python and plotly would be in scope.

I do have memberships to all the relevant projects.

How does one obtain scheme info via Mancini?

Otherwise I can follow the NRI path and just set up a cron to capture output of nci accounting command-line tasks.

Scott · 3 April 2025 04:47

You need to be listed as a scheme manager to see the scheme level info - whoever’s responsible in your org for distributing NCI compute resources

Paul.Gregory · 4 April 2025 01:17

Ok I’m now delegated CI for all CoE projects - giving me access to project accounting data via

https://my.nci.org.au/mancini/project/<project-id>/accounting

Do I now follow up with NCI to get the .csv data? Is that the underlying data the plotly image on the Mancini page is using?

Scott · 4 April 2025 02:43

It’s a different interface https://my.nci.org.au/mancini/scheme/{scheme}/compute/csv, https://my.nci.org.au/mancini/scheme/{scheme}/storage/csv, but probably the same underlying data.

You should be able to get the data source that plotly is displaying by looking at your browser developer tools. You can authenticate with mancini in a python session with

from contextlib import contextmanager
from bs4 import BeautifulSoup as BS
import requests

@contextmanager
def mancini_session():
    with requests.Session() as s:
        s.headers['Origin'] = 'https://my.nci.org.au'

        r = s.get("https://my.nci.org.au/mancini/login")
        r.raise_for_status()

        # Collect xsrf tokens required to log in
        soup = BS(r.text, 'html.parser')
        login_form = soup.find(id='login-form')
        form = {}
        for i in login_form.find_all('input'):
            if i.get('type', None) == 'submit':
                continue
            form[i['name']] = i.get('value', None)

        # Do the login
        form['username'] = os.environ['SCHEME_USER']
        form['password'] = os.environ['SCHEME_PASS']
        headers = {'Referer': 'https://my.nci.org.au/mancini/login'}

        r = s.post('https://my.nci.org.au/mancini/login', data=form, headers=headers)
        r.raise_for_status()

        yield s

with mancini_session() as s:
    # Replace with whatever URL you have access to
    r = s.get(f'https://my.nci.org.au/mancini/scheme/{scheme}/compute/csv')

Scott · 4 April 2025 02:58

If you’re already a member of all required projects however it’s probably simplest to just use nci_account’s info and not worry about mancini:

import requests
import pymunge

token = pymunge.encode().decode('utf-8')

url = f"http://gadi-pbs-01.gadi.nci.org.au:8811/v0/nciaccount/project/{project}"
headers = {'Authorization': f"MUNGE {token}"}

r = requests.get(url, headers=headers)

Paul.Gregory · 9 April 2025 08:40

An update.

It seems access to those .csv resource will take a while.

In the interim, I was going to write a simple cron script to run the various nci command-line resource scripts and pipe them to a text file, i.e.
lquota -v
and
nci_account -P <project> -v
for each project.

Then build some logic to parse the output into a pandas dataframe and then plot them. That’s easy for me and within my wheelhouse.

However, you seem to know more about web-scraping than I do and mentioned you have some scripts which already collate and process this data?

Is it related to this snippet?

import requests
import pymunge

token = pymunge.encode().decode('utf-8')

url = f"http://gadi-pbs-01.gadi.nci.org.au:8811/v0/nciaccount/project/{project}"
headers = {'Authorization': f"MUNGE {token}"}

r = requests.get(url, headers=headers)

Can a python script accessing a url on gadi obtain the same data as nci_account and lquota?

Scott · 9 April 2025 23:13

There’s no need to parse nci_account’s output - nci_account reads data from that url (it returns JSON data) and does some formatting around it, e.g. try printing r.json() to see the result. nci-files-report and nqstat both work similarly, calling an API and formatting the json output. These APIs are only accessible from Gadi itself.

lquota is a bit different, it hooks into the lustre library. NCI’s python interface for doing this is at /opt/nci/lquota/lustre.py.

Scott · 9 April 2025 23:20

Here’s the beginnings of a library for working with the nci tool outputs if you like - ScottWales/nci-tools

Topic		Replies	Views
Grafana post-accessdev Infrastructure	15	282	10 March 2025
Bring back uqstat? Technical help , inscope	7	41	30 June 2025
NRI climate conda environments: information to users General mailinglist , climate-conda-enviro	9	235	11 March 2025
Initial planning meeting 21/10/22 modules	2	231	12 September 2024
Poster: Improving the reusability of ACCESS model input data Posters data , atmosphere , infrastructure	1	229	30 August 2024

Dashboards for managing NCI resources

Related topics