Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Monitor CVMFS infrastructure #67

Open
bedroge opened this issue Jan 26, 2021 · 6 comments
Open

Monitor CVMFS infrastructure #67

bedroge opened this issue Jan 26, 2021 · 6 comments
Labels
enhancement New feature or request

Comments

@bedroge
Copy link
Collaborator

bedroge commented Jan 26, 2021

We had an issue with one of our Stratum 1 servers this week, which caused it to serve an older tag of the repository. This made me realize again that we should think about setting up some monitoring dashboard that gives an overview and statistics of our infrastructure, sends out alerts when something is wrong, etc. One way to easily grab some information about Stratum 1s is by reading out the .cvmfspublished file (and maybe the .cvmfs_last_snapshot too); the structure of that file is explained here in the docs.

@bedroge bedroge added the enhancement New feature or request label Jan 26, 2021
@bedroge
Copy link
Collaborator Author

bedroge commented Jan 27, 2021

Maybe we can register our repo here:
https://cvmfs-monitor-frontend.web.cern.ch/

@terjekv
Copy link
Member

terjekv commented Jan 27, 2021

https://cvmfs-monitor-frontend.web.cern.ch/alice-ocdb.cern.ch shows that entering proper metadata into your configuration is important! But yes, no need to reinvent the wheel, that looks good!

@bedroge
Copy link
Collaborator Author

bedroge commented Jan 27, 2021

Don't know if it will/can notify someone when there are issues, but nevertheless I think we should try to register our (pilot? production?) repo there anyway.

I'll take a look at adding the metadata. Can easily do that manually, but I'll see if it can be integrated into the Ansible role/playbook.

@terjekv
Copy link
Member

terjekv commented Jan 27, 2021

I asked on the CERN CVMFS Mattermost what it would take to be included.

@bedroge bedroge changed the title Monitoring dashboard Monitor CVMFS infrastructure Feb 22, 2021
@bedroge
Copy link
Collaborator Author

bedroge commented Mar 11, 2021

This can also be used (for the Stratum 0):
https://cvmfs.readthedocs.io/en/stable/cpt-repo.html#publisher-statistics

By setting CVMFS_UPLOAD_STATS_DB=true, the statistics database together with a web page with relevant plots will be published to the stratum 0 /stats location. This provides a lightweight monitoring for repository maintainers.

http://cvmfs-stratum-zero.cern.ch/cvmfs/sft-nightlies.cern.ch/stats-ws21/index.html

@rptaylor
Copy link

Using https://github.com/cvmfs-contrib/cvmfs-servermon is also a good idea.
You can also use the CERN monitoring system mentioned in that README. You'll get emails which include all stratum servers (including e.g. WLCG ones) but you can use an email filter to exclude ones you're not interested in.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants