Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Configure RSEs in Rucio #16

Closed
1 of 2 tasks
egazzarr opened this issue Nov 23, 2022 · 9 comments
Closed
1 of 2 tasks

Configure RSEs in Rucio #16

egazzarr opened this issue Nov 23, 2022 · 9 comments
Assignees
Labels
cern-vre-infra Things only related and depandant on our team component/data-transfer FTS and GFAL related topics priority/critical Needs to be done very soon

Comments

@egazzarr
Copy link
Member

egazzarr commented Nov 23, 2022

CRIC can be used as a library for RSEs configuration details (ref. https://github.com/cern-vre/project/issues/15). With a script running in a cronjob, this information can then be pulled from CRIC and imported into rucio.

Tasks

Preview Give feedback
  1. cern-escape-infra cern-vre-infra component/rucio
    egazzarr garciagenrique
@goseind goseind changed the title EOSC CLUSTER STEP 5: RSEs and CRIC Configure RSEs and CRIC in RUCIO Dec 16, 2022
@goseind goseind assigned garciagenrique and unassigned egazzarr Jan 20, 2023
@goseind goseind added cern-vre-infra Things only related and depandant on our team component/data-transfer FTS and GFAL related topics priority/critical Needs to be done very soon labels Jan 20, 2023
@goseind
Copy link
Member

goseind commented Jan 23, 2023

@garciagenrique maybe for testing we deploy this: https://github.com/goseind/k8s-tutorial/blob/master/xrd.yaml

@goseind
Copy link
Member

goseind commented Jan 23, 2023

Update

The following commands have been executed already:

rucio-admin rse add XRD1
rucio-admin rse add XRD2
rucio-admin rse add XRD3
rucio-admin rse add-protocol --hostname xrd1 --scheme root --prefix //rucio --port 1094 --impl rucio.rse.protocols.gfal.Default --domain-json '{"wan": {"read": 1, "write": 1, "delete": 1, "third_party_copy_read": 1, "third_party_copy_write": 1}, "lan": {"read": 1, "write": 1, "delete": 1}}' XRD1
rucio-admin rse add-protocol --hostname xrd2 --scheme root --prefix //rucio --port 1094 --impl rucio.rse.protocols.gfal.Default --domain-json '{"wan": {"read": 1, "write": 1, "delete": 1, "third_party_copy_read": 1, "third_party_copy_write": 1}, "lan": {"read": 1, "write": 1, "delete": 1}}' XRD2
rucio-admin rse add-protocol --hostname xrd3 --scheme root --prefix //rucio --port 1094 --impl rucio.rse.protocols.gfal.Default --domain-json '{"wan": {"read": 1, "write": 1, "delete": 1, "third_party_copy_read": 1, "third_party_copy_write": 1}, "lan": {"read": 1, "write": 1, "delete": 1}}' XRD3
rucio-admin rse set-attribute --rse XRD1 --key fts --value https://fts3-pilot.cern.ch:8446
rucio-admin rse set-attribute --rse XRD2 --key fts --value https://fts3-pilot.cern.ch:8446
rucio-admin rse set-attribute --rse XRD3 --key fts --value https://fts3-pilot.cern.ch:8446
rucio-admin rse add-distance --distance 1 --ranking 1 XRD1 XRD2
rucio-admin rse add-distance --distance 1 --ranking 1 XRD1 XRD3
rucio-admin rse add-distance --distance 1 --ranking 1 XRD2 XRD1
rucio-admin rse add-distance --distance 1 --ranking 1 XRD2 XRD3
rucio-admin rse add-distance --distance 1 --ranking 1 XRD3 XRD1
rucio-admin rse add-distance --distance 1 --ranking 1 XRD3 XRD2
rucio-admin account set-limits root XRD1 -1
rucio-admin account set-limits root XRD2 -1
rucio-admin account set-limits root XRD3 -1
rucio-admin scope add --account root --scope test
dd if=/dev/urandom of=file1 bs=10M count=1
dd if=/dev/urandom of=file2 bs=10M count=1
dd if=/dev/urandom of=file3 bs=10M count=1
dd if=/dev/urandom of=file4 bs=10M count=1

⚠️ Creating the xrd pods and secrets with our certificate and key but pods crash loop for yet unknown reason. Pods start when using the dummy certificates.

@garciagenrique
Copy link
Member

Two RSEs added to the CRE-VRE cluster: EULAKE-1 and LAPP-DCACHE.

Followed this documentation: https://github.com/cern-vre/cern-vre/wiki/Components#adding-rses-to-the-rucio-instance

@garciagenrique
Copy link
Member

Uploads to EULAKE-1 are working fine, we receive a warning that we don't know why it's appearing. RUCIO team contacted.

/root/envs-virtualenv/rucio-cli/lib/python3.6/site-packages/urllib3/connectionpool.py:1050: InsecureRequestWarning: Unverified HTTPS request is being made to host 'vre-rucio.cern.ch'. Adding certificate verification is strongly advised. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#ssl-warnings
  InsecureRequestWarning,

@goseind goseind changed the title Configure RSEs and CRIC in RUCIO Configure RSEs in Rucio Feb 6, 2023
@garciagenrique
Copy link
Member

It seems that the above Warning was also appearing in the escape-gitops cluster.

@garciagenrique
Copy link
Member

  • @egazzarr configured CNAF-STROM in the VRE.

  • CESNET-S3 gives the following problem after the DB was upgraded:

$ rucio-admin rse info CESNET-S3
Database exception.
Details: An unknown Database Exception has ocurred.
Rucio exited with an unexpected/unknown error, please provide the traceback below to the developers.
Traceback (most recent call last):
  File "/root/envs-virtualenv/rucio-cli/bin/rucio-admin", line 97, in new_funct
    return function(*args, **kwargs)
  File "/root/envs-virtualenv/rucio-cli/bin/rucio-admin", line 530, in info_rse
    rseinfo = client.get_rse(rse=args.rse)
  File "/root/envs-virtualenv/rucio-cli/lib/python3.6/site-packages/rucio/client/rseclient.py", line 50, in get_rse
    raise exc_cls(exc_msg)
rucio.common.exception.DatabaseException: Database exception.
Details: An unknown Database Exception has ocurred.
  • Erasing all replicas and files within the RSE:
$ rucio list-rules --account=<ACCOUNT_NAME>
$ rucio delete-rule <RULE_ID>
  • However, some files are still there
# Connect to the PSQL DB
rucio=> SELECT * FROM replicas WHERE rse_id = '<RSE_ID>';

# To get the rse-id: (f.ex:)
$ rucio list-rse-usage <RSE_NAME>

Issue being investigated

@goseind
Copy link
Member

goseind commented Mar 24, 2023

Current status:

@garciagenrique
Copy link
Member

IMO, I will take out this script because it might lead to wrong RSE configuration.
If everybody agrees we could just add a section on the Wiki explaining the main attributes and points to be configured, but without proposing a template.

@goseind
Copy link
Member

goseind commented Apr 19, 2023

CRIC will not be used atm. Ne RSE endpoints will be added as they come in.

@goseind goseind closed this as completed Apr 19, 2023
garciagenrique added a commit that referenced this issue May 9, 2023
* rename scrip file containing typo: cs for sc (StorageClass) - otherwise misleading

* remove script, as describe on vre issue #16 - https://github.com/vre-hub/vre/issues/16\#issuecomment-1511277938

* comment diagram CI script - not running properly as it tries to push directly to main
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cern-vre-infra Things only related and depandant on our team component/data-transfer FTS and GFAL related topics priority/critical Needs to be done very soon
Projects
None yet
Development

No branches or pull requests

3 participants